023 Intermediate

023 — Extract a Given Number of Randomly Selected Elements

Functional Programming

Tutorial Video

Text description (accessibility)

This video demonstrates the "023 — Extract a Given Number of Randomly Selected Elements" functional Rust example. Difficulty level: Intermediate. Key concepts covered: Functional Programming. Selecting k elements uniformly at random without replacement from a list (OCaml 99 Problems #23) is one of the fundamental sampling problems in computer science. Key difference from OCaml: 1. **Random number generation**: Rust requires the `rand` crate (not in stdlib). OCaml's `Random` module is in the standard library. Both use `gen_range`

Tutorial

The Problem

Selecting k elements uniformly at random without replacement from a list (OCaml 99 Problems #23) is one of the fundamental sampling problems in computer science. It underlies A/B testing, bootstrapping in statistics, Monte Carlo simulations, shuffling in card games, and random forest sampling in machine learning.

The naive approach — pick a random index, remove it, repeat — is O(k·n) due to repeated removal from the middle. The efficient approach, Fisher-Yates partial shuffle, is O(k): shuffle only the first k positions of the array, then take them. This is the algorithm used in numpy.random.choice and most production sampling implementations.

🎯 Learning Outcomes

• Use rand::thread_rng() and Rng::gen_range for uniform random index generation

• Implement random selection via repeated random removal (naive, O(k·n))

• Understand the Fisher-Yates partial shuffle as the O(k) alternative

• Handle the constraint k <= n (cannot select more items than exist)

• Return a new Vec<T> of the selected elements

• Use Fisher-Yates partial shuffle: swap the first k positions with random positions to select k items in O(k)

• Use a seeded RNG for reproducible tests: StdRng::seed_from_u64(42) from the rand crate

Code Example

#![allow(clippy::all)]
// Placeholder — pending conversion

(* Random Select *)
(* OCaml 99 Problems #23 *)

(* Implementation for example 23 *)

(* Tests *)
let () =
  (* Add tests *)
  print_endline "✓ OCaml tests passed"

Key Differences

Random number generation: Rust requires the rand crate (not in stdlib). OCaml's Random module is in the standard library. Both use gen_range-style APIs.

Array vs list: Fisher-Yates requires O(1) random access. Both implementations convert to array/Vec first. OCaml uses Array.of_list; Rust starts with Vec.

**swap_remove**: Rust's Vec::swap_remove(i) replaces element i with the last element — O(1) but changes order. This is the key to efficient random deletion.

Seeding: Rust's rand::rngs::StdRng::seed_from_u64(seed) produces deterministic output. OCaml uses Random.init seed. Both are important for reproducible tests.

**rand crate:** Rust requires an external crate (rand) for randomness; OCaml's Random module is in the standard library.

**remove vs filter:** Rust's Vec::remove(k) is O(n). The Fisher-Yates shuffle avoids this by swapping instead of removing.

Reproducibility: Both languages support seeded RNGs for reproducible results — important for testing. StdRng::seed_from_u64(42) in Rust; Random.init 42 in OCaml.

OCaml Approach

OCaml's version using Random: let random_select lst n = let arr = Array.of_list lst in let len = Array.length arr in for i = 0 to n - 1 do let j = i + Random.int (len - i) in let tmp = arr.(i) in arr.(i) <- arr.(j); arr.(j) <- tmp done; Array.to_list (Array.sub arr 0 n). This is Fisher-Yates partial shuffle: swap the i-th element with a random element in [i, len), building the selected prefix.

OCaml's version: let rec rand_select list n = if n = 0 then [] else let k = Random.int (List.length list) in let elem = List.nth list k in let rest = List.filteri (fun i _ -> i <> k) list in elem :: rand_select rest (n-1). This is O(k·n) because List.nth and List.filteri each traverse the list. For small k on short lists this is acceptable.

Full Source

#![allow(clippy::all)]
// Placeholder — pending conversion

(* Random Select *)
(* OCaml 99 Problems #23 *)

(* Implementation for example 23 *)

(* Tests *)
let () =
  (* Add tests *)
  print_endline "✓ OCaml tests passed"

Deep Comparison

OCaml vs Rust: Random Select

Overview

See the example.rs and example.ml files for detailed implementations.

Key Differences

Aspect	OCaml	Rust
Type system	Hindley-Milner	Ownership + traits
Memory	GC	Zero-cost abstractions
Mutability	Explicit ref	mut keyword
Error handling	Option/Result	Result<T, E>

See README.md for detailed comparison.

Exercises

With replacement: Write random_select_with_replacement(v: &[i32], k: usize) -> Vec<i32> that allows the same element to be selected multiple times. This is simpler — just pick k random indices independently.

Weighted sampling: Write weighted_select(v: &[i32], weights: &[f64], k: usize) -> Vec<i32> that samples proportionally to the given weights. Research the alias method or rejection sampling.

Reservoir sampling: Implement reservoir sampling, which selects k elements from a stream of unknown length in O(n) time and O(k) space. This is used in distributed log sampling.

Reservoir sampling: Implement reservoir sampling to select k elements uniformly at random from a stream of unknown length — processing each element once without knowing n in advance.

Weighted selection: Implement weighted random selection where each element has an associated probability weight, using the alias method or a simple cumulative probability array.

Open Source Repos

functional-rust

View the source for this example on GitHub — OCaml and Rust side by side in the repo.

Rust