Pseudo-Random Number Generation

At the heart of every Monte Carlo simulation lies a fundamental paradox: we use deterministic algorithms to generate sequences that appear random. Pseudo-random number generators (PRNGs) resolve this paradox by producing deterministic sequences that pass statistical tests for randomness, providing the foundation upon which all computational statistics and simulation methods rest.

The challenge of generating high-quality pseudo-random numbers extends far beyond simple uniformity. Modern applications demand generators that are fast, have enormous periods, pass batteries of statistical tests, and maintain their properties even when used in parallel computing environments. A poor PRNG can silently corrupt simulation results, making the understanding of these algorithms essential for any serious practitioner of computational methods.

Road Map 🧭

Understand the mathematical structure of PRNGs and their deterministic nature
Implement classical algorithms like Linear Congruential Generators and Mersenne Twister
Evaluate PRNG quality using statistical test suites
Apply best practices for seed management and parallel random number generation

Foundations of Pseudo-Randomness

True randomness is a luxury computers cannot afford. Instead, we create the illusion of randomness through carefully designed deterministic algorithms.

Mathematical Definition

Definition: A uniform pseudo-random number generator is an algorithm which, starting from an initial value u₀ (the seed) and a transformation D, produces a sequence:

\[u_i = D^i(u_0), \quad i = 1, 2, 3, ...\]

For all practical n, the values u₁, u₂, …, uₙ should reproduce the behavior of an i.i.d. sample of uniform random variables on [0, 1].

More formally, we test the hypothesis:

\[H_0: \text{Joint law of } (U_1, ..., U_n) \sim \text{Uniform}[0,1]^n\]

against:

\[H_a: \text{The joint law differs from i.i.d. uniform}\]

The Deterministic Paradox

PRNGs are entirely deterministic—given the same seed, they produce identical sequences. This property is both a blessing and a curse:

Advantages: - Reproducibility for debugging and verification - Ability to replay simulations exactly - Control over random streams in parallel computing

Challenges: - Finite period (sequences eventually repeat) - Correlations between successive values - Vulnerability to poor seed choices

Python Implementation Framework

import numpy as np
from abc import ABC, abstractmethod
import matplotlib.pyplot as plt
from typing import Union, List, Tuple

class PRNG(ABC):
    """
    Abstract base class for pseudo-random number generators.
    """

    def __init__(self, seed: int = None):
        """
        Initialize PRNG with seed.

        Parameters
        ----------
        seed : int
            Initial seed value. If None, use system time.
        """
        if seed is None:
            import time
            seed = int(time.time() * 1000000) % (2**32)

        self.initial_seed = seed
        self.state = seed
        self.counter = 0

    @abstractmethod
    def next_int(self) -> int:
        """Generate next integer in sequence."""
        pass

    def next_uniform(self) -> float:
        """Generate next uniform random number in [0, 1)."""
        return self.next_int() / self.max_int

    def generate(self, n: int) -> np.ndarray:
        """Generate n uniform random numbers."""
        return np.array([self.next_uniform() for _ in range(n)])

    def reset(self):
        """Reset generator to initial seed."""
        self.state = self.initial_seed
        self.counter = 0

    @property
    @abstractmethod
    def max_int(self) -> int:
        """Maximum integer value generator can produce."""
        pass

    @property
    @abstractmethod
    def period(self) -> int:
        """Period of the generator."""
        pass

Linear Congruential Generators

Linear Congruential Generators (LCGs) are the workhorses of pseudo-random generation, simple yet surprisingly effective when properly parameterized.

Mathematical Structure

An LCG is defined by the recurrence:

\[X_{n+1} = (a \cdot X_n + c) \bmod m\]

where: - m > 0 is the modulus - 0 < a < m is the multiplier - 0 ≤ c < m is the increment - 0 ≤ X₀ < m is the seed

The period is at most m, achieved when parameters satisfy the Hull-Dobell conditions: 1. c and m are relatively prime 2. a - 1 is divisible by all prime factors of m 3. If m is divisible by 4, then a - 1 is divisible by 4

Implementation and Analysis

class LCG(PRNG):
    """
    Linear Congruential Generator.

    Classic parameters from various implementations:
    - MINSTD: a=48271, c=0, m=2^31-1 (Park & Miller)
    - RANDU: a=65539, c=0, m=2^31 (historically bad!)
    - Numerical Recipes: a=1664525, c=1013904223, m=2^32
    """

    def __init__(self, seed=None, a=1664525, c=1013904223, m=2**32):
        super().__init__(seed)
        self.a = a
        self.c = c
        self.m = m

        # Validate parameters
        assert 0 < a < m, "Multiplier must be in (0, m)"
        assert 0 <= c < m, "Increment must be in [0, m)"
        assert 0 <= self.state < m, "Seed must be in [0, m)"

    def next_int(self) -> int:
        """Generate next integer using LCG formula."""
        self.state = (self.a * self.state + self.c) % self.m
        self.counter += 1
        return self.state

    @property
    def max_int(self) -> int:
        return self.m - 1

    @property
    def period(self) -> int:
        """Theoretical maximum period."""
        if self.c == 0:
            return self.m // 4  # Typical for multiplicative LCG
        return self.m  # Full period if Hull-Dobell conditions met

    def detect_period(self, max_iterations=10**7) -> int:
        """Empirically detect the actual period."""
        self.reset()
        initial_state = self.state

        for i in range(1, max_iterations):
            self.next_int()
            if self.state == initial_state:
                return i

        return -1  # Period not found within max_iterations

Example 💡: The Infamous RANDU Generator

RANDU was widely used in the 1960s-70s but has catastrophic flaws. Let’s visualize why it fails:

def visualize_lcg_correlations():
    """Demonstrate correlations in LCGs using 3D plots."""

    fig = plt.figure(figsize=(15, 5))

    # RANDU - historically bad
    randu = LCG(seed=1, a=65539, c=0, m=2**31)

    # Good LCG - Numerical Recipes
    good_lcg = LCG(seed=1, a=1664525, c=1013904223, m=2**32)

    # Generate triplets
    n_points = 10000

    for idx, (lcg, name) in enumerate([(randu, 'RANDU (Bad)'),
                                       (good_lcg, 'Good LCG')]):
        lcg.reset()
        points = lcg.generate(n_points * 3).reshape(-1, 3)

        ax = fig.add_subplot(1, 2, idx+1, projection='3d')
        ax.scatter(points[:1000, 0], points[:1000, 1],
                  points[:1000, 2], alpha=0.3, s=1)
        ax.set_title(f'{name}: 3D Structure')
        ax.set_xlabel('$U_i$')
        ax.set_ylabel('$U_{i+1}$')
        ax.set_zlabel('$U_{i+2}$')

    plt.tight_layout()
    plt.show()

    # Spectral test results
    print("RANDU: Points lie on only 15 parallel planes in 3D!")
    print("This catastrophic correlation makes it unsuitable for Monte Carlo.")

Result: RANDU’s points fall on just 15 parallel planes in 3D space, causing severe correlations.

Mersenne Twister

The Mersenne Twister MT19937 is the gold standard for non-cryptographic applications, offering an enormous period and excellent statistical properties.

Algorithm Structure

The Mersenne Twister is based on a matrix linear recurrence over a finite binary field:

\[x_{k+n} = x_{k+m} \oplus (x_k^u | x_{k+1}^l) A\]

where: - n = 624 is the degree of recurrence - m = 397 is the middle word - A is a special matrix for efficient computation - Period: 2^{19937} - 1 (a Mersenne prime)

class MersenneTwister(PRNG):
    """
    Mersenne Twister MT19937 implementation.
    """

    def __init__(self, seed=None):
        super().__init__(seed)

        # MT19937 parameters
        self.w, self.n, self.m, self.r = 32, 624, 397, 31
        self.a = 0x9908B0DF
        self.u, self.d = 11, 0xFFFFFFFF
        self.s, self.b = 7, 0x9D2C5680
        self.t, self.c = 15, 0xEFC60000
        self.l = 18
        self.f = 1812433253

        # Initialize state array
        self.MT = [0] * self.n
        self.index = self.n + 1
        self.lower_mask = (1 << self.r) - 1
        self.upper_mask = (~self.lower_mask) & 0xFFFFFFFF

        self.seed_mt(seed)

    def seed_mt(self, seed):
        """Initialize generator state from seed."""
        self.MT[0] = seed
        for i in range(1, self.n):
            self.MT[i] = (self.f * (self.MT[i-1] ^ (self.MT[i-1] >> (self.w-2))) + i) & 0xFFFFFFFF
        self.index = self.n

    def extract_number(self) -> int:
        """Extract tempered pseudorandom number."""
        if self.index >= self.n:
            if self.index > self.n:
                raise Exception("Generator not seeded")
            self.twist()

        y = self.MT[self.index]
        y = y ^ ((y >> self.u) & self.d)
        y = y ^ ((y << self.s) & self.b)
        y = y ^ ((y << self.t) & self.c)
        y = y ^ (y >> self.l)

        self.index += 1
        return y & 0xFFFFFFFF

    def twist(self):
        """Generate next n values."""
        for i in range(self.n):
            x = (self.MT[i] & self.upper_mask) + (self.MT[(i+1) % self.n] & self.lower_mask)
            xA = x >> 1
            if x & 0x1:
                xA = xA ^ self.a
            self.MT[i] = self.MT[(i + self.m) % self.n] ^ xA
        self.index = 0

    def next_int(self) -> int:
        return self.extract_number()

    @property
    def max_int(self) -> int:
        return 2**32 - 1

    @property
    def period(self) -> int:
        return 2**19937 - 1

Statistical Testing of PRNGs

A PRNG must pass rigorous statistical tests to be considered suitable for Monte Carlo simulations.

Test Suite Framework

class PRNGTester:
    """
    Statistical test suite for PRNGs.
    """

    def __init__(self, generator: PRNG):
        self.generator = generator
        self.test_results = {}

    def chi_square_uniformity(self, n_samples=10000, n_bins=100):
        """
        Chi-square test for uniformity.

        H0: Samples are uniformly distributed on [0,1]
        """
        from scipy import stats

        samples = self.generator.generate(n_samples)
        observed, _ = np.histogram(samples, bins=n_bins, range=(0, 1))
        expected = n_samples / n_bins

        chi2_stat = np.sum((observed - expected)**2 / expected)
        p_value = 1 - stats.chi2.cdf(chi2_stat, df=n_bins-1)

        self.test_results['uniformity'] = {
            'statistic': chi2_stat,
            'p_value': p_value,
            'pass': p_value > 0.01
        }

        return p_value > 0.01

    def serial_correlation_test(self, n_samples=10000, lag=1):
        """
        Test for serial correlation at specified lag.

        H0: No correlation between U_i and U_{i+lag}
        """
        samples = self.generator.generate(n_samples + lag)

        # Compute correlation
        x = samples[:-lag]
        y = samples[lag:]
        correlation = np.corrcoef(x, y)[0, 1]

        # Standard error under null hypothesis
        se = 1 / np.sqrt(n_samples)
        z_stat = correlation / se
        p_value = 2 * (1 - stats.norm.cdf(abs(z_stat)))

        self.test_results[f'correlation_lag{lag}'] = {
            'correlation': correlation,
            'p_value': p_value,
            'pass': p_value > 0.01
        }

        return p_value > 0.01

    def runs_test(self, n_samples=10000):
        """
        Wald-Wolfowitz runs test for independence.

        H0: Sequence is random (no patterns)
        """
        samples = self.generator.generate(n_samples)
        median = np.median(samples)

        # Convert to binary sequence
        binary = (samples > median).astype(int)

        # Count runs
        runs = 1
        for i in range(1, len(binary)):
            if binary[i] != binary[i-1]:
                runs += 1

        # Expected runs and variance under H0
        n1 = np.sum(binary)
        n2 = n_samples - n1

        expected_runs = (2 * n1 * n2) / n_samples + 1
        variance = (2 * n1 * n2 * (2 * n1 * n2 - n_samples)) / \
                   (n_samples**2 * (n_samples - 1))

        # Standardized test statistic
        z_stat = (runs - expected_runs) / np.sqrt(variance)
        p_value = 2 * (1 - stats.norm.cdf(abs(z_stat)))

        self.test_results['runs'] = {
            'runs': runs,
            'expected': expected_runs,
            'p_value': p_value,
            'pass': p_value > 0.01
        }

        return p_value > 0.01

    def spectral_test(self, dimension=2, n_samples=1000):
        """
        Simplified spectral test - looks for hyperplane structure.

        Tests if points are too regularly distributed.
        """
        samples = self.generator.generate(n_samples * dimension)
        points = samples.reshape(-1, dimension)

        # Compute distances to nearest neighbors
        from scipy.spatial import distance_matrix

        dist_matrix = distance_matrix(points[:100], points[:100])
        np.fill_diagonal(dist_matrix, np.inf)
        nearest_distances = np.min(dist_matrix, axis=1)

        # Test if distances are too regular (bad)
        cv = np.std(nearest_distances) / np.mean(nearest_distances)

        # For true random points, CV should be around 0.5
        self.test_results[f'spectral_{dimension}d'] = {
            'cv': cv,
            'pass': 0.3 < cv < 0.7  # Rough bounds
        }

        return 0.3 < cv < 0.7

    def birthday_spacing_test(self, n_samples=1000, year_length=365):
        """
        Birthday spacing test for detecting patterns.
        """
        # Generate "birthdays"
        samples = self.generator.generate(n_samples)
        birthdays = np.floor(samples * year_length).astype(int)
        birthdays = np.sort(birthdays)

        # Compute spacings
        spacings = np.diff(birthdays)
        spacings = spacings[spacings > 0]  # Remove zeros

        # Test if spacings follow expected distribution
        from scipy import stats
        _, p_value = stats.kstest(spacings,
                                 lambda x: 1 - np.exp(-x/year_length))

        self.test_results['birthday'] = {
            'p_value': p_value,
            'pass': p_value > 0.01
        }

        return p_value > 0.01

    def run_all_tests(self, verbose=True):
        """Run complete test suite."""
        tests = [
            ('Uniformity', self.chi_square_uniformity),
            ('Serial Correlation', self.serial_correlation_test),
            ('Runs Test', self.runs_test),
            ('Spectral Test 2D', lambda: self.spectral_test(2)),
            ('Spectral Test 3D', lambda: self.spectral_test(3)),
            ('Birthday Spacing', self.birthday_spacing_test)
        ]

        if verbose:
            print("PRNG Statistical Test Suite")
            print("=" * 50)

        all_pass = True
        for name, test_func in tests:
            passed = test_func()
            all_pass = all_pass and passed

            if verbose:
                status = "✓ PASS" if passed else "✗ FAIL"
                print(f"{name:.<30} {status}")

        if verbose:
            print("=" * 50)
            print(f"Overall: {'PASS' if all_pass else 'FAIL'}")

        return all_pass

NIST Test Suite

The NIST Statistical Test Suite (SP 800-22) provides comprehensive randomness testing:

Common Pitfall ⚠️

Passing statistical tests is necessary but not sufficient for cryptographic security. PRNGs suitable for Monte Carlo may be completely broken for cryptographic use.

Monte Carlo Requirements: - Statistical uniformity and independence - Long period - Computational efficiency

Additional Cryptographic Requirements: - Unpredictability (cannot determine next value from previous values) - Backtracking resistance - Side-channel resistance

Modern PRNGs and Best Practices

Contemporary applications demand PRNGs that work well in parallel environments and provide independent streams.

PCG Family

The Permuted Congruential Generator (PCG) family offers excellent statistical properties with small state:

class PCG32(PRNG):
    """
    PCG32 - Fast, space-efficient, statistically excellent.
    """

    def __init__(self, seed=None, stream=None):
        super().__init__(seed)
        self.state = seed
        self.inc = (stream or 1) * 2 + 1  # Must be odd

    def next_int(self) -> int:
        """PCG32 generation algorithm."""
        # LCG step
        oldstate = self.state
        self.state = oldstate * 6364136223846793005 + self.inc

        # Permutation step (XSH RR)
        xorshifted = ((oldstate >> 18) ^ oldstate) >> 27
        rot = oldstate >> 59

        result = (xorshifted >> rot) | (xorshifted << ((-rot) & 31))
        return result & 0xFFFFFFFF

    @property
    def max_int(self) -> int:
        return 2**32 - 1

    @property
    def period(self) -> int:
        return 2**64  # Per stream

Parallel Random Number Generation

class ParallelPRNG:
    """
    Manager for parallel random number generation.
    """

    def __init__(self, base_seed=12345, n_streams=4):
        """
        Initialize parallel PRNG streams.

        Uses different techniques:
        1. Random spacing
        2. Leapfrog
        3. Independent streams (PCG)
        """
        self.base_seed = base_seed
        self.n_streams = n_streams
        self.generators = []

        # Method 1: Random spacing (skip-ahead)
        # Good for LCGs with known skip-ahead formulas

        # Method 2: Different seeds (simple but risky)
        # Can lead to overlapping sequences

        # Method 3: Independent streams (recommended)
        for i in range(n_streams):
            gen = PCG32(seed=base_seed, stream=i)
            self.generators.append(gen)

    def parallel_monte_carlo(self, func, n_samples_per_stream):
        """
        Run Monte Carlo in parallel with independent streams.
        """
        from multiprocessing import Pool

        def worker(stream_id):
            """Worker function for one stream."""
            gen = self.generators[stream_id]
            samples = gen.generate(n_samples_per_stream)
            return np.mean([func(x) for x in samples])

        with Pool(self.n_streams) as pool:
            results = pool.map(worker, range(self.n_streams))

        # Combine results
        return np.mean(results), np.std(results) / np.sqrt(self.n_streams)

Seed Management

class SeedManager:
    """
    Centralized seed management for reproducible simulations.
    """

    def __init__(self, master_seed=None):
        """Initialize with master seed."""
        if master_seed is None:
            master_seed = np.random.randint(2**32)

        self.master_seed = master_seed
        self.seed_counter = 0
        self.seed_history = {'master': master_seed}

    def get_seed(self, identifier=None):
        """
        Get a unique seed for a simulation component.

        Parameters
        ----------
        identifier : str
            Optional identifier for this seed
        """
        # Generate seed from master + counter
        self.seed_counter += 1
        seed = (self.master_seed + self.seed_counter * 1000000007) % (2**32)

        if identifier:
            self.seed_history[identifier] = seed

        return seed

    def get_numpy_rng(self, identifier=None):
        """Get a numpy RandomState with managed seed."""
        return np.random.RandomState(self.get_seed(identifier))

    def save_state(self, filename):
        """Save seed state for reproducibility."""
        import json
        with open(filename, 'w') as f:
            json.dump({
                'master_seed': self.master_seed,
                'counter': self.seed_counter,
                'history': self.seed_history
            }, f)

    @classmethod
    def load_state(cls, filename):
        """Load seed state from file."""
        import json
        with open(filename, 'r') as f:
            data = json.load(f)

        manager = cls(data['master_seed'])
        manager.seed_counter = data['counter']
        manager.seed_history = data['history']
        return manager

Practical Considerations

Quality vs Speed Trade-offs

def benchmark_prngs(n_samples=10**7):
    """Benchmark different PRNG implementations."""
    import time

    generators = [
        ('Python random', lambda: np.random.random(n_samples)),
        ('NumPy MT19937', lambda: np.random.RandomState(42).random(n_samples)),
        ('LCG', lambda: LCG(42).generate(n_samples)),
        ('PCG32', lambda: PCG32(42).generate(n_samples)),
    ]

    print(f"Generating {n_samples:,} random numbers...")
    print("-" * 50)

    for name, gen_func in generators:
        start = time.time()
        samples = gen_func()
        elapsed = time.time() - start

        # Basic quality check - mean and std
        mean = np.mean(samples)
        std = np.std(samples)

        print(f"{name:.<25} {elapsed:>6.3f}s")
        print(f"  Mean: {mean:.6f} (expect 0.5)")
        print(f"  Std:  {std:.6f} (expect {1/np.sqrt(12):.6f})")
        print(f"  Rate: {n_samples/elapsed/1e6:.1f} M samples/sec")
        print()

Common Mistakes to Avoid

Common Pitfall ⚠️

Never use system time as seed in production! This makes results non-reproducible and can lead to correlation in distributed systems started simultaneously.

Instead: Use a seed management system:

# Bad
np.random.seed(int(time.time()))

# Good
seed_manager = SeedManager(master_seed=42)
rng = seed_manager.get_numpy_rng('experiment_1')

Testing Your Implementation

def visual_prng_test(generator, n_points=10000):
    """Visual tests for PRNG quality."""

    fig, axes = plt.subplots(2, 3, figsize=(15, 10))

    # Test 1: 2D scatter
    generator.reset()
    points = generator.generate(n_points * 2).reshape(-1, 2)
    axes[0, 0].scatter(points[:, 0], points[:, 1], alpha=0.3, s=1)
    axes[0, 0].set_title('2D Uniform Scatter')
    axes[0, 0].set_xlabel('$U_i$')
    axes[0, 0].set_ylabel('$U_{i+1}$')

    # Test 2: Histogram
    generator.reset()
    samples = generator.generate(n_points)
    axes[0, 1].hist(samples, bins=50, density=True, alpha=0.7)
    axes[0, 1].axhline(y=1, color='r', linestyle='--', label='Expected')
    axes[0, 1].set_title('Uniformity Test')
    axes[0, 1].set_xlabel('Value')
    axes[0, 1].set_ylabel('Density')
    axes[0, 1].legend()

    # Test 3: Autocorrelation
    from statsmodels.graphics.tsaplots import plot_acf
    generator.reset()
    samples = generator.generate(1000)
    plot_acf(samples, ax=axes[0, 2], lags=50)
    axes[0, 2].set_title('Autocorrelation Function')

    # Test 4: Gap test
    generator.reset()
    samples = generator.generate(n_points)
    gaps = np.diff(np.sort(samples))
    axes[1, 0].hist(gaps * n_points, bins=50, density=True, alpha=0.7)

    # Theoretical exponential
    x = np.linspace(0, 5, 100)
    axes[1, 0].plot(x, np.exp(-x), 'r--', label='Exponential(1)')
    axes[1, 0].set_title('Gap Test')
    axes[1, 0].set_xlabel('Scaled Gap')
    axes[1, 0].set_ylabel('Density')
    axes[1, 0].legend()

    # Test 5: Running mean
    generator.reset()
    samples = generator.generate(n_points)
    running_mean = np.cumsum(samples) / np.arange(1, n_points + 1)
    axes[1, 1].plot(running_mean, alpha=0.7)
    axes[1, 1].axhline(y=0.5, color='r', linestyle='--')
    axes[1, 1].fill_between(range(n_points),
                            0.5 - 1.96/np.sqrt(np.arange(1, n_points+1)),
                            0.5 + 1.96/np.sqrt(np.arange(1, n_points+1)),
                            alpha=0.2, color='red', label='95% CI')
    axes[1, 1].set_title('Convergence of Mean')
    axes[1, 1].set_xlabel('Sample Size')
    axes[1, 1].set_ylabel('Running Mean')
    axes[1, 1].legend()

    # Test 6: Spectral test (3D)
    generator.reset()
    points_3d = generator.generate(3000).reshape(-1, 3)
    ax = fig.add_subplot(2, 3, 6, projection='3d')
    ax.scatter(points_3d[:, 0], points_3d[:, 1], points_3d[:, 2],
              alpha=0.3, s=1)
    ax.set_title('3D Structure')

    plt.tight_layout()
    plt.show()

Bringing It All Together

Pseudo-random number generation forms the bedrock of Monte Carlo methods. While perfect randomness remains computationally unattainable, modern PRNGs provide sequences that are indistinguishable from true randomness for practical purposes. The key lies in choosing appropriate generators for specific applications and managing them correctly.

Key Takeaways 📝

Core concept: PRNGs are deterministic algorithms producing sequences that mimic randomness
Computational insight: Quality involves period length, uniformity, independence, and dimensional distribution
Practical application: Use established generators (MT19937, PCG) rather than rolling your own
Connection: Seed management and parallel streams are crucial for reproducible, scalable simulations

Exercises

Conceptual Understanding: Prove that an LCG with parameters a = 1, c = 1, m = 2^n has period m. Show that despite having full period, this is a terrible PRNG. What patterns would you observe?

Implementation: Implement the Middle Square Method (von Neumann, 1946):

def middle_square(seed, n_digits=4):
    """
    Von Neumann's middle square method.

    1. Square the seed
    2. Extract middle n_digits as next number
    3. Use as new seed
    """
    # Your implementation here
    pass

Demonstrate why this method fails (hint: find seeds that lead to 0 or short cycles).

Analysis: Compare the spectral properties of different LCGs:
1. Plot successive pairs (u_i, u_{i+1}) for RANDU and a good LCG
2. Perform 3D spectral analysis by finding the minimum number of parallel planes
3. Explain why multipliers of the form a = 2^k ± 1 often perform poorly

Extension: Implement a cryptographically secure PRNG (CSPRNG) using AES in counter mode:

from cryptography.hazmat.primitives.ciphers import Cipher, algorithms, modes

class AES_CSPRNG:
    """AES-based CSPRNG suitable for cryptographic applications."""
    # Your implementation here
    pass

Compare its speed and statistical properties with MT19937. When would you use each?