.. _exam1_spring2026:

STAT 350 — Exam 1 — Spring 2026 (V1)
======================================

.. admonition:: Exam Information
   :class: info

   | **Course:** STAT 350 — Introduction to Statistics
   | **Semester:** Spring 2026
   | **Version:** V1
   | **Total Points:** 105
   | **Time Allowed:** 60 minutes

   .. list-table::
      :header-rows: 1
      :widths: 50 25 25

      * - Problem
        - Total Possible
        - Topic
      * - Problem 1 (True/False, 2 pts each)
        - 12
        - Continuous RVs, Binomial, Normal
      * - Problem 2 (Multiple Choice, 3 pts each)
        - 15
        - Discrete/Continuous RVs, Normal
      * - Problem 3
        - 15
        - Descriptive Statistics / Boxplots
      * - Problem 4
        - 26
        - Binomial Distribution, LOTUS
      * - Problem 5
        - 17
        - Conditional Probability / Independence
      * - Problem 6
        - 20
        - PDFs and CDFs
      * - **Total**
        - **105**
        -

----

Problem 1 — True/False  (12 points, 2 points each)
----------------------------------------------------

.. admonition:: Question 1.1  (2 pts)
   :class: note

   Let :math:`X` denote a continuous random variable with a PDF :math:`f_X(x)`. For any
   interval such that :math:`[a, b] \subset \text{Support}(X)`, such that :math:`a < b`,

   **True or False:** :math:`P(a < X < b)` must be less than or equal to :math:`P(X < b)`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: TRUE**

      Since :math:`a < b`, the event :math:`\{a < X < b\}` is a subset of the event
      :math:`\{X < b\}`. Specifically,

      .. math::

         P(X < b) = P(X < a) + P(a \leq X < b) \geq P(a < X < b)

      because :math:`P(X < a) \geq 0`. For any :math:`A \subseteq B`, we always have
      :math:`P(A) \leq P(B)`, so :math:`P(a < X < b) \leq P(X < b)`.

.. admonition:: Question 1.2  (2 pts)
   :class: note

   Suppose :math:`X` is a Binomial random variable with parameters :math:`n` and :math:`p`.

   **True or False:** Holding the number of trials :math:`n` constant, the shape of the
   distribution shifts from positively skewed to negatively skewed as :math:`p` changes
   from 0.9 to 0.1.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: FALSE**

      The direction of skew is reversed. When :math:`p` is close to 1 (e.g., :math:`p = 0.9`),
      most outcomes are large (near :math:`n`), so the distribution is **negatively skewed**
      (long left tail). When :math:`p` is close to 0 (e.g., :math:`p = 0.1`), most outcomes
      cluster near 0, so the distribution is **positively skewed** (long right tail). Therefore,
      as :math:`p` changes from 0.9 to 0.1, the distribution shifts from **negatively** to
      **positively** skewed — the opposite of what the statement claims.

.. admonition:: Question 1.3  (2 pts)
   :class: note

   Regarding the properties of a Binomial random variable :math:`X \sim \text{Binomial}(n, p)`.

   **True or False:** The variance of :math:`X` cannot exceed the number of independent
   trials :math:`n`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: TRUE**

      The variance of a Binomial random variable is :math:`\text{Var}(X) = np(1-p)`. Since
      :math:`0 < p < 1`, we have :math:`p(1-p) \leq \tfrac{1}{4}`, which gives

      .. math::

         \text{Var}(X) = np(1-p) \leq n \cdot \tfrac{1}{4} \leq n.

      Therefore, the variance cannot exceed :math:`n`.

.. admonition:: Question 1.4  (2 pts)
   :class: note

   Suppose :math:`V \sim \text{Binomial}(n, p)` and :math:`W \sim \text{Poisson}(\lambda)`.

   **True or False:** Then for any positive integer :math:`n`, the support of :math:`V` is a
   subset of the support of :math:`W`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: TRUE**

      The support of :math:`V \sim \text{Binomial}(n, p)` is :math:`\{0, 1, 2, \ldots, n\}`,
      a finite set. The support of :math:`W \sim \text{Poisson}(\lambda)` is
      :math:`\{0, 1, 2, 3, \ldots\}`, the set of all non-negative integers. Since
      :math:`\{0, 1, \ldots, n\} \subseteq \{0, 1, 2, \ldots\}` for any positive integer
      :math:`n`, the support of :math:`V` is indeed a subset of the support of :math:`W`.

.. admonition:: Question 1.5  (2 pts)
   :class: note

   When converting a value :math:`x` from a normal distribution into a :math:`z`-score.

   **True or False:** A negative :math:`z`-score indicates that the original :math:`x` is
   smaller than the population mean :math:`\mu`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: TRUE**

      The :math:`z`-score is defined as :math:`z = \dfrac{x - \mu}{\sigma}`. Since
      :math:`\sigma > 0`, we have :math:`z < 0` if and only if :math:`x - \mu < 0`, i.e.,
      :math:`x < \mu`. A negative :math:`z`-score therefore indicates the original value
      :math:`x` lies below the population mean.

.. admonition:: Question 1.6  (2 pts)
   :class: note

   Suppose :math:`X` and :math:`Y` are Normally distributed random variables sharing the
   same mean :math:`\mu = 10`. It is also known that :math:`\text{Var}(X) < \text{Var}(Y)`.

   **True or False:** Then it follows that :math:`P(X \leq 12)` is larger than
   :math:`P(Y \leq 12)`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: TRUE**

      Since :math:`\text{Var}(X) < \text{Var}(Y)`, we have :math:`\sigma_X < \sigma_Y`.
      Converting 12 to a :math:`z`-score for each:

      .. math::

         z_X = \frac{12 - 10}{\sigma_X} = \frac{2}{\sigma_X}, \qquad
         z_Y = \frac{12 - 10}{\sigma_Y} = \frac{2}{\sigma_Y}.

      Since :math:`\sigma_X < \sigma_Y`, we have :math:`z_X > z_Y > 0`. Because the standard
      normal CDF :math:`\Phi` is increasing, :math:`\Phi(z_X) > \Phi(z_Y)`, which means
      :math:`P(X \leq 12) > P(Y \leq 12)`.

----

Problem 2 — Multiple Choice  (15 points, 3 points each)
---------------------------------------------------------

.. admonition:: Question 2.1  (3 pts)
   :class: note

   Let :math:`X` and :math:`Y` be discrete random variables that are **not** independent.
   Choose the statement about :math:`X` and :math:`Y` that **always** holds.

   (A) :math:`\text{Var}(X + Y) = \text{Var}(X) + \text{Var}(Y)`

   (B) For any :math:`x \in \text{Support}(X)` and :math:`y \in \text{Support}(Y)`,
   :math:`P(X = x, Y = y) = P(X = x)P(Y = y)`.

   (C) :math:`E[XY] = E[X]E[Y]`

   (D) :math:`E[X^3 + Y^{-2}] = E[X^3] + E[Y^{-2}]`

   (E) :math:`\text{Cov}(X, Y) > 0`

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (D)**

      - **(A) FALSE.** In general, :math:`\text{Var}(X+Y) = \text{Var}(X) + \text{Var}(Y) + 2\,\text{Cov}(X,Y)`. Since :math:`X` and :math:`Y` are not independent, :math:`\text{Cov}(X,Y)` need not be zero, so this equality does not always hold.
      - **(B) FALSE.** This is the definition of independence, which is explicitly stated to be false here.
      - **(C) FALSE.** :math:`E[XY] = E[X]E[Y]` holds when :math:`\text{Cov}(X,Y) = 0` (uncorrelated), but dependence does not guarantee this. In fact, :math:`E[XY] - E[X]E[Y] = \text{Cov}(X,Y)`, which may be nonzero.
      - **(D) TRUE.** By the **linearity of expectation**, :math:`E[aX + bY] = aE[X] + bE[Y]` holds for **all** random variables, regardless of dependence. Here, :math:`E[X^3 + Y^{-2}] = E[X^3] + E[Y^{-2}]` always.
      - **(E) FALSE.** Dependent random variables can have positive, negative, or zero covariance.

.. admonition:: Question 2.2  (3 pts)
   :class: note

   On days when STAT 350 homework is due, suppose Professor Reese receives extension
   requests according to a Poisson process at an average rate of 0.05 requests per 10
   minutes. Compute the probability that he receives **more than 1** extension request in
   a randomly selected 2-hour period.

   (A) 0.0012 |quad| (B) 0.0488 |quad| (C) 0.1219 |quad| (D) 0.4512 |quad| (E) 0.8781

   .. |quad| unicode:: U+00A0 U+00A0 U+00A0 U+00A0

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (C)**

      A 2-hour period contains :math:`120 \div 10 = 12` intervals of 10 minutes. The rate
      is 0.05 requests per 10-minute interval, so the expected number of requests in 2 hours
      is:

      .. math::

         \lambda = 0.05 \times 12 = 0.6 \text{ requests}

      Let :math:`X \sim \text{Poisson}(\lambda = 0.6)`. Then:

      .. math::

         P(X > 1) &= 1 - P(X = 0) - P(X = 1) \\
                  &= 1 - e^{-0.6} - 0.6\,e^{-0.6} \\
                  &= 1 - e^{-0.6}(1 + 0.6) \\
                  &= 1 - (0.5488)(1.6) \\
                  &= 1 - 0.8781 \\
                  &= \boxed{0.1219}

.. admonition:: Question 2.3  (3 pts)
   :class: note

   For some constant :math:`k`, define a PDF :math:`f_X(x) = k \cdot (x-5)^2` for
   :math:`x \in [4, 6]` and zero elsewhere. Which of the following statements correctly
   describes this distribution?

   (A) The distribution is bimodal.

   (B) The distribution is positively skewed.

   (C) The normalizing constant :math:`k` can be negative.

   (D) The median is larger than the mean.

   (E) None of the above statements correctly describes the distribution.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (E)**

      First, find :math:`k`. Requiring :math:`\int_4^6 k(x-5)^2\,dx = 1`:

      .. math::

         k \int_4^6 (x-5)^2\,dx = k\left[\frac{(x-5)^3}{3}\right]_4^6
         = k\left(\frac{1}{3} + \frac{1}{3}\right) = \frac{2k}{3} = 1
         \implies k = \frac{3}{2}.

      Evaluate each option:

      - **(A) FALSE.** The function :math:`f_X(x) = \tfrac{3}{2}(x-5)^2` achieves its minimum value of 0 at :math:`x = 5` and increases toward both endpoints :math:`x = 4` and :math:`x = 6`. While the PDF is U-shaped, this does not constitute a bimodal distribution in the standard sense (bimodal requires two distinct interior local maxima).
      - **(B) FALSE.** The PDF is symmetric about :math:`x = 5` (since :math:`(x-5)^2 = (5-x)^2`), so the distribution is **symmetric** — neither positively nor negatively skewed.
      - **(C) FALSE.** Since :math:`(x-5)^2 \geq 0` and :math:`f_X(x) \geq 0` is required, :math:`k` must be **positive** (:math:`k = \tfrac{3}{2} > 0`).
      - **(D) FALSE.** By symmetry of :math:`f_X(x)` about :math:`x = 5`, the mean equals the median equals 5. The median is **not** larger than the mean.
      - **(E) TRUE.** None of options (A)–(D) correctly describes this distribution.

.. admonition:: Question 2.4  (3 pts)
   :class: note

   The weights of packages shipped from a warehouse are Normally distributed with mean
   :math:`\mu = 50` pounds (lbs) and standard deviation :math:`\sigma = 4` lbs. A package
   is considered "light" if its weight is in the bottom 2.5% of the distribution. What is
   the cutoff weight to be considered a "light" package?

   (A) 38 lbs |quad| (B) 40 lbs |quad| (C) 42 lbs |quad| (D) 44 lbs |quad| (E) 46 lbs |quad| (F) 48 lbs

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (C)**

      We need the 2.5th percentile of :math:`\text{Normal}(\mu = 50, \sigma = 4)`. Let
      :math:`c` be the cutoff weight. Then :math:`P(X \leq c) = 0.0250`.

      From the :math:`z`-table: :math:`\Phi(-1.96) = 0.0250`, so :math:`z = -1.96`.

      Converting back to the original scale:

      .. math::

         c = \mu + z\sigma = 50 + (-1.96)(4) = 50 - 7.84 = 42.16 \approx \boxed{42 \text{ lbs}}

.. admonition:: Question 2.5  (3 pts)
   :class: note

   Let :math:`T` represent the "Triage Window" (in minutes) for resolving IT tickets at
   Purdue University. Suppose :math:`T` follows a Normal distribution where it is known
   that :math:`P(T > 15) = 0.0668` and :math:`P(T < 5) = 0.1587`. What is the mean
   :math:`\mu` and standard deviation :math:`\sigma` of this distribution?

   (A) :math:`\mu = 8,\ \sigma = 3.5` |quad| (B) :math:`\mu = 9,\ \sigma = 4` |quad| (C) :math:`\mu = 10,\ \sigma = 2` |quad| (D) :math:`\mu = 10,\ \sigma = 5`

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (B)**

      **Setting up two equations.** Convert each tail probability to a :math:`z`-score using
      the :math:`z`-table:

      - :math:`P(T > 15) = 0.0668 \Rightarrow P(T \leq 15) = 0.9332`. From the table, :math:`\Phi(1.50) = 0.9332`, so :math:`z = 1.50`.
      - :math:`P(T < 5) = 0.1587`. From the table, :math:`\Phi(-1.00) = 0.1587`, so :math:`z = -1.00`.

      This gives the system:

      .. math::

         \frac{15 - \mu}{\sigma} = 1.50 \quad \Rightarrow \quad 15 - \mu = 1.5\sigma \tag{1}

      .. math::

         \frac{5 - \mu}{\sigma} = -1.00 \quad \Rightarrow \quad 5 - \mu = -\sigma \tag{2}

      **Solving.** Subtract equation (2) from equation (1):

      .. math::

         (15 - \mu) - (5 - \mu) = 1.5\sigma - (-\sigma) \implies 10 = 2.5\sigma \implies \sigma = 4.

      Substituting back into (2): :math:`5 - \mu = -4 \Rightarrow \mu = 9`.

      .. math::

         \boxed{\mu = 9, \quad \sigma = 4}

----

Problem 3  (15 points) — Screen Time Boxplot
----------------------------------------------

.. admonition:: Problem 3 Setup
   :class: important

   A psychological research group studies the change in university students' screen time
   and how this affects their studying patterns. As part of the study, they collected the
   screen time, in minutes, of 47 students. Below is a partial data table containing the
   sorted observations and a corresponding partial modified boxplot.

   .. list-table::
      :header-rows: 1
      :widths: 20 10 10 10 10 10 10 10 10 10 10

      * - Index
        - 1
        - 2
        - ⋯
        - 23
        - 24
        - 25
        - 26
        - ⋯
        - 45
        - 46
        - 47
      * - Observation
        - 42.5
        - 54.4
        - ⋯
        - 132.0
        - 145.3
        - 147.9
        - 158.2
        - ⋯
        - 335.6
        - 342.8
        - 487.2

   .. figure:: https://yjjpfnblgtrogqvcjaon.supabase.co/storage/v1/object/public/stat-350-assets/images/Exams/Exam1/SPRING2026/image1.png
      :alt: Partial modified boxplot of screen time data for 47 students. The box spans Q1 = 98.1 to Q3 = 226.6, with a vertical line at the median. Blank labels (i), (ii), (iii) are shown above the box at Q1, median, and Q3 respectively. Blank value boxes (iv), (v), (vi) are shown below for the lower whisker, median value, and upper whisker. An explicit outlier point is plotted to the right of the upper whisker.
      :align: center
      :width: 85%

.. admonition:: Question 3a  (6 pts)
   :class: note

   Fill in the blank spaces **(i) – (vi)** corresponding to the boxplot. For boxes **(i)**,
   **(ii)**, and **(iii)**, provide the correct statistical terminology. For boxes **(iv)**,
   **(v)**, and **(vi)**, provide the exact numerical value.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Terminology labels (top of boxplot):**

      - **(i):** First quartile :math:`Q_1`
      - **(ii):** Median :math:`Q_2`
      - **(iii):** Third quartile :math:`Q_3`

      **Numerical values (bottom of boxplot):**

      - **(iv):** :math:`42.5` (lower whisker / minimum — no observations fall below the lower fence)
      - **(v):** :math:`145.3` (the median, which is the 24th observation in a sorted sample of :math:`n = 47`)
      - **(vi):** :math:`342.8` (upper whisker — 487.2 is an explicit outlier plotted beyond the whisker)

      **Justification for (v):** The median position for :math:`n = 47` is
      :math:`\tfrac{47+1}{2} = 24`, so the median is the 24th sorted observation = **145.3**.

      **Justification for (vi):** The upper fence is
      :math:`Q_3 + 1.5 \times \text{IQR} = 226.6 + 1.5(128.5) = 226.6 + 192.75 = 419.35`.
      Since :math:`487.2 > 419.35`, the value 487.2 is plotted as an explicit outlier and the
      upper whisker extends to the largest non-outlier observation, **342.8**.

.. admonition:: Question 3b  (3 pts)
   :class: note

   Based on the distribution shown, where is the mean for this dataset most likely to exist?

   (A) Between the minimum and first quartile.
   (B) Between the first quartile and the median.
   (C) Between the median and third quartile.
   (D) Between the third quartile and maximum.
   (E) No single option is more likely than others.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (D)**

      The boxplot reveals strong **right skew**: the right whisker is considerably longer
      than the left whisker, and there is a high outlier at 487.2. In a right-skewed
      distribution, the mean is pulled toward the long right tail and typically exceeds the
      median. The extreme outlier at 487.2 (far above the upper whisker of 342.8) exerts
      substantial upward leverage on the mean, pulling it above :math:`Q_3 = 226.6` and
      into the region between the third quartile and the maximum.

.. admonition:: Question 3c  (3 pts)
   :class: note

   Compute the interquartile range (IQR). Explain its significance strictly in the context
   of the students' screen time data.

   .. dropdown:: Solution
      :class-container: sd-border-success

      .. math::

         \text{IQR} = Q_3 - Q_1 = 226.6 - 98.1 = \boxed{128.5 \text{ minutes}}

      **Interpretation:** The IQR of 128.5 minutes means that the middle 50% of students'
      screen times span a range of approximately 128.5 minutes. In other words, the student
      at the 75th percentile of screen time watches about 128.5 more minutes per day than
      the student at the 25th percentile.

.. admonition:: Question 3d  (3 pts)
   :class: note

   Approximately how many data points are at least 98.1 and at most 226.6?

   .. dropdown:: Solution
      :class-container: sd-border-success

      The interval :math:`[98.1,\ 226.6] = [Q_1, Q_3]` contains the **middle 50%** of the
      observations. Therefore:

      .. math::

         0.50 \times 47 \approx \boxed{24} \text{ data points}

----

Problem 4  (26 points) — Defective Components
-----------------------------------------------

.. admonition:: Problem 4 Setup
   :class: important

   Due to recent severe mechanical failures on the assembly line, a production facility is
   experiencing an unusually high rate of errors. A quality inspector examines a small batch
   of :math:`n = 4` electronic components from a massive production line. Each component
   independently has a probability :math:`p = 0.3` of being defective. Let :math:`X` denote
   the number of defective components in the batch.

.. admonition:: Question 4a  (4 pts)
   :class: note

   Identify the distribution of :math:`X`, including its parameter(s). Write out the exact
   PMF formula for :math:`P(X = x)` and state the support of :math:`X`.

   .. dropdown:: Solution
      :class-container: sd-border-success

      :math:`X \sim \text{Binomial}(n = 4,\ p = 0.3)`

      **Support:** :math:`X \in \{0, 1, 2, 3, 4\}`

      **PMF:**

      .. math::

         P(X = x) = \binom{4}{x} (0.3)^x (0.7)^{4-x}, \quad x \in \{0, 1, 2, 3, 4\}

      The conditions for a Binomial model are satisfied: fixed number of trials
      (:math:`n = 4`), each component is independently defective with the same probability
      :math:`p = 0.3`, and each trial results in success (defective) or failure
      (non-defective).

.. admonition:: Question 4b  (5 pts)
   :class: note

   Compute the probability of observing **at least two** defective components in the batch.

   .. dropdown:: Solution
      :class-container: sd-border-success

      .. math::

         P(X \geq 2) &= 1 - P(X = 0) - P(X = 1) \\[6pt]
         P(X = 0) &= (0.7)^4 = 0.2401 \\[4pt]
         P(X = 1) &= \binom{4}{1}(0.3)^1(0.7)^3 = 4(0.3)(0.343) = 0.4116 \\[8pt]
         P(X \geq 2) &= 1 - 0.2401 - 0.4116 = \boxed{0.3483}

.. admonition:: Question 4c  (5 pts)
   :class: note

   Determine the expected number of defective components, the expected number of
   non-defective components, and the variance.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Expected number of defective components:**

      .. math::

         E[X] = np = 4(0.3) = \boxed{1.2}

      **Expected number of non-defective components:**

      .. math::

         E[n - X] = n - E[X] = n(1 - p) = 4(0.7) = \boxed{2.8}

      **Variance** (the variance of both :math:`X` and :math:`n - X` is the same):

      .. math::

         \text{Var}(X) = np(1-p) = 4(0.3)(0.7) = \boxed{0.84}

.. admonition:: Question 4d  (12 pts)
   :class: note

   Suppose the automated QA machine scans the batches. If a batch has many defects, the
   automated QA machine halts early and rejects it. The diagnostic time (in minutes) spent
   on a batch is modeled by the function :math:`D = \dfrac{60}{X + 1}`. Calculate the
   expected diagnostic time, :math:`E[D]`. *(Hint: The LOTUS flower brings clarity.)*

   .. dropdown:: Solution
      :class-container: sd-border-success

      Using the **Law of the Unconscious Statistician (LOTUS)**:

      .. math::

         E[D] = E\!\left[\frac{60}{X+1}\right] = \sum_{x=0}^{4} \frac{60}{x+1} \cdot P(X = x)

      **Step 1 — Compute all PMF values:**

      .. list-table::
         :header-rows: 1
         :widths: 15 30 25 30

         * - :math:`x`
           - :math:`P(X = x)`
           - :math:`\dfrac{60}{x+1}`
           - :math:`\dfrac{60}{x+1} \cdot P(X=x)`
         * - 0
           - :math:`(0.7)^4 = 0.2401`
           - 60
           - 14.4060
         * - 1
           - :math:`4(0.3)(0.7)^3 = 0.4116`
           - 30
           - 12.3480
         * - 2
           - :math:`6(0.3)^2(0.7)^2 = 0.2646`
           - 20
           - 5.2920
         * - 3
           - :math:`4(0.3)^3(0.7) = 0.0756`
           - 15
           - 1.1340
         * - 4
           - :math:`(0.3)^4 = 0.0081`
           - 12
           - 0.0972

      **Step 2 — Sum:**

      .. math::

         E[D] = 14.4060 + 12.3480 + 5.2920 + 1.1340 + 0.0972 = \boxed{33.2772 \text{ minutes}}

----

Problem 5  (17 points) — Meredith the Cat
------------------------------------------

.. admonition:: Problem 5 Setup
   :class: important

   Meredith 🐱 follows a daily routine in the following order:
   **eat → drink → poop → cuddle → sleep**. If Meredith successfully completes the first
   four steps, she falls asleep and is happy 😺. If any of the steps are broken, Meredith
   is guaranteed to get mad 😾.

   Let :math:`(M)` be the event that Meredith gets mad 😾; otherwise she is happy 😺
   (falls asleep). Let :math:`E`, :math:`D`, :math:`P`, and :math:`C` be the events that
   the routine is broken at the eat, drink, poop, and cuddle step, respectively. From an
   observational study, Meredith's owner, Heekyung, learned that :math:`P(M) = 0.2`. When
   Meredith gets mad, the cause of the broken routine is 50% eat, 25% drink, 10% poop, and
   15% cuddle.

   **Known probabilities:**

   .. math::

      P(M) = 0.2, \quad P(H) = 0.8

   .. math::

      P(E \mid M) = 0.50, \quad P(D \mid M) = 0.25, \quad P(P \mid M) = 0.10, \quad P(C \mid M) = 0.15

.. admonition:: Question 5a  (5 pts)
   :class: note

   What is the probability that Meredith gets mad **and** the broken routine is "poop"?

   .. dropdown:: Solution
      :class-container: sd-border-success

      Using the Multiplication Rule:

      .. math::

         P(M \cap P) = P(P \mid M) \cdot P(M) = 0.10 \times 0.2 = \boxed{0.0200}

.. admonition:: Question 5b  (5 pts)
   :class: note

   What is the probability that the broken routine is "eat" given that Meredith is happy?

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer:** :math:`P(E \mid H) = \boxed{0}`

      If Meredith is happy (i.e., she falls asleep), then by definition **no** step in her
      routine was broken. Therefore, the event :math:`E` (eat routine is broken) and the
      event :math:`H` (Meredith is happy) are mutually exclusive: :math:`E \cap H = \emptyset`.
      It follows that :math:`P(E \mid H) = 0`.

.. admonition:: Question 5c  (5 pts)
   :class: note

   What is the probability that Meredith gets mad given that the "poop" routine is broken?

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer:** :math:`P(M \mid P) = \boxed{1}`

      The problem states: *"If any of the steps are broken, Meredith is guaranteed to get
      mad."* Therefore, breaking the poop routine is a sufficient condition for getting mad.
      Since the poop step being broken (:math:`P`) implies mad (:math:`M`), we have
      :math:`P \subseteq M` and thus :math:`P(M \mid P) = 1`.

.. admonition:: Question 5d  (2 pts)
   :class: note

   Determine whether the events :math:`M` and :math:`P` are independent or not.

   (A) The two events are independent.
   (B) Two events are dependent.

   .. dropdown:: Solution
      :class-container: sd-border-success

      **Answer: (B) — Two events are dependent.**

      Two events are independent if and only if
      :math:`P(M \cap P) = P(M) \cdot P(P)`.

      First compute :math:`P(P)` using the Law of Total Probability:

      .. math::

         P(P) = P(P \mid M)\,P(M) + P(P \mid H)\,P(H) = 0.10(0.2) + 0(0.8) = 0.02

      Check independence:

      .. math::

         P(M) \cdot P(P) = 0.2 \times 0.02 = 0.004

      But from part (a): :math:`P(M \cap P) = 0.02 \neq 0.004`.

      Since :math:`P(M \cap P) \neq P(M) \cdot P(P)`, the events :math:`M` and :math:`P`
      are **dependent**.

      *(Equivalently: :math:`P(M \mid P) = 1 \neq 0.2 = P(M)`)*

----

Problem 6  (20 points) — GPU Thermal Stress Test
--------------------------------------------------

.. admonition:: Problem 6 Setup
   :class: important

   A data science lab is running a mandatory 10-hour thermal stress test on a new cluster
   of machine learning GPUs. Let :math:`T` be the time (in hours) until a defective GPU
   fails during the test.

   - **Phase 1** (:math:`0 \leq t \leq 1`): The thermal load ramps up linearly for the first hour.
   - **Phase 2** (:math:`1 < t \leq 10`): The probability of failure decays smoothly according to an inverse-square law.
   - **The Cutoff:** The stress test is automatically halted at exactly 10 hours.

   The probability density function (PDF) for the failure time is modeled by:

   .. math::

      f_T(t) = \begin{cases}
         \dfrac{5}{7} \cdot t & 0 \leq t \leq 1 \\[8pt]
         \dfrac{5}{7} \cdot \dfrac{1}{t^2} & 1 < t \leq 10 \\[8pt]
         0 & \text{otherwise}
      \end{cases}

.. admonition:: Question 6a  (10 pts)
   :class: note

   The partially completed cumulative distribution function (CDF) is given below. Find
   the missing equation for the region between 1 and 10.

   .. math::

      F_T(t) = \begin{cases}
         0 & t < 0 \\[4pt]
         \dfrac{5}{14}\,t^2 & 0 \leq t < 1 \\[8pt]
         \text{[MISSING]} & 1 \leq t < 10 \\[4pt]
         1 & t \geq 10
      \end{cases}

   .. dropdown:: Solution
      :class-container: sd-border-success

      For :math:`1 \leq t < 10`, the CDF must accumulate probability from both
      regions. We carry forward the probability already accumulated through Phase 1,
      then add the integral over Phase 2 up to :math:`t`:

      .. math::

         P(T \leq 1) = F_T(1) = \frac{5}{14}(1)^2 = \frac{5}{14}

      .. math::

         F_T(t) = \underbrace{F_T(1)}_{\text{Phase 1}} + \int_1^t \frac{5}{7} \cdot \frac{1}{x^2}\,dx

      Compute the integral explicitly:

      .. math::

         \int_1^t \frac{5}{7} \cdot \frac{1}{x^2}\,dx
         = \frac{5}{7}\left[-\frac{1}{x}\right]_1^t
         = \frac{5}{7}\left(-\frac{1}{t} + 1\right)
         = \frac{5}{7} - \frac{5}{7t}

      Combining:

      .. math::

         F_T(t) = \frac{5}{14} + \frac{5}{7} - \frac{5}{7t}
                = \frac{5}{14} + \frac{10}{14} - \frac{5}{7t}
                = \boxed{\frac{15}{14} - \frac{5}{7t}}, \quad 1 \leq t < 10

      **Verification:** At :math:`t = 10`:
      :math:`\dfrac{15}{14} - \dfrac{5}{70} = \dfrac{15}{14} - \dfrac{1}{14} = \dfrac{14}{14} = 1` ✓

----

.. math::

   F_T(t) = \begin{cases}
      0 & t < 0 \\[4pt]
      \dfrac{5}{14}\,t^2 & 0 \leq t < 1 \\[8pt]
      \dfrac{15}{14} - \dfrac{5}{7t} & 1 \leq t < 10 \\[4pt]
      1 & t \geq 10
   \end{cases}

----

.. admonition:: Question 6b  (10 pts)
   :class: note

   Calculate the **median** failure time for a defective GPU.

   .. dropdown:: Solution
      :class-container: sd-border-success

      We need :math:`\tilde{t}` such that :math:`F_T(\tilde{t}) = 0.5`.

      **Step 1 — Identify which region contains the median.**

      At the boundary :math:`t = 1`:

      .. math::

         F_T(1) = \frac{5}{14} \approx 0.3571 < 0.5

      Since we do not reach probability 0.5 within the first phase (:math:`0 \leq t \leq 1`),
      the median must lie in the **Phase 2 region** (:math:`1 \leq t < 10`).

      **Step 2 — Solve for the median using the Phase 2 CDF.**

      .. math::

         \frac{15}{14} - \frac{5}{7\tilde{t}} = \frac{1}{2}

      .. math::

         \frac{5}{7\tilde{t}} = \frac{15}{14} - \frac{1}{2} = \frac{15}{14} - \frac{7}{14} = \frac{8}{14} = \frac{4}{7}

      .. math::

         \tilde{t} = \frac{5}{7} \cdot \frac{7}{4} = \frac{5}{4} = \boxed{1.25 \text{ hours}}

      **Verification:** :math:`F_T(1.25) = \dfrac{15}{14} - \dfrac{5}{7(1.25)} = \dfrac{15}{14} - \dfrac{5}{8.75} = \dfrac{15}{14} - \dfrac{4}{7} = \dfrac{15}{14} - \dfrac{8}{14} = \dfrac{7}{14} = 0.5` ✓