STAT 350 — Exam 1 — Spring 2026 (V1)

Exam Information

Course: STAT 350 — Introduction to Statistics
Semester: Spring 2026
Version: V1
Total Points: 105
Time Allowed: 60 minutes

Problem	Total Possible	Topic
Problem 1 (True/False, 2 pts each)	12	Continuous RVs, Binomial, Normal
Problem 2 (Multiple Choice, 3 pts each)	15	Discrete/Continuous RVs, Normal
Problem 3	15	Descriptive Statistics / Boxplots
Problem 4	26	Binomial Distribution, LOTUS
Problem 5	17	Conditional Probability / Independence
Problem 6	20	PDFs and CDFs
Total	105

Problem 1 — True/False (12 points, 2 points each)

Question 1.1 (2 pts)

Let \(X\) denote a continuous random variable with a PDF \(f_X(x)\). For any interval such that \([a, b] \subset \text{Support}(X)\), such that \(a < b\),

True or False: \(P(a < X < b)\) must be less than or equal to \(P(X < b)\).

Question 1.2 (2 pts)

Suppose \(X\) is a Binomial random variable with parameters \(n\) and \(p\).

True or False: Holding the number of trials \(n\) constant, the shape of the distribution shifts from positively skewed to negatively skewed as \(p\) changes from 0.9 to 0.1.

Question 1.3 (2 pts)

Regarding the properties of a Binomial random variable \(X \sim \text{Binomial}(n, p)\).

True or False: The variance of \(X\) cannot exceed the number of independent trials \(n\).

Question 1.4 (2 pts)

Suppose \(V \sim \text{Binomial}(n, p)\) and \(W \sim \text{Poisson}(\lambda)\).

True or False: Then for any positive integer \(n\), the support of \(V\) is a subset of the support of \(W\).

Question 1.5 (2 pts)

When converting a value \(x\) from a normal distribution into a \(z\)-score.

True or False: A negative \(z\)-score indicates that the original \(x\) is smaller than the population mean \(\mu\).

Question 1.6 (2 pts)

Suppose \(X\) and \(Y\) are Normally distributed random variables sharing the same mean \(\mu = 10\). It is also known that \(\text{Var}(X) < \text{Var}(Y)\).

True or False: Then it follows that \(P(X \leq 12)\) is larger than \(P(Y \leq 12)\).

Problem 2 — Multiple Choice (15 points, 3 points each)

Question 2.1 (3 pts)

Let \(X\) and \(Y\) be discrete random variables that are not independent. Choose the statement about \(X\) and \(Y\) that always holds.

\(\text{Var}(X + Y) = \text{Var}(X) + \text{Var}(Y)\)

(B) For any \(x \in \text{Support}(X)\) and \(y \in \text{Support}(Y)\), \(P(X = x, Y = y) = P(X = x)P(Y = y)\).

\(E[XY] = E[X]E[Y]\)
\(E[X^3 + Y^{-2}] = E[X^3] + E[Y^{-2}]\)
\(\text{Cov}(X, Y) > 0\)

Question 2.2 (3 pts)

On days when STAT 350 homework is due, suppose Professor Reese receives extension requests according to a Poisson process at an average rate of 0.05 requests per 10 minutes. Compute the probability that he receives more than 1 extension request in a randomly selected 2-hour period.

0.0012 (B) 0.0488 (C) 0.1219 (D) 0.4512 (E) 0.8781

Question 2.3 (3 pts)

For some constant \(k\), define a PDF \(f_X(x) = k \cdot (x-5)^2\) for \(x \in [4, 6]\) and zero elsewhere. Which of the following statements correctly describes this distribution?

The distribution is bimodal.
The distribution is positively skewed.
The normalizing constant \(k\) can be negative.
The median is larger than the mean.
None of the above statements correctly describes the distribution.

Question 2.4 (3 pts)

The weights of packages shipped from a warehouse are Normally distributed with mean \(\mu = 50\) pounds (lbs) and standard deviation \(\sigma = 4\) lbs. A package is considered “light” if its weight is in the bottom 2.5% of the distribution. What is the cutoff weight to be considered a “light” package?

38 lbs (B) 40 lbs (C) 42 lbs (D) 44 lbs (E) 46 lbs (F) 48 lbs

Question 2.5 (3 pts)

Let \(T\) represent the “Triage Window” (in minutes) for resolving IT tickets at Purdue University. Suppose \(T\) follows a Normal distribution where it is known that \(P(T > 15) = 0.0668\) and \(P(T < 5) = 0.1587\). What is the mean \(\mu\) and standard deviation \(\sigma\) of this distribution?

\(\mu = 8,\ \sigma = 3.5\) (B) \(\mu = 9,\ \sigma = 4\) (C) \(\mu = 10,\ \sigma = 2\) (D) \(\mu = 10,\ \sigma = 5\)

Problem 3 (15 points) — Screen Time Boxplot

Problem 3 Setup

A psychological research group studies the change in university students’ screen time and how this affects their studying patterns. As part of the study, they collected the screen time, in minutes, of 47 students. Below is a partial data table containing the sorted observations and a corresponding partial modified boxplot.

Index	1	2	⋯	23	24	25	26	⋯	45	46	47
Observation	42.5	54.4	⋯	132.0	145.3	147.9	158.2	⋯	335.6	342.8	487.2

Partial modified boxplot of screen time data for 47 students. The box spans Q1 = 98.1 to Q3 = 226.6, with a vertical line at the median. Blank labels (i), (ii), (iii) are shown above the box at Q1, median, and Q3 respectively. Blank value boxes (iv), (v), (vi) are shown below for the lower whisker, median value, and upper whisker. An explicit outlier point is plotted to the right of the upper whisker.

Question 3a (6 pts)

Fill in the blank spaces (i) – (vi) corresponding to the boxplot. For boxes (i), (ii), and (iii), provide the correct statistical terminology. For boxes (iv), (v), and (vi), provide the exact numerical value.

Question 3b (3 pts)

Based on the distribution shown, where is the mean for this dataset most likely to exist?

Between the minimum and first quartile.
Between the first quartile and the median.
Between the median and third quartile.
Between the third quartile and maximum.
No single option is more likely than others.

Question 3c (3 pts)

Compute the interquartile range (IQR). Explain its significance strictly in the context of the students’ screen time data.

Question 3d (3 pts)

Approximately how many data points are at least 98.1 and at most 226.6?

Problem 4 (26 points) — Defective Components

Problem 4 Setup

Due to recent severe mechanical failures on the assembly line, a production facility is experiencing an unusually high rate of errors. A quality inspector examines a small batch of \(n = 4\) electronic components from a massive production line. Each component independently has a probability \(p = 0.3\) of being defective. Let \(X\) denote the number of defective components in the batch.

Question 4a (4 pts)

Identify the distribution of \(X\), including its parameter(s). Write out the exact PMF formula for \(P(X = x)\) and state the support of \(X\).

Question 4b (5 pts)

Compute the probability of observing at least two defective components in the batch.

Question 4c (5 pts)

Determine the expected number of defective components, the expected number of non-defective components, and the variance.

Question 4d (12 pts)

Suppose the automated QA machine scans the batches. If a batch has many defects, the automated QA machine halts early and rejects it. The diagnostic time (in minutes) spent on a batch is modeled by the function \(D = \dfrac{60}{X + 1}\). Calculate the expected diagnostic time, \(E[D]\). (Hint: The LOTUS flower brings clarity.)

Solution

Using the Law of the Unconscious Statistician (LOTUS):

\[E[D] = E\!\left[\frac{60}{X+1}\right] = \sum_{x=0}^{4} \frac{60}{x+1} \cdot P(X = x)\]

Step 1 — Compute all PMF values:

\(x\)	\(P(X = x)\)	\(\dfrac{60}{x+1}\)	\(\dfrac{60}{x+1} \cdot P(X=x)\)
0	\((0.7)^4 = 0.2401\)	60	14.4060
1	\(4(0.3)(0.7)^3 = 0.4116\)	30	12.3480
2	\(6(0.3)^2(0.7)^2 = 0.2646\)	20	5.2920
3	\(4(0.3)^3(0.7) = 0.0756\)	15	1.1340
4	\((0.3)^4 = 0.0081\)	12	0.0972

Step 2 — Sum:

\[E[D] = 14.4060 + 12.3480 + 5.2920 + 1.1340 + 0.0972 = \boxed{33.2772 \text{ minutes}}\]

Problem 5 (17 points) — Meredith the Cat

Problem 5 Setup

Meredith 🐱 follows a daily routine in the following order: eat → drink → poop → cuddle → sleep. If Meredith successfully completes the first four steps, she falls asleep and is happy 😺. If any of the steps are broken, Meredith is guaranteed to get mad 😾.

Let \((M)\) be the event that Meredith gets mad 😾; otherwise she is happy 😺 (falls asleep). Let \(E\), \(D\), \(P\), and \(C\) be the events that the routine is broken at the eat, drink, poop, and cuddle step, respectively. From an observational study, Meredith’s owner, Heekyung, learned that \(P(M) = 0.2\). When Meredith gets mad, the cause of the broken routine is 50% eat, 25% drink, 10% poop, and 15% cuddle.

Known probabilities:

\[P(M) = 0.2, \quad P(H) = 0.8\]

\[P(E \mid M) = 0.50, \quad P(D \mid M) = 0.25, \quad P(P \mid M) = 0.10, \quad P(C \mid M) = 0.15\]

Question 5a (5 pts)

What is the probability that Meredith gets mad and the broken routine is “poop”?

Question 5b (5 pts)

What is the probability that the broken routine is “eat” given that Meredith is happy?

Question 5c (5 pts)

What is the probability that Meredith gets mad given that the “poop” routine is broken?

Question 5d (2 pts)

Determine whether the events \(M\) and \(P\) are independent or not.

The two events are independent.
Two events are dependent.

Problem 6 (20 points) — GPU Thermal Stress Test

Problem 6 Setup

A data science lab is running a mandatory 10-hour thermal stress test on a new cluster of machine learning GPUs. Let \(T\) be the time (in hours) until a defective GPU fails during the test.

Phase 1 (\(0 \leq t \leq 1\)): The thermal load ramps up linearly for the first hour.
Phase 2 (\(1 < t \leq 10\)): The probability of failure decays smoothly according to an inverse-square law.
The Cutoff: The stress test is automatically halted at exactly 10 hours.

The probability density function (PDF) for the failure time is modeled by:

\[\begin{split}f_T(t) = \begin{cases} \dfrac{5}{7} \cdot t & 0 \leq t \leq 1 \\[8pt] \dfrac{5}{7} \cdot \dfrac{1}{t^2} & 1 < t \leq 10 \\[8pt] 0 & \text{otherwise} \end{cases}\end{split}\]

Question 6a (10 pts)

The partially completed cumulative distribution function (CDF) is given below. Find the missing equation for the region between 1 and 10.

\[\begin{split}F_T(t) = \begin{cases} 0 & t < 0 \\[4pt] \dfrac{5}{14}\,t^2 & 0 \leq t < 1 \\[8pt] \text{[MISSING]} & 1 \leq t < 10 \\[4pt] 1 & t \geq 10 \end{cases}\end{split}\]

\[\begin{split}F_T(t) = \begin{cases} 0 & t < 0 \\[4pt] \dfrac{5}{14}\,t^2 & 0 \leq t < 1 \\[8pt] \dfrac{15}{14} - \dfrac{5}{7t} & 1 \leq t < 10 \\[4pt] 1 & t \geq 10 \end{cases}\end{split}\]

Question 6b (10 pts)

Calculate the median failure time for a defective GPU.