STAT 350 — Exam 2 — Fall 2024

Exam Information

Course: STAT 350 — Introduction to Statistics
Semester: Fall 2024
Total Points: 105
Time Allowed: 60 minutes

Problem	Total Possible	Topic
Problem 1 (True/False, 2 pts each)	12	Statistics, p-values, Power, Paired Design, Confounding
Problem 2 (Multiple Choice, 3 pts each)	15	CLT, Confidence Intervals, Power, Estimation
Problem 3	26	CLT, Sampling Distribution of \(\bar{X}\)
Problem 4	25	Two-Sample Inference
Problem 5	27	One-Sample z-test, Confidence Bound
Total	105

Problem 1 — True/False (12 points, 2 points each)

Question 1.1 (2 pts)

A researcher collects various values from a dataset, including the sample mean \(\bar{x}\), the sample variance \(s^2\), the t-test statistic \(T_\text{TS}\) and the \(p\)-value.

True or False: Each of these values is an example of a statistic.

Question 1.2 (2 pts)

The \(p\)-value can be considered a continuous random variable as it is a function of the test statistic, which itself is a function of the data,

True or False: and therefore it must follow a normal distribution, as all data-derived quantities do.

Question 1.3 (2 pts)

A researcher conducts a hypothesis test at a significance level \(\alpha = 0.01\) and fails to reject the null hypothesis.

True or False: This result indicates that the null hypothesis is true at a 99% confidence level.

Question 1.4 (2 pts)

In a study let \(X_{A1}, X_{A2}, \ldots, X_{An}\) represent the first set of measurements and \(X_{B1}, X_{B2}, \ldots, X_{Bn}\) represent the second set of measurements, where each pair \((X_{Ai}, X_{Bi})\) for \(i \in \{1, 2, \ldots, n\}\) is taken from the same subject and are dependent. Let the difference between measurements for each subject be \(D_i = X_{Ai} - X_{Bi}\) be normally distributed and let \(\sigma_A^2 = \text{Var}(X_{Ai})\) and \(\sigma_B^2 = \text{Var}(X_{Bi})\) for all \(i \in \{1, 2, \ldots, n\}\).

True or False: If the covariance between pairs is a positive constant for all \(i \in \{1, 2, \ldots, n\}\), i.e., \(\text{Cov}(X_{Ai}, X_{Bi}) = \sigma_{AB} > 0\), meaning that when one measurement is higher (or lower) than average, the other measurement is likely to be similarly higher (or lower) than average. Therefore,

\[\text{Var}(\bar{D}) = \frac{\sigma_D^2}{n} < \frac{\sigma_A^2 + \sigma_B^2}{n}.\]

Question 1.5 (2 pts)

A researcher is studying the relationship between physical activity and cholesterol levels. However, they also collect data on participants’ diets, which are known to influence both physical activity and cholesterol levels.

True or False: Diet is considered a confounding variable in this study.

Question 1.6 (2 pts)

A researcher designed an experiment to test the effect of a new fertilizer on crop yield. They randomly assign the fertilizer treatment to half of the plots and leave the other half untreated. However, they notice that plots receiving fertilizer are closer to a water source.

True or False: The random assignment is sufficient to ensure that the experiment is free from confounding variables.

Problem 2 — Multiple Choice (15 points, 3 points each)

Question 2.1 (3 pts)

Assume \(W_1, W_2, \ldots, W_n\) are independent samples drawn from some unknown distribution \(f_W(w)\) with a population mean \(\mu = 10\) and population standard deviation \(\sigma = 10\). Which of the following statements is FALSE regarding the distribution of \(\bar{W}\)?

Line graph showing three curves plotted against sample size n (x-axis, 0 to 100) and Value (y-axis, 0 to 20). Curve A (dotted red) starts near 10 and increases toward approximately 20, approaching a horizontal asymptote. Curve B (solid black) is a horizontal line at value 10 across all sample sizes. Curve C (dashed blue) starts near 10 at small n and decreases toward 0 as n increases.

(A) If the distribution \(f_W(w)\) is heavily skewed, a larger sample is required to apply the central limit theorem.

(B) Curve A represents the value of \(sd(\bar{W})\) when the central limit theorem is not applicable.

(C) Curve B represents the value of the \(E[\bar{W}]\) for different sample sizes \(n\).

(D) Curve C indicates that the inference on \(\mu_W\) is more accurate as the sample size increases.

Question 2.2 (3 pts)

In the context of a one-sample procedure for constructing a 99% confidence interval for the population mean \(\mu\), assuming all conditions for inference are met, which quantity is guaranteed to be within the interval?

0 (B) \(\mu\) (C) \(\sigma\) (D) \(\bar{x}\) (E) None of the above

Question 2.3 (3 pts)

Consider an experiment in which a sample of size 100 is drawn from a population with unknown mean (\(\mu\)) and unknown standard deviation (\(\sigma\)). The experiment is repeated using ten different samples of the same size, and a 99% confidence interval is constructed for the unknown mean from each sample. Once all the intervals are computed, which of the following is always true?

(A) The critical value used to calculate the confidence intervals is the same across the 10 replications of the experiment.

(B) The numerical value at the center of the confidence interval is the same across the 10 replications of the experiment.

The margin of error is the same across the 10 replications of the experiment.

(D) Each of the 10 computed confidence intervals contain the true mean (\(\mu\)) with a probability of 0.99.

Two or more of the above statements are correct.

Question 2.4 (3 pts)

Which of the following strategies can a researcher use to increase the power of a statistical hypothesis test?

Increase the sample size \(n\).

(B) Increase the distance between the null value \(\mu_0\) and the alternative mean \(\mu_A\).

(C) Reduce the population standard deviation \(\sigma\) by controlling extraneous variables.

Increase the significance level \(\alpha\) (Acceptable Type I error rate).
All of the above.

Question 2.5 (3 pts)

Suppose you are estimating a population parameter using two different estimators: Estimator A is unbiased but has high variance, while Estimator B is biased but has low variance. Which of the following statements is TRUE?

Estimator A is always preferred because it is unbiased.
Estimator B is always preferred because it has low variance.

(C) Neither estimator is useful because both fail to provide accurate estimates of the true population parameter.

(D) Depending on the context, Estimator B may be preferred if its bias is small, and variance is significantly lower than Estimator A’s.

Both estimators are equally effective if the sample size is small enough.

Problem 3 (26 points) — Auto-Insurance CLT

Problem 3 Setup

An auto-insurance company plans to adjust policyholders’ premiums based on historical data. According to the data, the claim amounts follow a gamma distribution with a mean of 5k and a standard deviation of 2.25k. The company expects 900 claims to be filed in the upcoming month. These 900 claims can be thought of as a random sample of identically distributed claims drawn from the population distribution of claim amounts. Assume the claims are independent.

An enthusiastic statistician at the company conducted a simulation using 1,500 simple random samples, each of size 900, where each observation was randomly drawn from the gamma distribution. The following graphs are provided to support their analysis of \(\bar{X}\), the average claim amount:

Three side-by-side panels. Left panel titled "The Gamma Distribution" shows a right-skewed histogram of individual claim amounts with a red kernel density curve and blue normal density overlay, x-axis labeled Data ranging 0 to 15. Middle panel titled "Sampling Distribution X-bar" shows a symmetric, bell-shaped histogram of 1500 sample means with overlapping red kernel and blue normal curves, x-axis ranging 4.8 to 5.2. Right panel titled "Normal Probability Plot of X-bar" shows a QQ plot with points closely following the red reference line, sample quantiles on y-axis (4.8 to 5.2), theoretical quantiles on x-axis (-2 to 2).

Question 3a (10 pts)

Describe the approximate distribution of \(\bar{X}\), the average claim amount of 900 auto-insurance claims. Provide a detailed justification for why this approximation is valid, including the important theoretical principle that supports your result.

Question 3b (3 pts)

Find the mean and standard deviation of the sampling distribution of \(\bar{X}\).

Question 3c (3 pts)

Select the correct code for determining the probability that the average of 900 claims would be greater than 5.15k.

# (A) pnorm(5.15, mean = 5, sd = 2.25, lower.tail = FALSE)
# (B) pnorm(5.15, mean = 5, sd = 2.25, lower.tail = TRUE)
# (C) pnorm(5.15, mean = 5, sd = 0.075, lower.tail = FALSE)
# (D) pnorm(5.15, mean = 5, sd = 0.075, lower.tail = TRUE)
# (E) pgamma(5.15, shape = 5, rate = 2.25, lower.tail = FALSE)
# (F) pgamma(5.15, shape = 5, rate = 2.25, lower.tail = TRUE)

Question 3d (10 pts)

Suppose only 10 claims are expected in the upcoming month. Can the same inference about the average claim be made in this case? Justify your answer based on relevant theoretical principles.

Problem 4 (25 points) — GreatNotes Algorithm Generalization

Problem 4 Setup

GreatNotes is developing software that converts handwritten mathematical notations into typed text. To evaluate its performance, the company used 200 images of handwritten mathematical equations. Of these, 100 images were included in the training dataset, paired with their correct typed formats. The remaining 100 images were withheld from training to serve as a test set of new, unseen data.

After training, the company tested the algorithm on all 200 images, comparing each algorithm-generated output to its corresponding correct typed format to assess accuracy.

Each output was scored for accuracy, with scores ranging from 0 to 100. A numerical summary of the results is provided below:

	Training Data	Withheld Data	Training − Withheld
\(n\)	100	100	100
Sample Mean	96.4	95.2	1.2
Sample Standard Deviation	2.3	4.5	4.3

It is common for handwriting recognition algorithms to achieve higher accuracy on data used during training. However, for commercial success, GreatNotes must ensure that the algorithm performs comparably on new, unseen data. They will conclude that the algorithm fails to generalize if the true mean accuracy on training data is significantly higher than the true mean accuracy on withheld data.

Using a 91% confidence level, perform a hypothesis test to determine whether the algorithm fails to generalize.

Question 4a (2 pts)

Which two-sample method should be used?

Two-sample Independent Procedure
Two-sample Paired Procedure

Question 4b (5 pts)

Perform the first two steps of the four-step hypothesis test.

Question 4c (8 pts)

Compute the test statistic. Show work.

Question 4d (3 pts)

Select the R code that would correctly compute the \(p\)-value.

# (A) pnorm(test_statistic, lower.tail = TRUE)
# (B) pt(test_statistic, df=147.42, lower.tail = TRUE)
# (C) pt(test_statistic, df=199,    lower.tail = TRUE)
# (D) pnorm(test_statistic, lower.tail = FALSE)
# (E) pt(test_statistic, df=147.42, lower.tail = FALSE)
# (F) pt(test_statistic, df=199,    lower.tail = FALSE)

Question 4e (7 pts)

The resulting \(p\)-value was approximately 0.009. Provide the formal decision and interpret the conclusion in the context of the problem.

Problem 5 (27 points) — Coyote Lengths in Georgia

Problem 5 Setup

Urbanization has been associated with an increase in coyote sightings in Georgia (Mowry et al., 2020). This has raised concerns about the role of coyotes in urban ecosystems, particularly regarding human-coyote conflicts and negative interactions with pets.

Residents of Atlanta, GA, believe that coyotes are large on average, with a mean length of at least 94 cm. However, the Georgia Department of Natural Resources (DNR) suspects that the true mean length is less than 94 cm. To investigate, the DNR staff used Geographic Information Systems (GIS) to identify and randomly select sampling locations, supplemented by satellite imagery to capture 29 images of coyotes. From these images, they measured the lengths of the coyotes.

The sample mean length was 89.17 cm. Based on historical data, the DNR has determined that coyote lengths follow a normal distribution with a standard deviation of 9 cm.

Question 5a (8 pts)

Calculate an appropriate 90% confidence interval or bound to assess the belief of the true mean length of coyotes by the Georgian DNR. Clearly specify which R output from the last page of the exam you used.

Question 5b (5 pts)

Interpret the results obtained from part (a) within the context of the problem.

Question 5c (14 pts)

Carry out a hypothesis test on whether the data supports the claim made by the DNR staff. Use the information from above and on the last page of the exam to perform the four-step hypothesis test. Clearly specify which R output from the last page of the exam was used to obtain your conclusion. Test at \(\alpha = 0.1\).

Question 5 Code/Output:

# Output 1
t.test(coyote_data, conf.level = 0.90, alternative = "greater", mu = 89.17)
# t = 0.0012646, df = 28, p-value = 0.4995

# Output 2
t.test(coyote_data, conf.level = 0.90, alternative = "less", mu = 94)
# t = -2.5293, df = 29, p-value = 0.008671

# Output 3
t.test(coyote_data, conf.level = 0.90, alternative = "two.sided", mu = 94)
# t = -2.5293, df = 28, p-value = 0.01734

# Output 4
z_TS <- (89.17 - 94) / 9
# Test Statistic is: -0.5366667
p_value <- pnorm(z_TS, lower.tail = TRUE)
# p-value is: 0.2957489

# Output 5
z_TS <- (89.17 - 94) / (9 / sqrt(29))
# Test Statistic is: -2.890038
p_value <- pnorm(z_TS, lower.tail = TRUE)
# p-value is: 0.001925974

# Output 6
z_TS <- (89.17 - 94) / 9
# Test Statistic is: -0.5366667
p_value <- 2 * pnorm(z_TS, lower.tail = TRUE)
# p-value is: 0.5914979

# Output 7
z_TS <- (89.17 - 94) / (9 / sqrt(29))
# Test Statistic is: -2.890038
p_value <- 2 * pnorm(z_TS, lower.tail = TRUE)
# p-value is: 0.003851947

# Output 8
# qnorm(p=0.05, lower.tail = TRUE)   -> -1.644854
# qt(p=0.05, df=28, lower.tail=FALSE) -> 1.701131
# qnorm(p=0.1, lower.tail=FALSE)     -> 1.281552
# qt(p=0.1, df=28, lower.tail=FALSE)  -> 1.312527
# qnorm(p=0.1, lower.tail=TRUE)      -> -1.281552
# qt(p=0.1, df=28, lower.tail=TRUE)   -> -1.312527
# qnorm(p=0.05, lower.tail=FALSE)    -> 1.644854
# qt(p=0.05, df=28, lower.tail=TRUE)  -> -1.701131