Slides 📊
9.3. Precision of a Confidence Interval
In the previous section, we established that confidence intervals convey the degree of uncertainty through their width. A narrow interval suggests high precision, while a wide interval reflects greater uncertainty. This naturally raises two key questions:
What factors determine the width of a confidence interval?
What can we do to make an interval more precise (narrower)?
We will explore the answers to these questions in this lesson.
Road Map 🧭
Identify the three building blocks of a margin of error and understand how each influences the precision of a confidence interval.
Understand why the sample size is the only component of an ME that researchers can actively control.
For a desired level of precision, compute the mininum required sample size, and incorporate this into experiment planning.
9.3.1. What Makes a Confidence Interval Precise?
Fig. 9.5 The width of a confidence interval is equal to 2ME
The width of a confidence interval is entirely determined by its margin of error (ME) as shown in Fig. 9.5. Specifically,
From its mathematical definition, we can identify three elements that determine the size of an ME: \(\alpha\) (and thus \(C\)), \(\sigma\), and \(n\). Let us examine how each affects the width of a confidence interval.
Component of ME |
Effect on precision |
|---|---|
Confidence level \(C\) |
Higher \(C \to\) the central region of area \(C\) expands \(\to\) \(z_{\alpha/2}\) pushed to the right \(\to\) increased ME & lower precision
Fig. 9.6 Standard normal PDF |
Population standard deviation \(\sigma\) |
Higher \(\sigma \to\) increaseed ME & lower precision |
Sample size \(n\) |
Higher \(n \to\) decreased ME & higher precision |
Controlling the Precision
Of the three components of a margin of error, researchers can practically control only the sample size, \(n\). The population standard deviation \(\sigma\) is a fixed property of the population, and the confidence level \(C\) must be chosen to reflect the desired rigor of the experiment; it is not meaningful to lower confidence merely to obtain a narrower interval.
This leads to an important practical question: “How do we control the sample size to meet a target precision?”
9.3.2. Sample Size Planning for a Target Precision
Maximum Width
Suppose we want to construct a confidence interval whose width is at most \(w_{max}\). That is, we want
Rearranging to isolate \(n\),
Square both sides and bring \(n\) to the left hand side:
Since \(n\) must be a whole number, we take the the smallest integer on or above the lower bound. That is, the smallest required sample size \(n^*\) for a target width \(w_{max}\) is computed as:
Maximum Margin of Error
The desired level of precision can also be given as a maximum margin of error, \(\text{ME}_{max}\). In this case, we take \(n^*\) as:
through a similar flow of logic. You are encouraged to confirm this result as a quick exercise.
Example 💡: Weights of American Adult Males, Continued
We continue with the study of weights of American male adults from Section 9.2. Suppose the researchers want to compute a 99% confidence interval for the mean weight with a margin of error of at most 1. What is the smallest sample size they need?
The building blocks
The maximum ME: \(\text{ME}_{max} = 1\).
The population standard deviation: \(\sigma=49.26\) lbs
Confidence level: \(99\% \, (C=0.99, \alpha=0.01)\)
The z-critical value: \(z_{\alpha/2} = z_{0.005} = 2.5758\)
The goal
We would like to compute the smallest integer \(n\) satisfying
Rearranging and isolating \(n\),
The smallest sample size needed for the desired level of precison is \(n^* = \lceil 16099.53\rceil =16,100\).
9.3.3. Bringing It All Together
Key Takeaways 📝
The margin of error quantifies the precision of a confidence interval. The three factors of margin of error that control precision are: confidence level \(C\), population variance \(\sigma^2\) (or population sd \(\sigma\)), and sample size \(n\).
The smallest required sample size for a given maximum ME can be calculated using: \(n^* = \Big\lceil \left(\frac{z_{\alpha/2}\,\sigma}{\text{ME}_{max}}\right)^2\Big\rceil\).
Learning Objectives 🎯
These exercises will help you:
Identify the three components that determine margin of error
Understand how each component affects CI precision
Calculate the minimum sample size for a desired margin of error
Apply sample size planning to experiment design
Recognize the diminishing returns of increasing sample size
Key Formulas 📐
Margin of Error:
Sample Size Formula (for target ME):
CI Width: \(\text{Width} = 2 \times ME\)
R Function:
# Sample size for target ME
n_star <- ceiling((qnorm(alpha/2, lower.tail=FALSE) * sigma / ME_max)^2)
9.3.4. Exercises
Exercise 1: Components of Margin of Error
The margin of error for a confidence interval when σ is known is \(ME = z_{\alpha/2} \frac{\sigma}{\sqrt{n}}\).
List the three components that determine the margin of error.
For each component, describe the effect of increasing it on the margin of error (increases, decreases, or no effect).
Which component(s) can researchers typically control when planning a study?
Why is it generally not appropriate to lower the confidence level just to achieve a narrower interval?
Solution
Part (a): Three components
Confidence level \(C\) (through \(z_{\alpha/2}\) where \(\alpha = 1 - C\))
Population standard deviation \(\sigma\)
Sample size \(n\)
Part (b): Effects of increasing each
Increasing \(C\): Increases \(z_{\alpha/2}\) → increases ME → wider CI, less precision
Increasing \(\sigma\): Directly increases ME → wider CI, less precision
Increasing \(n\): Decreases \(\frac{\sigma}{\sqrt{n}}\) → decreases ME → narrower CI, more precision
Part (c): Controllable components
Researchers directly control sample size \(n\) and choose confidence level \(C\).
\(\sigma\) is a fixed property of the population (though design choices can sometimes reduce variability)
\(C\) is chosen based on the desired level of confidence—while it can be lowered to reduce ME, this is generally not appropriate (see part d)
Part (d): Why lowering confidence level is problematic
It undermines the reliability of the inference
A narrower interval at lower confidence is “cheap” precision—you’re less certain it contains μ
The appropriate confidence level should reflect the study’s scientific requirements, not be chosen for convenience
Reporting a 70% CI just to look more precise would be misleading
Exercise 2: Sample Size Calculation - Basic
A materials engineer wants to estimate the hardness of steel samples with a 95% confidence interval. From historical data, the population standard deviation is \(\sigma = 15\) Rockwell units.
Calculate the minimum sample size needed to achieve a margin of error of:
ME = 3 Rockwell units
ME = 2 Rockwell units
ME = 5 Rockwell units
What pattern do you notice? How does the required sample size change as the desired precision increases (ME decreases)?
Solution
Given: \(\sigma = 15\) Rockwell units, 95% confidence (\(z_{0.025} = 1.96\))
Formula: \(n^* = \left\lceil \left(\frac{z_{\alpha/2} \cdot \sigma}{ME_{max}}\right)^2 \right\rceil\)
Part (a): ME = 3
Part (b): ME = 2
Part (c): ME = 5
Part (d): Pattern
Desired ME |
Required n |
|---|---|
5 |
35 |
3 |
97 |
2 |
217 |
As ME decreases (precision increases), the required sample size increases quadratically. Reducing ME from 3 to 2 (a 33% reduction) requires increasing n from 97 to 217 (a 124% increase). Halving ME quadruples the required n.
R verification:
sigma <- 15
z <- qnorm(0.025, lower.tail = FALSE)
ceiling((z * sigma / 3)^2) # 97
ceiling((z * sigma / 2)^2) # 217
ceiling((z * sigma / 5)^2) # 35
Exercise 3: Sample Size for CI Width
A pharmaceutical researcher wants to estimate the mean drug concentration in blood samples. The population standard deviation is \(\sigma = 8.4\) ng/mL. The researcher wants a 95% CI with total width no more than 4 ng/mL.
What is the maximum margin of error corresponding to a width of 4 ng/mL?
Calculate the minimum sample size needed.
If the researcher can only afford to collect 50 samples, what would be the resulting CI width?
What width could be achieved with 100 samples?
Solution
Given: \(\sigma = 8.4\) ng/mL, 95% confidence (\(z_{0.025} = 1.96\))
Part (a): Maximum ME
Width = 2 × ME, so:
Part (b): Minimum sample size
Part (c): CI width with n = 50
Part (d): CI width with n = 100
R verification:
sigma <- 8.4
z <- 1.96
# Part (b)
ceiling((z * sigma / 2)^2) # 68
# Part (c): Width with n = 50
2 * z * sigma / sqrt(50) # 4.66
# Part (d): Width with n = 100
2 * z * sigma / sqrt(100) # 3.29
Exercise 4: Diminishing Returns
A quality engineer monitors the tensile strength of wire cables with \(\sigma = 25\) N. She wants to understand how sample size affects precision.
Complete the table below for a 95% confidence interval:
Sample Size (n)
Standard Error
Margin of Error
CI Width
25
?
?
?
100
?
?
?
400
?
?
?
1600
?
?
?
Each row quadruples the sample size. By what factor does the CI width decrease each time?
What is the practical implication of this pattern for study planning?
Solution
Given: \(\sigma = 25\) N, \(z_{0.025} = 1.96\)
Part (a): Completed table
n |
SE = σ/√n |
ME = 1.96 × SE |
Width = 2 × ME |
|---|---|---|---|
25 |
5.00 |
9.80 |
19.60 |
100 |
2.50 |
4.90 |
9.80 |
400 |
1.25 |
2.45 |
4.90 |
1600 |
0.625 |
1.225 |
2.45 |
Part (b): Factor of decrease
Each time n quadruples, the CI width is halved (factor of 2).
This is because \(\text{Width} \propto 1/\sqrt{n}\), and \(\sqrt{4} = 2\).
Part (c): Practical implication
Diminishing returns: To achieve each additional halving of CI width requires quadrupling the sample size. This means:
Early increases in n are very cost-effective
Later increases become increasingly expensive relative to the gain in precision
There’s often a practical “sweet spot” where the cost of additional samples outweighs the marginal improvement in precision
Exercise 5: Different Confidence Levels
An aerospace engineer needs to estimate mean thrust with \(\sigma = 500\) N. She wants a margin of error no larger than 100 N.
Calculate the minimum sample size for:
90% confidence
95% confidence
99% confidence
Create a table summarizing n, z-critical value, and resulting ME for each confidence level. What tradeoff does this illustrate?
Solution
Given: \(\sigma = 500\) N, \(ME_{max} = 100\) N
Part (a): 90% confidence (\(z_{0.05} = 1.645\))
Part (b): 95% confidence (\(z_{0.025} = 1.96\))
Part (c): 99% confidence (\(z_{0.005} = 2.576\))
Part (d): Summary table and tradeoff
Confidence |
z-critical |
Required n |
|---|---|---|
90% |
1.645 |
68 |
95% |
1.96 |
97 |
99% |
2.576 |
166 |
Tradeoff: Higher confidence requires larger sample sizes to maintain the same precision. Going from 95% to 99% confidence (a 4 percentage point increase) requires 71% more samples (97 → 166).
Exercise 6: Budget Constraints
A clinical researcher has a budget for testing at most 80 patient samples. Each test measures drug clearance rate with \(\sigma = 12\) mL/min.
At 95% confidence with n = 80, what margin of error will be achieved?
What is the resulting CI width?
If the researcher had hoped for ME ≤ 2 mL/min, how many samples would be needed? Is this feasible?
What ME could be achieved with a 90% CI using the same n = 80?
Solution
Given: \(\sigma = 12\) mL/min, budget allows n = 80
Part (a): ME with n = 80 at 95% confidence
Part (b): CI width
Part (c): Required n for ME ≤ 2
Not feasible with budget of 80 samples. Would need 139 samples.
Part (d): ME with 90% CI, n = 80
Switching to 90% confidence reduces ME from 2.63 to 2.21 mL/min, but still doesn’t achieve the target of 2.0 mL/min.
Exercise 7: Planning a Manufacturing Study
A manufacturing process produces components with \(\sigma = 1.2\) g. An engineer wants a 95% CI with width no more than 0.5 g.
Calculate the minimum sample size needed.
If the engineer uses systematic sampling by measuring every 10th item off the production line, what potential problem could arise?
If each measurement costs $15, what is the minimum budget needed for this precision?
Solution
Given: \(\sigma = 1.2\) g, width ≤ 0.5 g, 95% confidence
Part (a): Required sample size
\(ME_{max} = 0.5/2 = 0.25\) g
Part (b): Systematic sampling problem
Measuring every 10th item creates a systematic (non-random) sample. If the production process has any cyclical patterns (e.g., machine warm-up effects, batch variations), this could introduce bias. Random sampling is preferred for valid inference.
If systematic sampling must be used, randomize the starting point (e.g., randomly select a number 1-10 to determine which item to start with), and verify there is no periodicity in the production process that aligns with the sampling interval.
Part (c): Minimum budget
Exercise 8: Unknown σ Planning
When planning a study, σ is often unknown. A researcher estimates σ from a pilot study or uses a conservative estimate.
Suppose a researcher plans to estimate mean reaction time with ME ≤ 10 ms at 95% confidence. Consider three estimates of σ:
Conservative estimate: σ = 50 ms
Moderate estimate: σ = 40 ms
Optimistic estimate: σ = 30 ms
Calculate the required sample size for each estimate of σ.
If the true σ turns out to be 45 ms and the researcher used the optimistic estimate (σ = 30), what ME will actually be achieved?
Why is it often recommended to use a conservative (larger) estimate of σ when planning?
Solution
Given: \(ME_{max} = 10\) ms, 95% confidence (\(z_{0.025} = 1.96\))
Part (a): Required sample sizes
Conservative (σ = 50):
Moderate (σ = 40):
Optimistic (σ = 30):
Part (b): Actual ME if true σ = 45 but used n = 35
The actual ME (14.9 ms) exceeds the target (10 ms) by 49%!
Part (c): Why use conservative estimates
Using a larger σ estimate leads to a larger sample size
If the true σ is smaller than estimated, you get better precision than planned (ME < target)
If the true σ is larger than estimated (as in part b), you fail to meet your precision goal
It’s better to collect “too many” samples than to end up with insufficient precision
Exercise 9: Comparing Precision Requirements
Three different applications require estimating a mean with \(\sigma = 20\):
Application A: Rough estimate needed; ME ≤ 5 at 90% confidence
Application B: Moderate precision; ME ≤ 3 at 95% confidence
Application C: High precision; ME ≤ 1 at 99% confidence
Calculate the required sample size for each application.
Application C requires how many times more samples than Application A?
If each sample costs $50, calculate the cost for each application.
Solution
Given: \(\sigma = 20\)
Part (a): Required sample sizes
Application A (90%, ME ≤ 5): \(z_{0.05} = 1.645\)
Application B (95%, ME ≤ 3): \(z_{0.025} = 1.96\)
Application C (99%, ME ≤ 1): \(z_{0.005} = 2.576\)
Part (b): Ratio of C to A
Application C requires about 60 times more samples than Application A.
Part (c): Costs
Application A: 44 × $50 = $2,200
Application B: 171 × $50 = $8,550
Application C: 2654 × $50 = $132,700
9.3.5. Additional Practice Problems
True/False Questions (1 point each)
Doubling the sample size will halve the margin of error.
Ⓣ or Ⓕ
The population standard deviation σ can be reduced by collecting more samples.
Ⓣ or Ⓕ
To halve the CI width, you must quadruple the sample size.
Ⓣ or Ⓕ
Higher confidence levels require larger sample sizes to maintain the same ME.
Ⓣ or Ⓕ
The sample size formula always gives a whole number.
Ⓣ or Ⓕ
CI width is equal to twice the margin of error.
Ⓣ or Ⓕ
Multiple Choice Questions (2 points each)
If \(\sigma = 10\), what sample size is needed for ME = 2 at 95% confidence?
Ⓐ 25
Ⓑ 49
Ⓒ 97
Ⓓ 196
The sample size formula \(n = \left(\frac{z \sigma}{ME}\right)^2\) shows that n is:
Ⓐ Directly proportional to ME
Ⓑ Inversely proportional to ME
Ⓒ Inversely proportional to ME²
Ⓓ Independent of ME
To decrease ME by a factor of 3 (e.g., from 6 to 2), sample size must increase by a factor of:
Ⓐ 3
Ⓑ 6
Ⓒ 9
Ⓓ 27
Which statement about the ceiling function is correct?
Ⓐ ⌈67.01⌉ = 67
Ⓑ ⌈67.99⌉ = 68
Ⓒ ⌈67.00⌉ = 68
Ⓓ ⌈67.5⌉ = 67.5
A researcher has σ = 15 and wants ME ≤ 3 at 95% confidence. If budget allows only n = 50, the actual ME will be:
Ⓐ Less than 3
Ⓑ Exactly 3
Ⓒ Greater than 3
Ⓓ Cannot be determined
The “diminishing returns” principle means:
Ⓐ Larger samples always give the same ME
Ⓑ Each additional sample contributes less to precision than the previous one
Ⓒ Sample size has no effect on CI width
Ⓓ Higher confidence gives narrower intervals
Answers to Practice Problems
True/False Answers:
False — Doubling n reduces ME by factor of √2 ≈ 1.41, not 2.
False — σ is a fixed population property; it cannot be changed by sampling.
True — Width ∝ 1/√n, so halving width requires n × 4.
True — Higher confidence means larger z-critical value, requiring more samples for same ME.
False — The formula often gives non-integers; we round up (ceiling function).
True — CI is x̄ ± ME, so width = (x̄ + ME) − (x̄ − ME) = 2ME.
Multiple Choice Answers:
Ⓒ — n = ⌈(1.96 × 10/2)²⌉ = ⌈96.04⌉ = 97.
Ⓒ — n = (zσ/ME)² means n ∝ 1/ME².
Ⓒ — If ME decreases by factor of 3, n increases by 3² = 9.
Ⓑ — Ceiling rounds up to the nearest integer, so ⌈67.99⌉ = 68.
Ⓒ — Required n = 97, but only 50 available. ME = 1.96 × 15/√50 = 4.16 > 3.
Ⓑ — As n increases, the marginal improvement in precision decreases.