STAT 350: Introduction to Statistics
Orientation
Course Introduction & Overview
Why Statistics? Why Now?
Course Roadmap
A Clear Note on Using AI
Getting Started: A Checklist
Homework Assignments on Edfinity
Miscellaneous Tips
Exam Information
Course Examinations
General Exam Policies
Exam 1
Exam Locations by Section
Exam Coverage
Preparation Materials
Exam 2
Exam Locations by Section
Exam Coverage
Additional Resources
Preparation Materials
Final Exam
About the Final Exam
Required Review Materials
Post Exam 2 Preparation Materials
Study Guide Resource
Worksheets
Course Worksheets
Pedagogical Philosophy
Implementation Guidelines
Why These Worksheets Matter
The Critical Role of Simulation
Worksheets
Worksheet 1: Exploring Data with R
Introduction
Part 1: Loading and Understanding the Dataset
Part 2: Initial Data Exploration
Part 3: Frequency Tables
Part 4: Univariate Analysis of Uptake
Part 5: Grouped Statistics with tapply
Part 6: Comparative Visualization by Type
Part 7: Exploring the Concentration Effect
Part 8: Advanced Visualization with Multiple Categories
Reference: Key Functions
Troubleshooting Guide
Additional Resources
Worksheet 2: Set Theory and Probability Fundamentals
Introduction
Part 1: Set Theory Foundations
Part 2: Probability Axioms
Part 3: Applying Probability Rules
Part 4: The Inclusion-Exclusion Principle
Key Takeaways
Submission Guidelines
Worksheet 3: Conditional Probability and Bayes’ Theorem
Introduction
Part 1: Understanding Conditional Probability
Part 2: Tree Diagrams and Sequential Sampling
Part 3: Bayes’ Theorem and Sequential Updating
Key Takeaways
Worksheet 4: Independence and Random Variables
Part 1: Independence Property
Part 2: Independent vs. Mutually Exclusive Events
Part 3: Introduction to Random Variables
Part 4: Probability Mass Functions
Part 5: Joint Probability Mass Functions
Key Takeaways
Worksheet 5: Expected Value and Variance
Introduction
Part 1: Expected Value and LOTUS
Part 2: Variance and Its Properties
Part 3: Sums of Random Variables
Part 4: Joint Probability Mass Functions
Key Takeaways
Worksheet 6: Named Discrete Distributions
Introduction
Part 1: The Bernoulli and Binomial Distributions
The Binomial Distribution
Part 2: The Poisson Distribution
Part 3: Other Named Discrete Distributions
Key Takeaways
Worksheet 7: Continuous Random Variables
Introduction
Part 1: Probability Density Functions
Part 2: Finding Constants for Valid PDFs
Part 3: Expected Value and Variance
Part 4: Cumulative Distribution Functions
Key Takeaways
Worksheet 8: Uniform and Exponential Distributions
Introduction
Part 1: The Uniform Distribution
Part 2: The Exponential Distribution
Key Takeaways
Worksheet 9: The Normal Distribution
Introduction
Part 1: The Normal Distribution
Part 2: The Standard Normal Table
Part 3: Z-Score Transformation
Key Takeaways
Worksheet 10: Checking Normality and Introduction to Sampling Distributions
Part 1: Checking Normality
Part 2: Introduction to Sampling Distributions
Key Takeaways
Worksheet 11: The Central Limit Theorem
Introduction
Tutorial: Generating the Sampling Distribution of
\(\bar{X}\)
Part 1: Exploring CLT with Skewed Distributions
Part 2: Application of the CLT
Part 3: Beyond Mean and Sum
Part 4: Exploring CLT Generalizability with AI Assistance (Exploration)
Key Takeaways
Worksheet 12: Point Estimators and Unbiased Estimation
Introduction
Part 1: Estimating Parameters of the Exponential Distribution
Part 2: Estimating the Maximum of a Uniform Distribution
Part 3: Minimum Variance Unbiased Estimators (MVUE)
Key Takeaways
Worksheet 13: Introduction to Confidence Intervals
Introduction
Part 1: Pivotal Quantities and Deriving Confidence Intervals
Part 2: Sample Size Determination
Part 3: One-Sided Confidence Bounds
Part 4: Confidence Intervals When σ is Unknown
Key Takeaways
Worksheet 14: Student’s t-Distribution and Statistical Power
Introduction
Assumptions for t-Distribution Inference
Part 1: Analyzing Chick Growth with Confidence Intervals
Part 2: Introduction to Hypothesis Testing
Part 3: Sample Size Determination and Power Analysis
Key Takeaways
The Hypothesis Testing Framework
The Test Statistic (Known σ)
Distribution of the Test Statistic
The p-value
Part 1: Simulating Test Statistics and P-values
Question 1a: Simulation When Null Hypothesis is True
Question 1b: Simulation When Null Hypothesis is False
Question 1c: Comparing the Two Scenarios
The t-Test When σ is Unknown
The t-Test Statistic
Properties of the t-Distribution
Relationship Between Confidence Intervals and Hypothesis Tests
Part 2: EPA Ozone Concentration Analysis
Question 2a: Assumptions and Exploratory Analysis
Question 2b: Full Hypothesis Testing Procedure
Question 2c: Manual Verification
Question 2d: Confidence Bound
Question 2e: Power Calculation
Key Takeaways
Worksheet 16: Two-Sample Inference
Introduction
Part 1: Independent vs. Paired Designs
Part 2: Independent Samples with Known Variances
Part 3: Pooled Two-Sample t-Test (Equal Variances)
Part 4: Welch’s Two-Sample t-Test (Unequal Variances)
Key Takeaways
Worksheet 17: Paired Sample Inference
Introduction
Part 1: Theory and Procedure for Paired Samples
Part 2: Sleep Study Analysis
Part 3: Sample Size Calculation for Paired Designs
Key Takeaways
Computer Assignments
R / RStudio Guide and Function Reference
Overview
Getting Started with R and RStudio
Quick Start: R / RStudio Setup
RStudio Orientation
Packages / Libraries (Course Set)
Getting Started with swirl
Alternative R Learning Resources
Quick Reference Table
Computer Assignments and Tutorials
Overview
Assignment Structure by Session
Assignment Tutorials (Links)
Course Pipeline (At a Glance)
Function Reference Part 1
Data I/O & Housekeeping
Data Structures & Creation
Data Wrangling & Utilities
Descriptive Statistics & Correlation
Probability & Distributions
Simulation Functions
Function Reference Part 2: Inference Functions
Diagnostic Plots for Assumptions
Graphics (ggplot2)
Core Components
Geoms (Geometric Objects)
Categorical Data Visualization
Plot Customization
Tables & Reporting
Best Practices & Common Pitfalls
Data Import & Validation
Statistical Assumptions
Common Errors to Avoid
Workflow Template
Course Datasets
Primary Course Dataset - AppRating
Tutorial Support Datasets
Loading Datasets in R
Built-in R Datasets Used in Course
Data Download and Organization
Chapters
1. Introduction to Statistics
1.1. What Is Statistics?
1.2. Probability & Statistical Inference: How Are They Associated?
2. Graphical Summaries
2.1. Data Set Structure and Variable Types
2.2. Tools for Categorical (Qualitative) Data
2.3. Tools for Numerical (Quantitative) Data
2.4. Exploring Quantitative Distributions: Modality, Skewness & Outliers
3. Numerical Summaries
3.1. Introduction to Numerical Summaries: Notation and Terminology
3.2. Measures of Central Tendency
3.3. Measures of Variability - Range, Variance, and Standard Deviation
3.4. Measures of Variability - Interquartile Range and Five-Number Summary
3.5. Choosing the Right Measure & Comparing Measures Across Data Sets
4. Probability
4.1. Basic Set Theory
4.2. Probability
4.3. Conditional Probability
4.4. Law of Total Probability and Bayes’ Rule
4.5. Sequential Bayesian Updating
4.6. Independence of Events
5. Discrete Distributions
5.1. Discrete Random Variables and Probability Mass Distributions
5.2. Joint Probability Mass Functions
5.3. Expected Value of a Discrete Random Variable
5.4. Varianace of a Discrete Random Variable
5.5. Covariance of Dependent Random Variables
5.6. The Binomial Distribution
5.7. The Poisson Distribution
6. Continuous Distributions
6.1. Continuous Random Variables and Probability Density Functions
6.2. Expected Value and Variance of Continuous Random Variables
6.3. Cumulative Distribution Functions
6.4. Normal Distribution
6.5. Uniform Distribution
6.6. Exponential Distribution
7. Sampling Distributions
7.1. Statistics and Sampling Distributions
7.2. Sampling Distribution for the Sample Mean
7.3. The Central Limit Theorem (CLT)
7.4. Understanding Binomial and Poisson Distributions through CLT
8. Experimental Design
8.1. Experimental and Sampling Designs
8.2. Experimental Design Principles
8.3. Basic Types of Experimental Design
8.4. Addressing Potential Flaws in Experimental Design
8.5. Examples of Experimental Design
8.6. Sampling Design
8.7. Sampling Bias
9. Confidence Intervals and Bounds
9.1. Introduction to Statistical Inference
9.2. Confidence Intervals for the Population Mean, When σ is Known
9.3. Precision of a Confidence Interval
9.4. Confidence Bounds for the Poulation Mean When σ is Known
9.5. Confidence Intervals and Bounds When σ is Unknown
10. Hypothesis Testing
10.1. The Foundation of Hypothesis Testing
10.2. Hypothesis Test for the Population Mean When σ is Known
10.3. Connecting CI and HT;
t
-Test for μ When σ Is Unknown
10.4. The Four Steps to Hypothesis Testing and Understanding the Result
11. Two Sample Procedures
11.1. Statistical Inference for Two Samples
11.2. Independent Two-Sample Analysis When Population Variances Are Known
11.3. Independent Two-Sample Analysis - Pooled Variance Estimator
11.4. Independent Two-Sample Analysis - No Equal Variance Assumption
11.5. Paired Two-Sample Analysis
12. ANOVA
12.1. Introduction to One-Way ANOVA
12.2. Different Sources of Variability in ANOVA
12.3. ANOVA F-Test and Its Relationship to Two-Sample
t
-Tests
12.4. Multiple Comparison Procedures
13. Simple Linear Regression
13.1. Introduction to Linear Regression: Correlation and Scatter Plots
13.2. Simple Linear Regression
13.3. Model Diagnostics and Statistical Inference
13.4. Prediction, Robustness, and Applied Examples
STAT 350: Introduction to Statistics
12.
ANOVA
View page source
12.
ANOVA
12.1. Introduction to One-Way ANOVA
12.2. Different Sources of Variability in ANOVA
12.3. ANOVA F-Test and Its Relationship to Two-Sample
t
-Tests
12.4. Multiple Comparison Procedures