1. Collecting Data

1.1 Experimental Design

1.2 Sampling Methods & Bias

1.2.1 Introduction to Sampling

1.2.2 Simple Random Sampling (SRS)

1.2.3 Random Sampling Methods

1.2.4 Types of Bias

1.2.5 Non-random (Biased) Sampling Methods

2. Inference

2.1 Inference for Regression Slopes

2.1.1 Sampling Distributions for Sample Slopes

2.1.2 Hypothesis Tests for Slopes of Regression Lines

2.1.3 Confidence Intervals for Slopes of Regression Lines

2.2 Errors in Hypothesis Tests

2.2.1 Type I & Type II Errors

2.2.2 Probabilities of Errors

2.2.3 Power of a Test

2.3 Introduction to Inference

2.3.1 Tails on a Normal Distribution

2.3.2 Introduction to Hypothesis Testing

2.3.3 Introduction to Confidence Intervals

2.4 Inference for Proportions

2.4.1 Hypothesis Tests for Population Proportions

2.4.2 Confidence Intervals for Population Proportions

2.4.3 Hypothesis Tests for Differences in Population Proportions

2.4.4 Confidence Intervals for Differences in Population Proportions

2.5 Inference for Means

2.5.1 The t-distribution

2.5.2 Hypothesis Tests for Population Means

2.5.3 Confidence Intervals for Population Means

2.5.4 Hypothesis Tests for Differences in Population Means

2.5.5 Confidence Intervals for Differences in Population Means

2.5.6 t-scores versus z-scores

2.5.7 Hypothesis Tests for Differences in Matched Pairs

2.5.8 Confidence Intervals for Differences in Matched Pairs

2.6 Goodness of Fit (Chi-Square)

2.6.1 The Chi-Square Distribution

2.6.2 Hypothesis Tests for Goodness of Fit

2.7 Independence & Homogeneity (Chi-Square)

2.7.1 Tests for Independence

2.7.2 Tests for Homogeneity

3. Probability, Random Variables and Probability Distributions

3.1 Probability

3.1.1 Estimating Probability using Relative Frequency

3.1.2 Probabilities of Single Events

3.1.3 Introduction to Combined Events

3.1.4 Addition Rule & Mutually Exclusive Events

3.1.5 Conditional Probability

3.1.6 Multiplication Rule & Independent Events

3.1.7 Probabilities of Combined Events using Tree Diagrams

3.1.8 Probabilities of Combined Events using the Rules

3.2 Discrete Random Variables

3.2.1 Probability Distributions for Discrete Random Variables

3.2.2 Cumulative Probability Distributions for Discrete Random Variables

3.2.3 Mean & Standard Deviation of a Discrete Random Variable

3.2.4 Linear Transformations of Random Variables

3.2.5 Linear Combinations of Random Variables

3.3 Binomial & Geometric Distributions

3.3.1 Introduction to Binomial Distributions

3.3.2 Probabilities for Binomial Distributions

3.3.3 Introduction to Geometric Distributions

3.3.4 Probabilities for Geometric Distributions

4. Exploring One-Variable Data

4.1 Summary Statistics

4.1.1 Describing Variables

4.1.2 Parameters & Statistics

4.1.3 Measures of Center

4.1.4 Measures of Position

4.1.5 Measures of Variability

4.1.6 Tables & Relative Frequency

4.1.7 Grouped Data

4.1.8 Outliers & Resistant Measures

4.1.9 Five-Number Summary & Boxplots

4.1.10 Skewness of Data

4.1.11 Comparing Data using Summary Statistics

4.2 Graphical Representations

4.2.1 Shape of Distributions

4.2.2 Bar Charts & Histograms

4.2.3 Dotplots & Stemplots

4.2.4 Cumulative Graphs

4.2.5 Comparing Univariate Graphs

4.3 Normal Distribution

4.3.1 Properties of Normal Distributions

4.3.2 Standardized z-scores

4.3.3 Comparing Normal Distributions

4.3.4 Finding Proportions from Normal Distributions

4.3.5 Inverse Normal Calculations

4.3.6 Estimating Parameters of Normal Distributions

5. Sampling Distributions

5.1 Sampling Distributions

5.1.1 Introduction to Sampling Distributions

5.1.2 Sampling Distributions for Sample Means

5.1.3 The Central Limit Theorem

5.1.4 Sampling Distributions for Differences in Sample Means

5.1.5 Sampling Distributions for Sample Proportions

5.1.6 Sampling Distributions for Differences in Sample Proportions

5.1.7 Biased & Unbiased Estimators

6. Exploring Two-Variable Data

6.1 Tables & Graphs

6.1.1 Two-Way Tables & Relative Frequencies

6.1.2 Bar Graphs & Mosaic Plots

6.2 Scatterplots & Regression

6.2.1 Two-Way Tables & Relative Frequencies

6.2.2 Bar Graphs & Mosaic Plots

6.2.3 Explanatory & Response Variables

6.2.4 Scatterplots

6.2.5 Association & Correlation Coefficients

6.2.6 Interpolation & Extrapolation using Linear Models

6.2.7 Residuals

6.2.8 The Least-Squares Regression Line

6.2.9 Residual Plots

6.2.10 The Coefficient of Determination

6.2.11 Outliers, High-Leverage & Influential Points

6.2.12 Linearization of Bivariate Data

Hypothesis Tests for Goodness of Fit

Topic 2/3

Revision Notes
Flashcards
Past Paper Analysis
Questions
Videos

Your Flashcards are Ready!

15 Flashcards in this deck.

Hypothesis Tests for Goodness of Fit

Introduction

Hypothesis tests for goodness of fit are fundamental statistical tools used to determine how well observed data align with an expected distribution. In the context of the College Board AP Statistics curriculum, mastering these tests equips students with the ability to assess the validity of theoretical models against real-world data, fostering critical thinking and analytical skills essential for statistical inference.

Key Concepts

Understanding Goodness of Fit

Goodness of fit tests evaluate whether sample data fit a distribution from a certain population. It assesses the discrepancy between observed frequencies and expected frequencies under a specific hypothesis, typically the null hypothesis.

Types of Goodness of Fit Tests

The most common goodness of fit test is the Chi-Square ($\chi^2$) test. There are also other tests like the Kolmogorov-Smirnov test and the Anderson-Darling test, but the Chi-Square test is predominantly used in categorical data analysis.

Chi-Square Goodness of Fit Test

The Chi-Square Goodness of Fit test evaluates whether the distribution of observed categorical data matches an expected distribution. It is particularly useful for testing hypotheses about the distribution of frequencies across different categories.

Hypotheses in Goodness of Fit Tests

Null Hypothesis ($H_0$): Assumes that there is no significant difference between the observed and expected frequencies.
Alternative Hypothesis ($H_a$): Suggests that there is a significant difference between the observed and expected frequencies.

Test Statistic

The test statistic for the Chi-Square Goodness of Fit test is calculated using the formula: $$ \chi^2 = \sum \frac{(O_i - E_i)^2}{E_i} $$ where:

$O_i$ = Observed frequency for category $i$
$E_i$ = Expected frequency for category $i$

Degrees of Freedom

The degrees of freedom (df) for a Chi-Square Goodness of Fit test are determined by the number of categories minus one, and minus the number of parameters estimated from the data: $$ df = k - 1 - p $$ where:

$k$ = Number of categories
$p$ = Number of parameters estimated

Calculating Expected Frequencies

Expected frequencies ($E_i$) are calculated based on the null hypothesis. For example, if testing whether a die is fair, the expected frequency for each face is: $$ E_i = \frac{\text{Total Observations}}{\text{Number of Faces}} $$

Assumptions of the Chi-Square Test

Data should be in the form of frequencies or counts of cases.
Observations should be independent of each other.
Expected frequency for each category should be at least 5 to ensure the validity of the test.

Step-by-Step Procedure

State the Hypotheses: Define $H_0$ and $H_a$ based on the research question.
Choose the Significance Level: Commonly, $\alpha = 0.05$.
Calculate Expected Frequencies: Based on the null hypothesis.
Compute the Chi-Square Statistic: Using the formula $\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}$.
Determine Degrees of Freedom: $df = k - 1 - p$.
Find the Critical Value or P-Value: Using Chi-Square distribution tables or software.
Make a Decision: Compare the test statistic to the critical value or assess the p-value against $\alpha$ to accept or reject $H_0$.

Interpreting Results

If the calculated Chi-Square statistic exceeds the critical value or if the p-value is less than the chosen significance level, the null hypothesis is rejected. This indicates that there is a significant difference between the observed and expected frequencies.

Example Problem

*Suppose a six-sided die is rolled 60 times, and the observed frequencies for each face are as follows: 1: 8, 2: 12, 3: 10, 4: 10, 5: 10, 6: 10. Test at $\alpha = 0.05$ whether the die is fair.*

State the Hypotheses: <
- $H_0$: The die is fair; each face has an expected frequency of 10.
- $H_a$: The die is not fair; at least one face has a different expected frequency.
Calculate Expected Frequencies: $E_i = 10$ for each face.
Compute Chi-Square Statistic: $$ \chi^2 = \frac{(8-10)^2}{10} + \frac{(12-10)^2}{10} + \frac{(10-10)^2}{10} + \frac{(10-10)^2}{10} + \frac{(10-10)^2}{10} + \frac{(10-10)^2}{10} = \frac{4}{10} + \frac{4}{10} = 0.8 $$
Degrees of Freedom: $df = 6 - 1 = 5$.
Find Critical Value: For $df=5$ and $\alpha=0.05$, $\chi^2_{critical} \approx 11.070$.
Decision: $0.8 < 11.070$, so we fail to reject $H_0$.
Conclusion: There is no significant evidence to suggest the die is unfair.

Applications of Goodness of Fit Tests

Testing the fairness of dice or games of chance.
Evaluating the distribution of categorical survey responses.
Assessing the fit of observed genetic trait distributions in biology.

Advantages of Goodness of Fit Tests

Simple to perform and interpret.
Applicable to various types of categorical data.
Does not require large sample sizes if expected frequencies are adequate.

Limitations of Goodness of Fit Tests

Requires a sufficiently large sample size to ensure expected frequencies are reliable.
Not suitable for small sample sizes or when expected frequencies are less than 5.
Sensitive to violations of the test assumptions, such as independence of observations.

Comparison Table

Aspect	Chi-Square Goodness of Fit	Other Goodness of Fit Tests
Data Type	Categorical	Continuous (e.g., Kolmogorov-Smirnov)
Assumptions	Expected frequencies ≥ 5, independent observations	Depends on the test; e.g., K-S requires continuous distribution
Sensitivity	Less sensitive to deviations in large samples	More sensitive to deviations in specific areas (e.g., tail behavior)
Common Uses	Testing categorical distributions like dice fairness	Evaluating distribution fit for continuous data, such as normality tests
Advantages	Simple, widely applicable for categorical data	Can handle different types of data and distributions
Disadvantages	Not suitable for small samples or low expected frequencies	May require more complex calculations or assumptions

Summary and Key Takeaways

Goodness of fit tests assess how well observed data match expected distributions.
The Chi-Square test is the most common method for categorical data.
Proper calculation of expected frequencies and adherence to assumptions are crucial.
Understanding degrees of freedom aids in accurate hypothesis testing.
Goodness of fit tests have wide applications but also specific limitations.

Examiner Tip

Tips

Remember the acronym CHi-FREE to recall the steps of the Chi-Square test: Clarify hypotheses, Head significance level, Input expected frequencies, Formulate statistic, Review degrees of freedom, Evaluate results, and Explain conclusions. Additionally, practice with diverse examples to strengthen your understanding and ensure success on the AP exam.

Did You Know

The Chi-Square test was first introduced by the German mathematician Karl Pearson in 1900. It's not only used in statistics but also plays a crucial role in machine learning algorithms, especially in feature selection for classification problems. Additionally, goodness of fit tests can help in validating models in fields like genetics, marketing, and even sports analytics to ensure the models accurately reflect real-world scenarios.

Common Mistakes

Mistake 1: Ignoring the expected frequency requirement. For example, using the Chi-Square test when some expected frequencies are below 5 can lead to inaccurate results.
Correction: Always ensure that all expected frequencies are at least 5 or consider combining categories.

Mistake 2: Miscalculating degrees of freedom. Students often forget to subtract the number of estimated parameters.
Correction: Use the formula $df = k - 1 - p$ to accurately determine degrees of freedom.

FAQ

What is the purpose of a goodness of fit test?

A goodness of fit test assesses how well observed data match an expected distribution, helping determine if a theoretical model is appropriate for the data.

When should I use the Chi-Square goodness of fit test?

Use the Chi-Square test when dealing with categorical data and you want to compare observed frequencies with expected frequencies under a specific hypothesis.

How do I calculate the Chi-Square statistic?

The Chi-Square statistic is calculated by summing the squared difference between observed and expected frequencies, divided by the expected frequencies: $\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}$.

What are the assumptions of the Chi-Square test?

The assumptions include having categorical data, independent observations, and expected frequencies of at least 5 in each category.

How do degrees of freedom affect the Chi-Square test?

Degrees of freedom determine the critical value from the Chi-Square distribution table. It is calculated as the number of categories minus one minus the number of estimated parameters.

Can I use the Chi-Square test for small sample sizes?

Generally, the Chi-Square test is not suitable for small sample sizes, especially if expected frequencies in some categories are less than 5. In such cases, consider combining categories or using alternative tests.

1. Collecting Data

1.1 Experimental Design

1.1.1 Completely Randomized Design

1.1.2 Randomized Block & Matched Pairs Design

1.1.3 Introduction to Experiments

1.1.4 Well-Designed Experiments

1.1.5 Control Groups, Placebos & Blind Experiments

1.2 Sampling Methods & Bias

1.2.1 Introduction to Sampling

1.2.2 Simple Random Sampling (SRS)

1.2.3 Random Sampling Methods

1.2.4 Types of Bias