1. Collecting Data

1.1 Experimental Design

1.2 Sampling Methods & Bias

1.2.1 Introduction to Sampling

1.2.2 Simple Random Sampling (SRS)

1.2.3 Random Sampling Methods

1.2.4 Types of Bias

1.2.5 Non-random (Biased) Sampling Methods

2. Inference

2.1 Inference for Regression Slopes

2.1.1 Sampling Distributions for Sample Slopes

2.1.2 Hypothesis Tests for Slopes of Regression Lines

2.1.3 Confidence Intervals for Slopes of Regression Lines

2.2 Errors in Hypothesis Tests

2.2.1 Type I & Type II Errors

2.2.2 Probabilities of Errors

2.2.3 Power of a Test

2.3 Introduction to Inference

2.3.1 Tails on a Normal Distribution

2.3.2 Introduction to Hypothesis Testing

2.3.3 Introduction to Confidence Intervals

2.4 Inference for Proportions

2.4.1 Hypothesis Tests for Population Proportions

2.4.2 Confidence Intervals for Population Proportions

2.4.3 Hypothesis Tests for Differences in Population Proportions

2.4.4 Confidence Intervals for Differences in Population Proportions

2.5 Inference for Means

2.5.1 The t-distribution

2.5.2 Hypothesis Tests for Population Means

2.5.3 Confidence Intervals for Population Means

2.5.4 Hypothesis Tests for Differences in Population Means

2.5.5 Confidence Intervals for Differences in Population Means

2.5.6 t-scores versus z-scores

2.5.7 Hypothesis Tests for Differences in Matched Pairs

2.5.8 Confidence Intervals for Differences in Matched Pairs

2.6 Goodness of Fit (Chi-Square)

2.6.1 The Chi-Square Distribution

2.6.2 Hypothesis Tests for Goodness of Fit

2.7 Independence & Homogeneity (Chi-Square)

2.7.1 Tests for Independence

2.7.2 Tests for Homogeneity

3. Probability, Random Variables and Probability Distributions

3.1 Probability

3.1.1 Estimating Probability using Relative Frequency

3.1.2 Probabilities of Single Events

3.1.3 Introduction to Combined Events

3.1.4 Addition Rule & Mutually Exclusive Events

3.1.5 Conditional Probability

3.1.6 Multiplication Rule & Independent Events

3.1.7 Probabilities of Combined Events using Tree Diagrams

3.1.8 Probabilities of Combined Events using the Rules

3.2 Discrete Random Variables

3.2.1 Probability Distributions for Discrete Random Variables

3.2.2 Cumulative Probability Distributions for Discrete Random Variables

3.2.3 Mean & Standard Deviation of a Discrete Random Variable

3.2.4 Linear Transformations of Random Variables

3.2.5 Linear Combinations of Random Variables

3.3 Binomial & Geometric Distributions

3.3.1 Introduction to Binomial Distributions

3.3.2 Probabilities for Binomial Distributions

3.3.3 Introduction to Geometric Distributions

3.3.4 Probabilities for Geometric Distributions

4. Exploring One-Variable Data

4.1 Summary Statistics

4.1.1 Describing Variables

4.1.2 Parameters & Statistics

4.1.3 Measures of Center

4.1.4 Measures of Position

4.1.5 Measures of Variability

4.1.6 Tables & Relative Frequency

4.1.7 Grouped Data

4.1.8 Outliers & Resistant Measures

4.1.9 Five-Number Summary & Boxplots

4.1.10 Skewness of Data

4.1.11 Comparing Data using Summary Statistics

4.2 Graphical Representations

4.2.1 Shape of Distributions

4.2.2 Bar Charts & Histograms

4.2.3 Dotplots & Stemplots

4.2.4 Cumulative Graphs

4.2.5 Comparing Univariate Graphs

4.3 Normal Distribution

4.3.1 Properties of Normal Distributions

4.3.2 Standardized z-scores

4.3.3 Comparing Normal Distributions

4.3.4 Finding Proportions from Normal Distributions

4.3.5 Inverse Normal Calculations

4.3.6 Estimating Parameters of Normal Distributions

5. Sampling Distributions

5.1 Sampling Distributions

5.1.1 Introduction to Sampling Distributions

5.1.2 Sampling Distributions for Sample Means

5.1.3 The Central Limit Theorem

5.1.4 Sampling Distributions for Differences in Sample Means

5.1.5 Sampling Distributions for Sample Proportions

5.1.6 Sampling Distributions for Differences in Sample Proportions

5.1.7 Biased & Unbiased Estimators

6. Exploring Two-Variable Data

6.1 Tables & Graphs

6.1.1 Two-Way Tables & Relative Frequencies

6.1.2 Bar Graphs & Mosaic Plots

6.2 Scatterplots & Regression

6.2.1 Two-Way Tables & Relative Frequencies

6.2.2 Bar Graphs & Mosaic Plots

6.2.3 Explanatory & Response Variables

6.2.4 Scatterplots

6.2.5 Association & Correlation Coefficients

6.2.6 Interpolation & Extrapolation using Linear Models

6.2.7 Residuals

6.2.8 The Least-Squares Regression Line

6.2.9 Residual Plots

6.2.10 The Coefficient of Determination

6.2.11 Outliers, High-Leverage & Influential Points

6.2.12 Linearization of Bivariate Data

Introduction to Hypothesis Testing

Topic 2/3

Revision Notes
Flashcards
Past Paper Analysis
Questions
Videos

Your Flashcards are Ready!

15 Flashcards in this deck.

Introduction to Hypothesis Testing

Introduction

Hypothesis testing is a fundamental statistical method used to make inferences about populations based on sample data. In the context of Collegeboard AP Statistics, understanding hypothesis testing is crucial for analyzing data, drawing conclusions, and making informed decisions. This article provides a comprehensive introduction to hypothesis testing, covering essential concepts, methodologies, and practical applications relevant to students preparing for the AP Statistics exam.

Key Concepts

1. What is Hypothesis Testing?

Hypothesis testing is a systematic procedure used to evaluate statements or claims about a population parameter. It involves making an initial assumption (the null hypothesis) and determining whether there is sufficient evidence in the sample data to reject this assumption in favor of an alternative hypothesis.

2. Types of Hypotheses

Null Hypothesis ($H_0$): This is the default or skeptical assertion that there is no effect or no difference. It represents a statement of no change or no association.
Alternative Hypothesis ($H_a$): This hypothesis contradicts the null hypothesis. It represents the outcome that the researcher aims to support, indicating an effect or a difference.

3. Steps in Hypothesis Testing

State the Hypotheses: Clearly define the null and alternative hypotheses based on the research question.
Select the Significance Level ($\alpha$): Common choices are 0.05, 0.01, or 0.10, representing the probability of rejecting the null hypothesis when it is true.
Choose the Appropriate Test: Depending on the data type and sample size, select tests such as z-test, t-test, chi-square test, or ANOVA.
Compute the Test Statistic: Calculate the test statistic using relevant formulas. For example, the z-test statistic is computed as:

$$ z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}} $$ where $\bar{x}$ is the sample mean, $\mu_0$ is the population mean under the null hypothesis, $\sigma$ is the population standard deviation, and $n$ is the sample size.

Determine the P-Value or Critical Value: The p-value indicates the probability of observing the test statistic or something more extreme under the null hypothesis. Alternatively, critical values define the threshold at which the null hypothesis is rejected.

Make a Decision: Compare the p-value with the significance level to decide whether to reject or fail to reject the null hypothesis.

Draw a Conclusion: Interpret the results in the context of the research question.

4. Types of Errors

Type I Error ($\alpha$): Rejecting the null hypothesis when it is actually true. The probability of committing a Type I error is equal to the significance level.
Type II Error ($\beta$): Failing to reject the null hypothesis when the alternative hypothesis is true.

5. One-Tailed vs. Two-Tailed Tests

One-Tailed Test: Used when the alternative hypothesis is directional, specifying that a parameter is either greater than or less than the null hypothesis value.
Two-Tailed Test: Used when the alternative hypothesis is non-directional, stating that a parameter is simply different from the null hypothesis value without specifying the direction.

6. P-Value Interpretation

The p-value measures the strength of evidence against the null hypothesis. A smaller p-value indicates stronger evidence to reject the null hypothesis. If the p-value is less than or equal to the significance level ($\alpha$), the null hypothesis is rejected.

7. Power of a Test

The power of a test is the probability that it correctly rejects a false null hypothesis (i.e., it detects an effect when there is one). Power is calculated as $1 - \beta$ and is influenced by factors such as sample size, effect size, and significance level.

8. Effect Size

Effect size quantifies the magnitude of the difference or relationship being tested. It provides context to the statistical significance, helping to understand the practical importance of the results.

9. Assumptions of Hypothesis Testing

Different tests have specific assumptions that must be met for the results to be valid. Common assumptions include:

Independence of observations
Normality of the data distribution
Homogeneity of variances (for certain tests like ANOVA)

Violating these assumptions can lead to inaccurate conclusions.

10. Example of Hypothesis Testing

**Scenario:** A teacher claims that the average score of her students on a standardized test is 75. A student believes that the true average score is different. **Steps:**

State the Hypotheses:
- $H_0$: $\mu = 75$
- $H_a$: $\mu \neq 75$
Significance Level: $\alpha = 0.05$
Choose the Test: Two-tailed t-test (assuming the population standard deviation is unknown)
Compute the Test Statistic: $$ t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}} $$ Suppose $\bar{x} = 78$, $s = 10$, and $n = 30$: $$ t = \frac{78 - 75}{10 / \sqrt{30}} \approx 0.82 $$
Determine the P-Value: Using t-distribution tables or software, the p-value for $t = 0.82$ with 29 degrees of freedom is approximately 0.42.
Decision: Since $0.42 > 0.05$, fail to reject the null hypothesis.
Conclusion: There is insufficient evidence to conclude that the average score differs from 75.

11. Common Mistakes in Hypothesis Testing

Confusing Significance Level and P-Value: The significance level is a threshold, while the p-value is a calculated probability.
Ignoring Assumptions: Not verifying whether the data meets the assumptions required for the chosen test can invalidate the results.
Misinterpreting the P-Value: A p-value does not measure the probability that the null hypothesis is true.
Multiple Comparisons: Conducting multiple tests increases the likelihood of Type I errors if adjustments are not made.

12. Applications of Hypothesis Testing

Hypothesis testing is widely used in various fields, including:

Medicine: Testing the efficacy of new drugs or treatments.
Business: A/B testing for marketing strategies.
Education: Evaluating the impact of teaching methods on student performance.
Engineering: Quality control and process improvement.

13. Advanced Topics

Confidence Intervals: Provide a range of plausible values for population parameters, complementing hypothesis testing.
Non-Parametric Tests: Used when data does not meet the assumptions required for parametric tests, such as the Mann-Whitney U test.
Bayesian Hypothesis Testing: Incorporates prior beliefs and evidence to update the probability of hypotheses.

Comparison Table

Aspect	Null Hypothesis ($H_0$)	Alternative Hypothesis ($H_a$)
Definition	Statement of no effect or no difference.	Statement indicating the presence of an effect or difference.
Purpose	Serves as a starting point for statistical testing.	Represents what the researcher aims to support.
Acceptance	Cannot be proven true; only can fail to be rejected.	Accepted when there is sufficient evidence against $H_0$.
Examples	$\mu = 50$, no difference between groups.	$\mu \neq 50$, group A has a higher mean than group B.
Type of Test	Basis for determining critical regions.	Determines the direction of the test (one-tailed or two-tailed).

Summary and Key Takeaways

Hypothesis testing allows for making inferences about population parameters using sample data.
Understanding the null and alternative hypotheses is essential for setting up tests.
Proper selection of the test type and verification of assumptions ensure valid results.
Interpreting p-values and understanding Type I and Type II errors are critical for decision-making.
Applications of hypothesis testing span various fields, highlighting its importance in statistical analysis.

Examiner Tip

Tips

To excel in hypothesis testing for the AP exam, remember the mnemonic "SHoP-CT" to recall the steps: State hypotheses, Highlight significance level, Choose the test, Calculate the statistic, and Take action based on p-value or critical value. Additionally, always visualize your test using a graph to better understand the rejection regions and the impact of different significance levels.

Did You Know

Did you know that hypothesis testing was first developed by Ronald Fisher, Jerzy Neyman, and Egon Pearson in the early 20th century? Their collaboration laid the foundation for modern statistical inference. Additionally, hypothesis testing plays a crucial role in the development of vaccines, where it's used to determine the effectiveness of new treatments before they are approved for public use.

Common Mistakes

Many students confuse the significance level ($\alpha$) with the p-value. For example, interpreting a p-value of 0.03 as the probability that the null hypothesis is true is incorrect. The p-value actually represents the probability of observing the data, or something more extreme, assuming the null hypothesis is true. Another common mistake is neglecting to check the assumptions of the test, which can lead to invalid conclusions.

FAQ

What is the difference between the null and alternative hypotheses?

The null hypothesis ($H_0$) states that there is no effect or difference, while the alternative hypothesis ($H_a$) suggests that there is an effect or difference.

How do you choose the significance level ($\alpha$)?

Common significance levels are 0.05, 0.01, and 0.10. The choice depends on the context of the study and the acceptable risk of making a Type I error.

What happens if the p-value is greater than $\alpha$?

If the p-value is greater than the significance level ($\alpha$), you fail to reject the null hypothesis, indicating insufficient evidence to support the alternative hypothesis.

Can you always reject the null hypothesis?

No, you can only reject the null hypothesis if there is sufficient evidence (i.e., the p-value is less than or equal to $\alpha$). Otherwise, you fail to reject it.

What is a Type II error?

A Type II error occurs when you fail to reject the null hypothesis even though the alternative hypothesis is true.

Why are assumptions important in hypothesis testing?

Assumptions ensure the validity of the test results. Violating these assumptions can lead to incorrect conclusions.

1. Collecting Data

1.1 Experimental Design

1.1.1 Completely Randomized Design

1.1.2 Randomized Block & Matched Pairs Design

1.1.3 Introduction to Experiments

1.1.4 Well-Designed Experiments

1.1.5 Control Groups, Placebos & Blind Experiments

1.2 Sampling Methods & Bias