1. Collecting Data

1.1 Experimental Design

1.2 Sampling Methods & Bias

1.2.1 Introduction to Sampling

1.2.2 Simple Random Sampling (SRS)

1.2.3 Random Sampling Methods

1.2.4 Types of Bias

1.2.5 Non-random (Biased) Sampling Methods

2. Inference

2.1 Inference for Regression Slopes

2.1.1 Sampling Distributions for Sample Slopes

2.1.2 Hypothesis Tests for Slopes of Regression Lines

2.1.3 Confidence Intervals for Slopes of Regression Lines

2.2 Errors in Hypothesis Tests

2.2.1 Type I & Type II Errors

2.2.2 Probabilities of Errors

2.2.3 Power of a Test

2.3 Introduction to Inference

2.3.1 Tails on a Normal Distribution

2.3.2 Introduction to Hypothesis Testing

2.3.3 Introduction to Confidence Intervals

2.4 Inference for Proportions

2.4.1 Hypothesis Tests for Population Proportions

2.4.2 Confidence Intervals for Population Proportions

2.4.3 Hypothesis Tests for Differences in Population Proportions

2.4.4 Confidence Intervals for Differences in Population Proportions

2.5 Inference for Means

2.5.1 The t-distribution

2.5.2 Hypothesis Tests for Population Means

2.5.3 Confidence Intervals for Population Means

2.5.4 Hypothesis Tests for Differences in Population Means

2.5.5 Confidence Intervals for Differences in Population Means

2.5.6 t-scores versus z-scores

2.5.7 Hypothesis Tests for Differences in Matched Pairs

2.5.8 Confidence Intervals for Differences in Matched Pairs

2.6 Goodness of Fit (Chi-Square)

2.6.1 The Chi-Square Distribution

2.6.2 Hypothesis Tests for Goodness of Fit

2.7 Independence & Homogeneity (Chi-Square)

2.7.1 Tests for Independence

2.7.2 Tests for Homogeneity

3. Probability, Random Variables and Probability Distributions

3.1 Probability

3.1.1 Estimating Probability using Relative Frequency

3.1.2 Probabilities of Single Events

3.1.3 Introduction to Combined Events

3.1.4 Addition Rule & Mutually Exclusive Events

3.1.5 Conditional Probability

3.1.6 Multiplication Rule & Independent Events

3.1.7 Probabilities of Combined Events using Tree Diagrams

3.1.8 Probabilities of Combined Events using the Rules

3.2 Discrete Random Variables

3.2.1 Probability Distributions for Discrete Random Variables

3.2.2 Cumulative Probability Distributions for Discrete Random Variables

3.2.3 Mean & Standard Deviation of a Discrete Random Variable

3.2.4 Linear Transformations of Random Variables

3.2.5 Linear Combinations of Random Variables

3.3 Binomial & Geometric Distributions

3.3.1 Introduction to Binomial Distributions

3.3.2 Probabilities for Binomial Distributions

3.3.3 Introduction to Geometric Distributions

3.3.4 Probabilities for Geometric Distributions

4. Exploring One-Variable Data

4.1 Summary Statistics

4.1.1 Describing Variables

4.1.2 Parameters & Statistics

4.1.3 Measures of Center

4.1.4 Measures of Position

4.1.5 Measures of Variability

4.1.6 Tables & Relative Frequency

4.1.7 Grouped Data

4.1.8 Outliers & Resistant Measures

4.1.9 Five-Number Summary & Boxplots

4.1.10 Skewness of Data

4.1.11 Comparing Data using Summary Statistics

4.2 Graphical Representations

4.2.1 Shape of Distributions

4.2.2 Bar Charts & Histograms

4.2.3 Dotplots & Stemplots

4.2.4 Cumulative Graphs

4.2.5 Comparing Univariate Graphs

4.3 Normal Distribution

4.3.1 Properties of Normal Distributions

4.3.2 Standardized z-scores

4.3.3 Comparing Normal Distributions

4.3.4 Finding Proportions from Normal Distributions

4.3.5 Inverse Normal Calculations

4.3.6 Estimating Parameters of Normal Distributions

5. Sampling Distributions

5.1 Sampling Distributions

5.1.1 Introduction to Sampling Distributions

5.1.2 Sampling Distributions for Sample Means

5.1.3 The Central Limit Theorem

5.1.4 Sampling Distributions for Differences in Sample Means

5.1.5 Sampling Distributions for Sample Proportions

5.1.6 Sampling Distributions for Differences in Sample Proportions

5.1.7 Biased & Unbiased Estimators

6. Exploring Two-Variable Data

6.1 Tables & Graphs

6.1.1 Two-Way Tables & Relative Frequencies

6.1.2 Bar Graphs & Mosaic Plots

6.2 Scatterplots & Regression

6.2.1 Two-Way Tables & Relative Frequencies

6.2.2 Bar Graphs & Mosaic Plots

6.2.3 Explanatory & Response Variables

6.2.4 Scatterplots

6.2.5 Association & Correlation Coefficients

6.2.6 Interpolation & Extrapolation using Linear Models

6.2.7 Residuals

6.2.8 The Least-Squares Regression Line

6.2.9 Residual Plots

6.2.10 The Coefficient of Determination

6.2.11 Outliers, High-Leverage & Influential Points

6.2.12 Linearization of Bivariate Data

Hypothesis Tests for Differences in Matched Pairs

Topic 2/3

Revision Notes
Flashcards
Past Paper Analysis
Questions
Videos

Your Flashcards are Ready!

15 Flashcards in this deck.

Hypothesis Tests for Differences in Matched Pairs

Introduction

Hypothesis tests for differences in matched pairs are fundamental in statistical analysis, particularly in the Collegeboard AP Statistics curriculum. These tests allow researchers to determine whether there is a significant difference between two related groups. By analyzing paired observations, students can draw meaningful inferences about population parameters, enhancing their understanding of inferential statistics.

Key Concepts

Understanding Matched Pairs

Matched pairs involve two related sets of observations. Each pair consists of measurements taken from the same subject under different conditions or from matched subjects. This pairing controls for variability between subjects, making it easier to detect differences attributable to the conditions being tested.

Setting Up Hypotheses

In matched pairs hypothesis testing, the null hypothesis ($H_0$) typically states that there is no difference between the paired observations. The alternative hypothesis ($H_a$) posits that a significant difference exists. Formally, this can be expressed as:

$$ \begin{aligned} H_0 &: \mu_d = 0 \\ H_a &: \mu_d \neq 0 \quad (\text{two-tailed}), \\ H_a &: \mu_d > 0 \quad (\text{right-tailed}), \\ H_a &: \mu_d < 0 \quad (\text{left-tailed}) \end{aligned} $$

where $\mu_d$ represents the mean difference between paired observations.

Assumptions of the Test

For the hypothesis test to be valid, several assumptions must be met:

Random Sampling: Pairs should be randomly selected to ensure generalizability.
Independence: Each pair must be independent of others.
Normality: The distribution of differences should be approximately normal, especially important for small sample sizes.

Calculating Differences

Begin by calculating the difference ($d_i$) for each pair:

$$ d_i = X_{i1} - X_{i2} $$

where $X_{i1}$ and $X_{i2}$ are the two related measurements for the $i^{th}$ pair.

Descriptive Statistics of Differences

Compute the mean difference ($\bar{d}$) and the standard deviation of differences ($s_d$):

$$ \bar{d} = \frac{1}{n} \sum_{i=1}^{n} d_i $$ $$ s_d = \sqrt{\frac{\sum_{i=1}^{n} (d_i - \bar{d})^2}{n - 1}} $$

These statistics summarize the central tendency and variability of the differences.

Test Statistic

The test statistic for matched pairs is calculated using the t-distribution:

$$ t = \frac{\bar{d} - \mu_{d0}}{s_d / \sqrt{n}} $$

where $\mu_{d0}$ is the hypothesized mean difference (usually 0), and $n$ is the number of pairs.

Under $H_0$, the test statistic follows a t-distribution with $n-1$ degrees of freedom.

Determining the P-value

The p-value represents the probability of observing a test statistic as extreme as, or more extreme than, the one calculated, assuming $H_0$ is true. It is determined based on the t-distribution and the directionality of $H_a$ (two-tailed, left-tailed, or right-tailed).

Decision Rule

Compare the p-value to the chosen significance level ($\alpha$, typically 0.05):

If p-value ≤ α: Reject $H_0$. There is sufficient evidence to support $H_a$.
If p-value > α: Fail to reject $H_0$. There is insufficient evidence to support $H_a$.

Confidence Intervals

A confidence interval for the mean difference provides a range of plausible values for $\mu_d$. It is calculated as:

$$ \bar{d} \pm t^* \left( \frac{s_d}{\sqrt{n}} \right) $$

where $t^*$ is the critical value from the t-distribution based on the desired confidence level and degrees of freedom.

If a confidence interval does not contain the hypothesized value (e.g., 0), it suggests rejecting $H_0$.

Example Scenario

Consider a study investigating the effectiveness of a new teaching method. A teacher records students' test scores before and after implementing the method. The data form matched pairs as each student's performance is measured twice.

Steps:

Calculate the differences in scores for each student.
Compute $\bar{d}$ and $s_d$.
Formulate $H_0$ and $H_a$.
Calculate the test statistic $t$.
Determine the p-value.
Make a decision based on the p-value and $\alpha$.

If the p-value is less than 0.05, the teacher can conclude that the new teaching method has significantly affected student performance.

Power of the Test

The power is the probability of correctly rejecting $H_0$ when $H_a$ is true. It depends on factors like sample size, effect size, and significance level. Higher power increases the test's ability to detect true differences.

Common Mistakes to Avoid

Ignoring Pairing: Treating paired data as independent can lead to incorrect conclusions.
Violating Assumptions: Not checking the normality of differences, especially with small samples.
Misinterpretation: Confusing correlation with causation or misinterpreting the direction of difference.

Extensions and Applications

Matched pairs tests are widely used in various fields:

Medicine: Comparing patient health metrics before and after treatment.
Psychology: Assessing behavioral changes due to interventions.
Education: Evaluating the impact of teaching methods on student performance.

Non-parametric Alternatives

When the normality assumption is violated, non-parametric tests like the Wilcoxon Signed-Rank Test can be used. These tests do not assume a specific distribution and are based on the ranks of the differences.

The Wilcoxon test steps:

Calculate differences and remove pairs with zero differences.
Rank the absolute differences.
Assign signs to the ranks based on the direction of differences.
Sum the positive and negative ranks.
Determine the test statistic and p-value based on rank sums.

Comparing Independent and Matched Pairs Tests

While both tests assess differences in means, matched pairs tests account for subject-level variability by pairing related observations, increasing the test's sensitivity to detect differences.

Comparison Table

Aspect	Matched Pairs Test	Independent Samples Test
Data Structure	Paired observations from the same subject or matched subjects	Two independent groups
Control of Variability	Reduces variability by pairing, increasing test sensitivity	Higher variability due to independent groups
Assumptions	Normality of differences, independence of pairs	Normality in each group, equal variances (for some tests)
Example Applications	Pre-test and post-test scores, before-and-after measurements	Comparing test scores between two different classes
Pros	Increased sensitivity, controls for subject variability	Simple to implement, widely applicable
Cons	Requires paired data, more complex analysis	Less sensitive to differences, especially with high variability

Summary and Key Takeaways

Matched pairs tests compare related observations to detect significant differences.
Proper hypothesis formulation and assumption checking are crucial for valid results.
Calculating the mean and standard deviation of differences forms the basis of the test statistic.
Understanding the test's power and potential pitfalls ensures reliable conclusions.
Non-parametric alternatives like the Wilcoxon test provide flexibility when assumptions are unmet.

Examiner Tip

Tips

Remember the acronym PAIRS: Plot your data, Assess assumptions, Identify differences, Run calculations, and Summarize results. Additionally, practice interpreting p-values and confidence intervals to strengthen your understanding. For the AP exam, focus on understanding the underlying concepts rather than memorizing formulas.

Did You Know

The concept of matched pairs originated from agricultural experiments where researchers paired similar plants to test the effects of fertilizers. Additionally, matched pairs designs are extensively used in clinical trials to compare patient outcomes before and after treatments, enhancing the precision of results by controlling individual variability.

Common Mistakes

One frequent error is treating paired data as independent, which overlooks the inherent relationship between observations. For example, assuming pre-test and post-test scores are independent can distort results. Another mistake is neglecting to verify the normality of differences, leading to invalid test conclusions. Correct approach involves acknowledging the paired nature and checking underlying assumptions.

FAQ

What is the purpose of a matched pairs test?

A matched pairs test is used to determine if there is a significant difference between two related groups, such as measurements taken from the same subjects under different conditions.

How do you calculate the test statistic in a matched pairs test?

The test statistic is calculated using the formula $t = \frac{\bar{d} - \mu_{d0}}{s_d / \sqrt{n}}$, where $\bar{d}$ is the mean difference, $\mu_{d0}$ is the hypothesized mean difference, $s_d$ is the standard deviation of differences, and $n$ is the number of pairs.

What assumptions must be met for a matched pairs test?

The assumptions include random sampling, independence of pairs, and approximately normal distribution of the differences.

When should you use a non-parametric test instead of a matched pairs test?

When the normality assumption is violated, especially with small sample sizes, a non-parametric test like the Wilcoxon Signed-Rank Test should be used.

Can matched pairs tests be used with more than two conditions?

Matched pairs tests are typically used for comparing two related conditions. For more than two conditions, alternative methods like repeated measures ANOVA may be more appropriate.

How does increasing sample size affect the power of the test?

Increasing the sample size generally increases the power of the test, making it more likely to detect a true difference if one exists.

1. Collecting Data

1.1 Experimental Design

1.1.1 Completely Randomized Design

1.1.2 Randomized Block & Matched Pairs Design

1.1.3 Introduction to Experiments

1.1.4 Well-Designed Experiments

1.1.5 Control Groups, Placebos & Blind Experiments

1.2 Sampling Methods & Bias

1.2.1 Introduction to Sampling

1.2.2 Simple Random Sampling (SRS)