Document

Download Report

Transcript Document

Lesson 11 - 1

Inference about Two Means Dependent Samples

Objectives

•

Distinguish between independent and dependent sampling

•

Test claims made regarding matched pairs data

•

Construct and interpret confidence intervals about the population mean difference of matched pairs

Vocabulary

•

Robust – minor deviations from normality will not affect results

•

Independent – when the individuals selected for one sample do not dictate which individuals are in the second sample

•

Dependent – when the individuals selected for one sample determine which individuals are in the second sample; often referred to as

matched pairs

samples

Now What

●

Chapter 10 covered a variety of models dealing with one population

  

The mean parameter for one population The proportion parameter for one population The standard deviation parameter for one population

●

However, many real-world applications need techniques to compare two populations



Our Chapter 10 techniques do not do these

Two Population Examples



We want to test whether a certain treatment helps or not … the measurements are the “before” measurement and the “after” measurement



We want to test the effectiveness of Drug A versus Drug B … we give 40 patients Drug A and 40 patients Drug B … the measurements are the Drug A and Drug B responses



Two precision manufacturers are bidding for our contract … they each have some precision (standard deviation) … are their precisions significantly different

Types of Two Samples

An independent sample is when individuals selected for one sample have no relationship to the individuals selected for the other

●

Examples

 

50 samples from one store compared to 50 samples from another 200 patients divided at random into two groups of 100 each A dependent sample is one when each individual in the first sample is directly matched to one individual in the second

●

Examples

 

Before and after measurements (a specific person’s before and the same person’s after) Experiments on identical twins (twins matched with each other

Match Pair Designs

•

Remember back to Chapter 1 discussions on design of experiments: the dependent samples were often called matched-pairs

●

Matched-pairs is an appropriate term because each observation in sample 1 is matched to exactly one in sample 2

  

The person before



the person after One twin



the other twin An experiment done on a person’s left eye



the same experiment done on that person’s right eye

Terms

•

d-bar or d – the mean of the differences of the two samples

x 1 – x 2 = d 30 – 25 = 5 23 – 27 = - 4

•

s

is the standard deviation of the differenced data

Requirements

Testing a claim regarding the difference of two means using

matched pairs

•

Sample is obtained using simple random sampling

•

Sample data are matched pairs

•

Differences are normally distributed with no outliers or the sample size, n, is large (n ≥ 30)

Classical and P-Value Approach – Matched Pairs P-Value is the area highlighted t 0 -t α -|t 0 | |t 0 | -t α/2 t α/2 Critical Region Remember to add the areas in the two-tailed!

Test Statistic: t 0 d = -------- s d /√n Left-Tailed Reject null hypothesis, if P-value < α Two-Tailed Right-Tailed t 0 < - t α t 0 < - t α/2 or t 0 > t α/2 t 0 > t α t α t 0

Confidence Interval – Matched Pairs

Lower Bound: d – t α/2 · s d /√n Upper Bound: d + t α/2 · s d /√n t α/2 is determined using n - 1 degrees of freedom d is the mean of the differenced data s d is the standard deviation of the differenced data Note: The interval is exact when population is normally distributed and approximately correct for nonnormal populations, provided that n is large.

Two-sample, dependent, T-Test on TI

•

If you have raw data:

– –

enter data in L1 and L2 define L3 = L1 – L2 (or vice versa – depends on alternative Hypothesis)

•

L1 – L2 STO



•

Press STAT, TESTS, select T-Test

– –

raw data: List set to L3 and freq to 1 summary data: enter as before

Example Problem

Carowinds quality control manager feels that people are waiting in line for the new roller coaster too long. To determine is a new loading and unloading procedure is effective in reducing wait time, she measures the amount of time people are waiting in line for 7 days and obtains the following data.

Day

Old New

Mon

11.6

10.7

Tue

25.9

28.3

Wed

20.0

19.2

Thu

38.2

35.9

Fri

57.3

59.2

Sat

32.1

31.8

Sat

81.8

75.3

Sun

57.1

54.9

Sun

62.8

62.0

A normality plot and a box plot indicate that the differences are apx normal with no outliers. Test the claim that the new procedure reduces wait time at the α=0.05 level of significance.

Example Problem Cont.

•

Requirements: seem to be met from problem info

•

Hypothesis H 0 : H 1 : Mean wait time the same (d-bar = 0, new-old) Mean wait time reduced (d-bar < 0, new-old)

• •

Test Statistic: t 0 d-bar - 0 = --------------------- s d /



n = -1.220, p = 0.1286

Critical Value: t c (9-1,0.05) = -1.860, α = 0.05

•

Conclusion: Fail to Reject H 0 : not enough evidence to show that new procedure reduces wait times

Summary and Homework

•

Summary

–

Two sets of data are dependent, or matched-pairs, when each observation in one is matched directly with one observation in the other

–

In this case, the differences of observation values should be used

–

The hypothesis test and confidence interval for the difference is a “mean with unknown standard deviation” problem, one which we already know how to solve

•

Homework

–

pg 582-587; 1, 2, 4-8, 12, 15, 18, 19

6) independent

HW Answers

8) dependent 12a) your task 12b) d-bar = -1.075

s d = 3.833

12c) Fail to reject H 0 12d) [-5.82, 3.67] 18) example problem in class