Transcript Chapter 4: Collecting, Displaying, and Analyzing Data
Chapter 4: Collecting, Displaying, and Analyzing Data
Regular Math
Section 4.1: Samples and Surveys
Population – the entire group being studied Sample – part of the population Biased Sample – not a good representation Random Sample – every member has an equal chance Systematic Sample – according to a rule or formula Stratified Sample – at random from a randomly subgroup
Example 1: Identifying Biased Samples Identify the population and sample. Give a reason why the sample could be biased.
A radio station manager chooses 1500 people from the local phone book to survey about their listening habits.
Population = people in the local area Sample = up to 1500 people that take the survey Biased = not everyone is in the phone book An advice columnist asks her readers to write in with their opinions about how to hang the toilet paper on the roll.
Population = readers of the column Sample = readers who wrote in Biased = only readers with strong opinions would write in Surveyors in a mall choose shoppers to ask about their product preferences.
Population = all shoppers at the mall Sample = people who are polled Biased = surveyors are more likely to approach people who look agreeable
Try these on your own…
Identify the population and sample. Give a reason why the sample could be biased.
A record store manager asks customers who make a purchase how many hours of music they listen to each day. Population = record store customers Sample = customers who make a purchase Biased = Customers who make a purchase may be more interested in music than others who are in the store.
An eighth-grade student council member polls classmates about a new school mascot.
Population = students in the school Sample = classmates Biased = She polls more eighth graders than students in other grades.
Sampling Method How Members Are Chosen
Random By chance Systematic Stratified According to a rule or formula At random from randomly chosen subgroups
Example 2: Identifying Sampling Methods Identify the sampling method used.
An exit poll is taken of every tenth voter.
systematic In a statewide survey, five counties are randomly chosen and 100 people are randomly chosen from each county.
stratified Students in a class write their names on strips of paper and put them in a hat. The teacher draws five names.
random
Try these on your own…
Identify the sampling method used.
In a county survey, Democratic Party members whose names begin with the letter D are chosen.
systematic A telephone company randomly chooses customers to survey about its service.
random A high school randomly chooses three classes from each grade and then draws three random names from each class to poll about lunch menus.
stratified
Section 4.2: Organizing Data
Stem-and-Leaf Plot Back –to-Back Stem-and-Leaf Plot
Example 1: Organizing Data in Tables Use the given data to make a table.
Greg has received job offers as a mechanic at three airlines. The first has a salary range of $20,000 $34,000, benefits worth $12,000, and 10 days’ vacation. The second has 15 days’ vacation, benefits worth $10,500, and salary range of $18,000-$50,000. The third has benefits worth $11,400, a salary range of $14,000-$40,000, and 12 days’ vacation.
Job 1 Job 2 Job 3 Salary Range
$20,000 $34,000 $18,000 $50,000 $14,000 $40,000
Benefit s
$12,000 $10,500 $11,400
Vacatio n Days
10 15 12
Try this one on your own…
Day
Monday Tuesday Wednesday
To School To Home
7 min 5 min 8 min 9 min 9 min 7 min Use the given data to make a table.
Jack timed his bus rides to and from school. On Monday, it took 7 minutes to get to school and 9 minutes to get home. On Tuesday, it took 5 minutes and 9 minutes respectively, and on Wednesday, it took 8 minutes and 7 minutes.
Example 2: Reading Stem-and-Leaf Plots List the data values in the stem-and-leaf plot.
0 1 2 3 2 5 3 3 7 8 0 2 6 1 7 Try this one on your own… 1 2 5 2 3 0 1 1 2 7 9 12, 15, 40, 41, 41, 52, 57, 59 Key: 3 I 1 means 31
Example 3: Organizing Data in Stem-and Leaf Plots Ash 47 Beech 40 Black Maple 40 Cedar 67 Cherry 42 Douglas Fir 91 Elm 38 Grand Fir 77 Hemlock 74 Hickory 58 Oak 61 Pecan 44 Red Maple 55 Sequoia 84 Spruce 63 Sycamore 40 Western Pine 48 Willow 35 Use the data set on heights of trees in the U.S. (m) to make a stem-and-leaf plot.
Try this one on your own…
Use the data on top speeds of animals (mi/h) to make a stem-and-leaf plot.
Cheetah Wildebeest Lion 64 61 50 Elk Coyote Gray Fox 45 43 42
Example 4: Organizing Data in Back-to Back Stem-and-Leaf Plots Use the given data on Super Bowl scores, 1990 2000, to make a back-to-back stem-and-leaf plot.
1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 Winning 55 20 37 52 30 49 27 35 31 34 23 Losing 10 19 24 17 13 26 17 21 24 19 16
Try this one on your own…
1950 2000 IL 25 19 MA 14 10 MI 18 15 NY 43 29 PA 31 19 Use the data on US. Representatives for Selected States, 1950 and 2000, to make a back-to-back stem-and-leaf plot.
Section 4.3: Measures of Central Tendency Definition Use to Answer Mean – the sum of the values, divided by the number of values Median – the middle number in an ordered set of data “What is the average?” “What single number best represents the data?” “What is the halfway point of the data?” Mode – the value or values that occur most often “What is the most common value?”
Section 4.4: Variability
Variability – how spread out the data is Range – largest number minus the smaller number Quartile – divide data parts into four equal parts Box-and-Whisker Plot – shows the distribution of data
Example 1: Finding Measures of Variability Find the range and the first and third quartiles for each data set.
85, 92, 78, 88, 90, 88, 89 78, 85, 88, 88, 89, 90, 92 Range: 92-78 = 14 1 st Quartile = 85 3 rd Quartile = 90 14, 12, 15, 17, 15, 16, 17, 18, 15, 19, 20, 17 12, 14, 15, 15, 15, 16, 17, 17, 17, 18, 19, 20 Range: 20-12 = 8 1 st Quartile = (15 + 15) / 2 = 15 3 rd Quartile = (17 + 18) / 2 = 17.5
Try these on your own…
Find the range and the first and third quartiles for each data set.
15, 83, 75, 12, 19, 74, 21 Range: 71 1 st Quartile: 15 3 rd Quartile: 75 75, 61, 88, 79, 79, 99, 62, 77 Range: 38 1 st Quartile: 69 3 rd Quartile: 83.5
Example 2: Making a Box-and-Whisker Plot Use the given data to make a box-and whisker plot.
22, 17, 22, 49, 55, 21, 49, 62, 21, 16, 18, 44, 42, 48, 40, 33, 45 Find the smallest value, first quartile, median, third quartile, and largest value.
Smallest: 16 1 st Quartile: (21+21) / 2 = 21 Median: 40 3 rd Quartile: (48+49) / 2 = 48.5
Largest: 62
Try this one on your own…
Use the given data to make a box-and whisker plot.
21, 25, 15, 13, 17, 19, 19, 21 Smallest: 13 1 st Quartile: 16 Median: 18 3 rd Quartile: 21 Largest: 25
Example 3: Comparing Data Sets Using Box-and-Whisker Plots These box-and-whisker plots compare the number of home runs Babe Ruth hit during his 15-year career from 1920 to 1934 with the number Mark McGwire hit during the 15 years from 1986 to 2000.
Compare the medians and ranges.
Compare the ranges of the middle half of the data for each.
Try these on your own…
These box-and-whisker plots compare the ages of the first ten U.S. presidents with the ages of the last ten presidents (through George W. Bush) when they took office.
Compare the medians and ranges.
Compare the differences between the third quartile and first quartile for each.
Section 4.5: Displaying Data
Bar Graph – display data that can be grouped in categories Frequency Table – use with data that is given in list Histogram – type of bar graph; groups by using intervals Line Graph – show trends or to make estimates
Example 1: Displaying Data in a Bar Graph Organize the data into a frequency table and make a bar graph.
Age License Received 16 17 18
The following are the ages when a randomly chosen group of 20 teenagers received their driver’s licenses: 18, 17, 16, 16, 17, 16, 16, 16, 19, 16, 16, 17, 16, 17, 18, 16, 18, 16, 19, 16 Frequency 11 4 3
19
2
Try this one on your own…
Hours
0 1 2 3 4 5
Frequency
2 3 1 3 0 1 Organize the data into a frequency table and a bar graph.
The following data set reflects the number of hours of television watched every day by members of a sixth grade class: 1,1,3,0,2,0,5,3,1,3
Example 2: Displaying Data in a Histogram John surveyed 15 people to find out how many pages were in the last book they read. Use the data to make a histogram.
368, 153, 27, 187, 240, 636, 98, 114, 64, 212, 302, 144, 76, 195, 200
Pages
0-99 100-199 200-299 300-399 400-499 500-599 600-699
Frequency
Make a frequency table first.
Then, use intervals of 100 to make a histogram.
Try this one on your own…
Dollars
0-0.99
1.00 – 1.99
2.00 – 2.99
3.00 – 3.99
Frequency
Jimmy surveyed 12 children to find out how much money they received from the tooth fairy. Use the data set to make a histogram.
0.35, 2.00, 0.75, 2.50, 1.50, 3.00, 0.25, 1.00, 1.00, 3.50, 0.50, 3.00
Example 3: Displaying Data in a Line Graph
Year
Make a line graph of the given data. Use the graph to estimate the number of polio cases in 1993.
1975 1980 1985 1990 1995 2000
Number of Polio Cases Worldwide
49,293 52,552 38,637 23,484 7,035 2,880
Try this one on your own…
Year
1985 1990 1995 2000
Salary ($)
42,000 49,000 58,000 69,000 Make a line graph of the given data.
Use the graph to estimate Mr. Yi’s salary in 1992.
Section 4.6: Misleading Graphs and Statistics Explain why each graph is misleading.
Try this one on your own…
Explain why the graphs are misleading.
Example 2: Identifying Misleading Statistics Explain why each statistic is misleading.
A small business has 5 employees with the following salaries: $90,000 (owner); $18,000; $22,000; $20,000; $23,000. The owner places an ad that reads: “Help Wanted – average salary $34,600” Only one person in the company makes more than $23,000 and that is the owner. It is not likely that a new person’s salary would be close to $34,600.
A market researcher randomly selects 8 people to focus-test three brands, labeled A, B, and C. Of these, 4 chose brand A, 2 chose brand B, and 2 chose brand C. An ad for brand A states: Preferred 2 to 1 over leading brands!” The sample size is too small. The researcher needs to ask more people to get a true representation.
The total revenue at Worthman’s for the three-month period from June 1 to September 1 was $72,000. The total revenue at Meilleure for the three-month period from October 1 to January 1 was $108,000.
They are comparing two different times of the year.
Try these on your own…
Explain why each statistic is misleading.
Sam scored 43 goals for his soccer team during the season, and Jacob scored only 2.
Four out of five dentists surveyed preferred Ultraclean toothpaste.
Shopping at Save-a-Lot can save you up to $100 a month!
Section 4.7: Scatter Plots
Scatter Plots – show relationships between two data sets
Example 1: Making a Scatter Plot of a Data Set
Compound ZIA Average Effects
Zinc Gluconate 100 Reduced cold 7 days Zinc Gluconate Zinc Orotate Zinc Gluconate Zinc Gluconate Zinc Aspartate Zinc Acetate-tartarate-glycine Zinc Gluconate 44 0 25 13.4
0 -55 -11 Reduced cold 4.8 days None Reduced cold 1.6 days None None Increased cold 4.4 days Increased cold 1 day A scientist studying the effects of zinc lozenges on colds has gathered the following data. Zinc ion availability (ZIA) is a measure of the strength of the lozenge. Use the data to make a scatter plot.
Try this one on your own…
Use the given data to make a scatter plot of the weight and height of each member of a basketball team.
Height (in)
71 68 70 73 74
Weight (lb)
170 160 175 180 190
Correlations
Example 2: Identifying the Correlation of Data
Do the data sets have positive, a negative, or no correlation?
The population of a state and the number of representatives positive The number of weeks a movie has been out and weekly attendance negative A person’s age and the number of siblings they have No correlation
Try these on your own…
Do the data sets have a positive, negative, or no correlation?
The size of a jar of baby food and the number of jars a baby eats negative The speed of a runner and the number of races she wins positive The size of a person and the number of fingers he has No correlation
Example 3: Using a Scatter Plot to Make Predictions
Hours Studied
5 9 3 12 1
Exam Grade
80 95 75 98 70 Use the data to predict the exam grade for a student who studies 10 hours per week.
About 95
Try this one on your own…
Use the date to predict how much a worker will earn in tips in 10 hours.
Hours
4 8 3 2 11 Approximately $24
Tips ($)
12 20 7 7 26