Transcript Document
Lesson 1 - 1 Displaying Distribution with Graphs Knowledge Objectives • What is meant by exploratory data analysis • What is meant by the distribution of a variable • Differentiate between categorical variables and quantitative variables • What is meant by the mode of a distribution • What is meant by an outlier in a stemplot or histogram Construction Objectives • Construct bar graphs and pie charts for a set of categorical data • Construct a stemplot for a set of quantitative data • Construct a back-to-back stemplot to compare two related distributions • Construct a stemplot using split stems • Construct a histogram for a set of quantitative data, and discuss how changing the class width can change the impression of the data given by the histogram Construction Objectives cont • Describe the overall pattern of a distribution by its shape, center, and spread • Recognize and identify symmetric and skewed distributions • Construct and interpret an ogive (relative cumulative frequency graph) from a relative frequency table • Construct a time plot for a set of data collected over time Vocabulary • Roundoff error – errors associated with decimal inaccuracies • Pie chart – chart that emphasize each category’s relation to the whole • Bargraph – displays the distribution of a categorical variable • Stemplot – includes actual numerical values in a plot that gives a quick picture of the distribution • Back-to-back stemplot – two distributions plotted with a common stem • Splitting stems – divides step into 0-4 and 5-9 • Trimming – removes the last digit or digits before making a stemplot • Histogram – breaks range of values into classes and displays their frequencies • Frequency – counts of data in a class • Frequency table – table of frequencies Vocabulary • • • • • • • • Modes – major peaks in a distribution Unimodal – a distribution whose shape with a single peak (mode) Bimodal – a distribution whose shape has two peaks (modes) Symmetric – if values smaller and larger of the center are mirror images of each other Skewed – if smaller or larger values from the center form a tail Ogive – relative cumulative frequency graph Time plot – plots a variable against time on the horizontal scale of the plot Seasonal variation – a regular rise and fall in a time plot Categorical Data • Categorical Variable: – Values are labels or categories – Distributions list the categories and either the count or percent of individuals in each • Displays: BarGraphs and PieCharts Categorical Data Example Body Part Frequency Relative Frequency Back 12 0.4 Wrist 2 0.0667 Elbow 1 0.0333 Hip 2 0.0667 Shoulder 4 0.1333 Knee 5 0.1667 Hand 2 0.0667 Groin 1 0.0333 Neck 1 0.0333 Total 30 1.0000 Physical Therapist’s Rehabilitation Sample Categorical Data • Items are placed into one of several groups or categories (to be counted) • Typical graphs of categorical data: – Pie Charts; emphasizes each category’s relation to the whole – Bar Charts; emphasizes each category’s relation with other categories Groin Neck Hand 3% 3% 7% Bar Chart 14 Rehab Pie Chart 12 10 Back 40% Knee 17% 8 6 4 Neck Groin Hand Knee Shoulder Hip Elbow Wrist 0 Back 2 Rehab Shoulder 13% Hip 7% Elbow 3% Wrist 7% Charts for Both Data Types Pareto Chart 1.2 Rehab 1 0.8 0.6 0.4 0.2 Neck Groin Hand Knee Shoulder Hip Elbow Wrist Back 0 Neck Groin Elbow Hip Hand Wrist Knee Back Neck Groin Hand Rehab Cumulative Frequency Chart Percent Knee Shoulder Hip Elbow Wrist 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 Shoulder Rehab Percent 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 Back Percent Relative Frequency Chart Example 1 Construct a pie chart and a bar graph. Radio Station Formats Format Nr of Stations Percentage Adult contemporary 1,556 11.2 Adult standards 1.196 8.6 569 4.1 Country 2,066 14.9 News/Talk/Info 2,179 15.7 Oldies 1,060 7.7 Religious 2,014 14.6 Rock 869 6.3 Spanish Language 750 5.4 1,579 11.4 13,838 99.9 Contemporary Hits Other formats Total Why not 100%? Example 1 Pie Chart Example 1 Bar Graph Quantitative Data • Quantitative Variable: – Values are numeric - arithmetic computation makes sense (average, etc.) – Distributions list the values and number of times the variable takes on that value • Displays: – Dotplots – Stemplots – Histograms – Boxplots Dot Plot • Small datasets with a small range (max-min) can be easily displayed using a dotplot – Draw and label a number line from min to max – Place one dot per observation above its value – Stack multiple observations evenly • First type of graph under STATPLOT 34 values ranging from 0 to 8 Stem Plots • A stemplot gives a quick picture of the shape of a distribution while including the numerical values – Separate each observation into a stem and a leaf eg. 14g -> 1|4 256 -> 25|6 32.9oz -> 32|9 – Write stems in a vertical column and draw a vertical line to the right of the column – Write each leaf to the right of its stem • Note: – Stemplots do not work well for large data sets – Not available on calculator Stem & Leaf Plots Review Given the following values, draw a stem and leaf plot 20, 32, 45, 44, 26, 37, 51, 29, 34, 32, 25, 41, 56 Ages Occurrences -----------------------------------------------------------------2 | 0, 6, 9, 5 | 3 | 2, 3, 4, 2 | 4 | 5, 4, 1 | 5 | 1, 6 Splitting Stems • Double the number of stems, writing 0-4 after the first and 5-9 after second. Back-to-Back Stemplots • Back-to-Back Stemplots: Compare datasets Example1.4, pages 42-43 Literacy Rates in Islamic Nations Example 1 The ages (measured by last birthday) of the employees of Dewey, Cheatum and Howe are listed below. Office A Office B 22 31 21 49 26 42 42 30 28 31 39 39 20 37 32 36 35 33 45 47 49 38 28 48 a) Construct a stem graph of the ages b) Construct a back-to-back comparing the offices c) Construct a histogram of the ages Example 1a: Stem and Leaf 22 31 21 49 26 42 42 30 28 31 39 39 20 37 32 36 35 33 45 47 49 38 28 48 Ages of Personnel 2 0, 1, 2, 6, 8, 8, 3 0, 1, 1, 2, 3, 5, 6, 7, 8, 9, 9, 4 2, 2, 5, 7, 8, 9, 9, Example 1b: Back-to-Back Stem 22 31 21 49 26 42 42 30 28 31 39 39 20 37 32 36 35 33 45 47 49 38 28 48 Office A: Ages of Personnel 1, 2, 6, 8 2 0, 1, 1, 9, 9 3 2, 2, 9 4 Office B: Ages of Personnel 0, 8 2, 3, 5, 6, 7, 8, 5, 7, 8, 9, Example 2 Below are times obtained from a mail-order company's shipping records concerning time from receipt of order to delivery (in days) for items from their catalogue? 3 7 10 5 14 12 6 2 9 22 25 11 5 7 12 10 22 23 14 8 5 4 7 13 27 31 13 21 6 8 3 10 19 12 11 8 a) Construct a stem plot of the delivery times b) Construct a split stem plot of the delivery times c) Construct a histogram of the delivery times Example 2: Stem and Leaf Part 3 7 10 5 14 12 6 2 9 22 25 11 5 7 12 10 22 23 14 8 5 4 7 13 27 31 13 21 6 8 3 10 19 12 11 8 Days to Deliver 0 2, 3, 3, 4, 5, 5, 5, 6, 6, 7, 7, 7, 8, 8, 8, 9 1 0, 0, 0, 1, 1, 2, 2, 2, 3, 3, 4, 4, 9 2 1, 2, 2, 3, 5, 7 3 1 Example 2b: Split Stem and Leaf 3 7 10 5 14 12 6 2 9 22 25 11 5 7 12 10 22 23 14 8 5 4 7 13 27 31 13 21 6 8 3 10 19 12 11 8 Days to Deliver 0 0 1 1 2 2 3 2, 3, 3, 4 5, 5, 5, 6, 6, 7, 7, 7, 8, 8, 8, 9 0, 0, 0, 1, 1, 2, 2, 2, 3, 3, 4, 4 9 1, 2, 2, 3 5, 7 1 Day 1 Summary and Homework • Summary – Categorical data • Data where adding/subtracting makes no sense • Pie charts and bar graphs – Quantitative data • Data where arithmetic operations make sense • Stem plots and histograms – Some graphs can work for both types of data • Frequency and dot plots • Ogive and Pareto • Homework – pg 46 – 48 problems 1-5