Tables, Charts, Graphs - City University of New York

Download Report

Transcript Tables, Charts, Graphs - City University of New York

Organizing and Presenting Data

GTECH 201 Session 11

Terminology

      Classes  Categories for grouping data Frequency  Number of observations that fall in a class (frequency is a count) Frequency Distribution  A listing of all classes along with their frequencies Relative Frequency  The ratio of the frequency of a class to the total number of observations Relative Frequency Distribution  A listing of all classes along with their relative frequencies Width/Class Interval  The difference between the upper and lower cut points (breaks) of a class

Organizing Data

   Classification Rules Aim is to create categories or classes  First step is to compute range  Range = Largest Value – Smallest Value  Interval or Ratio Scale data only Class Intervals  Width of Class Interval     Equal based on range Unequal based on range Quantile (Quartile or Quintile) Natural

Classification Methods

 Natural breaks  Equal interval  Quantile  Manual

How to Decide (on a classification scheme)

  Rule of thumb: 3 - 7 classes Classification histogram (see later today)

Classification method

Natural breaks Equal interval Quantile Manual

How to Decide, part II

When to use

When attributes are distributed unevenly across the overall range of values When you want all classes to have the same range

How many classes to have

Look for natural groups Easily understood interval, such as 2, 50, 1000, etc.

When attributes are distributed in a linear fashion When you want classes to break at specific values Determined by purpose of the map

 Line graph  Bar graph  Scatterplots

Graphs

250 200 150 100 50 0 1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 49 41 33 25 17 9 1 100.0

0 50 100 150 200 250 80.0

60.0

40.0

20.0

0.0

0 50 100 150 200 250

Creating a Line Graph

 The growth of the population of students at a Midwestern university is as follows Year 1960 1970 1980 1990 2000 Population Growth 1500 5000 14670 18923 24000

Line Graph

Population Growth 30000 25000 20000 15000 10000 5000 0 1 2 3 4 5 Population Growth

Bar Graphs

 Here are data on the percent of females among people earning doctoral degrees in 1990, in several different fields of study All fields Biological Sciences Eduation Engineering Physical Sciences Psychology 29.5

26 44.2

3.8

12.5

41.7

Bar Graph

50 45 40 35 30 25 20 15 10 5 0 All fields Biological Sciences Eduation Engineering Physical Sciences Psychology

Scatter Plots

   Graph bi-variate data when both variables are measured in an interval/ratio or ordinal scale Units for one variable are marked on the horizontal axis Independent variable should always go on the horizontal, x axis

Scatterplots

Calories in Common Foods

450 400 350 300 250 200 150 100 50 0 -50 0 2 4 6 8 10 12  Survey of 3368 people asking them to estimate number of calories in common foods.

Example

0 3 0 0 3 0 2 2 00 1 2 1 0 0 1 2 4 0 4 2 1 0 1 0 0 2 0 1 3 2

  A city planner collected data on the number of school age children in each of 30 families. Construct a grouped data table using classes based on a single value

Computing Frequency

Number of Children 0 1 2 3 4 Frequency Relative Frequency 12 6 7 3 2

30

0.400

0.200

0.233

0.100

0.067

1.000

 There are three ways you can create classes  a < but not equal to b   b < but not equal to c a – b, c – d, e - f single value grouping

Distributions

      Histograms Difference between histograms and bar graphs Bars in a histogram are always vertical Base scale is marked off in equal units; there is no base scale in a bar graph Width of bars in a histogram have meaning Bars in a histogram touch each other

Constructing a Histogram

  Class Intervals Frequency 30 -39 40-49 50-59 60-69 70-79 80-89 90-99 3 1 8 10 7 7 4

40

Relative Frequency 0.075

0.025

0.2

0.25

0.175

0.175

0.1

1

Midpoint 35 45 55 65 75 85 95 Histogram – height of bar equal to frequency of class represented Bar extends from lowest value to highest value of the class

20 15 10 5 0

Histogram Chart

150% 100% 50% 0% Frequency Cumulative %

Frequency Polygons

 Similar to a histogram  Midpoint of the class is indicated  Points connected by straight lines  Cumulative frequency polygon, ogive

250 200 150 100 50 0 S1