STATISTICAL DATA ANALYSIS SOFTWARE

Download Report

Transcript STATISTICAL DATA ANALYSIS SOFTWARE

STATISTICAL DATA
ANALYSIS SOFTWARE
By
Johnson Lubega Kagugube
Director, District Statistics and Capacity Development
Uganda Bureau of Statistics
1
OUTLINE OF THE PRESENTATION

Meaning of data analysis

Purpose for data analysis

Reason for statistical analysis

Issues to consider in data analysis

Statistical data analysis softwares

Issues to consider when choosing a
Statistical Package

Conclusion
MEANING OF STATISTICAL DATA
ANALYSIS

Collection of methods used to process raw
data and report the overall trends.

Process of systematically applying statistical
and/or logical techniques to describe and
illustrate, condense and recap, and evaluate
data.
REASON FOR STATISTICAL ANALYSIS


Transform raw data into information
The general purpose of statistical analysis is to
provide meaning to what otherwise would be a
collection of numbers and/or values.
 Provide a way of drawing inductive inferences
from data and distinguishing the signal (the
phenomenon of interest) from the noise
(statistical fluctuations) present in the data
 Statistical analysis procedures are categorized
according to the type of statistics generated;
i.e descriptive, associative, and inferential.
REASON FOR STATISTICAL ANALYSISCont..
1.
2.
Descriptive statistics portray individuals or events in
terms of some predefined characteristics, like
measure of central tendency and dispersion –Mean,
Median, Range, Standard Deviation, etc.
Associative or relative statistics seek to identify
meaningful interrelationships between or among data.
Such statistics include; univariate, bivariate and
multivariate analysis. For instance, "Is there a
relationship between salt intake and diastolic blood
pressure among middle-age women?" is a problem
definition suitable for analysis by associative
statistics.
REASON FOR STATISTICAL ANALYSISCont..
3.
Inferential statistics seek to assess the
characteristics of a sample in order to make
more general statements about the parent
population, or about the relationship between
different samples or populations.


Measures of differences of the means and
measures of statistical significance
For Example; "Does a low sodium diet lower the
diastolic blood pressure of middle-age women?"
represents a problem definition suitable for
inferential statistics.
ISSUES TO CONSIDER IN DATA ANALYSIS

There are a number of issues to consider with
respect to data analysis. These include:

Having the necessary skills to analyze

Following acceptable norms for data analysis
and presentation

Choosing the appropriate statistical software

Providing honest and accurate analysis

Manner of presenting data

Extent of data analysis
A Statistical package is a computer programme that
specializes in statistical data analysis.
WHAT STATISTICAL SOFTWARES CAN
DO IN RELATION TO DATA ANALYSIS

Input data into the computer

Organise data

Compare data

Manage data

Summarise data (transform raw data into information)

Generate tables and graphs

Facilitate presentation of information and preparation
of analytical reports
SOME OF THE STATISTICAL PACKAGES
BY SOURCE
OPEN
SOURCE
PUBLIC
DOMAIN
FREEWARE
PROPRIETARY
ADD-INS
OpenEpi
BrightStat
BV4.1
SAS
ANALYSE-IT
PSPP
CSPro
GeoDA
STATA
SIGMA XL
R
Epi Info
WinBUGS
SPSS
STATEL
R Commander
X-12-ARIMA
WINPEPI
S-PLUS
SUDAAN
Shogun
INSTAT
WINIDAMS
MINITAB
TOTAL
ACCESS
Statistics
ZAITUN Time
Series
GENSTAT
SSC-STATA
Ploticus
Simfit
E-VIEWS
Statistical Lab
STATISTICA
MAJOR STATISTICAL DATA ANALYSIS
PACKAGES

In terms of the wide usageare;
 STATA
 SAS
–Statistical Analysis System
 SPSS- Statistical Package for Social Sciences
MAJOR STATISTICAL DATA ANALYSIS
PACKAGES –Cont..

Licensing policies
STATA
SAS
SPSS
COST
In US Dollars
295
6000
1599
DURATION
Purchase and
own the version
Annual
Annual
INSTALLATION
Multiple
installations
allowed
One license per
CPU
One license per
CPU
EXTRA COST
No extra pay for
separate modules
No extra cost
Extra Modules like
Survey data, and time
series paid for
MAJOR STATISTICAL DATA ANALYSIS
PACKAGES –Cont..

Other Issues
Installation and
Updates
STATA
SAS
SPSS
Simple
Complicated
Quick and easy
All customers
entitled to
technical support
Students buying
Gradpack are not
entitled to
technical support
Availability of
All customers
Technical support entitled to
from developer
technical support
Web Site
http:/www.stata.com/
http:/www.sas.com
http://w.w.w.spss.com
Add-on-programs
Users permitted to
create new commands
that integrated in the
system
Macros are developed
but cannot be
integrated in the
system
Little space to accept
new macros
ISSUES TO CONSIDER WHEN
CHOOSING A STATISTICAL PACKAGE






Important to know more than one statistical software
package
Analyse your needs with respect to data management
and analysis; and choose a package that addresses
the needs
Ease of importing and exporting data to other
computer programmes
Ease of transferring the output into word processing
facilities
Licensing facility-Purchase to own Vs hire
General Vs Specialized purpose statistical software
UBOS’ EXPERIENCE

UBOS is currently using STATA. Recently STATA Ver 10 and
Statransfer Ver 8 were procured for UBOS, sector ministries and
the Higher Local Governments

Why STATA


It is more sustainable

Cost

One time license
Is it more useful?

Handling data

Graphics for exploration and reports

Capacity for programming

Latest version is Windows based to a great extent

Technical capacity already available for the stakeholders in the NSS
CONCLUSION

Statistical capacity building is necessary in terms of training and mentoring
to;

enable countries assist and also learn from each other

enable all stakeholders involved in the respective countries NSS to
acquire expertise to determine their statistical data analysis needs

enable staff handling statistics in Africa to acquire knowledge to
use statistical packages to process, and analyse data to support
planning and monitoring of development programmes

Choosing a statistical package to use requires analysis of the cost, data
analysis needs and the licensing policy.

The National Statistical Offices should establish collaborative arrangements
with the Statistical Training Institutions to ensure that the graduates train in
the selected statistical packages.
E
ND
THANK YOU
17