United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.

Download Report

Transcript United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.

United Nations Economic Commission for Europe
Statistical Division
The Importance of Databases in
the Dissemination Process
Steven Vale, UNECE
Contents
How are data currently disseminated?
 Advantages and disadvantages of
different approaches
 Introduction to data cubes
 Good practices

06 November 2015
Steven Vale - UNECE Statistical Division
Slide 2
Dissemination Practices



Web sites of statistical agencies for all
56 UNECE member countries checked
during spring 2008.
Data dissemination systems and formats
recorded.
Not possible to check all national
language versions of websites.
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 3
Results
Number of
Countries
%
Static html / pdf / word pages
29
51.8%
Excel spreadsheets
12
21.4%
National database software
17
30.4%
PC-Axis
12
21.4%
Statbank / PC-Axis
3
5.4%
SuperWEB
2
3.6%
Internet Dissemination Tools
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 4
Static html / pdf / word Pages
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 5
Static html / pdf / word Pages

Advantages
•
Quick, easy and cheap to prepare
• Data at a glance
• Possible to combine tables, graphics and text
• Html and pdf viewers are free

Disadvantages
•
Only a picture - users can not easily download
or manipulate data
• Manual updates
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 6
Excel Spreadsheets
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 7
Excel Spreadsheets

Advantages
•
Users can download and customize data
• Most common format for basic data analysis

Disadvantages
•
Excel software is not cheap!
• Manual updates
• User has to download the whole file
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 8
Output Databases
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 9
Output Databases

Advantages
•
Interactive with flexible outputs
• User friendly (usually!)
• Can be tailored to national requirements
• Some generic systems available

Disadvantages
•
Can be expensive to develop and maintain,
particularly if you develop your own system
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 10
What Do Users Want?







Depends on the type of user
Quick access to key figures
Options to select and manipulate data
Easy export to own analysis packages
Graphic visualizations (maps, charts, ..)
Appropriate metadata
Multiple languages
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 11
What is a Data Cube?
A multi-dimensional structure
containing data points that represent
unique combinations of several
classifications
 A flexible way of storing and
disseminating data

06 November 2015
Steven Vale - UNECE Statistical Division
Slide 12
Two-dimensional Cube
Year
Country 2000
2001
2002
2003
AAA
123 456 124 567 125 678 126 789
BBB
987 654 988 654 989 654 999 654
CCC
35 789
06 November 2015
36 789
37 789
Steven Vale - UNECE Statistical Division
38 789
Slide 13
Threedimensional
Cube
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 14
More dimensions are possible,
but not easy to display!
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 15
Why Data Cubes are Important




Many statistical data management models
and systems are based on cubes
Users can select just those data that are of
interest
Cubes can easily be expanded, e.g. for
extra years, countries, or other categories
At least in theory, cubes can have an
infinite number of dimensions
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 16
Good Practices





Static tables can be useful for key figures
For detailed or large datasets, allow users
to create and manipulate their own tables
Store data as multi-dimensional cubes
Offer graphic visualizations
Allow users to download data in a range
of formats (including SDMX)
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 17
Good Practices (2)



Link data and metadata
Share development in an open-source
environment or network, with an electronic
forum for discussions and questions
Don’t try to re-invent the wheel!
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 18
Thank you for listening
Questions?
06 November 2015
Steven Vale - UNECE Statistical Division
Slide 19