United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Download ReportTranscript United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE Contents How are data currently disseminated? Advantages and disadvantages of different approaches Introduction to data cubes Good practices 06 November 2015 Steven Vale - UNECE Statistical Division Slide 2 Dissemination Practices Web sites of statistical agencies for all 56 UNECE member countries checked during spring 2008. Data dissemination systems and formats recorded. Not possible to check all national language versions of websites. 06 November 2015 Steven Vale - UNECE Statistical Division Slide 3 Results Number of Countries % Static html / pdf / word pages 29 51.8% Excel spreadsheets 12 21.4% National database software 17 30.4% PC-Axis 12 21.4% Statbank / PC-Axis 3 5.4% SuperWEB 2 3.6% Internet Dissemination Tools 06 November 2015 Steven Vale - UNECE Statistical Division Slide 4 Static html / pdf / word Pages 06 November 2015 Steven Vale - UNECE Statistical Division Slide 5 Static html / pdf / word Pages Advantages • Quick, easy and cheap to prepare • Data at a glance • Possible to combine tables, graphics and text • Html and pdf viewers are free Disadvantages • Only a picture - users can not easily download or manipulate data • Manual updates 06 November 2015 Steven Vale - UNECE Statistical Division Slide 6 Excel Spreadsheets 06 November 2015 Steven Vale - UNECE Statistical Division Slide 7 Excel Spreadsheets Advantages • Users can download and customize data • Most common format for basic data analysis Disadvantages • Excel software is not cheap! • Manual updates • User has to download the whole file 06 November 2015 Steven Vale - UNECE Statistical Division Slide 8 Output Databases 06 November 2015 Steven Vale - UNECE Statistical Division Slide 9 Output Databases Advantages • Interactive with flexible outputs • User friendly (usually!) • Can be tailored to national requirements • Some generic systems available Disadvantages • Can be expensive to develop and maintain, particularly if you develop your own system 06 November 2015 Steven Vale - UNECE Statistical Division Slide 10 What Do Users Want? Depends on the type of user Quick access to key figures Options to select and manipulate data Easy export to own analysis packages Graphic visualizations (maps, charts, ..) Appropriate metadata Multiple languages 06 November 2015 Steven Vale - UNECE Statistical Division Slide 11 What is a Data Cube? A multi-dimensional structure containing data points that represent unique combinations of several classifications A flexible way of storing and disseminating data 06 November 2015 Steven Vale - UNECE Statistical Division Slide 12 Two-dimensional Cube Year Country 2000 2001 2002 2003 AAA 123 456 124 567 125 678 126 789 BBB 987 654 988 654 989 654 999 654 CCC 35 789 06 November 2015 36 789 37 789 Steven Vale - UNECE Statistical Division 38 789 Slide 13 Threedimensional Cube 06 November 2015 Steven Vale - UNECE Statistical Division Slide 14 More dimensions are possible, but not easy to display! 06 November 2015 Steven Vale - UNECE Statistical Division Slide 15 Why Data Cubes are Important Many statistical data management models and systems are based on cubes Users can select just those data that are of interest Cubes can easily be expanded, e.g. for extra years, countries, or other categories At least in theory, cubes can have an infinite number of dimensions 06 November 2015 Steven Vale - UNECE Statistical Division Slide 16 Good Practices Static tables can be useful for key figures For detailed or large datasets, allow users to create and manipulate their own tables Store data as multi-dimensional cubes Offer graphic visualizations Allow users to download data in a range of formats (including SDMX) 06 November 2015 Steven Vale - UNECE Statistical Division Slide 17 Good Practices (2) Link data and metadata Share development in an open-source environment or network, with an electronic forum for discussions and questions Don’t try to re-invent the wheel! 06 November 2015 Steven Vale - UNECE Statistical Division Slide 18 Thank you for listening Questions? 06 November 2015 Steven Vale - UNECE Statistical Division Slide 19