EU-SILC anonymisation - Symposium 2005
Download
Report
Transcript EU-SILC anonymisation - Symposium 2005
Disseminating Statistics: Internet and Publications
INE – Madrid, 3-5 March 2008
How to link publications and
Internet in order to create
complementary dissemination
Ulrich Wieland, Eurostat
Agenda
A. The Eurostat dissemination strategy
B. Website vs. paper publications
C. Integration from the user perspective
D. Integration from the producer perspective
Disseminating Statistics - INE – Madrid, 3-5 March 2008
2
A. The Eurostat communication strategy
Adopted in February 2006
Follows the free dissemination
From dissemination to communication
Disseminating Statistics - INE – Madrid, 3-5 March 2008
3
Objectives of the Eurostat communication
strategy
To be the first-choice source for European statistical
data
To increase the level of service to users
To present the Eurostat figures attractively and, as
far as possible, to accompany them with clear and
objective comments
Disseminating Statistics - INE – Madrid, 3-5 March 2008
4
Some elements of the communications
strategy
The dissemination of statistics must be an integral
part of the work of the units which produce statistics
The website must be the main
dissemination/communication tool
Publications should concentrate on analytical
publications and on short publications
(pocketbooks) aiming to increase Eurostat‘s
visibility
Disseminating Statistics - INE – Madrid, 3-5 March 2008
5
B. Website vs. Paper publications
What is the role of paper publications in the age of
the Internet?
Disseminating Statistics - INE – Madrid, 3-5 March 2008
6
Advantages of the Website
Easy update
Easy distribution
Easy access
Sophisticated search functions
Re-usability
Cheap production (?)
Disseminating Statistics - INE – Madrid, 3-5 March 2008
7
The need for publications
Analytical publications are easier to read on paper
Books can be an important image factor
Publicity handed out on paper reaches a different
audience than electronic publicity
Publications are usually better archived
Disseminating Statistics - INE – Madrid, 3-5 March 2008
8
Complementarity
Both the Internet and the paper publications have
their role to play in dissemination
Consider Web site and paper as two different media
of the same dissemination process. Make sure that
you use the right media for the objective of your
dissemination; don´t hesitate to disseminate the
same information on different media
Disseminating Statistics - INE – Madrid, 3-5 March 2008
9
Recommendations
Make all public information available on the Web
site
Use publications for
Flagship publications (e.g.yearbook)
Analytical publications (more than 50% of text)
Small publications for publicity purposes (press
releases, pocketbooks, leaflets)
Publications with a large readership
Disseminating Statistics - INE – Madrid, 3-5 March 2008
10
C. Integration from the user’s perspective
Accessing publications from the Website: Pdf files
and the Web search
Linking from publications to the Web site
Look and feel
Disseminating Statistics - INE – Madrid, 3-5 March 2008
11
Books on the Internet
Offer all your paper publications as pdf files on
your Website
Use user-friendly features such as table of content,
hyperlinks etc.
Pay attention to the use of colour (user might print
in b&w)
Pay attention to file sizes for downloading
Disseminating Statistics - INE – Madrid, 3-5 March 2008
12
Books on the Internet - Example
Disseminating Statistics - INE – Madrid, 3-5 March 2008
13
The Internet Search
Make sure that all books can easily be found
through the search on your website
Allow for search by title, abstract, collection, release
date, keyword
Offer on-line ordering of paper copies (and of pdf
files if they are not available free of charge)
Disseminating Statistics - INE – Madrid, 3-5 March 2008
14
Disseminating Statistics - INE – Madrid, 3-5 March 2008
15
Disseminating Statistics - INE – Madrid, 3-5 March 2008
16
Disseminating Statistics - INE – Madrid, 3-5 March 2008
17
Pricing policy
Agree on a clear pricing policy (e.g. for paper
publications, pdf files, data base access and other
services)
Often, the pricing policy reflects the printing and
distribution cost; other conflicting criteria include
maximising income or supporting a large
distribution
Disseminating Statistics - INE – Madrid, 3-5 March 2008
18
Harmonise the Look and Feel
Use your logo extensively
Use the same symbols and abbreviations
Use the same terminology
Use similar layout and fonts
-> develop a style guide for both publications and
web site
Disseminating Statistics - INE – Madrid, 3-5 March 2008
19
The DOI (Digital Object Identifier)
A system for identifying content objects in the digital
environment.
Provides a framework for persistent identification of
intellectual content.
Managed by the International DOI Foundation
Recently been accepted for standardisation within
ISO.
Disseminating Statistics - INE – Madrid, 3-5 March 2008
20
Disseminating Statistics - INE – Madrid, 3-5 March 2008
21
The Eurostat code I
A Eurostat-specific code to identify tables and data
sets on the web site
In contrary to the DOI, it refers to dynamic (regularly
updated) data sets rather that to static data sets
The codes remain stable over a long time period
Disseminating Statistics - INE – Madrid, 3-5 March 2008
22
The Eurostat code II
All tables and graphs in a publication will receive a
code (e.g. TPS00021) which refers to the
corresponding data set or table on the web site
Data sets and tables can be accessed either by
typing the code into the Eurostat search field or (for
pdf versions) by clicking on the hyperlink
Disseminating Statistics - INE – Madrid, 3-5 March 2008
23
The Eurostat code III
Structure of the code:
Example: TPS00021
T = Table
PS = <theme> (Population and social
statistics)
00021 = <sequential number>
The code system can be extended to cover other
objects such as meta data, definitions, nodes in
data trees, etc.
Disseminating Statistics - INE – Madrid, 3-5 March 2008
24
The Eurostat code IV
Implemented in the Eurostat yearbook and the
Eurostat pocketbook
In the future, all publications will use the Eurostat
code
Currently, no attempts are made in Eurostat to
implement the DOI
Disseminating Statistics - INE – Madrid, 3-5 March 2008
25
D. Integration from the producer’s
perspective
How to avoid separate production processes and
redundant work
when creating publications and web content?
Disseminating Statistics - INE – Madrid, 3-5 March 2008
26
XML publishing: Where are we now?
Disseminating Statistics - INE – Madrid, 3-5 March 2008
27
XML publishing
XML publishing is a method of creating from the
same content various types of output, in particular
pdf and html
It uses a standardised format for representing
content (text, tables, graphs etc.)
Disseminating Statistics - INE – Madrid, 3-5 March 2008
28
XML publishing: Where do we want to get?
HTML
Database
MS Office
XML document
XSL processor
Other
documents
PDF
Any other
structured
format
XSL style sheet
document
Disseminating Statistics - INE – Madrid, 3-5 March 2008
29
What is XML?
A standardised language to represent content (not
layout)
Supported by many suppliers including Microsoft
Standard software to convert from and to XML is
available
See www.xml.net
Disseminating Statistics - INE – Madrid, 3-5 March 2008
30
Predefined tables
Disseminating Statistics - INE – Madrid, 3-5 March 2008
31
Example of an XML document (very
simplified)
<AxisY name="geo">
<Position value="eu15">
<AxisX name="time">
<Position value="1996">
<Cell value="109.6" />
</Position>
<Position value="1997">
<Cell value="109.5" />
</Position>
Disseminating Statistics - INE – Madrid, 3-5 March 2008
32
XML in Eurostat – predefined tables
Data
+
XSLT process
XML
HTML
Dictionaries
XSL
Disseminating Statistics - INE – Madrid, 3-5 March 2008
33
The problem: Where do we want to get?
HTML
Database
MS Office
XML document
XSL processor
Other
documents
PDF
Any other
structured
format
XSL style sheet
document
Disseminating Statistics - INE – Madrid, 3-5 March 2008
34
Strategic questions:
When is the XML publishing a good choice?
Costs and benefits of an XML based approach?
How and when should be the dissemination
programme modified to better fit the technology?
Disseminating Statistics - INE – Madrid, 3-5 March 2008
35
Technical question 1: Standards
What XML language to use:
– DocBook (common format for publications)
– SDMX-ML (Standard format for statistical data
and meta data)
– CoSSI (developed by Statistics Finland)
– ODF (Open Document Format)
Eurostat has chosen the Open Document Format
Disseminating Statistics - INE – Madrid, 3-5 March 2008
36
Technical question 2: XML content creation
Native XML editor or standard Office software
Database output in XML
Conversion software
Disseminating Statistics - INE – Madrid, 3-5 March 2008
37
Practical experiences
Statistics Finland
Statec Luxembourg (Yearbook)
Disseminating Statistics - INE – Madrid, 3-5 March 2008
38
Eurostat activities
Development of SDMX-ML together with other
international statistical bodies
Workshop on XML publishing in October 2007
Study to determine the best way to implement XML
publishing in Eurostat
Attempts will be made to work closely with some
NSOs in the development of XML publishing
Pilot implementation using the Eurostat pocketbook
Disseminating Statistics - INE – Madrid, 3-5 March 2008
39
Contact
Ulrich Wieland
Head of Sector Publications
EUROSTAT - Unit B6
BECH A3-100
European Commission
L-2920 Luxembourg
Tel. +352 4301 33644
E-mail: [email protected]
Disseminating Statistics - INE – Madrid, 3-5 March 2008
40