Publishing Official Datasets Toby Green OECD Publishing 4th Bloomsbury Conference on e-Publishing and e-Publications 24th and 25th June , 2010

Download Report

Transcript Publishing Official Datasets Toby Green OECD Publishing 4th Bloomsbury Conference on e-Publishing and e-Publications 24th and 25th June , 2010

Publishing Official
Datasets
Toby Green
OECD Publishing
4th Bloomsbury Conference on
e-Publishing and e-Publications
24th and 25th June , 2010
Publishing Official Data
in cool ways since 1961
Climategate!
“investigation reveals scientific
concern about missing tree
ring data”. The Guardian, January 2010
Would it have been lost had it
been properly published and
curated?
Should we rely on authors to
self-publish data?
Data is not second class stuff.
It should be just as easy to:
• peer review
• publish
• cite
as research articles.
We simply need the existing
scholarly publishing ‘toolkit’:
• review mechanism
• metadata
• doi identifiers
• CrossRef
So, whereas for books we have this:
“Serial” level complete with DOI
Here’s one OECD prepared
or issue” level earlier . . . “Book
complete with DOI
“Chapter or article” level complete with DOI
For datasets we could have this:
“Serial” level complete with DOI
“Book or issue” level complete with DOI
“Chapter or article” level complete with DOI
But data is not the same as an
article or book chapter,
Sub-sets can be published.
Sub-sets: each has unique identifier,
with links to the ‘mother’ dataset
Data subset series
Homepage
DOI: 1234.56/Series
Subset 1
Homepage
DOI link to: Main dataset
DOI: 1234.56/Subset#1
Subset 2
Homepage
DOI link to: Main dataset
DOI: 1234.56/Subset#2
Subset 3
Homepage
DOI link to: Main dataset
DOI: 1234.56/Subset#3
The same data can have a
different rendition or
graphical interface
Datasets with
multiple renditions: same identifier
Dataset
‘Homepage’
Rendition 1
Rendition 2
Rendition 3
Datasets can grow.
Our current solution is to give
them the same identifier and
explain the growth in the
metadata
Datasets can change.
Our current solution is to give
them a NEW identifier, explain
the change in the metadata,
and provide a link back to the
original dataset.
Read all about it!
http://doi.org/abr
OECD’s
“stuffdata
machine”
(2010)
Jim Gray’s
‘era’ (2008)
Publications
Processed data
Data Presentations
Data
Data publishing workflow at OECD
Data
producer
(author)
Statistician and Researcher Responsibility
Data Editor
Data
Production
Editor
Data
Operations
Data
Marketing &
Support
Selection,
Quality
Assurance,
Metadata,
Acronym
killing,
Packaging
DOI
allocation,
Technical
checks.
Hosting,
Infrastructure
Promotion,
Training,
Support,
Discovery
optimisation
Publisher Responsibility
End User and Librarian Feedback
I can end it here, or
is there time for
more?
[email protected]
http://statlinks.oecdcode.org/
Great visualisations tell stories
Charles Minard's 1869 chart showing the losses in men, their movements,
and the temperature of Napoleon's 1812 Russian campaign.
TOYS FOR BOYS?
OECD Toys
OECD Factbook iPhone App
http://itunes.apple.com/us/app/oecdfactbook-2010/id327348502?mt=8&uo=6
OECD Regional Statistics eXplorer
http://stats.oecd.org/OECDregionalstatistics/
OECD Factblog
https://community.oecd.org/community/factbl
og/blog/2010/05/11/tax-who-pays-what
OECD graph generator
http://viz.oecdcode.org/ts/20755104table1/latest
Pimp my data
Facebook privacy (not any more):
http://mattmckeon.com/facebook-privacy/
Why I can’t get a cab outside the UN
building in NY?
http://www.nytimes.com/interactive/2010/0
4/02/nyregion/taxi-map.html
Why my musician brother grows his own
food
http://www.informationisbeautiful.net/2010
/how-much-do-music-artists-earn-online/
How they spend your money
www.wheredoesmymoneygo.org
PIMP KITS and SITES FOR
SHARING DATA
http://statlinks.oecdcode.org/
Thank-you and er…
[email protected]