Transcript Dia 1

Use of semantic technologies for publishing
and re-using cultural and scientific heritage
data
Antoine Isaac
SEMIC conference
June 18 2012, Brussels
?
Europeana facts
“Single, direct and multilingual access point to the
European cultural heritage.”
European Parliament, 27 September 2007
23.5 million objects
more than 2.200 institutions
33 countries
Who submits data to Europeana?
Horizontal Aggregators
Vertical Aggregators
National Aggregators
Archives
Culture Grid
APEnet
Libraries
GLAMs
The European Library
Regional Aggregators
Dark Aggregators
Film archives
ATHENA
Flanders museums
ELocal
European Film Gateway
Museums
GLAMs
Mn;kl;k;klj;lkj;lkj;jh;lkj;klj;klj;
GLAMs
klj;klj
What is submitted to Europeana?
1. Thumbnails
2. Metadata
3. Links to digital objects online
TEXT
IMAGE
VIDEO
AUDIO
Making metadata work for Europeana
Building a search engine on top of metadata is difficult
Traditional metadata quality problems: correctness, coverage
Especially when data is so heterogeneous
100s of formats, multilingual data
We currently use a simple flat interoperability format
(ESE)
More semantics-enabled services
Enhance access by semantics
Query expansion, clustering of results
Exploiting various relations: "located in", “more specific concept"…
Goal: to make richer data and services available to us and
others
Semantics are already there, in original metadata
Thesauri, classifications…
ESE loses information
Building a "semantic layer” context
Matches interest for linked data in libraries,
archives and museums
LOD-LAM
Available Linked Library Data
http://www.w3.org/2005/Incubator/lld/XGR-lld-vocabdataset/
Available Library Linked Data
• Element sets/schemas/ontologies
SKOS, Dublin Core, OAI-ORE…
• Value vocabularies/thesauri/authority lists
LCSH, VIAF…
• Datasets
British Library, Chronicling America…
Europeana and Linked Data
Provide trusted, reference data for cultural objects
Promote the use of the technology
Promoting the exchange of data in the community and
with third parties: Open (meta)data!
Europeana and Linked Data
http://vimeo.com/36752317
Some steps in production services
Re-use and linking
Currently: GeoNames, GEMET…
Data re-use can be serendipitous!
From our domain (VIAF, UDC) or others (Eurovoc)
Multilingual resources are key for us
Europeana Data Model
• Representing objects & others: persons, places...
• Linking to internal or external data sources
• Separating original data from enrichments
• Enabling domain-specific data profiles
• Model re-uses existing vocabularies
http://pro.europeana.eu/edm-documentation
data.europeana.eu
Europeana Linked Open Data Pilot
• Fully open metadata
• 2.4 M objects
• 200 individual providers
• 15 countries
Challenges of semantic technology
Really big impact on processes IF you wish so
Requires a lot of education/evangelisation
More complex data modeling is an art
finding the right balance & linking to requirements
Linking datasets remain difficult
needs tooling, involvement of stakeholders
Ongoing work
EDM implementation in
 Data harvesting
 Search, browse etc.
 Data publishing interfaces
Search API, Linked Open Data, data dumps, OAI-PMH…
Metadata enrichment
Summary: benefits of semantic
technologies for Europeana
Vocabularies and datasets to re-use
Flexible approach to building & re-using standards
More flexible approach to interoperability
custom vocabularies co-existing with standard ones
No constraints on the granularity of the data model
Technical ease of connecting and publishing data
Vision relates to open data strategies
ISA?
Contribution to data modeling and exchange
Core vocabularies give good hints on what is needed
Source of data for re-use
Helping our data to be re-used
ADMS
Thank you
Antoine Isaac
[email protected]