PowerPoint-presentasjon

Download Report

Transcript PowerPoint-presentasjon

CLARINO
WP2 National Registry and LongTerm Archiving
Freddy Wetjen and Oddrun Pauline Ohren
National Library of Norway
Bergen, 12. September 2013
National Registry of metadata
• Goal
– Joint metadata registry of resources in all
Clarino centres
• Harvest data from all CLARINO centres
• Exchange data with other national CLARIN
centres
• Status – current situation
• On-going and planned activities
National Registry of metadata
Status (1)
• Metadata registry version 1 is running
– Search/browse, editing and management, but no
harvesting facilities
– Infrastructure:
• META-SHARE infrastructure 3.0
– http://metashare.nb.no/, proxied by the managing node
http://metashare.tilde.com/
– Metadata complying META-SHARE metadata format 3.0
– No harvesting facilities
– Metadata content:
• 71 resources
– Usage:
• 11.9.2013: 37 of the resources downloaded 1-17 times
– Norwegian Wordnet (Bokmål) at the top
– Topmost downloading locations: Norway, Germany, Greece,
Sweden
National Registry of metadata
Status (2)
• Decision made: Migrate to CMDI (CLARIN
platform)
– Uncertain future for META-SHARE
• 2 ys guaranteed life span
– Need for more adaptability and
expressivity in metadata model
– Increased involvement with the CLARIN
community
National Registry of metadata
Planned activities
• Build a basic CMDI infrastructure
– Repository, editor, search service, PID
scheme, harvesting
• Convert metadata from META-SHARE to CMDI
– Use META-SHARE profile as specified in
Component Registry
• Extend/adapt metadata model according to
need
– In collaboration with the other CLARINO centres
CMDI Metadata framework
Definitions of concepts used in metadata
components
ISOcat
Concept Registry
Relation
Registry
Metadata
modeler
META-SHARE
components, a.o
CLARIN
Component
Registry
Component
editor
Other trusted
concept
Registries
«My profile»
<xxxx>
<yyyy>
<zz>
<xxxx>
Joint Metadata
Repository
Search
Service
Metadata
editor
Språkbanken
User
TextLab
EDD
Bergen
Centre
LAP
Other
centre…
Metadata
creator
Adaptation of Broeder, D. A Data Category Registry- and Component-based Metadata Framework. LREC 2010.
National Registry of metadata;
Services
Clarin common
infrastructure
«Our
profiles»
OAI/PMH harvesting
Repository
Search
Services
Weblicht
VLO
FCS?
Metadata
Editor
(Arbil..?)
Metadata
creator
CMDI
Long term archiving
Data
Repository
Metadata
editor
-Resoures
Data
Delivery
client
Processing and
adaptation for
long term storage
(Checksum,pid,
metadata etc.)
NB long term storage
(preservation)
Time perspective
• Metadata registry version 2 : Primo 2014
– Basic CMDI infrastructure
• existing metadata converted from META-SHARE
• OAI/PMH endpoint, but no harvesting from other
centres
• Metadata registry version 3: Mid 2015
– Extended/adapted metadata model
– Harvesting from other CLARINO centres
• Long term archiving: Mid 2014 with both data
and metadata.
CLARINO
WP2 National Registry and LongTerm Archiving
Freddy Wetjen and Oddrun Pauline Ohren
National Library of Norway
Bergen, 12. September 2013