www.driver-support.eu

Download Report

Transcript www.driver-support.eu

Towards International Repositories
Infrastructures
Workshop 16/17 March, 2009
Norbert Lossau,
Director Göttingen State and University Library
& Scientific Coordinator DRIVER
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
1
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
2
2
Objectives of the Workshop
1. Identify and establish relationships with key thought
leaders, major projects/activities and services, and
leading practitioners from around the world
2. Suggest commonalities between infrastructures, points of
possible collaboration and pathways that might take the
collaboration forward
3. To come to a shared vision of an international repositories
infrastructure or, at least, the infrastructure components
that might best be developed internationally
4. To identify the essential components of an international
repositories infrastructure
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
3
Objectives of the Workshop
5. To review the approaches to sustainability, scalability and
interoperability being taken by these components, bearing
in mind the wider research infrastructure
6. To consider ways in which the progress might be
coordinated and reviewed over time
7. Focus the agenda to achieve tangible outcomes
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
4
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
5
2
High-level Vision …
Free and unrestricted access
to sciences and human knowledge
representation
worldwide,
incl. cultural heritage
Berlin Declaration, October 2003
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
6
International Repositories or Knowledge
Infrastructure Vision …
To support the…
Discovery
& Access
Dissemination
& Publishing
Collaboration &
Sharing
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
complete research
cycle, working with
scientific information
Management
Usage &
manipulation
7
High-level use cases & possible infrastructure
components
Preservation actions
file format registries
validation tools
representation information registries
Ingest
SWORD
shared metadata services
name / factual authority services
automatic metadata creation services
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
8
High-level use cases & possible infrastructure
components
Access
widespread OAI-ORE implementation
common text-mining API?
Online Reputation and reporting
effective, real-time, automatic forward and backward
citation mechanisms
factual authority (common tagging of objects with
funder / grant number metadata)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
9
Infrastructure components
non-technical factors
Discovery & Access
Established search & browsing behaviours and
pathways
Online Reputation and reporting
Established evaluation mechanisms (impact factor)
Preservation actions
Additional (manual) effort on the author side required?
Ingest
Additional (manual) effort on the author side required?
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
10
Use cases - behind the scene“ infrastructures –
essential components
A ‘component’ might be:
A service (eg sherpa-romeo, connotea, BASE, funder
repository, institutional repository)
A service environment (eg, Amazon S3, Microsoft Azure,
Facebook)
A technical success factor (eg consistent use of DC to
point from a metadata record to the ‘full text’, use of OAIORE, the DRIVER Guidelines), or
a non-technical success factor (e.g. filling repositories
through OA-agreements with publishers).
These components will form the focus for the workshop and
the action plans that will be its principal output.
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
11
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
12
2
International Repositories Infrastructure –
Where do we stand today?
„Briefings“
Author identification
Copyright and licensing
Global harvesters (other than search engines)
Harvesters – subject or discipline based
Ingest – selected issues
Institution identifiers
Peer review
Persistent identifiers
Preservation
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
13
International Repositories Infrastructure –
Where do we stand today?
Prestige and profiling services
Registries
Repository software
Repository support organisations
Storage
Usage reporting and metrics
User services
Validation and certification of repositories
Versioning
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
14
Development status of components - for
discussion
Advanced?
Information on the repository landscape
Global harvesters
Preservation of research papers
Repository software
Storage
Validation & certification of repositories
A brief insight into some components =>…
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
15
International Repositories Infrastructure –
Where do we stand?
Repositories
• Informal survey carried out by SURF earlier in
2005
• DRIVER Inventory Study 2007
1.
Produced 7 studies in 3 publications
•
•
•
2.
Inventory study into the present type and level of
OAI compliant Digital Repository activities in the
EU
A DRIVER's Guide to European Repositories
The Investigative Study of Standards for Digital
Repositories and Related Services
Disseminated through the DRIVER website (in Open Access) +
as 3 books (Amsterdam University Press)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
16
International Repositories Infrastructure –
Where do we stand?
Repositories
• “Research Repositories in Europe:
the 2008 DRIVER Inventory study”, Maurits van
der Graaf (on behalf of SURF, DRIVER)
=>…
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
17
Research Repositories in Europe
Topics
Growth, total number and situation
Contents, coverage and depositing
Technical issues and standards
Services on top of repositories
Steady increase of number of Digital
Repositories
Total of 280, yearly increase by 25-30
Large part of universities in half of European countries
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
18
Conclusions content, coverage and depositing
More flexibility in access forms
Trend to more OA
Version of deposited full text articles
Trend towards depositing postprint stage
Work processes for depositing
No harmonisation
Growing (partly) mandatory depositing
32% in 2008, while 25% in 2006
(Still) Coverage of a third
33% of Researchers delivering in repositories
35% of Research output of an institution deposited
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
19
Technical issues
Various technical issues
2008
2006
persistent identifier
84%
75%
long-term availability secured
52%
73%
statistical data on access and usage
72%
70%
some form of subject indexing
86%
93%
author identifier
31%
33%
locally
developed
OPUS
CDSware
DIVA
ARNO
Digitool
Fedora
GNU
EPrint
iTOR
DSpace
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
MyCoRe
VITAL
other
20
Technical issues
Is your repository technically prepared for
Enhanced Publications?
no, but we have plans to prepare our repository
YES
46.1%
no, no plans
No, but
32.6%
NO
21.3%
Metadata standards on the rise:
DIDL, MODS and OAI-ORE
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
21
International Repositories Infrastructure –
Where do we stand?
Repositories
• OpenDOAR – a comprehensive register of digital
repositories worldwide
 More than 1300 repositories listed
=>…
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
22
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
23
OpenDOAR
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
24
OpenDOAR
Repository Type
=>1072
institutional and
177 disciplinary
Content Type
=>815 hold
journal articles,
318 Multimedia,
audiovisual
….
69 datasets,
27 software etc.
… This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
25
OpenDOAR
=>Languages:
1133 English
155 German
96 Spanish
86 French
73 Japanese
…
3 Africaans
…
2 Pashto, Pushto
…
1 Bulgarian
1 Romanian
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
26
OpenDOAR
=>Disciplines
763 Multidisciplinary
86 Science General
…
99 Health and
medicine
…
98 History and
Archaeology
…
75 Social Sciences
General
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
27
Where do we stand?
Repository platforms
& their (international) communities
EPrints*, DSpace*, Fedora Commons*, OPUS
(GE), DiVA (SE), CDS Invenio (CERN),
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
28
Where do we stand?
Country organisations and/or repository infrastructures
Australia, Belgium, Brazil, Canada, France,
Germany, Hungary, Ireland, Italy, Japan, The
Netherlands, Nordic countries, Portugal, Spain,
UK, ???
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
29
Where do we stand?
Global Harvesters
OAIster, BASE, Scientific Commons
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
30
Where do we stand?
Cross-Country
organisations & repository infrastructures
DRIVER
eIFL
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
31
Where do we stand?
Repository Infrastructure Architectures
DRIVER
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
32
Existing Solutions: Repository Aggregation
Systems (RAS)
RAS aggregate content from OAI-PMH
Repositories, form an Information Space and
provide community-specific functionalities via
Web User Interfaces
Well known examples
BASE (DE)
DAREnet (NE)
OAIster (USA)
Others…
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
33
RAS
UI
Search
Index
…
Index
Information Space
Aggregator
OAI-PMH
OAI-PMH
Institution Site
Institution Site
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
OAI-PMH
…
Institution Site
34
Service Open Infrastructures (SOI), DRIVER
Inspired by component-oriented systems
Components provide specific functionality in isolation
Components can be provided by different Service
Providers and be shared between applications
Applications are formed by combining independent
components under the control of System Managers
Service Open Infrastructure
Components are distributed services running on the
network at different sites
Open to instance and types of services: instances or
new functionality can be added/removed any time
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
35
Infrastructure architecture
Information
Service
ResulSet
Service
Functionality Layer
User Interface
Service
Recomm.
Service
Community
Service
Search
Service
User
Service
User
Service
Manager
Service
Authz&Authn
Service
Repositorie
Enabling
Layer
OAI-PMH
Service
Index
Service
Browse
Service
Text Engine
Service
Store
Service
Aggregator
Service
Validator
Service
Collection
Service
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
Data Layer
36
Reuse
Functionality sharing
UI
Search
Index
Store
Aggregator
User Profiling
…
Others
Enabling Layer Middleware
Functionality
Dynamic, distributed
Services
Run-time Infrastructure
Index
UI
UI
Search
Search
…
Index
Index
Store
Aggregator
OAI-PMH
OAI-PMH
Aggregator
OAI-PMH
…
Institution Site
Institution Site Institution Site
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
OAI-PMH
…
Content
Resources
Institution Site
37
Where do we stand?
Repository Infrastructure Interoperability
DINI Certificate
DRAMBORA
TRAC project
DRIVER Guidelines, Maurice Vanderfeesten,
SURF + DRIVER partners (some of the following
slides have been presented by Maurice on the 29
August 2007, TICER Digital Libraries a-la-Carte)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
38
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
39
Interoperability pragmatics
- Guidelines
- Validate
- Workflow
40
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
40
Guidelines
-
Chapter
Chapter
Chapter
Chapter
Chapter
Chapter
Chapter
Chapter
Chapter
(IPR)
1:
2:
3:
4:
5:
6:
7:
8:
9:
Use
Use
Use
Use
Use
Use
Use
Use
Use
of
of
of
of
of
of
of
of
of
OAI-PMH
Metadata OAI_DC
Best Practices for OAI_DC
Compound Object Wrapping
Vocabularies and Semantics
Quality labels
Persistent Identifiers
Usage Statistics Exchange
Intellectual Property Rights
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
41
DRIVER Guidelines in various languages
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
42
Guidelines
From the inventory study:
Does your repository follow the DRIVER guidelines?
n
%
We do not know about the DRIVER guidelines
49
27.5
We know about the DRIVER guidelines,
but do not follow them
32
18.0
We know about the DRIVER guidelines and
(make every effort) to follow them
97
54.5
72.5% knows DRIVER Guidelines; 54.5% tries to follow
them
This work is licensed under a Creative Commons License
is as optional footer
Attribution Non-commercial ShareAlikeThis
2.0 Germany
43
Validator
- Detects interoperability failures
- Goes deep into the metadata content
- Provides explanation about guideline principals per
interoperability feature.
- Offers recommendations on how to correctly modify
your repository to interoperable standards
- Creates a report for future reference
=>Developed at the University of Athens
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
44
Where do we stand?
Technology Components – 2008 Study
Slides from the „Technology Watch“, Karen Van
Godtsenhoven, University of Gent (+ DRIVER
partners)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
45
Structure of Technology Watch Report
Chapters
DRIVER-GRID interaction
Interoperability
Long Term Preservation (LTP)
+
DRIVER-CRIS interaction (added later)
Result: two main parts
New communities and technologies (GRID, CRIS,
LTP)
Interoperability of EP’s (5 types)
Structure of each chapter
Theory - Case studies - Outcomes for DRIVER
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
46
Interoperability Enhanced Publications
Interoperability in DRIVER context:
Exchange and dissemination of EP’s as complex,
compound objects, based on textual publication
Focus on five types of representing and
publishing enhanced publications (relationship
of files within objects)
Envelope models or packaging formats
Overlays, maps, feeds
Embedding formats
New/Old publishing formats
Web services
NOT focus on ingest or descriptive metadata
(Russell, Vanderfeesten, Hochstenbach, Van Godtsenhoven)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
47
Envelopes
Access to metadata, structural data, identifiers,
and binary streams of publications all in one
package
(= envelope)
MPEG 21-DIDL in DARE context
METS
IMS – CP
ODF packages
OOXML/ Package convention
Open e-book package
Comparison: table with all features in doc
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
48
Overlays, maps and feeds
SWAP, ORE and POWDER all qualify as good
formats / models for the dissemination of EP’s
SWAP uptake by community is very low (high
complexity)
OAI-ORE very popular in community and used in
DRIVER demonstrator for EPs
POWDER: recent W3C standard, viable alternative to
ORE (when the aggregations are of a very dynamic
nature or cannot be simply enumerated)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
49
New Publishing Formats
ODF (ISO 26300:2006) versus OOXML (ISO 29 5001:2008)
File format ISO standards for saving & exchanging
office documents (alternative to proprietary formats
e.g. doc, ppt)
Open up access to structured content which can
be reused by other services e.g. DRIVER
Guarantee long term accessibility
Controversy surrounding development of OOXML:
DRIVER should adopt approach that is capable of
using both ODF and OOXML
Plus: many disciplinary xml types, structured and
crawlable data
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
50
Where do we stand?
Infrastructure
Technology Components in Practice
Automating and monitoring harvesting, data
processing and indexing processes
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
51
DRIVER Repository Map
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
52
DRIVER Admin (Internal) Control Panel I
Monitor repository landscape I
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
53
DRIVER Admin (Internal) Control Panel II
Monitor repository landscape II
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
54
DRIVER Admin (Internal) Control Panel III
Monitor & Process
Repository Data
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
55
DRIVER Admin (Internal) Control Panel V
Check repository
index profile updates
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
56
Where do we stand?
Linking
publications to datasets (Enhanced Publications)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
57
DRIVER – Enhanced Publications
Technology
The demonstrator aggregates scientific web
resources via OAI-ORE v0.9 and RDF. XSLT is
used to transform these into XHTML. CSS and
Javascript do the rest of the presentation. A Java
applet is used to dynamically display the relations
between resources. Although these relations can
be fed to the applet as parameters, they are not
yet automatically interpreted from the RDFserialisation
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
58
Driver II Definition Enhanced Publication
An Enhanced Publication (EP) is:
a textual publication enhanced with:
research data (evidence of the research) and/or
extra materials (to illustrate or to clarify) and/or
post-publication data (commentaries, ranking)
So: ever developing
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
59
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
60
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
61
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
62
2
Challenges towards an International Repositories
Infrastructure
Complex matrix, addresses
Countries
(min.) five main dimensions
Disciplines
Countries (political, finance,
Content access
organisational, legal issues
& usage
etc.)
Data Models &
Academic disciplines
Technology
Content access & usage
Content type
Multiple content
resource types
Data Models & Technology
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
63
Essential: keep all stakeholders and their
perspectives in mind
Researchers/disciplines
Research managers
Library Managers
Repository Managers (technical & content)
Computer Scientists
Publishers & further content providers
Service & Infrastructure providers
Funders
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
64
The Diversity & Wealth of academic
disciplines
EC, Framework 6 Programme: 46 pages, c. 40
entries each
Countries
Disciplines
Content access
& usage
Data Models &
Technology
Content type
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
65
Discipline Schema (Keywords): European
Commission
7 main areas
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
66
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
67
Content type
Text, Manuscript
Drawing
Painting
Foto
Film
Radio, TV Broadcasts
Papyri
Cuneiform tablets
Artefacts
Buildings
Maps
Language audio
recordings
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
68
Discipline Schema: Deutsche
Forschungsgemeinschaft, Germany
4 main domains
HSS
Life Sciences
Natural
Sciences
Engineering
14 subdomains
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
69
Disciplines and their International Repositories
Infrastructures
Examples
ArXiV – Physics, Mathematics, Informatics
PubMed - Life Sciences
CLARIN, OLAC – Linguistics, language archives
(datasets, international)
CESSDA – Social Sciences (datasets, international)
DARIAH – Humanities (datasets, international)
RePEc; NEEO – Economics (pre-/postprint
publications, international)
METAFOR – Meteorology, Climate research
(publications + datasets, international)
MACE - Architecture
IVOA - Astronomy
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
70
Issues to be addressed on the way towards an
international repositories infrastructure, e.g.
Each discipline – one infrastructure?
Each information type – one infrastructure?
Same data models, technology, same services –
different implementation
Same data models and syntax- different
semantics
Project specific goals & funding – external liaision
& collaboration
Focus on a specific community, a country, a
region – cross community, cross-country
initiatives
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
71
Further issues
Content! Filling repositories
Publications: business models publishers
Research data: culture of sharing data
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
72
2
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
73
2
Global Data Network: a model for
the International Repositories
Infrastructure?
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
74
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
75
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
76
Global Data Networks vs. Global Repository
Infrastructure?
Data networks are „neutral carriers“ of
information - Digital repositories contain the
actual information
Content resources – multiple semantics and
formats
Data networks are generic – knowledge
infrastructures are disciplin-specific
Cultural issues for disciplines: „You share the
network – but not your research data“
Financing: hardware vs. service, business
cases
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
77
Further, architectural models for
infrastructures?
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
78
JISC-funded
content providers
institutional
content providers
external
content providers
authentication/authorisation (Athens)
service registries
metadata schema registries
brokers
aggregators
catalogues
indexes
identifier services
institutional profiling
services
OpenURL media-specific institutional
link servers
portals
portals
subject
portals
learning management
systems
terminology services
shared infrastructure
end-user
desktop/browser
© Andy Powell (UKOLN, University of Bath), 2005
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
JISC Information Environment architecture
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
79
Geistes- und
SozialWissenschaften
Natur- und
LebensWissenschaften
2x
4x
Wissenschaftler
Disziplinspezifische
Werkzeuge
und Dienste
•3-D-Rekonstruktion von Artifakten
•Handschriften-Transkription
>>
>
•Analyse
von
Sprachaufzeichnungen
1x
Shared
Workspace,
Kollaborati
onsdienste
“Open Office”
Disziplinüber-Programme
greifende
Suite
(Scholarly
Dienste &
Werkzeuge workbench)
Basisdienste
Content
Definition von
Standards
(Metadaten,
Formate,
etc.)
3x
1x
Dokum.
Server
2x
Digitalisierte
Sammlungen
2x
Fernsehen
/Radio
0x
1x
0x
CAD,
CAE&S,
CAM
1x
7x
Suche,
Navigation
,
Visualisier
ung, AAR
Weitere
Wissenschaften
• Datenvisualisierung
• ...
Publikations-/
Kommunikatio
nsDienste
(z.B. Wikis)
6x
2x
IngenieurWissenschaften
0x
Datenkonverter,
Rohdatenanalyse u.
Referenz
1x
Nutzungsstatistiken,
Zitationen
Semanti
c Social
Interacti
on
1x
Langzeitarchivierung +
Verfügbarkeit
1x
Forschungsdaten
4x
D-Gridund links4science-Workshop,
29.
This
work is licensed
under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
1x
MultiMedia
Server
1x
Rapid
Prototyping
N.N.
1x
Informatio
nsExtraktion,
Semantisc
he
Vernetzung
Disziplinspez.
Navigation
und
Visual.
3x
4x
Datenaggregation und
Verlinkung
Mail
Archive
3x
0x
…
2x
Datentransfer
und
Workflowintegration
Repositories
6x
Bildbear
beitung
und annotati
on
1x
Kataloge /
Datenbanken
1x
4x
März 2007, Göttingen
80
wissenschaftliche Communities, Institutionen
disziplinspezifische Werkzeuge und Dienste
...
disziplinübergreifende Werkzeuge und Infrastruktur
InfoExtrak
tion
Visualisierung
Grid/VOSuche
Dienstekatalog,
Service
Registry
Ontology
Registry
und
Dienste
virtualisierte Hardwareresourcen
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
Metadata
Registry
und
Dienste
Persistent
Identifier
Resolver
LZADienste
Repository
Systeme
Content
81
Learn from other sectors, e.g.
logistics industry?
openID-center
An open platform for the integration
of identification systems
Fraunhofer Institute of Material Flow
and Logistics, www.openID-center.de
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
82
Topics
Objectives of the Workshop
Visions - use cases – infrastructure and
components
International Repositories Infrastructure(s) –
Where do we stand today?
Challenges
Global Data Network: a model for the
International Repositories Infrastructure?
How do we proceed: our next two days (and
beyond)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
83
2
How do we proceed?
plans
Four action
Organisational structures (Norbert)
Sharing citation data (Les)
‚Repository handshake‘ (Peter)
Identification Infrastructure (Andrew)
=>Aimed to stimulate discussions (drafts have been
circulated beforehand)
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
84
Purpose of the action plans
The action plans are not necessarily about
building infrastructure, but about whatever action
needs to be taken so that the components form
an infrastructure capable of supporting the use
cases.
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
85
Organisational structures
Suggestions…
Define a clear Statement of Intent/Code of Conduct of the
Confederation in relation to Open Access and repository
developments
Launch the nucleus of an International Repository
Confederation, unifying diverse stakeholders from country
networks, disciplinary networks, technology, research
managers, funders etc.
Commission an international Inventory Study on
disciplinary repository infrastructures
Start a systematic consultation process with discipline
representatives, selected national research funders etc.
from representative regions all over the world
Draft a roadmap for an International Repository
Infrastructure
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
86
Sharing citation data
Suggestions …
Revive Citebase, and expand its scope to cover a fuller
range of open access material, including from OA journals
and institutional repositories
Define and implement a common API for citation services
such as Citebase, to enable machine query of the data
Implement the updated “CLADDIER trackback protocol” in
major repository software as part of the core release
Learn lessons from above that impact on repository and
journal practice, eg on metadata consistency. Act on
those lessons
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
87
‚Repository handshake‘
Suggestions…
1. Establish working group incl. major interested
parties
2. Define/refine priority use cases
3. Describe negotiations needed for each use case
4. Identify minimum set of tools and mechanisms
5. Identify test partners
6. Implementation
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
88
Identification Infrastructure
Suggestions…
Identify relevant (inter)national activities (see
briefing materials)
Define in the abstract who are the trusted sources
of authority for each of the named entities (eg, a
funder is trusted to assert the title of a project)
Identify relevant (inter)national naming and
resolution practice (DOIs, Handles, URNs, etc)
Based on above, and relevant trends / plans,
define a practical roadmap with milestones
Implement roadmap!
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
89
In addition:
Organisational structures
Promote complimentary, non-technical actions to
technological strands
Sharing citation data <= Engage researchers, learned
societies, research managers, research funders to
discuss new models for evaluation and reputation
schemas
Repository handshake <= Bring together existing and
future initiatives (such as the PEER project) to discuss
policy and legal frameworks, business models and
organisational issues
Identification infrastructures <= Explore how identifiers
will be used in practice in research processes, in
difderent disciplines, on a broad scale
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
90
Outlook: a global network of repository infrastructure hubs?!
[email protected]
www.driver-community.eu
This work is licensed under a Creative Commons License
Attribution Non-commercial ShareAlike 2.0 Germany
91