Digital repositories as research infrastructure: a UK perspective Dr Liz Lyon Director UKOLN is supported by: This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 www.ukoln.ac.uk A.

Download Report

Transcript Digital repositories as research infrastructure: a UK perspective Dr Liz Lyon Director UKOLN is supported by: This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 www.ukoln.ac.uk A.

Digital repositories as
research infrastructure:
a UK perspective
Dr Liz Lyon
Director
UKOLN is supported by:
This work is licensed under a Creative Commons Licence
Attribution-ShareAlike 2.0
www.ukoln.ac.uk
A centre of expertise in digital information management
Presentation services: subject, media-specific, data, commercial portals
Data creation /
capture /
gathering:
laboratory
experiments,
Grids,
fieldwork,
surveys, media
Resource
discovery, linking,
embedding
Data analysis,
transformation,
mining, modelling
Searching ,
harvesting,
embedding
Aggregator
services: national,
commercial
Resource
discovery,
linking,
embedding
Learning object
creation, re-use
Harvesting
metadata
Research &
e-Science
workflows
Deposit / selfarchiving
Learning &
Teaching
workflows
Repositories :
institutional,
e-prints, subject,
data, learning objects
Validation
Deposit / selfarchiving
Publication
The scholarly knowledge cycle.
Liz Lyon, Ariadne, July 2003.
Resource
discovery, linking,
embedding
Institutional
presentation
services: portals,
Learning
Management
Systems, u/g, p/g
courses, modules
Validation
www.ukoln.ac.uk
Peer-reviewed publications:
journals, conference
© Liz Lyon (UKOLN, University of Bath), 2005
A centre of expertise in digital information
proceedingsmanagement
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
Quality
assurance
bodies
“JISC Vision”: a global landscape of
federated repositories
• Multi-disciplinary, crosssectoral
• e-Framework and Information
Environment context
• National, institutional
• Define common + domainspecific + repository “services”
• Different platforms
• Many format types: data,
eprints, images, geospatial
heterogeneous - metadata
formats, content formats,
identifiers, packaging
standards
homogeneous - metadata
formats, content formats,
identifiers, packaging
standards
www.ukoln.ac.uk
repository
• Interoperability based on open
standards, software tools
From Andy Powell: http://www.ukoln.ac.uk/distributed-systems/jiscie/arch/presentations/jiie-jcs-2005/
repository
repository
repository
repository
fusion layer ‘repository federator’
portal
portal
portal
A centre of expertise in digital information management
portal
portal
JISC-funded
content providers
institutional
content providers
external
content providers
authentication/authorisation (Athens)
service registries
metadata schema registries
brokers
aggregators
catalogues
indexes
identifier services
institutional profiling
services
OpenURL media-specific institutional
link servers
portals
portals
subject
portals
learning management
systems
terminology services
shared infrastructure
end-user
desktop/browser
© Andy Powell (UKOLN, University of Bath), 2005
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
JISC Information Environment architecture
Update on JISC DR activity 1
• Commissioned reports: Review (Feb 2005), Roadmap (April
2006), Linking UK Repositories (June 2006)
• £4M DR Programme 2005
– 21 Projects: some working with data, VERSIONS (of eprints)
• DR support at UKOLN : wiki
http://www.ukoln.ac.uk/repositories/digirep/index/JISC_Digital_Repository_Wiki
– Advocacy Package (autumn 2006)
– Project synthesis, collecting user scenarios, developing use cases,
scoping/evaluating reference models: OAIS?
– Standards (and harmonisation)
– ePrints Dublin Core Application Profile Working Group
– “Remote deposit” API Working Group (Mellon New York meeting)
• UK IR cross search service (eprints)
www.ukoln.ac.uk
A centre of expertise in digital information management
e-Research: understanding business process
• Project StORe: Source-to-Output
Repositories (Edinburgh)
– primary data : research publications
– Survey questionnaire
• RepoMMan: Repository Metadata and
Management (Hull)
– Survey questionnaire and interviews
– Activity diagram
• R4L Repository for the Laboratory
(Southampton)
– Crystallography workflow analysis, automated
data capture, user deposit scenarios
RAW DATA
DERIVED DATA
RESULTS DATA
www.ukoln.ac.uk
A centre of expertise in digital information management
eBank UK Project
http://www.ukoln.ac.uk/projects/ebank-uk/
• Promote open access crystallography data
• Aggregator service harvests OAI metadata from institutional
data repository (e-Crystals archive)
• Service linking from data to derived research publication
• Embedding eBank service in learning workflows: pedagogy
• Future federation plans for crystallography data repositories
UKOLN (lead), University of Southampton, University of
Manchester
www.ukoln.ac.uk
A centre of expertise in digital information management
eBank Metadata Publication
• Using simple Dublin Core
• Crystal structure
• Title (Systematic IUPAC Name)
• Authors
• Affiliation
• Creation Date
• Additional chemical information through Qualified Dublin Core
• Empirical formula
• International Chemical Identifier InChI
• Compound Class & Keywords
• Specifies which ‘datasets’ are present in an entry
• Application Profile
http://www.ukoln.ac.uk/projects/ebank-uk/schemas/
• DOIs, data citation http://dx.doi.org/10.1594/ecrystals.chem.soton.ac.uk/145
www.ukoln.ac.uk
A centre of expertise in digital information management
Discovering data:
• Domain identifier:
International
Chemical Identifier
(INChI) code
• Google molecule
using INChI
Slide from Simon Coles
Coles, S.J., Day, N.E., Murray-Rust, P., Rzepa, H.S., Zhang, Y., Org. Biomol.
Chem., 2005, (10),1832-1834. DOI: 10.1039/b502828k
www.ukoln.ac.uk
A centre of expertise in digital information management
Data descriptions
• Validation, publication & discovery
of data models & schema
• Metadata packaging standards
– METS
– MPEG 21 DIDL
– Complex object model?
• Semantic descriptions
– Formal controlled vocabularies
– High-level and domain ontologies
– Inter-disciplinary discovery
• Informal social network approaches
“folksonomies”
www.ukoln.ac.uk
A centre of expertise in digital information management
Adding value: repository services
• Tools: for deposit, normalisation,
manipulation, transformation…..
• Linking, annotation, visualisation
• Aggregators: generic,
(sub-) disciplinary
• Knowledge extraction:
 Mining (data, text, structures)
National Centre for Text Mining NaCTeM
 Modelling (economic, climate,
mathematical, biological…)
 Analysis (statistical, lexical, gene….)
www.ukoln.ac.uk
A centre of expertise in digital information management
JISC DR update 2
• OpenDOAR Directory of Open Access repositories:
Universities of Nottingham and Lund
• “Interim” Repository
• Access management systems integration: Shibboleth
• New funding 2006: Capital Programme Roadmap,
Repositories & Preservation Programme
–
–
–
–
£14M over 3 years but current Call:
Repositories Support Project
Tools & Innovation Strand
Discovery to Delivery Strand
• Data Curation and Preservation
www.ukoln.ac.uk
A centre of expertise in digital information management
Digital repositories, OA & preservation
• Long-term access: trust, responsibility, policy
• Trusted DR Audit Checklist for Certification Draft
Research Libraries Group-NARA Taskforce
• Defined criteria under 4 categories
–
–
–
–
Organisation
Functions, processes & procedures
Designated community & usability
Technologies & technical infrastructure
• UK Digital Curation Centre: advice, tools & services
• RepInfo Registry
http://www.dcc.ac.uk/
• CASPAR Preservation Framework
www.ukoln.ac.uk
A centre of expertise in digital information management
Political, cultural, socio-legal, IPR
• Funding bodies position on OA: Research Councils RCUK
statement, Research Assessment Exercise (RAE), IRRA
• Institutional OA position:
– Business drivers? University of Southampton Self-Archiving Policy and
a mandate (not a recommendation)
– Legal responsibilities as publisher, IPR, TrustDR, licences, automated
Digital Rights Management DRM
• Culture & human factors:
– “Sharing culture?”
– Multidisciplinary teams: computer scientists, domain scientists, digital
library experts, statisticians/modellers e.g. eBank project
– Lessons learnt: e-Science Human Factors Audit Report (to be
published 2006) Roy Kawalsky, Loughborough
www.ukoln.ac.uk
A centre of expertise in digital information management