Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8th International Bielefeld Conference February 2006. This work is.

Download Report

Transcript Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8th International Bielefeld Conference February 2006. This work is.

Digital Libraries and e-Research:
new horizons, new challenges?
Dr Liz Lyon, Director
UKOLN, University of Bath, UK
8th International Bielefeld Conference
February 2006.
This work is licensed under a Creative Commons Licence
Attribution-ShareAlike 2.0
UKOLN is supported by:
www.ukoln.ac.uk
a centre of expertise in digital information management
www.bath.ac.uk
Overview
1. Data-intensive science - contextual drivers
•
•
•
Scientific: e-Research process
Socio-political: open access to data-sets
Technical: data curation and repository infrastructure
2. An update and exemplars from the UK
3. Some issues for libraries
•
•
•
Engagement and advocacy
Skills and expertise
Strategic position and profile
8th International Bielefeld Conference
2
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
3
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
4
8th International Bielefeld Conference
5
Engineering Product
Information
EPSRC Grand Challenge Project,
8 International
Prof Chris McMahon, University
ofBielefeld
BathConference
th
6
– Access Grid
– Collaborative telematic art
– Modify spaces for performers
– Interplay: Hallucinations
8th International Bielefeld Conference
7
Library issues 1: Data capture & integration
into research workflows
• R4L Repository for the Laboratory Project (JISC-funded)
automated data capture from instrumentation, deposit of results
(chemistry)
• SMART TEA electronic Laboratory notebook + annotations
• How is primary research data captured in faculty and academic
departments?
• Where and how is primary research data stored in your
institution?
8th International Bielefeld Conference
8
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
9
Digital repositories: a UK view in 2006
• Institutional repository trends D-Lib Magazine Sept 2005
– Statistics: UK 31, (Germany 103, Sweden 25)
– Policy: UK RCUK draft, (Germany YES),
– National programmes: UK YES (Germany Sweden Netherlands)
• Pioneering work: eprints.org, ePrints UK, eBank UK……
• University of Southampton has a Self-Archiving Policy
and a mandate rather than a recommendation
• OpenDOAR Directory of Open Access repositories: Univ
Nottingham and Lund
• JISC £4M Digital Repository Programme + support :
use cases, reference models, standards, deposit APIs,
DigiRep wiki
8th International Bielefeld Conference
10
Federated repository architectures
• Global
• Data, eprints, images…….
• Inter-disciplinary
• e-Framework: JISC & DEST
• Cross-sectoral
• Defining common services +
domain-specific services +
repository services
• Multiple format types
From Andy Powell: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/presentations/jiie-jcs-2005/
heterogeneous - metadata
formats, content formats,
identifiers, packaging
standards
homogeneous - metadata
formats, content formats,
identifiers, packaging
standards
repository
repository
repository
repository
repository
fusion layer ‘repository federator’
portal
portal
portal
8th International Bielefeld Conference
portal
portal
11
Trusted digital repositories
• Audit Checklist for Certification Draft August 2005
• Research Libraries Group RLG-NARA Taskforce
• Defined criteria under 4 categories
–
–
–
–
Organisation
Functions, processes & procedures
Designated community & usability
Technologies & technical infrastructure
• UK Digital Curation Centre
– Providing advice, tools and support services
– 2nd DCC International Conference Glasgow November 21-22
http://www.dcc.ac.uk/12
8th International Bielefeld Conference
Open access driver?
8th International Bielefeld Conference
13
Presentation services: subject, media-specific, data, commercial portals
Data creation /
capture /
gathering:
laboratory
experiments,
Grids,
fieldwork,
surveys, media
Resource
discovery, linking,
embedding
Data analysis,
transformation,
mining, modelling
Searching ,
harvesting,
embedding
Aggregator
services: national,
commercial
Resource
discovery,
linking,
embedding
Learning object
creation, re-use
Harvesting
metadata
Learning &
Teaching
workflows
Research &
e-Science
workflows
Repositories :
institutional,
e-prints, subject,
data, learning objects
Deposit / selfarchiving
Validation
Publication
Deposit / selfarchiving
Resource
discovery, linking,
embedding
The scholarly knowledge cycle.
Liz Lyon, Ariadne, July 2003.
© Liz Lyon (UKOLN, University of Bath), 2005
This work is licensed under a Creative Commons License
th
Attribution-ShareAlike 2.08
Institutional
presentation
services: portals,
Learning
Management
Systems, u/g, p/g
courses, modules
Peer-reviewed
publications: journals,
conference proceedings
International Bielefeld Conference
Validation
Quality assurance bodies
14
eBank UK Project
http://www.ukoln.ac.uk/projects/ebank-uk/
• Two key themes:
– Open access to datasets
– Linking research data to publications and to learning
• UKOLN (lead), University of Southampton, University of Manchester
• Hybrid team: scientists, computer scientists and digital library specialists
• e-Science application ‘Combechem’ : Grid-enabled combinatorial chemistry
+ National Crystallography Service
8th International Bielefeld Conference
15
A data repository entry
ecrystals.chem.soton.ac.uk
8th International Bielefeld Conference
16
Access to the underlying data:
complex objects
8th International Bielefeld Conference
17
Library issues 2: data descriptions
• Validation, publication & discovery of
data models & schema
• Complex objects metadata packaging
standards
– METS
– MPEG 21 DIDL
• Semantic descriptions
– Formal controlled vocabularies
– High-level and domain ontologies
– Inter-disciplinary discovery
• Informal / social approaches Web 2.0
“folksonomies”
• eBank Application Profile publication
• What data models and metadata
schema are in place?
• Have librarians been involved in
their development?
8th International Bielefeld Conference
18
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
19
Discovering data:
• Domain identifier:
International
Chemical Identifier
(INChI) code
• Google molecule
using INChI
Slide from Simon Coles
Coles, S.J., Day, N.E., Murray-Rust, P., Rzepa, H.S., Zhang, Y., Org.
Biomol. Chem., 2005, (10),1832-1834. DOI: 10.1039/b502828k
8th International Bielefeld Conference
20
Library issues 3: Persistent
identifiers for data citation
• How will they be used? We need use cases: depositor,
author, service provider, reader, publisher?
• Schemes: DOI, Handle, ARK, PURL
• Publication & citation of scientific primary data project
National Library for Science & Technology (TIB),
University of Hanover, Germany. STD-DOI Project
http://www.std-doi.de
– DOI registry for datasets
• eBank is working with TIB to assign DOIs to crystal
structure data
• What persistent identifiers have been assigned to
your data?
• Was the Library involved in the process?
8th International Bielefeld Conference
21
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
22
Adding value: eBank linking data to
publications
8th International Bielefeld Conference
23
Linking research to learning - embedding
eBank aggregator service in a science
portal for student learners
8th International Bielefeld Conference
24
Integration into the curriculum
and e-Learning workflows
• MChem course
• Assess role in
Undergraduate
Chemical Informatics
courses
• Pedagogic evaluation
• February – May 2006
• Report & workshop to
follow.
8th International Bielefeld Conference
25
(Very simple) e-Research Cycle and Data Curation
(New) knowledge
extraction: data
mining, modelling,
analysis, synthesis
Data processing
Formulate hypothesis / ideas, test,
experiment, observe: data creation,
collection & capture
Data processing
Data processing
Data management
storage & validation:
description, deposit,
self-archiving,
preservation,
certification
e-Infrastructure
Adding value: Data
linking, annotation,
visualisation, simulation
Open access
Collaboration
Data processing
Data processing
Scholarly communications: data disclosure,
publication, citation, discovery, re-use
This work is licensed under a Creative Commons License
Attribution-ShareAlike 2.0
8th International Bielefeld Conference
26
8th International Bielefeld Conference
27
Library issues 4: Adding value
and repository services
• Adding value
- Linking, annotation, visualisation
• Repository services for knowledge extraction
- Mining (data, text, structures)
- Modelling (economic, climate,
mathematical, biological)
- Analysis (statistical, lexical, pattern
matching, gene)
• How is your data being used and re-used?
8th International Bielefeld Conference
28
Library issues 5: workforce
development and capacity building
• NSF Draft Report 2005 Longlived digital data collections
• “Data scientist” - hybrid skills
• Facilitate collaboration:
researchers, data centres,
digital libraries & archives
communities
• How does your Library
shape up?
• SWOT analysis
8th International Bielefeld Conference
29
STRENGTHS
WEAKNESSES
Scholarly communications role
Historic “document tradition”
Links with academic community
Synergies between physical &
digital worlds are still evolving
Content / collection management
/ stewardship practice
Cataloguing, classification &
metadata expertise
Shortage of technical skills
Cautious approach to innovation
Vision? (“its not our problem….”)
(e)-Service delivery function
OPPORTUNITIES
THREATS
Build on ePrints work &
eLearning experience
Paradigm shift in research will outpace change in libraries
Exploit links with researchers they need your skills
Researchers will (only?) use ondemand e-Services
Seek funding to engage in
innovative projects & services
Libraries may lose their role in
scholarly communications and
eResearch workflows
Develop local, regional,
8th International Bielefeld Conference
national, global partnerships
30
Libraries: Facing the future?
• Develop leadership & vision for eResearch engagement
• Review organisational structures
– Extend & re-profile the Faculty/Subject/Reference Librarian role?
– Closer collaboration with Computing Services?
• Provide eServices for data
– We “do” eLearning so why not eResearch?
– Include in institutional digital asset management
• Promote professional development of staff
– Awareness-raising activities, new skills
– Greater engagement, hybrid roles and hybrid teams
• Build new partnerships, new business models
• Facilitate Transformational Change in Libraries
8th International Bielefeld Conference
31
Thank you.
Questions?…..
More information: UKOLN http://www.ukoln.ac.uk/
UKOLN receives core funding from the Joint Information Systems
Committee (JISC) and the Museums, Libraries & Archives Council
(MLA) and is based at the University of Bath, UK.