Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8th International Bielefeld Conference February 2006. This work is.
Download ReportTranscript Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8th International Bielefeld Conference February 2006. This work is.
Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8th International Bielefeld Conference February 2006. This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 UKOLN is supported by: www.ukoln.ac.uk a centre of expertise in digital information management www.bath.ac.uk Overview 1. Data-intensive science - contextual drivers • • • Scientific: e-Research process Socio-political: open access to data-sets Technical: data curation and repository infrastructure 2. An update and exemplars from the UK 3. Some issues for libraries • • • Engagement and advocacy Skills and expertise Strategic position and profile 8th International Bielefeld Conference 2 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 3 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 4 8th International Bielefeld Conference 5 Engineering Product Information EPSRC Grand Challenge Project, 8 International Prof Chris McMahon, University ofBielefeld BathConference th 6 – Access Grid – Collaborative telematic art – Modify spaces for performers – Interplay: Hallucinations 8th International Bielefeld Conference 7 Library issues 1: Data capture & integration into research workflows • R4L Repository for the Laboratory Project (JISC-funded) automated data capture from instrumentation, deposit of results (chemistry) • SMART TEA electronic Laboratory notebook + annotations • How is primary research data captured in faculty and academic departments? • Where and how is primary research data stored in your institution? 8th International Bielefeld Conference 8 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 9 Digital repositories: a UK view in 2006 • Institutional repository trends D-Lib Magazine Sept 2005 – Statistics: UK 31, (Germany 103, Sweden 25) – Policy: UK RCUK draft, (Germany YES), – National programmes: UK YES (Germany Sweden Netherlands) • Pioneering work: eprints.org, ePrints UK, eBank UK…… • University of Southampton has a Self-Archiving Policy and a mandate rather than a recommendation • OpenDOAR Directory of Open Access repositories: Univ Nottingham and Lund • JISC £4M Digital Repository Programme + support : use cases, reference models, standards, deposit APIs, DigiRep wiki 8th International Bielefeld Conference 10 Federated repository architectures • Global • Data, eprints, images……. • Inter-disciplinary • e-Framework: JISC & DEST • Cross-sectoral • Defining common services + domain-specific services + repository services • Multiple format types From Andy Powell: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/presentations/jiie-jcs-2005/ heterogeneous - metadata formats, content formats, identifiers, packaging standards homogeneous - metadata formats, content formats, identifiers, packaging standards repository repository repository repository repository fusion layer ‘repository federator’ portal portal portal 8th International Bielefeld Conference portal portal 11 Trusted digital repositories • Audit Checklist for Certification Draft August 2005 • Research Libraries Group RLG-NARA Taskforce • Defined criteria under 4 categories – – – – Organisation Functions, processes & procedures Designated community & usability Technologies & technical infrastructure • UK Digital Curation Centre – Providing advice, tools and support services – 2nd DCC International Conference Glasgow November 21-22 http://www.dcc.ac.uk/12 8th International Bielefeld Conference Open access driver? 8th International Bielefeld Conference 13 Presentation services: subject, media-specific, data, commercial portals Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Data analysis, transformation, mining, modelling Searching , harvesting, embedding Aggregator services: national, commercial Resource discovery, linking, embedding Learning object creation, re-use Harvesting metadata Learning & Teaching workflows Research & e-Science workflows Repositories : institutional, e-prints, subject, data, learning objects Deposit / selfarchiving Validation Publication Deposit / selfarchiving Resource discovery, linking, embedding The scholarly knowledge cycle. Liz Lyon, Ariadne, July 2003. © Liz Lyon (UKOLN, University of Bath), 2005 This work is licensed under a Creative Commons License th Attribution-ShareAlike 2.08 Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Peer-reviewed publications: journals, conference proceedings International Bielefeld Conference Validation Quality assurance bodies 14 eBank UK Project http://www.ukoln.ac.uk/projects/ebank-uk/ • Two key themes: – Open access to datasets – Linking research data to publications and to learning • UKOLN (lead), University of Southampton, University of Manchester • Hybrid team: scientists, computer scientists and digital library specialists • e-Science application ‘Combechem’ : Grid-enabled combinatorial chemistry + National Crystallography Service 8th International Bielefeld Conference 15 A data repository entry ecrystals.chem.soton.ac.uk 8th International Bielefeld Conference 16 Access to the underlying data: complex objects 8th International Bielefeld Conference 17 Library issues 2: data descriptions • Validation, publication & discovery of data models & schema • Complex objects metadata packaging standards – METS – MPEG 21 DIDL • Semantic descriptions – Formal controlled vocabularies – High-level and domain ontologies – Inter-disciplinary discovery • Informal / social approaches Web 2.0 “folksonomies” • eBank Application Profile publication • What data models and metadata schema are in place? • Have librarians been involved in their development? 8th International Bielefeld Conference 18 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 19 Discovering data: • Domain identifier: International Chemical Identifier (INChI) code • Google molecule using INChI Slide from Simon Coles Coles, S.J., Day, N.E., Murray-Rust, P., Rzepa, H.S., Zhang, Y., Org. Biomol. Chem., 2005, (10),1832-1834. DOI: 10.1039/b502828k 8th International Bielefeld Conference 20 Library issues 3: Persistent identifiers for data citation • How will they be used? We need use cases: depositor, author, service provider, reader, publisher? • Schemes: DOI, Handle, ARK, PURL • Publication & citation of scientific primary data project National Library for Science & Technology (TIB), University of Hanover, Germany. STD-DOI Project http://www.std-doi.de – DOI registry for datasets • eBank is working with TIB to assign DOIs to crystal structure data • What persistent identifiers have been assigned to your data? • Was the Library involved in the process? 8th International Bielefeld Conference 21 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 22 Adding value: eBank linking data to publications 8th International Bielefeld Conference 23 Linking research to learning - embedding eBank aggregator service in a science portal for student learners 8th International Bielefeld Conference 24 Integration into the curriculum and e-Learning workflows • MChem course • Assess role in Undergraduate Chemical Informatics courses • Pedagogic evaluation • February – May 2006 • Report & workshop to follow. 8th International Bielefeld Conference 25 (Very simple) e-Research Cycle and Data Curation (New) knowledge extraction: data mining, modelling, analysis, synthesis Data processing Formulate hypothesis / ideas, test, experiment, observe: data creation, collection & capture Data processing Data processing Data management storage & validation: description, deposit, self-archiving, preservation, certification e-Infrastructure Adding value: Data linking, annotation, visualisation, simulation Open access Collaboration Data processing Data processing Scholarly communications: data disclosure, publication, citation, discovery, re-use This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0 8th International Bielefeld Conference 26 8th International Bielefeld Conference 27 Library issues 4: Adding value and repository services • Adding value - Linking, annotation, visualisation • Repository services for knowledge extraction - Mining (data, text, structures) - Modelling (economic, climate, mathematical, biological) - Analysis (statistical, lexical, pattern matching, gene) • How is your data being used and re-used? 8th International Bielefeld Conference 28 Library issues 5: workforce development and capacity building • NSF Draft Report 2005 Longlived digital data collections • “Data scientist” - hybrid skills • Facilitate collaboration: researchers, data centres, digital libraries & archives communities • How does your Library shape up? • SWOT analysis 8th International Bielefeld Conference 29 STRENGTHS WEAKNESSES Scholarly communications role Historic “document tradition” Links with academic community Synergies between physical & digital worlds are still evolving Content / collection management / stewardship practice Cataloguing, classification & metadata expertise Shortage of technical skills Cautious approach to innovation Vision? (“its not our problem….”) (e)-Service delivery function OPPORTUNITIES THREATS Build on ePrints work & eLearning experience Paradigm shift in research will outpace change in libraries Exploit links with researchers they need your skills Researchers will (only?) use ondemand e-Services Seek funding to engage in innovative projects & services Libraries may lose their role in scholarly communications and eResearch workflows Develop local, regional, 8th International Bielefeld Conference national, global partnerships 30 Libraries: Facing the future? • Develop leadership & vision for eResearch engagement • Review organisational structures – Extend & re-profile the Faculty/Subject/Reference Librarian role? – Closer collaboration with Computing Services? • Provide eServices for data – We “do” eLearning so why not eResearch? – Include in institutional digital asset management • Promote professional development of staff – Awareness-raising activities, new skills – Greater engagement, hybrid roles and hybrid teams • Build new partnerships, new business models • Facilitate Transformational Change in Libraries 8th International Bielefeld Conference 31 Thank you. Questions?….. More information: UKOLN http://www.ukoln.ac.uk/ UKOLN receives core funding from the Joint Information Systems Committee (JISC) and the Museums, Libraries & Archives Council (MLA) and is based at the University of Bath, UK.