The VCoE: what it offers

Download Report

Transcript The VCoE: what it offers

Co-funded by the European Union under FP7-ICT-2009-6

Alliance Permanent Access to the Records of Science in Europe Network

The VCoE: what it offers

David Giaretta, [email protected]

APA APARSEN webinar, November 2014 Co-ordinated by aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Solutions to problems related to digital preservation

• • • Based on experience and expertise of the “pioneers” of digital preservation All kinds of digital objects – But   Audio visual – perhaps better done by PrestoCentre Documents and images – perhaps better done by OPF Need to: – – – Define the solution : Using : Your people then need : CONSULTANCY TOOLS & SERVICES TRAINING The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

CHALLENGES

• • • • • Different types of digital objects e.g.

– – – – Rendered – simple images, sound, video, documents Data – needs meaning of numbers etc Software Time dependent Many different tools and services – many lists Many sources – APARSEN members and others Many terminologies, many glossaries NEED AN INTEGRATED VIEW The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Many models – why another?

See http://blogs.loc.gov/digitalpreservation/2012/02/life -cycle-models-for-digital-stewardship / The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

…and more

Co-funded by the European Union under FP7-ICT-2009-6 Data Lifecycle Models and Concepts by CEOS, 2012, see http://www.ceos.org/images/DSIG/Data% 20Lifecycle%20Models%20and%20Conc epts%20v13.docx The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Integrated vision

• • • • • • If the payback is not immediate then the resources need to be justified Since the resources have to be found somehow, the question “who pays and why?” if often heard To justify the resources needed for preservation one needs to identify the potential value.

To maintain, for even increase, the likely value, the techniques chosen for preservation plays a key role.

http://www.alliancepermanentaccess.org/index.php/commu nity/common-vision/ Clickable image to showing related research The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Training

• Consistency through the “Integrated View” Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Click on PRESERVATION:

Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Basic preservation activities

Libraries say: • Can repeat what has been done before BUT • Cannot use new applications • “Emulate or migrate” • Convert to format which new software can use BUT • What if there are many software systems?

– Works well with data only in special cases  Can repeat what was done before instead of new things – Does not help with building cross-disciplinary communities The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Preservation techniques

For each technique look for evidence

– what evidence?

must at least make sure we consider different types of data

– rendered vs non-rendered – composite vs simple – dynamic vs static – active vs passive •

must look at all types of threats

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 Co-funded by the European Union under FP7-ICT-2009-6 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Evidence

• APA/APARSEN list of tools: http://www.alliancepermanentaccess.org/index.php/tools/tool s-for-preservation/ –

details of preservation related software

,

examples of data

and the

evidence of preservation

linking software to types of data . Some of this evidence comes from specific

testbeds

much comes from

user scenarios

.

but The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Tools • evidence based selection • linked to SCIDIP-ES toolkits

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Other evidence and tools

• • • • CASPAR project large amount collected evidence about the effectiveness of tools and services for many different types of data: – – – Scientific Cultural heritage Contemporary performing arts Prototyped the tools and services These have been developed in SCIDIP-ES http://www.scidip-es.eu

and http://int platform.digitalpreserve.info

Also massive collection of information about views of thousands of researchers, data managers and publishers across disciplines and around the world – PARSE.Insight

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

When things change

• We need to: – Know something has changed – Identify the implications of that change – Decide on the best course of action for preservation – What RepInfo we need to fill the gaps  Created by someone else or creating a new one – If transformed: how to maintain data authenticity – Alternatively: hand it over to another repository – Make sure data continues to be usable

Orchestration Service Gap Identification Service Preservation Strategy Tk RepInfo Registry Service Authenticity Toolkit Storage Service RepInf o Toolkit Data Virtualisa tion Toolkit Process Virtualisa tion Toolkit

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Threat Requirement for solution

Co-funded by the European Union under FP7-ICT-2009-6 Users may be unable to understand or use the data e.g. the semantics, format, processes or algorithms involved In addition the

Orchestration Manager

and

Knowledge Gap Manager

help to ensure that the RepInfo is adequate .

Non-maintainability of essential hardware, software or support environment may make the information inaccessible The chain of evidence may be lost and there may be lack of certainty of provenance or authenticity Access and use restrictions may make it difficult to reuse data, or alternatively may not be respected in future Loss of ability to identify the location of data Ability to share information about the availability of hardware and software and their replacements/substitutes changes.

The Representation Information will include such things as software source code and emulators.

Ability to bring together evidence from diverse sources about

Authenticity toolkit

will allow one to capture evidence from many the Authenticity of a digital object sources which may be used to judge Authenticity.

Ability to deal with Digital Rights correctly in a changing and located over time.

access rights policy into AIP The current custodian of the data, whether an organisation or project, may cease to exist at some point in Brokering of organisations to hold data and the ability to

Orchestration Manager

will, amongst other things, allow the package together the information needed to transfer information between organisations ready for long term the future David Giaretta, APA The ones we trust to look after the digital holdings may let us down Certification process so that one can have confidence about for ISO 16363 Audit and Certification #APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

APARSEN test audit findings

• • • • Lack of definition of Designated Community – SCIDIP-ES Gap Identification Services helps Lack of adequate Representation Information – SCIDIP-ES Registry or RepInfo, Preservation Strategy and RepInfo Toolkit help to create/share RepInfo – Orchestration service and Gap Identification Services help repository manager Inadequate Archival Information Packages – SCIDIP-ES Packaging tools help create AIPs – several flavours Lack of hand-over plans – Orchestration Services helps find partners The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Terminologies – APARSEN DP Glossary

• Why another?

Provides relationships between different terms from the various glossaries OAIS, APARSEN, DPC, ANZ, SNIA, INTERPARES, TDR (ISO 16363) – Uses SKOS to organise the terms – tells us whether a term is   Broader / narrower / related to another term See http://www.alliancepermanentaccess.org/index.php/consultancy/d pglossary/  Each term has a URI e.g. Representation Information: http://www.alliancepermanentaccess.org/index.php/consultancy/d pglossary/#Representation_Information The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Definition:

Co-funded by the European Union under FP7-ICT-2009-6

Representation_Information

In scheme:

OAIS The information that maps a Data Object into more meaningful concepts. An example of Representation Information for a bit sequence which is a FITS file might consist of the FITS standard which defines the format plus a dictionary which defines the meaning in the file of keywords which are not part of the standard. Another example is JPEG software which is used to render a JPEG file; rendering the JPEG file as bits is not very meaningful to humans but the software, which embodies an understanding of the JPEG standard, maps the bits into pixels which can then be rendered as an image for human viewing.

Pref Label: Narrower term: Narrower term: Narrower term: Narrower term: Narrower term: Narrower term: Narrower term: Narrower term: Narrower term: Broader term: Related term: Related term: Related term:

0

Format Representation_Network Semantic_Information Structure_Information Data_Type Documentation Format_ANZ Logical_format Packaging_Information Metadata Data_format

The VCoE: what it offers

Form

David Giaretta, APA

Registry_Repository-of-Representation_Information

aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Standards support

• • OAIS ISO 14721:2012 Audit and Certification ISO 16363:2014 Co-funded by the European Union under FP7-ICT-2009-6 The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Solution providers

• • • • Many possible solution providers APA members and others e.g.

– – http://www.giaretta.org

http://www.iso16363.org

Links to – – Audit and Certification Developing new standards following on from OAIS on the whole data lifecycle Links to RDA e.g. – Active Data Management Plans: https://rd alliance.org/groups/active-data-management-plans.html

– Preservation infrastructures: https://rd alliance.org/groups/preservation-e-infrastructure-ig.html

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

• • • • •

Summary: Solutions – consultancy, tools & services, training - based on

Experience and expertise of digital pioneers Evidence based tools and services – All types of digital objects Training materials – on-line and face-to-face Consistency provided by – – – Integrated View Terminology brought together by SKOS Glossary Standards database Consistent with (and contributed to) the digital preservation fundamental standards ISO 14721 (OAIS) and ISO 16363 (Audit and Certification) The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Co-funded by the European Union under FP7-ICT-2009-6

Resources All resources are available on the website: http://www.alliancepermanentaccess.org

Contact: [email protected]

Or [email protected]

The VCoE: what it offers David Giaretta, APA Webinar, November 2014 aparsen.eu

#APARSEN

Network of Excellence

aparsen.eu

#APARSEN