Transcript Slide 1
Seamless Sharing:
NYU, HathiTrust, ReCAP and the
Cloud Library
KAT HAGEDORN
HATHITRUST SPECIAL PROJECTS COORDINATOR
UNIVERSITY OF MICHIGAN LIBRARIES
OCTOBER 9, 2009
With thanks to Constance Malpas at OCLC and John Wilkin at University of Michigan for their considerable contributions
Overview
The cloud library and this pilot project
Brief overview of HathiTrust
Findings
Expectations
Cloud Library, not cloud computing
Similar but vastly different
Necessity/desire to share resources
Multiple digital and print repositories
Repositories can now move into a “cloud” that will
become a shared network resource
What infrastructure needed?
Loans
Borrowing
System
Digitized
Library Collections
Off-Site Collections
Shared
Collections
ReCAP
Transfers
Retrievals
Aggregate holdings and joint
commitments constitute a
Disclose
Local
Collections
Holdings
Withdrawals
Registry
shared asset
enabling collaborative
management strategies
Assets
Infrastructure
Policies
Procedures
Perceived need
Already good support of other “virtual” shared
services, e.g., ILL, doc delivery
What exists in off-site storage and digital
repositories that isn’t currently accessible?
Collection development mechanisms need to
discover accessibility and preservation statuses
How should we build such a service for consumers?
Demand for services
Multiple, sometimes overlapping, reasons
institutions will be interested in being part of a cloud
library
preserving titles that are rare and/or special in some
manner
remove titles that are duplicated across many institutions
added value of shared materials in digital repository
(discovery, search)
contributing to a public good
Partners in pilot
NYU – model customer
Acute space pressures; major library renovation
Limited mandate to build local collection of record
ReCAP – model supplier
Large-scale shared academic storage collection
HathiTrust – model supplier
Large-scale shared digital repository
OCLC Research and CLIR – consultants & convener
A bit about HathiTrust
To contribute to the common good by collecting,
organizing, preserving, communicating, and sharing
the record of human knowledge
materials converted from print
improve access …to meet the needs of the co-owning
institutions
reliable and accessible electronic representations
coordinate shared storage strategies
“public good” …sustaining the historical record
simultaneously …centralized …open
Growth of HathiTrust
Includes ingest of materials not from Google (GBS)
Intersections
Material that NYU
can obtain through
HT dependent on
copyright status –
enhance ‘local’
collection
opportunities for institutional cooperation
shared policy frameworks
joint service agreements
increased operational efficiencies
Material that NYU can
relegate with a high
degree of confidence
HathiTrust
Material that
NYU can
already source
through existing
ILL – enhance
local collection
N=3.8M
N=2.3M
N=7.6M
ReCAP
that NYU
may choose to
relegate based on
copyright/ availability
ReCAP
Material that NYU
may choose to
relegate with
appropriate service
level agreement
The Cloud Library
Increased reliance on a network of collections and
services with a robust underpinning of shared policy
and service infrastructures that are jointly owned by
participating libraries
Naturally, as number of participants grows, value of
partnership increases
Goal of pilot study:
service expectations for both digital and print
repositories
cost/benefit analyses for sharing resources
processes for discovery of shareable titles
Process for discovery of overlap
Ingestion on a monthly basis
Checking of OCLC numbers (without can’t be
processed)– use of xID to derive more
New data structure…
Harvest
Hathi
metadata
Overlap
analysis
report
Process,
index,
analyze
Normalize
rights
values
Monthly data harvest
2 weeks per cycle
to process
Join Hathi
and
WorldCat
data
Rights
anomalies
report
Extract
OCLC
numbers
Derive add’l
OCLC
numbers
via xID
Extract
WorldCat
data
OCLCnum
report
HathiTrust: Looking forward
Ingesting from 4 institutions (UC, Indiana,
Wisconsin, Michigan), more to come
Moving from off-site storage scanning to main
libraries
Result: slight changes in number of PD volumes
Change in membership …broader base of institutions
for cost-sharing
Future contracts will mostly be picklists
Internet Archive ingest starts this winter/late fall
Completion of TRAC certification
Requirements and benefits
Service expectations for both HathiTrust and ReCAP
turnaround time
continuity of operations
access privileges
With HathiTrust, all are par for the course
As partners in the cloud library…
preservation of texts and metadata
longevity and perptuity
trust and reliability
access to titles not held by library (comprehensive)
opportunity for voice in HathiTrust development
Questions?
Constance Malpas (OCLC): [email protected]
John Wilkin (HathiTrust): [email protected]
Kat Hagedorn (HathiTrust): [email protected]
http://hathitrust.org/
[email protected]