The Hindi word for ‘elephant’ ITC Friday, January 22, 2010 In the Beginning—or maybe it was in 2000 • Interest in making historical documents widely.

Download Report

Transcript The Hindi word for ‘elephant’ ITC Friday, January 22, 2010 In the Beginning—or maybe it was in 2000 • Interest in making historical documents widely.

The Hindi word for ‘elephant’
ITC
Friday, January 22, 2010
In the Beginning—or maybe it was
in 2000
• Interest in making historical documents
widely available on the open Internet
• UWDCC start up in 2000
• Growth to about 2M local images today
 08/09 7M visitors
 09/10 (6 months) 7M visitors
University of Wisconsin-Madison
Libraries
2
Discovery and Access
•
•
•
•
•
•
•
•
Discussions began in late 2005
Signed agreement in October 2006
Not less than 500K over 6 years
About 10K p/month
No $$ transferred to or from Google
About 11M books in GB now
About 15% is PD
Very successful collaboration
University of Wisconsin-Madison
Libraries
3
2006 Agreement
•
•
•
•
Free search in GB
Find in library/bookstore feature
Snippet view for in-copyright material
Full text access to PD material and pre1923 out of copyright material
University of Wisconsin-Madison
Libraries
4
Google Settlement Agreement
• Author’s Guild/Association of American
Publishers/DOJ
– Copyright infringement—U.S. and
international
– Monopoly—orphan works
– Antitrust/Pricing concerns--products
– What we can and can’t do with the udc
• Judge NY Feb, 2010
University of Wisconsin-Madison
Libraries
5
2009 Amendment
• Free search in GB
• Find in library/bookstore feature
• Snippet is eliminated-replaced by an
expanded 30% preview of in-copyright
material
• Ability for us to license access to the
entire digitized corpus for the entire
UWS
University of Wisconsin-Madison
Libraries
6
2009 Amendment continued
• Free subscription to the entire corpus for
each public library/academic library in
the U.S.
• A consumer model that will enable a
person to purchase access to the incopyright out of print material (orphans)
• Full text access to PD and the pre-1923
out of copyright material
University of Wisconsin-Madison
Libraries
7
Remember the elephant?
•
•
•
•
•
Insurance policy
Michigan/IU lead/host/curate
CIC/UC/CDL/UVA/CUL ...
Collaborative—annual fees
Governance—sustainability
University of Wisconsin-Madison
Libraries
8
HathiTrust continued
•
•
•
•
•
•
Long-term preservation and curation
5M books—about 15% PD
These are our library files
> just Google content
UC is adding local content now
Collaborative collection management—
digital and print
University of Wisconsin-Madison
Libraries
9
Preservation *AND* Scholarly
Tools
• Expand accessibility to restricted files to
users with print disabilities
• Non-consumptive research
– Text analysis using a very large corpus
– MIT work
• Collaborative collection management
University of Wisconsin-Madison
Libraries
10