New Life to Old Serials: Digitizing Back Volumes

Download Report

Transcript New Life to Old Serials: Digitizing Back Volumes

New Life to Old Serials:
Digitizing Back Volumes
Wendy C. Robertson
The University of Iowa Libraries
http://ir.uiowa.edu/lib_pubs/78/
Introduction
2
NASIG • St. Louis, MO
6/4/2011
Are your institutions
digitizing text?
3
NASIG • St. Louis, MO
6/4/2011
What do you read on a screen?
4
NASIG • St. Louis, MO
6/4/2011
Why?
5
NASIG • St. Louis, MO
6/4/2011
As the traditional collectors and
preservers of content, libraries
should ensure their content
remains accessible to a wide
audience.
6
NASIG • St. Louis, MO
6/4/2011
Selection
7
NASIG • St. Louis, MO
6/4/2011
Do you have the rights
to digitize the item?
8
NASIG • St. Louis, MO
6/4/2011
http://copyright.cornell.edu/resources/publicdomain.cfm
9
NASIG • St. Louis, MO
6/4/2011
http://www.copyright.gov/records/
10
NASIG • St. Louis, MO
6/4/2011
http://collections.stanford.edu/copyrightrenewals/
11
NASIG • St. Louis, MO
6/4/2011
http://www.hathitrust.org/bib_rights_determination
12
NASIG • St. Louis, MO
6/4/2011
http://www.hathitrust.org/rights_database
13
NASIG • St. Louis, MO
6/4/2011
A serial may be partially in
the public domain.
14
NASIG • St. Louis, MO
6/4/2011
http://onlinebooks.library.upenn.edu/cce/firstperiod.html
http://onlinebooks.library.upenn.edu/cce/
15
NASIG • St. Louis, MO
6/4/2011
Has the title already been digitized?
16
NASIG • St. Louis, MO
6/4/2011
http://www.oclc.org/digitalregistry/
17
NASIG • St. Louis, MO
6/4/2011
Does a digitized title have gaps?
18
NASIG • St. Louis, MO
6/4/2011
Assess your priorities to focus
digitization time appropriately.
19
NASIG • St. Louis, MO
6/4/2011
Seek partners so that an entire run
can be digitized.
20
NASIG • St. Louis, MO
6/4/2011
Scanning prioritized by book
condition can yield a motley
assortment of volumes.
21
NASIG • St. Louis, MO
6/4/2011
Scanning
22
NASIG • St. Louis, MO
6/4/2011
Digitization might be done for access
or preservation.
23
NASIG • St. Louis, MO
6/4/2011
Standards and best practices are
widely available.
24
NASIG • St. Louis, MO
6/4/2011
U Michigan naming
U Iowa naming
25
NASIG • St. Louis, MO
6/4/2011
Presentation
26
NASIG • St. Louis, MO
6/4/2011
Think about how content
will be used.
27
NASIG • St. Louis, MO
6/4/2011
Content is often presented as a
bound object, not as a logical unit
related to other materials.
28
NASIG • St. Louis, MO
6/4/2011
Creating PDFs
29
NASIG • St. Louis, MO
6/4/2011
30
NASIG • St. Louis, MO
6/4/2011
Do the best OCR (Optical Character
Recognition) that you can afford.
31
NASIG • St. Louis, MO
6/4/2011
Consider accessibility and mobile
access when create PDFs.
32
NASIG • St. Louis, MO
6/4/2011
Not clearscan – text less crisp
Letters stretch when reflow chosen
Default OCR option
33
NASIG • St. Louis, MO
6/4/2011
Clearscan
34
NASIG • St. Louis, MO
6/4/2011
several "thousand dollars. [When]. Mother announced to her . her intentions of marrying father
after she came of age .'. . the stepmother skipped out with all the funds, simply vanished, and
mother was left penniless.,, 1 This itiformation was most welcome, for I had been able to say
very little, in my edition of Mattie's letters some years earlier, about her life before she married
into the Whitman family, and could only speculate on what was here confirmed: that she had
been an orphan whose connections with kin had been largely if not entirely severed. 2 The
interview in which this small but helpful revelation comes.is the most interesting part of the
little-noticed Fansler' Collection of Whitman materials at Northwestern University.3 The fortyeight page ha􀂍dwritten 'transcript, supplemented by a number of Miss Jessie's letters to the
Adding tags & soft hyphens
35
NASIG • St. Louis, MO
6/4/2011
http://trove.nla.gov.au/
36
NASIG • St. Louis, MO
6/4/2011
PDF reflow & soft-hyphens with Goodreader
37
NASIG • St. Louis, MO
6/4/2011
Examples
38
NASIG • St. Louis, MO
6/4/2011
Random selection of
volumes shown as
related (v.3, 17 & 38)
Google books
39
NASIG • St. Louis, MO
6/4/2011
Random items
not necessarily
in same series
40
NASIG • St. Louis, MO
Contents may list articles
but has OCR problems
6/4/2011
Search may display more information
than book page
41
NASIG • St. Louis, MO
6/4/2011
42
NASIG • St. Louis, MO
6/4/2011
43
NASIG • St. Louis, MO
6/4/2011
HathiTrust - http://catalog.hathitrust.org/Record/008162447
44
NASIG • St. Louis, MO
6/4/2011
http://catalog.hathitrust.org/Record/000055609
45
NASIG • St. Louis, MO
6/4/2011
Internet Archive
46
NASIG • St. Louis, MO
6/4/2011
47
NASIG • St. Louis, MO
6/4/2011
48
NASIG • St. Louis, MO
6/4/2011
http://illinoisharvest.grainger.uiuc.edu/collections.asp?ctype=digibk
49
NASIG • St. Louis, MO
6/4/2011
http://welshjournals.llgc.org.uk/browse/
50
NASIG • St. Louis, MO
6/4/2011
CONTENTdm
51
NASIG • St. Louis, MO
6/4/2011
52
NASIG • St. Louis, MO
6/4/2011
http://content.lib.utah.edu/u?/dialogue,45
53
NASIG • St. Louis, MO
6/4/2011
54
NASIG • St. Louis, MO
6/4/2011
Digital Commons
55
NASIG • St. Louis, MO
6/4/2011
56
NASIG • St. Louis, MO
6/4/2011
Open Journal Systems
57
NASIG • St. Louis, MO
6/4/2011
D-Space
58
NASIG • St. Louis, MO
6/4/2011
59
NASIG • St. Louis, MO
6/4/2011
Unstructured web page
60
NASIG • St. Louis, MO
6/4/2011
A Few More Things…
61
NASIG • St. Louis, MO
6/4/2011
Split content into articles if feasible,
especially if an article is
the reading unit.
62
NASIG • St. Louis, MO
6/4/2011
Ensure the PDF can be cited in
isolation.
63
NASIG • St. Louis, MO
6/4/2011
Request an ISSN.
64
NASIG • St. Louis, MO
6/4/2011
Pay attention to title changes.
65
NASIG • St. Louis, MO
6/4/2011
Conclusion
66
NASIG • St. Louis, MO
6/4/2011
Be involved with your local
digitization to bring a serials
perspective.
67
NASIG • St. Louis, MO
6/4/2011
Resources
68
NASIG • St. Louis, MO
6/4/2011
• Copyright Term and the Public Domain in the
United States http://copyright.cornell.edu/resources/publicd
omain.cfm
• U.S. Copyright Office http://www.copyright.gov/records/
• Stanford's Copyright Renewal Database http://collections.stanford.edu/copyrightrene
wals/bin/page?forward=home
• Automated Bibliographic Rights
Determination http://www.hathitrust.org/bib_rights_determi
nation
69
NASIG • St. Louis, MO
6/4/2011
• HathiTrust Rights Database http://www.hathitrust.org/rights_database
• Smith, Kevin. Copyright Risk and Reward in
Mass Digitization. Presented at ARL annual
meeting, May 2011.
http://www.arl.org/bm~doc/mm11sp-smith.pdf
• Ockerbloom, John Mark. The Next Mother
Lode for Large-scale Digitization? Historic
Serials, Copyrights, and Shared Knowledge.
Presneted at Digital Library Federation Spring
Forum. Apr. 2006.
http://works.bepress.com/john_mark_ockerbl
oom/5/
70
NASIG • St. Louis, MO
6/4/2011
• First copyright renewals for periodicals
http://onlinebooks.library.upenn.edu/cce/first
period.html
• Information about the Catalog of Copyright
Entries
http://onlinebooks.library.upenn.edu/cce/
• DLF/OCLC Registry of Digital Masters http://www.oclc.org/digitalregistry/
• Northeast Document Conservation Center.
Reformatting. Preservation and Selection for
Digitization http://www.nedcc.org/resources/leaflets/6Ref
ormatting/06PreservationAndSelection.php
71
NASIG • St. Louis, MO
6/4/2011
• PREMIS (Preservation Metadata Maintenance
Activity)
http://www.loc.gov/standards/premis/
• NARA’s Technical Guidelines for Digitizing
Archival Materials for Electronic Access http://www.archives.gov/preservation/technic
al/guidelines.pdf
• DLF’s Benchmark for Faithful Digital
Reproductions of Monographs and Serials http://old.diglib.org/standards/bmarkfin.htm
• University of Michigan DLPS Digitization
specifications http://www.hathitrust.org/documents/UMDigi
tizationSpecs20100827.pdf
72
NASIG • St. Louis, MO
6/4/2011
• ABBYY FineReader
http://finereader.abbyy.com/
• Omnipage http://www.nuance.com/forbusiness/byproduct/omnipage/index.htm
• OCRopus
http://code.google.com/p/ocropus/
73
NASIG • St. Louis, MO
6/4/2011
Thank you!
Questions?
[email protected]
http://ir.uiowa.edu/lib_pubs/78/
74
NASIG • St. Louis, MO
6/4/2011