The EAD Cookbook and EBind: - University of Illinois at

Download Report

Transcript The EAD Cookbook and EBind: - University of Illinois at

E-Content on a Shoestring:
Using the EAD Cookbook and
Ebind XML in the Archives
CIL Conference: March 16, 2001
Christopher J. Prom, Assistant University Archivist
University of Illinois Archives
http://www.library.uiuc.edu/ahx/ead
XML/EAD Implementation
Issues
 Cost
 Data sharing
 Technical requirements for digitization
– xml “foreign” to many library programmers, an
emerging technology
 Why Choose XML?
– Simplification and cost control
– Demand for best-practice system
Reston Finding Aid Project
 Provide finding aid (box and folder listing)
 Integrate digital copies of documents to
finding aid
James B. Reston
 New York Times
 Reporter, Chief
Correspondent,
Editor, Vice
President
 Reputation as a
journalistic insider,
from the 50s to the
80s
Confidential
Memorandum
 Document off-the-
record background
conversations
Diaries
 Moscow trip, 1943
 China, interview
of premier Chou
En-lai, 1971.
Interviews
 Henry Kissinger
 Ted Kennedy
 Jimmy Carter
Correspondence
 Some public
 Much internal NY
Times
 Copyright unclear in
many cases
Reader Mail
 Illustrates the
challenge posed by
copyright law in
constructing online research
collections
Jack Valenti
MPAA
Our Mandate
 Provide as much on-line as possible
 at the lowest cost
 using archival/manuscript descriptive
standards
Project Issues (1)
 Archival/policy
– compliance with archival practices of
arrangement and description (context)
– integrate images directly into finding aid
– Not involve time-consuming cataloging or
design
 Cost
– Can you rely on open source software?
Project Issues (2)
 Technical
– Markup
• efficiency of markup
• staffing turnover and concerns
– Manuscript display. Needed system to:
•
•
•
•
handle thousands of images in varied in formats
provide a printable image
be easily navigable
and quick and dirty, without depending on expensive
equipment or software
Tools and Techniques (1)
 Open-source is the solution
 EAD (Encoded Archival Description)
– Archival finding aid markup (XML)
– allows nesting of “levels” in finding aid
– <dao> tag
Tools and Techniques
 EAD Cookbook (Michael Fox)
– predefined tag library
– XSLT stylesheets to generate static HTML
– Stylesheets easy to modify
– implementation instructions
 WordPerfect’s XML editor for markup
Tools and Techniques
 Image display/turner
– EBind
• Berkely, http://sunsite.berkeley.edu/Ebind/
– Perl script generates html from SGML files
– book oriented
• UI Archives modifications
–
–
–
–
optimized for Windows server
built in limitations removed
added fields for physical description and genre
enhanced user interface for integration to EAD Cookbook
and use with manuscripts
 Notetab Light (Clips)
 Clip Language: Easy to program (really!)
 CompuPic Pro
(only software
purchased)
 DOS batch file
integrated into clip
automates site update
Tools and Techniques
 Workflow
– 1. Scan images to uncompressed .tif
– 2. Perform batch conversion routine (simple,
but needs “one click” automation)
– 3. Complete query box in Notetab
– 4. Click “create worksheet” (editing rare)
– 5. Click “move”
• Entering descriptive data
• Tagging automatically inserted and file saved
• worksheet created and automatically saved
Corresponds to
relative path
Structure may be edited
for complex documents
File sequence
• XML created automatically from worksheet
Bibliographic data
Image sequencing
• Updating the site (takes place for each document)
1. Transforms EAD to HTML
2. Creates EBind SGML file
3. Moves everything to web server in
consistent directory structure
4. Archives the uncompressed .tif to disk
in parallel structure
Interface: Opening Page
Navigable TOC
in frame
Dublin Core in Frameset
Interface: Series Level
<c01> level linked
from TOC
Interface: DAO Links
Interface: Ebind TOC
Interface: Ebind Thumbnails
Interface: Ebind Page View
Hi-res prints on
single sheet of
paper
Navigation bar
repeated at bottom
Costs
 EAD markup, proofreading and display
– Reston: $248.63 or $3.65 per printed page
– Bardeen: $110 or $1.96 per printed page
 Ebind
– For recent student workweek
• 142 images in 9.5 hours ( includes 1 hour selecting documents):
• $118.37 or .83 cents per page scanned and added
EAD Cookbook/Ebind/Notetab:
Strengths
–
–
–
–
Open source approach helps control costs
Standards compliant, syntactically sound
Will allow for data sharing and exchange
Easy design for staff, easy workflow for
students
EAD Cookbook/Ebind/NoteTab:
Weaknesses
– Low accuracy of description; does not fully
comply to rules of archival description
• Journalism students, supervision difficult
– Load times. Need to rewrite stylesheets
– Not currently searchable.
Resources
 EAD Cookbook
– http://jefferson.village.virginia.edu/ead/
 Ebind (Berkeley)
– http://sunsite.berkeley.edu/Ebind/
 EBind (University of Illinois)
– http://www.library.uiuc.edu/ahx/ead/tech/
 Notetab Light
– http://www.notetab.com