Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009

Download Report

Transcript Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009

Evolving MARC 21 for the future
Rebecca Guenther
CCS Forum, ALA Annual
July 10, 2009
Overview of presentation
The current MARC environment
 Successes of MARC
 Moving to XML
 RDA and FRBR experimentation
 Linked data
 Issues to resolve

Credits
Sally McCallum for some slide
content
 John Espley for FRBR
implementation
 Clay Redding for Linked Data
content

What is MARC 21?

A syntax defined by an international standard for
communications with 2 expressions:






Classic MARC (MARC 2709)
MARCXML
A data element set defined by content
designation and semantics
Institutions do not store “MARC 21”, as it is a
communications format
Many data elements are defined by external
content rules; a common misperception is that it
is tied to AACR2
A set of 5 formats for different purposes:
Bibliographic, Authority, Holdings, Classification,
Community Information
The current MARC
environment






Billions of rich descriptive records in MARC
systems
Many national formats have been harmonized
with MARC 21
Integrated library systems that support MARC
bibliographic, authority and holdings format for
different functions
Wide sharing of records for 30+ years
OCLC is a major source of records
MARC records are being reused (sometimes
converted) and repackaged
MARC successes

Can carry data formulated by different
cataloging rules and conventions





Multiple descriptive rules, different principles and
models
Different subject thesauri
Multiple languages and scripts
Cooperation in record exchange has resulted in
widespread use and cost savings
Richness of MARC records supports multifaceted
retrieval


Coded data
Parsed data
Problems with MARC







MARC 2709 syntax problems
Limitation of available fields, subfields,
indicator values, etc.
Redundant data (fixed vs. variable fields)
Controlled values embedded in the
standard
Ability to link
Limits to extensibility
Lack of explicit hierarchical levels
Some progress…
MARCXML
 Definition of $u, $w and $0 for
linking
 Definition of $2 to specify source of
controlled values
 Exploration into use of URIs
 MODS

Streamlining MARC 21
into the future

Take advantage of XML



Develop simpler (but compatible)
alternatives


MODS
Allow for interoperability with different XML
metadata schemas


Increasingly use MARC 21 in an XML structure
Take advantage of freely available XML tools
Assemble coordinated set of tools
Provide continuity with current data

Provide flexible transition options
MARCXML






MARCXML uses the MARC data element set in
an XML syntax
Lossless roundtrip conversions
Simple flexible XML schema, no need to
change when MARC 21 changes
Interoperability with different XML schemes by
taking advantage of free XML tools
Collaborative use of metadata for access
(e.g.OAI)
Continuity with current data and flexible
transition options
Example: http://lccn.loc.gov/2004012412
MARC 21 evolution to XML
More use of MARCXML
Transition to exchange in MARCXML
instead of MARC 2709
 Take advantage of namespaces to
allow for extensions (e.g. technical
metadata)
 Potential to extend with new
attributes or subfield markers

Other related XML
schemas: MODS








Eliminates some of the problems with MARC
(e.g. lack of tags/subfield codes)
More user-friendly (uses language tags)
Repackages redundant data elements into one
Can carry hierarchical data
Less tied to cataloging rules
Highly compatible with MARC but simpler,
although retaining some richness
Widely implemented especially for digital
projects
Governed by Editorial Committee
Example: http://lccn.loc.gov/2004012412/mods
Other related XML
schemas: METS


METS
 A container/information package
 Wrapper for MARCXML and MODS descriptions
 Allows for additional technical and
preservation metadata
 Enables tracking of actions on the metadata
itself
LC is exploring a native XML database for search
and retrieval bringing together records from
many internal systems using
MODS/MARCXML/METS
Experimentation with RDA,
FRBR and RDF



MARC changes to accommodate RDA
MODS changes to accommodate RDA
FRBR experimentation



VTLS has implemented a FRBR based system using
MARC
Other systems are looking at migrating to reflect entityrelationship models
Semantic web experiments



Modeling MODS and MADS as RDF
Modeling MARC as RDF
See Martha Yee’s article in Information Technoclogy and
Libraries (v. 28, no. 2, June 2009)
Example of FRBR
implementation of MARC: VTLS
Single database can have FRBR and non-FRBR
records
System is “aware” of FRBR record level and
changes displays as needed
Display of FRBR records are in tree structure
Local level fields have values of W, E, and M to
indicate type of FRBR record
001 and 004 used as linking tags
Experimentation with
“Linked data”





Library of Congress Authorities & Vocabularies service:
http://id.loc.gov
Allows both human-oriented and programmatic access to
LC-promulgated authorities and vocabularies
Actionable URIs associated with concepts
First offering is Library of Congress Subject Headings, but
more to come: e.g. Thesaurus of Graphic Materials, ISO
639-2, MARC code lists, etc
Advantages




Facilitate development and maintenance process for
vocabularies
Expose vocabularies to wider communities
Offer bulk downloads
Example:
http://id.loc.gov/authorities/sh85049843
Issues to resolve for
bibliographic formats









Actionable vs. descriptive
Parsed vs. text
Controlled/access vs. transcribed
Codes vs. words
Library vs. non-library traditions
My model vs. your model
Stability vs. change
Basic retrieval vs. scholar retrieval
Cost of change
How do we move forward?



Continue RDA/FRBR implementation changes
Transition to XML for exchange (syntax)
Evolve MARC 21 (data element set)






Analyze successes that need to be carried over into a
streamlined MARC
Consider MODS features to be incorporated
Consider use of MARC 21 for different cataloging
rules, models and traditions, not just RDA
Consider cost implications
Assure continuity with current data
Evolution will take some time and will be in
phases
Comments?

Tell us what you think:
[email protected]
or
 [email protected]