Beyond the Record : OCLC & the Future of MARC

Download Report

Transcript Beyond the Record : OCLC & the Future of MARC

Beyond the Record : OCLC
& the Future of MARC
CCS Forum
ALA - Chicago
July 11, 2009
MARC 212709
MARC 212709
ONIX Books
MARC XML
OCLC MARC
OCLC CDF
CDF
MODS
ONIX Books
DC XML
MARC XML
OAI-DC XML
DC XML
OAI-PMH XML
DC-Qualified
DC-Qualified
ONIX Serials
MODS
Ted Fons
Director WorldCat Global
Metadata Network
Beyond the Record: OCLC and the Future of
MARC
• The OCLC Context
• OCLC’s Role in RDA
• Beyond MARC
• Beyond the Record
The OCLC Context
The OCLC Context
• A membership organization
• Diverse membership
The OCLC Cooperative
69,826 libraries in 112 countries
1,355
5,639
55,284
1,080
4,253
882
1,015
320
OCLC’s Role with RDA
OCLC & RDA
• Committee Contribution:
• ex-officio membership in the ALA Committee on
Cataloging: Description and Access
• MARC Advisory Committee
• Staff Participation:
• Joint Steering Committee's two RDA Examples Groups
• RDA/MARC Working Group
• Representation on: ALA ALCTS RDA Implementation Task Force
• Various program sessions
OCLC & RDA
• OCLC Internal Activities:
• Discussions with the three U.S. national libraries to plan
for the testing/evaluation period (late 2009)
• Planning for MARC21 format changes to support the
testing/evaluation period
• OCLC Contract Services to staff have been selected to
participate in the testing/evaluation period.
Beyond MARC21
With thanks to Jean Godby of OCLC Research
The Crosswalk Web Service at OCLC
• Enables OCLC to translate from one metadata format to
another.
• A “metadata format” is a triple that consists of a metadata schema,
a structural encoding, and a character encoding.
• Supported standards are bibliographic, but the software can handle
other types of data.
• Can be called from any product or service that processes
metadata.
• A version with a slightly different interface resides on the
OCLC Enterprise Bus.
Inputs and outputs
MARC 212709
MARC 212709
ONIX Books
OCLC MARC
MARC XML
OCLC CDF
CDF
MODS
ONIX Books
DC XML
MARC XML
OAI-DC XML
DC XML
OAI-PMH XML
DC-Qualified
DC-Qualified
ONIX Serials
MODS
Data flow for a single translation
Example: MARC21 to Dublin Core via CDF
MARC input
ISO 2709
522 $a northwest
Convert to input structure
Translate to DC Terms
Convert to output structure
DC Terms output
or
<record>
<?xml version=“1.0” encoding=“UTF-8”?>
<record>
<header>
<qualifieddc xmlns
<header>
<schema
name=‘marc21’
MARC
XML
dcterms=‘purl.org;dc/terms’ >
<schema namespace=‘uri:”marc:21’/>
name=‘DC-Terms’
… <dctermsset>
</header> namespace=‘uri:DC-Terms’/>
<dcterms:spatial>
<field name=‘522’>
</header>
northwest
<datafield
tag=‘522”>
<field
<field
name=‘spatial’>
name=‘a’>
</dcterms:spatial>
<subfield
code=‘a’>northwest</subfield>
<value>northwest</value>
<value>northwest</value>
</dctermsset>
</datafield>
</field>
</field>
</qualifieddc>
</record>
</field>
</record>
…
In sum…
• The Crosswalk Web service is engineered for
reusability.
• It is abstract enough to handle any kind of
metadata markup.
• It keeps a close connection between humangenerated translation logic and executable code.
• It is flexible enough to handle many use cases.
Adoptions
The Crosswalk Web Service has been incorporated into:
• Connexion Client 2.0
• ContentDM Ingest
• Data Load Enhancement
• eSerials, eSweep
• NetLibrary
• Next Generation Cataloging
Adoption is being studied for components of:
• Digital Collection Gateway
• WorldCat Cataloging Partners NCIP (NISO Circulation Interchange Protocol)
It is being used in research projects:
• Art and natural history museum metadata (with RLG partners)
• ISO 8459 bibliographic message exchange (with Janifer Gatenby)
Future priorities
1. Develop a user interface that accepts translation
logic and automatically generates Seel scripts.
2. Streamline and enhance some of the Seel language
features.
3. Investigate ways to interoperate with the
crosswalking software developed at OCLC Leiden.
4. Develop translations for non-bibliographic
metadata.
For more information
1. Metadata translation at OCLC, pre-CWS
•
A Survey of Metadata Translation Activity at OCLC
2. CWS documentation
•
The Crosswalk Web Service Users’ Guide
•
The Seel tutorial: Introduction; Seel in a Nutshell
3. 4. Research reports
•
Encoding Application Profiles in a Computational Model of the Crosswalk
•
Toward element-level interoperability in bibliographic metadata
•
A Repository of Metadata Crosswalks
•
Two Paths to Interoperable Metadata
Beyond the Record
With thanks to Diane Vizine-Goetz of OCLC Research
WorldCat Identities
FRBR Entity Levels Revisited
Work
Expression
Original Text
Translation
Critical
Edition
Paper
PDF
Manifestation
Item
The movie
The novel
Copy 1
Autographed
Original
Version
HTML
Copy 2
Based on a graphic in Tillett, Barbara "AACR2's Strategic Plan and IFLA Work towards an International
Cataloguing Code“ (2002)
OCLC FRBR Work-set Algorithm
Provides a FRBR-based view of the data
1. Records clustered into works using author and
title fields from bibliographic and authority
records
2. Author names and titles normalized to construct a
work key
3. All records with the same key are grouped
together in a work set or cluster
Share data elements across a FRBR Work Set
Work
The novel
Expression
Original
Text
Manifestation
Summary
Translation
Cover Art
Critical
Edition
Subject
Terms
Work pages beta
Provides a rich context from cataloging data
Beyond the Record : OCLC
& the Future of MARC
CCS Forum
ALA - Chicago
July 11, 2009
MARC 212709
MARC 212709
ONIX Books
MARC XML
OCLC MARC
OCLC CDF
CDF
MODS
ONIX Books
DC XML
MARC XML
OAI-DC XML
DC XML
OAI-PMH XML
DC-Qualified
DC-Qualified
ONIX Serials
MODS
Ted Fons
Director WorldCat Global
Metadata Network
[email protected]