Transcript Document
FAST AT CORNELL FAST Interest Group, ALA Annual Conference 2014 Steven Folsom, [email protected] Chew Chiat Naun, [email protected] Collaborating with OCLC Research for Enhanced Discovery THE FAST TEAM OCLC Research Rick Bennett Eric Childress Kerre Kammerer Ed O’Neil Diane Vizine-Goetz Cornell University Gary Branch Chew Chiat Naun Steven Folsom Sarah Ross Ardeen White Cornell catalogers: Yen Bui, Roswitha Clark, Sung Ok Kim, Yelena Kurbanova, Apikanya McCarty, Teresa Mei Cornell Discovery and Access Team: (Too many to list) BLACKLIGHT CATALOG Open Source Greater control over Indexing Decisions Greater control over User Experience Fits within our larger Discovery and Access Initiative Co-indexing local digital collection metadata Co-indexing third party content, e.g. Hathitrust DOAB CORNELL PIPELINE FOR BLACKLIGHT •MARC21 (from ILS and elsewhere) •Non-MARC Data Sources Integration Layer •XML •RDF •Mapping to Solr Fields for Blacklight Indexing Q: WHY THE TRIPLE STORE? A: Inferences FOR EXAMPLE… ULAN TO LCNAF LCNAF TO FAST CURRENT SUBJECT MAPPING IN SOLR CURRENT SUBJECT FACETING IN BLACKLIGHT SUBJECT/GENRE PARTNERING WITH OCLC December 11-12, 2013ALCTS eforum, OCLC offers to partner with a library to convert their LCSH. Cornell requests service. September 9th, 2013Cornell contacts OCLC about 2001 publication September 2013- OCLC Announces project to enhance WorldCat db with FAST headings Spring 2014OCLC and Cornell discuss feedback on problematic mappings Spring 2014OCLC and Cornell test Batch Conversions of Cornell Records Near FutureMerge FAST headings into Cornell MARC records Spring 2014OCLC and Cornell strategize methods for extending FAST CATALOGING WITH FAST Pilot began February 2014 7 catalogers currently participating For minimal-level cataloging only (25% of our original output) Replaces use of uncontrolled 653 Currently using http://fast.oclc.org/searchfast/ SEARCHFAST SEARCHFAST CATALOGING WITH FAST FAST vs LCSH Introductory presentation Characteristics of FAST Strengths and weaknesses Cheat sheet FAST “mindset” KEY TRAINING POINTS Use what you find (all headings established) Subjects do not cross facets Observe distinction between topical and genre/form facets Fewer application rules • No constraints on combinations of topical and geographical terms • Order of headings is not significant Dates can be whatever you need to assign. DATES VOCABULARY ISSUES Unestablished headings Problematical mappings Missing references (e.g. vernacular) Scope notes Event headings WHAT HAPPENS IF A HEADING IS NOT IN FAST? Extensions Not in LCNAF: 600 17 |a Folsom, Steven |2 fast/NIC In LCNAF but not in FAST: 600 17 |a Childress, Eric |a fast/ naf |0 (DLC)no2005043559 Legal but not explicitly established in LCSH: 650 _7 |a Elephants |x Weight |2 fast/lcsh Headings in our conversion project that failed to validate against FAST will be given similar treatment. PROBLEMATICAL MAPPINGS “RELATIONS” – OCLC SOLUTION Relations [countries] International relations Relations [religions] Interfaith religions Foreign relations Diplomatic relations Foreign economic relations International economic relations Military relations Military relations EVENT HEADINGS FAST conversion output 611 7 $a Trenton, Battle of (New Jer sey : 1776) $2 fast 648 7 $a 1776 $2 fast 651 7 $a New Jer sey $z Trenton $2 fast FAST event authority 034 __ $d W0744435$e W0744435$f N0401301$g N0401301$2 geonames 043 __ $a n-us-nj 046 __ $s1776 053 _0 $a E241 .T7 111 0_ $a Trenton, Battle of (New Jer sey : 1776) 511__ $a American Revolution (1775 -1783)$0 ( OCoLC)fst01351668$w g 551__ $a New Jer sey$z Trenton$0 ( OCoLC)fst01 207908 STATE OF PLAY: CATALOGING About 450 records in first 3 months Unit time about 50% higher than minimal-level cataloging Well-accepted by catalogers Further steps Maintenance Improvements to editing environment (maybe) The Elephant in the Room FINALIZING BATCH PROCESSING AND MERGE Add valid FAST headings and local extensions to records For names not in LCNAF, add $2 fast/NIC Recode $x subdivisions to $v if they match a current $v subdivision, in LC cataloged records created pre 2000 and all non-LC cataloged records. (Hoping this to be a little more nuanced.) Retain original LCSH headings in the records TENTATIVE SOLR MAPPING Vocabularies for faceting Most likely to only use |2 fast, |2 fast/NIC, and |2 fast/naf Could conditionally use $2 naf if there are no FAST headings since the literals would converge Conditional logic for item record display Likely to supress FAST from item view if LC authorities are available. PROJECTING PROGRESS: FACETS After Before (Now) S u b je ct / G en re G a m blin g Ca s inos M i n o rit y bus i n ess e n te rpri s es Un i te d St a te s . Sm a l l B us i n ess Adm i n ist ra t ion B i l dun gsro mans 12 4 2 2 1 S u b je ct / G en re G a m blin g C a s inos M i n o rit y bus i n ess e n te rpri s es U n i te d St a te s . Sm a l l B us i n ess Adm i n ist ra t ion B i l dun gsro mans Se e m o re > > S u b je ct : Re g io n La s Ve g as ( N ev. ) 17 N eva da La s Ve g as 11 Un i te d St a te s 2 Ari z o na B l a c k M e s a ( N ava jo Co un t y a n d Apa c h e Co un t y ) 1 B l a c k M e s a ( N ava jo Co un t y a n d Apa c h e Co un t y, Ari z . ) 1 Se e m o re > > S u b je ct E r a 2 0 th c e n t ur y 4 12 4 2 2 1 Se e m o re > > S u b je ct : Re g io n N eva da - - Las Ve g a s U n i te d St a te s A ri z o na - - Blac k M e s a ( N ava jo Co un t y A pa c h e C o un t y ) 28 2 and 2 Se e m o re > > S u b je ct E r a 1 97 0 - 1979 1 97 8 2 0 th c e n t ur y 3 1 1 PROJECTING PROGRESS: ITEM VIEW Before (Now) After OUTSTANDING ISSUES Assessment Workflow… Is it fast enough? Do mappings need tweaking? Is genre data good enough to parse out from topical? Digital collection use of FAST Legacy metadata… Convert or infer? New Digital Collections… use FAST as vocabulary or map? MAINTENANCE Still working out the details, but: OCLC will supply report of changed headings IDs should greatly facilitate automation OCLC master record updates Not restricted to ILS data FAST CHANGE REPORT RESOURCES O’Neill and Chan’s FAST: Faceted Application of Subject Terminology (OCLC #624025531) OCLC FAST web site: http://oclc.org/research/activities/fast.html ALCTS eforum summary: http://lists.ala.org/wws/arc/alcts-eforum/2013-12/msg00131.html Cornell cheat sheet: http://lts.library.cornell.edu/lts/pp/cat/127FAST Cornell FAST introductory presentation: http://ecommons.cornell.edu/bitstream/1813/34435/2/FAST_C%26M.pptx Other local Cornell documents available (tips, problems, etc.) QUESTIONS?