What is a digital library?

Download Report

Transcript What is a digital library?

Digital Libraries and Music

Jon Dunn SLIS L631 Music Librarianship Seminar April 7, 2003

Outline

 Digital Libraries  Music content  Variations  Variations2  Special topics:  Music information retrieval  Open Archives Initiative

What is a digital library?

  DL as collection/information system  

“a collection of information that is both digitized and organized” -- Mike Lesk, National Science Foundation “networked collections of digital text, documents, images, sounds, scientific data, and software” - President’s Information Technology Advisory Council report

DL as organization: 

“an organization that provides the resources, including the specialized staff, to select, structure, offer intellectual access to, interpret, distribute, preserve the integrity of, and ensure the persistence over time of collections of digital works so that they are readily and economically available for use by a defined community or set of communities” -- Digital Library Federation

Applications of music DLs

   Education   Electronic reserves Online instructional tools Research   Better access to special collections New capabilities for analysis, searching Commercial   Professionals  E.g. music/film/video production Consumers  Online music catalogs, digital distribution

What is a music digital library?

       What does it contain?

How is this content acquired?

How is this content accessed?

How can the content be used once located?

What is the purpose?

Who are the users?

How is content protected?

Music DL features

     Content  Selection, digitization, storage, delivery Metadata (cataloging) Search capabilities  for content and metadata Interfaces  User interfaces, programmatic interfaces Access control

Basic Representations of Music

Digital Audio Audio (e.g., CD, MP3): like speech MIDI file): like unformatted text with complex formatting

Content Formats

 Audio  MIDI  Scores  Images  Structured file format  (Video)

Digital audio

 Sampling  Sample rate, sample size, number of channels  Compression  Perceptual audio coding  File formats  Standards

Digital audio file formats

 Uncompressed –all basically the same  WAV - Microsoft/IBM   AIFF - SGI/Apple AU/SND - NeXT/Sun  Compressed   MPEG-1 layers 1-3, MPEG-2 AAC RealAudio, Windows Media, QuickTime  Each supports various compression options

Digital audio file sizes

 Uncompressed audio  44.1 kHz, 16 bit, stereo (CD quality)   650 MB for one hour 1.4 Megabits/second  Compressed    MP3: 58 MB for one hour, 128 Kilobits/second AAC: 29 MB for one hour, 64 Kilobits/second RealAudio, Windows Media Audio, QuickTime Qdesign Music: down to 20 Kilobits/second or less

Digital Audio

audio sampling quantization noise Barlow,

Multimedia Systems

, p. 77.

CD Audio

    Sample rate:  44.1 kHz (44,100 samples/second) Sample size:  16 bits Number of channels:  2 (stereo) Bitrate  44100 samples/second * 16 bits/sample * 2 channels = 1.4112 megabits/second (plus file format/network overhead)

Masking Effect

Barlow,

Multimedia Systems

, p. 73.

From

Research and Creative Activity

, September 1999

Digital audio delivery

 Delivery options  Download  Streaming  e.g. RealAudio, Windows Media, QuickTime Streaming  Encrypted download  e.g. LiquidAudio, a2bmusic, Windows Media

Scores

 Score image  File format: TIFF, JPEG, GIF, PDF, …  Resolution  Grayscale vs. bitonal (black and white)  Score notation  Many proprietary formats  No common standard

Attributes of notated musical information

 Pitch  Duration  Tempo  Dynamic level  Articulation  Part (sometimes implying timbral definition) Selfridge-Field,

Beyond MIDI

, p. 9.

Difficulties in representing CMN

 Grammar of CMN is open-ended  Which is more critical: graphical appearance or semantic meaning?

 Much left open to interpretation  Style differences, e.g. interpretation of rhythms

Music Notation File Formats

 www.music-notation.info

lists over 50 different music notation formats, most for CMN

MIDI

   M usical I nstrument D igital I nterface  Originally a hardware interface spec  Communication of real-time events between musical devices Standard MIDI File (SMF)   Stores time-stamped MIDI event information  e.g note on/off, key pressure, aftertouch, pitch bend, control change, program change.

Each event accompanied by parameters  e.g. note on includes pitch, duration, dynamic range Spec maintained by industry group  MIDI Manufacturers’ Assocation

Limitations of MIDI

 MIDI does not represent many musical attributes  Graphical notation elements  Rests, stem direction, enharmonic distinctions, staff systems, page layout, etc.

 Sound elements  Timbre, full stylistic expression  Extensions exist but not widely used

Creating Notation Content

 Transcription  Music notation editor  ASCII data entry  Recognition  OMR: Optical Music Recognition

Music notation editor example: Finale

OMR: Optical Music Recognition

 Commercial packages  Musitek MidiScan/SmartScore  Version included with Finale  Neuratron PhotoScore  Version included with Sibelius

OMR: A long way from OCR

Here's the original: Scanned into Finale: Only 5 easy edits needed.

Taken from http://www.codamusic.com/finale/scanning.asp

OMR: Optical Music Recognition

 Research projects  CANTOR  University of Waikato, New Zealand  Adaptive OMR  Johns Hopkins University, USA  Example: http://mambo.peabody.jhu.edu/omr/demo/  others:  http://www.cs.waikato.ac.nz/~davidb/omr/

Score Images

 Scanned images still useful  Historical editions, manuscripts  Preservation, improving access  Impracticality of large-scale OMR  Music presents challenges for scanning

Variations

 Digital library of music sound recordings and scores  Online since 1996  Accessible in Music Library and other select locations - copyright  Used daily by large student population

Original Concept

 Burroughs and Fenske, 1990  VARIATIONS name 

Theme and Variations

 Variety of information formats for music  Networked access for the music student or scholar to sound recordings, scores, textual materials, video recordings

Focus on Audio

 High demand portion of collection  Fragile formats  Lack of previous work; uniqueness

Focus on Audio Reserves

 Half of sound recording use from reserves  Problems with existing practices  Cassette tape dubs, analog distribution systems  Concentrated use of a few items at any given time

Variations System

 Digitization  Storage  Access

Design and Development

     Developed by Music Library with assistance from UITS and Library Information Technology Integrate rather than develop from scratch Partnership with IBM Funding: School of Music, Libraries, UITS, IBM Online in April 1996

Digitization

 Formats  Analog: LP, cassette tape, reel-to-reel tape  Digital: CD, DAT  Capture at CD quality   44.1 kHz, 16 bit, stereo, 700MB for one hour Compress to MPEG 1 layer 2 (“MP2”)   200 MB for one hour Create “track file”

Digitization Hardware and Software

 Windows PCs  Sound capture card  Microtest Disc-to-Disk CD capture software  Sonic Foundry Sound Forge XP audio editor

Music Library Digitizing Lab

Storage

 Tivoli Storage Manager software  IBM RS/6000 AIX server  IBM Tape Library Dataserver  Contains three tape drives  10 terabyte (10,000 gigabyte) capacity

IBM 3494 Tape Library

IBM 3590E tapes: 20GB each

Access

 Discovery  How does the user find the desired recording?

 Playback  How is audio delivered to the user?

 How does the user navigate within a given recording?

Collection

 Currently: 6900 titles, 8000 hours of audio  5.6 TB uncompressed  1.6 TB compressed  Opera, songs, instrumental music, jazz, rock, world music

Discovery

 Varies based on purpose of access  Reserves  Course reserve lists  Faculty-created course home pages (incl. Oncourse)  General use  Links from IUCAT library catalog (856 fields in MARC bib records)

Playback

 Streaming server  IBM RS/6000, 150 GB disk  IBM VideoCharger server software  Software to connect VideoCharger with TSM (locally written)  Client  IBM VideoCharger client software  Variations Player (locally written)  Navigation via track files

Network

 Originally ATM  25/100/155 megabits per second  Now switched Ethernet  10/100/1000 megabits per second  Variations audio stream requires 384 kilobits/second  Up to 150 streams

Variations Demonstration

Variations2

 Four-year project  Started October 1, 2000      Funding from NSF and NEH through Digital Libraries Phase 2 (DLI2) program Large interdisciplinary team of investigators Faculty: Music, Information Science, Law, Computer Science Librarians and technologists: Libraries, University Information Technology Services Bloomington and Indianapolis campuses

Project goals

   Establish a digital music library testbed system supporting multiple formats: audio, video, score images, score notation Develop multiple interfaces for specific user applications in the music library and the classroom Conduct research in metadata, usability, copyright, and networking

Partners: “Satellite Sites”

    United States    University of Illinois at Urbana-Champaign University of Massachusetts at Amherst Northwestern University United Kingdom   Kings College - London Loughborough University  University of Oxford Japan  Waseda University Evaluation…potential for co-development

The Variations2 System

     Integrated access to music in all formats     Digital audio recordings Score images Score notation Video Delivery to wide range of users    Faculty: teaching, course design, research Students: coursework, independent study Music librarians, other library users Extensible Multiple user interfaces Staged development

Variations2 Version 1.0 Features

    Infrastructure  Data/metadata repositories, authentication, logging Search and retrieval interface  Based on new data model Presentation/navigation of audio and scanned scores Bookmarking

Variations2 Version 1.0 Technical Environment

      Client and server developed in Java Windows and Mac OS X client platforms, Unix (AIX/Linux) server Audio streaming: QuickTime for Java, Darwin Streaming Server Database: IBM DB2, DB2 Text Information Extender Image compresssion: DjVu from AT&T Labs and Lizardtech XML/MARC/Z39.50 tools: Saxon, Xerces, Jafer, James

Variations2 1.0

System Architecture: Layer View

Metadata Audio Video Score images Score notation Repositories Search Image retrieval Notation retrieval Sound playback and others...

Access Components Search tool Sound player Score viewer Digital time liner Cataloging tools and others...

User Interface Components Applications General purpose library application Multimedia Music Theory Teaching application Oncourse Cataloging/ adminsistration application General user Theory student or instructor Non-major music student or instructor Cataloger or digitization technician

Variations2 1.0

Communications

Apache HTTP Server Darwin Streaming Server DB2 Database JDBC Variations2 Kerberos Server Kerberos IU Kerberos Server Variations2 Library Server RTSP/RTP HTTP Java RMI Variations2 Client

Demonstration

Usability

   Usability = ease of use +

usefulness

Established baseline    Usability test of existing Variations system Satisfaction study of Variations users Contextual inquiry Evaluation of usability of Variations2    Prototype interviews Usability tests of preliminary versions Pilot studies  Data gathering through satisfaction survey and automated usage logging

Future Versions

  Continuing 6-month development cycle    Version 2: Spring 2003 Version 3: Fall 2003 etc.

Features to be added include:   Support for music notation Support for additional image and audio formats   Support for new and emerging streaming technologies Support for video

Future Versions

 Features to be added (continued):          Support for supplemental recording materials (e.g., liner notes, booklets) Improved browsing interface User interface support for synchronized navigation and playback Instructional authoring, classroom presentation, and instructional delivery interfaces Structure diagramming/visualization tools (e.g., Digital Timeliner) Web browser interface OnCourse integration Access control based on intellectual property requirements Improved cataloging/administrative interface

Variations2 Version 2 Demo

Music Information Retrieval

 Areas of research:  Indexing and search of music content  Audio, MIDI, notation  Feature detection  Genre, style, form, instrumentation, …

Music IR: Inherent Difficulties in Music

   No analogue to ‘words’  No easy units on which to index or do synonym lookup, etc.

Problems of representation  Graphical vs. logical aspects of music Polyphony    Multiple voices, chords Cross-voice matching Music is not linear

Polyphony

Polyphony

 Almost all music IR work to date focused on pitch matching in

monophonic

music

Music IR: More problems

    Query specification   What would a musical query look like?

“Query by humming” Music perception  People do not always perceive pitch correctly What type of matching?

    Exact pitch or intervals Melodic contour Exact rhythm “Rhythmic contour” What to index?

  Entire works Themes

Music IR: Yet more problems

    Variety of users, probably with very different needs, including:   General public looking for pop music Music students, scholars However, no formal assessment of user needs No standard query sets, relevance judgements, or test collections Problems of copyright in building test collections

More information on Music IR

  ISMIR: International Symposium/Conference on Music IR     2000: Plymouth, Massachusetts, USA 2001: Bloomington, Indiana, USA 2002: Paris, France October 26-30, 2003: Baltimore, MD, USA http://www.ismir.net/

OAI: Open Archives Initiative

 Original problem: searching across e print archives  Distributed searching hard  e.g. Z39.50

 Varying search semantics, capabilities  Network, server problems  Solution: metadata harvesting

Metadata harvesting

  Extract metadata from various sources Build services on local copies of metadata all searching, browsing, etc. performed on the metadata here Individual repositories can still support direct user interaction

metadata harvested offline metadata harvested offline

user search for “Mozart” Service provider

metadata harvested offline

local copy of metadata

metadata harvested offline

. . .

Data providers

OAI roles

  Data Providers   Repositories of digital content and metadata Support harvesting of metadata via the OAI protocol Service Providers  Harvest metadata from data providers using the OAI protocol   Implement user interface to data  Usually for searching, but other services also possible Can be selective

OAI protocol

       Originally developed in 1999 (“Santa Fe Convention”) Original focus on E-prints Has grown into general metadata harvesting protocol 

OAI-PMH: OAI Protocol for Metadata Harvesting

Version 1.0: January 2001 Version 1.1: June 2001  Conform to XML Schema 1.0

Version 2.0: June 2002  Transition period through December 2002 Currently 53

registered

OAI data providers

OAI protocol

 Carried over HTTP  Requests: HTTP GET or POST  Responses encoded in XML  Format defined via XML schema  Metadata in simple (unqualified) Dublin Core (and potentially other formats)

Dublin Core elements

 Coverage  Description  Type  Relation  Source  Subject  Title  Contributor  Creator  Publisher  Rights  Date  Format  Identifier  Language

OAI verbs

Verb

Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord

Function

description of archive metadata formats supported by archive sets defined by archive OAI unique ids contained in archive listing of N records listing of a single record

OAI resources

 Web site, mailing lists  Repository explorer  Data/service provider toolkits www.openarchives.org

Becoming an OAI data provider

    Make digital content available on web Translate metadata into Dublin Core   Crosswalks exist for MARC Can also make other formats available, e.g. MARC XML Choose a unique identifier system Set up OAI data provider server software   See tools list at www.openarchives.org

Depending on tool, uses its own database or operates over existing database

Other technical concerns for scores

   Areas for standardization/agreement    Within-score navigation  User interface, supporting metadata Image file format MARC-DC metadata crosswalk Not essential to OAI model, but enables more consistent user experience Packaging scores for exchange between libraries, e.g. for e-reserves, cooperative preservation  Can METS play a role?

Examples of OAI service providers

 UIUC Cultural Heritage Repository  http://dlc.grainger.uiuc.edu/  UMich OAIster  http://www.oaister.org/  RLG Cultural Materials  http://www.rlg.org/culturalres/  UCLA/JHU/IU Sheet Music Harvester  http://digital.library.ucla.edu/sheetmusic/