[PEALKIRI] - Eesti Rahvusraamatukogu

Download Report

Transcript [PEALKIRI] - Eesti Rahvusraamatukogu

Sirje Nilbe
Tiiu Tarkpea








Estonian Subject Thesaurus (EMS)
Thesaurus and the online catalogues of
libraries
Databases of articles
The Estonian National Bibliography Database
Digital Archive DIGAR
University of Tartu Digital Archive on DSpace
Other potential users
Conclusions




A controlled vocabulary of universal coverage
for information retrieval and indexing
Was taken into use under this name in May
2009
Predecessors: the Thesaurus of the University
of Tartu Library (1994) and the Estonian
Universal Thesaurus (1999)
Merging project carried on 2007 – 2009 =>
Estonian Subject Thesaurus






Co-managed by ELNET Consortium, the National
Library of Estonia and The University of Tartu
Library
EMS is developed by a manager, a programmer and
9 authorised editors, all with part-time
contribution
Free for Estonian libraries and other institutions
Contains over 54 000 terms (36 500 preferred,
17 500 non-preferred)
The user interface is in Estonian and in English
EMS mostly serves as a tool for post-coordinated
indexing




to search by each term in the online catalogue
ESTER, in the database of Estonian articles ISE, in
the database of Estonian National Bibliography and
in Google
to subscribe current awareness service for new,
changed and deleted subject terms
for using the data in the other systems the subject
term records can be exported in MARC21 format
a word retrieval web service for machine-tomachine interaction includes machine-readable
MARC21, eye-readable MARC21 or MARC XML
formats








Online catalogue ESTER is the shared catalogue of the ELNET
Consortium member libraries
13 libraries, 3 million bibliographic records
Software: integrated library system Millennium from US Company
Innovative Interfaces Inc.
Facilities for development of authority files in the library system are
poor
Since May 2009 the subject authority records are no longer compiled
manually
The authority data are updated twice a month on the basis of
extentions and corrections in EMS database
A MARC21 file is exported from the thesaurus system and loaded
into the library system
In the subject index of the catalogue, cross references from
authority records are arranged between data indexed from
bibliographic records.




Compiled mostly by copy cataloguing from
ESTER
Do not support authority control
Special, public and school libraries often rich
their records with additional subject terms
The records sometimes contain synonymous
descriptors and nonpreferred terms to
increase the number of access points




A result of cooperation between 12 libraries in the
ELNET Consortium (2009)
Maintained in the additional module Reference
Database of the library system Millennium, thus the
functionality and appearance of it resemble the
e-catalogue ESTER
Contains articles from newspapers, magazines and
journals, serial publications and anthologies
Full-text accessed in free digital archives and Web
publications




Present standard for subject indexing is the Estonian
Subject Thesaurus
The authority data are updated twice a month
Old separate databases had their own local
vocabularies, we have to harmonise the records with
EMS that takes several years
EMS doesn't always meet the needs of subject
indexing of periodicals (local history in local
newspapers for example)







ERB is compiled and managed by the National
Library of Estonia
Software: developed locally on the basis of MySQL
Items are catalogued, classified and indexed in
Millennium, exported and loaded into ERB
No changes can be made in ERB database
Subject search can be carried out by subject terms,
UDC numbers and keywords (title and subject
fields)
ERB does not contain authority data
A user can click on EMS logo to find terms or start
with EMS and find ERB records this way


DIGAR is the digital archive of the National Library of Estonia
The archive preserves:
◦ Estonian online publications issued on the Internet
◦ digital copies of Estonian electronic publications issued on physical
carriers
◦ digital copies of publications on analog carriers
◦ print files of Estonian publications




Objects, stored in DIGAR, have been described in ESTER or ISE
Records in these databases and full texts in DIGAR are linked
The metadata of DIGAR is mostly imported from ESTER, incl.
subject descriptors
The metadata is supplemented by the English-language
equivalents of the descriptors from EMS



The aim of the repository is to collect, preserve and make
available the digital information of the university faculties,
departments and institutes
Publicly available via Internet search engines and can be
interfaced by OAI data exchange protocols
The repository preserves:
 print files of doctoral theses published by the University of Tartu
 original files created in the university, e. g. conference papers, articles,
reports, textbooks, personal archives etc.
 Digital copies of analog carriers
 Online publications issued on the Internet
 Digital copies of electronic publications issued on physical carriers




Records in DSpace are searchable via online
catalogue ESTER, which has a link to full text in
repository
Additionaly, English equivalents of subject terms
are added manually
If so desired, one can add subject terms or
keywords in any language (preferably in the
language of original paper)
It is possible to create subject term patterns added
by default for a collection





Other knowledge institutions (museums and
archives) are starting to develop their
information system
They need controlled vocabularies to describe
their specific records and artefacts
EMS seems to be too extensive for them, but
lacking many specific terms at the same time
EMS as a language and terminology resource
EMS and EuroTermBank









EMS tries to consider the needs of all interest groups
Includes large amount of terms for specific and exhaustive
indexing
Seems to be too extensive for archives and museums
EMS is available as a web database and authority records in
MARC21 format
Authority control is supported by the online catalogue ESTER
and article database ISE
The other systems do not support authority records, thus EMS
loses lot of its advantage as a thesaurus
In the digital collections metadata often include Englishlanguage equivalents
Further steps: conformity to the semantic web standards like
SKOS and Open Linked Data
[email protected]
[email protected]