CrossAsia at the Staatsbibliothek zu Berlin an approach to organise access to research material in the field of Asian studies.

Download Report

Transcript CrossAsia at the Staatsbibliothek zu Berlin an approach to organise access to research material in the field of Asian studies.

CrossAsia at the
Staatsbibliothek zu Berlin
an approach to organise access to
research material in the field of Asian
studies
The Berlin State Library its East Asia
Department and CrossAsia
 Background – some facts on the library
and its responsibilities
 What is CrossAsia?

Special subject information guide

Collecting, cataloguing, archiving

Who can benefit of this work?
 Who needs CrossAsia? - Evaluation &
needs
Background
 Germany is historically and politically a decentralized
country with 16 federal states
 Material is non-German languages is collected on a
national level by some 20 major libraries in Germany
supported by „German Research Foundation“
 Within this framework Berlin State Library is in responsible
for:
law, slavic studies and literature, foreign newspapers,
topographic maps, cartography, East- and Southeast Asia
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Berlin State Library
 10 million books
 4.400 incunabula
 18.350 Occidental manuscripts
 40.000 Oriental manuscripts
 250.000 autographs
 67.000 music autographs
 1.400 estates and personal archives
 1 million maps and atlases
 32.000 subscription periodicals and monograph series
 180.000 early newspapers and 350 subscription newspapers
 Diverse electronic databases and full-texts
 2.3 million microfiches / microfilms
 13.5. million images in the picture archive
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
East and Southeast Asia Collection
 Partly funded and supervised by German Research
Foundation
 A kind of „national collection“
 Responsible for collection building, processing,
acquisition, e-resources, digitising projects, technical
solutions
 Special inter-library-loan service „Blauer Leihverkehr“ open
to German and European institutions/users
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia
 Co-operative project (start 2005 – end of funding 2010/11)
 Organising access to relevant material in each language and
form (print/digital)
 CrossAsia is the technical platform to offer and distribute
“our” content (our printed collection and more than 40
databases)
 Giving access to all kinds of metadata
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Partners
 German Institute of Global and Area Studies, Hamburg
 the State and University of Library Lower Saxony in
Göttingen
 the Institute for Chinese Studies at Heidelberg University,
Heidelberg
 the Department of Chinese and Korean Studies at Tübingen
University, Tübingen,
 the Department of Japanese Studies at Tübingen University,
Tübingen
 the Internet Guide for Chinese Studies, Leiden, the
Netherlands
 and CrossAsia in conjunction with the Asian Studies WWW
Virtual Library and the Nordic Institute of Asian Studies
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia – E-Resources
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Online Guide East Asia – special subject information
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Online Guide East Asia - Workflow System
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Digital Archive as part of OGEA
Input OGEA Data
has to be archived ?
NO
1. set archive flag
2. set interval
3. make archive URL
YES
1. pass data to Heritrix
2. activation per CRON
3. saving *.arc.gz
4. indexing by NutchWAX
generating index
search and present with NutchWAX
and Wayback-Machine
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
eGovernment-Archiv CrossAsia
 Simple and advanced search
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
eGovernment-Archiv CrossAsia - Result
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Building search applications with lucene and nutch
1. We inject a set of URLs into
the web database
(crawldb).
inject
2. We fetch the web pages:
1.
To fetch, we first
generate a fetch-list
from the database.
This generates a fetchlist in a new segment.
2.
Now we run the fetcher
on this segment.
3.
We have to update the
database with the
results of the fetch.
3. Now we can build a new
index of resources
catalogued in the Online
Guide East Asia.
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
fetching
1.
generate
2.
fetch
3.
update
indexing
OGEA and Search Engine technology
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Questions
 Do we need to or did we already reinvent the wheel? We
all know the Asian Studies WWW VL, the Internet Guide
for Chinese Studies or the Portal to Asian Internet
Resources etc.?
 Will we be able to continue our work to create new records
into the central system or do we have to stop when the
funding has been expired?
 Is it a good idea to archive non-permanent material and
how can we solve the copyright issue?
 Is this work in google/rss/Twitter/del.icio.us etc-times
really necessary?
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Outlook
 Printed monographs, series, and newspapers with access to
it via interlibrary-loan services 
 Access to our own digitised collection which will start in
summer 2009 (visible in 2010 with 2,5 Million pages in 2012)
 Electronic databases like full-text databases, fact
databases, E-journals, EBooks, full-text statistics, which are
subject to licence with access for German users 
 Information of other subject collections and projects like
resources from the National Library of China and National
Central in Taibei, National Diet Library in Japan, or activities
in for example Laos National Library  (to be continued)
 Special Subject Information  (OGEA to be continued)
 Archiving web-resources (started )
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Organising Access
central system
& central index
1.
2.
3.
4.
5.
6.
EACAT
Main Libray Catalogue
OGEA
E-ressource 1 (CAJ)
E-ressource 2 (Apabi)
…
Search
interface /
OPAC
for users
free arrangement
of metadata
sets /collections
digital collection
workflow system
cataloguing system
NACSIS/NII
E-ressource 1 (CAJ – journal level &
article level
Online Guide East Asia (incl. Digital
Archive CrossAsia)
E-ressource 2 (Apabi – different
collections)
other resources …
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Thank you very much!
Matthias Kaun
Berlin State Library
East Asia Department
http://ead.staatsbibliothek-berlin.de
http://crossasia.org
Matthias Kaun | CrossAsia | CEAL - Chicago 2009