DDC on the Semantic Web: Publishing the German DDC 22 and

Download Report

Transcript DDC on the Semantic Web: Publishing the German DDC 22 and

Lars G. Svensson
DDC on the Semantic Web:
Publishing the German DDC 22 and the SWD as Linked Open
Data in MelvilSearch
1 | 28
| DDC on the Semantic Web | April 28, 2009
Library databases contain much
information on how to organise resources
DDC
Colon
PACS
GHBS
MeSH
RVK
STW
SWD
SAB
3 | 28
| DDC on the Semantic Web | April 28, 2009
LCSH
MSC
In Melvil we publish the German DDC 22
and the Schlagwortnormdatei (SWD)
DDC+SWD
=
true
4 | 28
| DDC on the Semantic Web | April 28, 2009
The problem is that the information in
Melvil is not computer understandable
5 | 28
| DDC on the Semantic Web | April 28, 2009
We want the information to be useful for
computers and people alike
partOf
6 | 28
| DDC on the Semantic Web | April 28, 2009
relatedTo
We can encode machine-understandable
information in the MelvilSearch pages
7 | 28
| DDC on the Semantic Web | April 28, 2009
With RDF we can describe the data in a
machine-understandable way
“With RDF we can describe […]”
this slide
“application/vnd.ms-powerpoint“
my presentation
me
“DDC on the Semantic Web: […]”
“Lars G. Svensson”
9 | 28
| DDC on the Semantic Web | April 28, 2009
RDF is about linking data together and give
the link a meaning
“RDF is about linking data […]”
this slide
“application/vnd.ms-powerpoint“
my presentation
me
“DDC on the Semantic Web: […]”
“Lars G. Svensson”
10 | 28
| DDC on the Semantic Web | April 28, 2009
http://www.w3.org/2007/Talks/0223-Bangalore-IH/files/SKOS_simpleThesaurus.png
SKOS is a vocabulary for describing how
thesauri and classifications are linked
internally
11 | 28
| DDC on the Semantic Web | April 28, 2009
We can encode RDF data in a web page
with RDFa
12 | 28
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd>
<html xmlns=“http://www.w3.org/1999/xhtml”
xmlns:foaf=“http://xmlns.com/foaf/0.1/”
xmlns:dc=“http://purl.org/dc/elements/1.1/”
xml:lang="en">
<head>
<title>John's Home Page</title>
<base href="http://example.org/john-d/" />
<meta property="dc:creator" content="Jonathan Doe" />
</head>
<body>
<h1>John's Home Page</h1>
<p>My name is <span property="foaf:nick">John D</span> and I like
<a href="http://www.neubauten.org/" rel="foaf:interest"
xml:lang="de">Einstürzende Neubauten</a>.
</p>
<p>
My <span rel="foaf:interest" resource="urn:ISBN:0752820907">favourite
book</span> is the inspiring <span about="urn:ISBN:0752820907"><cite
property="dc:title">Weaving the Web</cite> by <span
property="dc:creator">Tim Berners-Lee</span></span>.
</p>
</body>
| DDC on the Semantic Web | April 28, 2009
</html>
Through MelvilSearch and the SWD browser
we can publish RDF data on the web
14 | 28
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
"http://www.w3.org/TR/html4/strict.dtd"> <html lang="de"> <head> <title>MelvilSearch:
Deutsche Nationalbibliothek</title> <!-- <meta http-equiv="content-type" content="text/html;
charset=UTF-8">--> <!-- we hide the stylesheet from NS4 and others --> <link rel="stylesheet"
href="/melvilsearch.css" media="all"> </head> <body> <div id="Header"> <img class="banner"
src="/pics/kossuth-default/MelvilSearch-banner-large.gif" height="60" width="468"
alt="MelvilSearch: Your gateway to classified information"> <div class="username"><p>Suche mit
der Dewey-Dezimalklassifikation</p> <p>Deutsche Nationalbibliothek</p> </div> <ul> <li
id="current"><span>Browsing</span></li> <li> <a href="/melvilsearch/impressum ?bs=dnbportal&amp;id=590733 ">Impressum</a> </li> <li> <a href="/melvilsearch/copyright ?bs=dnbportal&amp;id=590733 " lang="en">Copyright</a> </li> <li> <a href="/melvilsearch/help
?bs=dnb-portal&amp;id=590733 ">Hilfe</a> </li> </ul> </div> <div id="Content"> <form
action="melvilsearch" method="get" accept-charset="UTF-8"> <p> <label for="searchfield"
accesskey="S"> <span class="accesskey">S</span>uchbegriff oder <abbr title="Dewey Decimal
Classification">DDC</abbr>-Notation </label> <input id="searchfield" name="ri" type="text">
<input name="bs" type="hidden" value="dnb-portal"> <input type="submit" value="Suchen">
Titel erst ab 2006 </p> </form> <hr> <table id="MelvilSnippet"> <tr> <th
id="MelvilSnippetCaption">Thema</th> <th id="MelvilSnippetHits1">Treffer in dieser Klasse</th>
<th id="MelvilSnippetHits2">Treffer in dieser Klasse und ihren Unterklassen</th> </tr> <tr
class="melvilSnippetOdd"> <td class="melvilSnippetLevel0"><a href="melvilsearch?bs=dnbportal&amp;id=1409024">DDC-&#220;bersicht</a> </td> <td> 0 Titel </td> <td> 0 Titel
</td> </tr> <tr> <td class="melvilSnippetLevel1"><a href="melvilsearch?bs=dnbportal&amp;id=589829">Informatik, Informationswissenschaft, allgemeine Werke</a> </td>
<td> 0 Titel </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D0*"> 12244 Titel</a> </td> </tr>
<tr class="melvilSnippetOdd"> <td class="melvilSnippetLevel2"><a href="melvilsearch?bs=dnbportal&amp;id=590728">Verb&#228;nde, Organisationen, Museen</a> </td> <td> 0 Titel </td>
<td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D06*"> 218 Titel</a> </td> </tr>
<tr id="MelvilSnippetSelected" > <td class="melvilSnippetLevel3">Allgemeine Organisationen und
Museumswissenschaft <ul> <li>Für fächerübergreifende Werke über zwischenstaatliche
Organisationen siehe <a href="melvilsearch?bs=dnb-portal&amp;id=1409300">Die internationale
Gemeinschaft</a> .</li> </ul> </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060"> 2 Titel</a> </td> <td> <a
href="https://portal.d-nb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060*"> 24
Titel</a> </td> </tr> <tr class="melvilSnippetOdd"> <td class="melvilSnippetLevel4"><a
href="melvilsearch?bs=dnb-portal&amp;id=590731">Spezielle Themen zu allgemeinen
Organisationen</a></td> <td> 0 Titel </td> <td> 0 Titel </td> </tr> <tr> <td
class="melvilSnippetLevel4"><a href="melvilsearch?bs=dnb-portal&amp;id=590734">Historische
und personenbezogene Behandlung</a></td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060.9"> 1 Titel</a> </td> <td> <a
href="https://portal.d-nb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060.9*"> 10
| DDC on the Semantic Web | April 28, 2009
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head>
rel="stylesheet" type="text/css" href="/swd.css"> <title>SWD: |s|Nonprofit-Organisatio
<link rel="stylesheet" type="text/css" href="/explorertree.css"> <script type="text/java
src="/events.js"></script> <script type="text/javascript" src="/cssutil.js"></script> <sc
type="text/javascript" src="/explorertree.js"></script> </head> <body> <div id="Head
action="/swd-search"> <label for="term">Ansetzungsform: </label><input type="text"
name="term" id="term"><input type="submit" value="Start"> </form> <hr> </div> <
id="Tree"> <ul> <li class="sibling"><span class="selected">|s|Nonprofit-Organisation<
class="explorertree"> <li> UB1&nbsp;<a href="/swd/040627144">|s|Verein</a>&nbsp
src="/pics/BT.gif" alt="Mehrere Oberbegriffe" title="Mehrere Oberbegriffe"><img src="/p
alt="Verwandte Begriffe vorhanden" title="Verwandte Begriffe vorhanden"> <ul> <li> UB
href="/swd/041420004">|s|Alpenverein</a>&nbsp; <ul> <li>UB3&nbsp;<a
href="/swd/043119867">|s|Bergsteigerverein</a>&nbsp;<img src="/pics/BT.gif" alt="M
Oberbegriffe" title="Mehrere Oberbegriffe"></li> </ul> <li> UB2&nbsp;<a
href="/swd/952909901">|s|Arbeiterinnenverein</a>&nbsp; <ul> <li>UB3&nbsp;<a
href="/swd/952910020">|s|Katholischer Arbeiterinnenverein</a>&nbsp;<img src="/pics
alt="Mehrere Oberbegriffe" title="Mehrere Oberbegriffe"><img src="/pics/RT.gif" alt="Ve
Begriffe vorhanden" title="Verwandte Begriffe vorhanden"></li> </ul> <li> UB2&nbsp;<
href="/swd/041428684">|s|Arbeiterverein</a>&nbsp; <ul> <li>UB3&nbsp;<a
href="/swd/042030072">|s|Evangelischer Arbeiterverein</a>&nbsp;<img src="/pics/BT
alt="Mehrere Oberbegriffe" title="Mehrere Oberbegriffe"><img src="/pics/RT.gif" alt="Ve
Begriffe vorhanden" title="Verwandte Begriffe vorhanden"></li> <li>UB3&nbsp;<a
href="/swd/952909979">|s|Katholischer Arbeiterverein</a>&nbsp;<img src="/pics/BT.g
alt="Mehrere Oberbegriffe" title="Mehrere Oberbegriffe"><img src="/pics/RT.gif" alt="Ve
Begriffe vorhanden" title="Verwandte Begriffe vorhanden"></li> </ul> <li> UB2&nbsp;<
href="/swd/042652804">|s|Auswanderungsverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/960595597">|s|Behindertenverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/940029960">|s|Betreuungsverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/956162630">|s|Bildungsverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/965091554">|s|Brieftaubenverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/961160454">|s|Bürgerverein</a>&nbsp; <ul> <li>UB3&nbsp;<a
href="/swd/957127707">|s|Contrada</a>&nbsp;<img src="/pics/NT.gif" title="Unterbe
vorhanden" alt="Unterbegriffe vorhanden"></li> </ul> <li> UB2&nbsp;<a
href="/swd/04151291X">|s|Eingetragener Verein</a>&nbsp; <ul> <li>UB3&nbsp;<a
href="/swd/041378792">|s|Versicherungsverein auf Gegenseitigkeit</a>&nbsp;<img
src="/pics/NT.gif" title="Unterbegriffe vorhanden" alt="Unterbegriffe vorhanden"></li> <
UB2&nbsp;<a href="/swd/958440883">|s|Eisenbahnverein</a>&nbsp; <li> UB2&nbsp;<
href="/swd/041520556">|s|Elternverein</a>&nbsp;<img src="/pics/BT.gif" alt="Mehrer
Oberbegriffe" title="Mehrere Oberbegriffe"> <li> UB2&nbsp;<a
href="/swd/955631807">|s|Fastnachtsverein</a>&nbsp; <li> UB2&nbsp;<a
href="/swd/952082098">|s|Folkloregruppe</a>&nbsp;<img src="/pics/BT.gif" alt="Meh
Oberbegriffe" title="Mehrere Oberbegriffe"> <ul> <li>UB3&nbsp;<a
href="/swd/043110940">|s|Trachtenverein</a>&nbsp;</li> <li>UB3&nbsp;<a
We encode the DDC information in
MelvilSearch
<tr class="melvilSnippetOdd“ about=“http://dewey.info/06” skos:broader=“http://dewey.info/0”>
<td class="melvilSnippetLevel2">
<a href="melvilsearch?bs=dnb-portal&amp;id=590728“ property=“skos:altLabel”>
Verb&#228;nde, Organisationen, Museen</a>
</td>
<td>0 Titel</td>
<td>
<a href="https://portal.d-nb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D06*">218 Titel</a
</td>
</tr>
<tr id="MelvilSnippetSelected" about=“http://dewey.info/060”
skos:broader=“http://dewey.info/06” skos:closeMatch=“http://d-nb.info/gnd/4293729-2”>
<td class="melvilSnippetLevel3“ property=“skos:altLabel”>
Allgemeine Organisationen und Museumswissenschaft
<ul>
<li>Für fächerübergreifende Werke über zwischenstaatliche Organisationen siehe
<a href="melvilsearch?bs=dnb-portal&amp;id=1409300">Die internationale Gemeinschaft</a> .
</li>
</ul>
</td>
<td>
<a href="https://portal.d-nb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060">2 Titel</a>
</td>
<td>
<a href="https://portal.d-nb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060*">24 Titel</a
</td>
15 | 28
| DDC on the Semantic Web | April 28, 2009
</tr>
We encode the SWD data in the SWD
browser
<ul>
<li class="sibling“ about=“http://d-nb.info/gnd/4293729-2” skos:closeMatch=“http://dewey.info/060”>
|s|<span class="selected“ property=“skos:prefLabel”>Nonprofit-Organisation</span>
<ul class="explorertree">
<li about=“http://d-nb.info/gnd/4062714-7”
skos:broader=“http://d-nb.info/gnd/4293729-2”>
UB1&nbsp;|s|<a href="/swd/040627144“ property=“skos:prefLabel”>Verein</a>
<ul>
<li about=“http://d-nb.info/gnd/4142000-7” skos:broader=“http://d-nb.info/gnd/4062714-7”>
UB2&nbsp;|s|<a href="/swd/041420004">Alpenverein</a>&nbsp;
<ul>
<li>UB3&nbsp;|s|<a href="/swd/043119867">Bergsteigerverein</a></li>
</ul>
[…]
16 | 28
| DDC on the Semantic Web | April 28, 2009
An RDFa-aware agent can collect the
semantic data
skos:altLabel
Verbände, Organisationen,
Museen
http://dewey.info/06
Nonprofit-Organisation
skos:prefLabel
http://d-nb.info/gnd/4293729-2
skos:broader
skos:closeMatch
Verein
skos:broader
http://dewey.info/060
skos:prefLabel
http://d-nb.info/gnd/4062714-7
skos:altLabel
Allgemeine Organisationen
und Museumswissenschaft
skos:broader
Alpenverein
skos:prefLabel
http://d-nb.info/gnd/4142000-7
17 | 28
| DDC on the Semantic Web | April 28, 2009
We can link the Dewey data to other data
19 | 28
| DDC on the Semantic Web | April 28, 2009
Linked Open Data is about connecting
datasets
– Use HTTP URIs so that people can look up those names.
– When someone looks up a URI, provide useful
information.
– Include links to other URIs. so that they can discover
more things.
20 | 28
| DDC on the Semantic Web | April 28, 2009
http://www.w3.org/DesignIssues/LinkedData.html
– Use URIs as names for things
You can navigate from one information
point to the next ones
DDC
LCSH
SWD
RAMEAU
??
LoC NAF
21 | 28
| DDC on the Semantic Web | April 28, 2009
PND
Machines can harvest the information of
interest by examining the relations
Let‘s see if I
can find some
pubs in Vienna
today…
22 | 28
| DDC on the Semantic Web | April 28, 2009
So is it useful to serve thesauri and
classifications on the Semantic Web?
23 | 28
| DDC on the Semantic Web | April 28, 2009
Thesauri and classifications must be useful
for humans and machines, but currently the
data isn’t machine-understandable
partOf
25 | 28
| DDC on the Semantic Web | April 28, 2009
relatedTo
We can encode machine-understandable
information in the MelvilSearch pages
26 | 28
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> <html lang="de"> <head> <title>MelvilSearch: Deutsche
Nationalbibliothek</title> <!-- <meta http-equiv="content-type" content="text/html; charset=UTF-8">--> <!-- we hide the stylesheet from NS4 and others --> <link
rel="stylesheet" href="/melvilsearch.css" media="all"> </head> <body> <div id="Header"> <img class="banner" src="/pics/kossuth-default/MelvilSearch-banner-large.gif"
height="60" width="468" alt="MelvilSearch: Your gateway to classified information"> <div class="username"><p>Suche mit der Dewey-Dezimalklassifikation</p> <p>Deutsche
Nationalbibliothek</p> </div> <ul> <li id="current"><span>Browsing</span></li> <li> <a href="/melvilsearch/impressum ?bs=dnb-portal&amp;id=590733
">Impressum</a> </li> <li> <a href="/melvilsearch/copyright ?bs=dnb-portal&amp;id=590733 " lang="en">Copyright</a> </li> <li> <a href="/melvilsearch/help ?bs=dnbportal&amp;id=590733 ">Hilfe</a> </li> </ul> </div> <div id="Content"> <form action="melvilsearch" method="get" accept-charset="UTF-8"> <p> <label for="searchfield"
accesskey="S"> <span class="accesskey">S</span>uchbegriff oder <abbr title="Dewey Decimal Classification">DDC</abbr>-Notation </label> <input id="searchfield"
name="ri" type="text"> <input name="bs" type="hidden" value="dnb-portal"> <input type="submit" value="Suchen"> Titel erst ab 2006 </p> </form> <hr> <table
id="MelvilSnippet"> <tr> <th id="MelvilSnippetCaption">Thema</th> <th id="MelvilSnippetHits1">Treffer in dieser Klasse</th> <th id="MelvilSnippetHits2">Treffer in dieser
Klasse und ihren Unterklassen</th> </tr> <tr class="melvilSnippetOdd"> <td class="melvilSnippetLevel0"><a href="melvilsearch?bs=dnb-portal&amp;id=1409024">DDC&#220;bersicht</a> </td> <td> 0 Titel </td> <td> 0 Titel </td> </tr> <tr> <td class="melvilSnippetLevel1"><a href="melvilsearch?bs=dnbportal&amp;id=589829">Informatik, Informationswissenschaft, allgemeine Werke</a> </td> <td> 0 Titel </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D0*"> 12244 Titel</a> </td> </tr> <tr class="melvilSnippetOdd"> <td class="melvilSnippetLevel2"><a
href="melvilsearch?bs=dnb-portal&amp;id=590728">Verb&#228;nde, Organisationen, Museen</a> </td> <td> 0 Titel </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D06*"> 218 Titel</a> </td> </tr> <tr id="MelvilSnippetSelected" > <td class="melvilSnippetLevel3">Allgemeine
Organisationen und Museumswissenschaft <ul> <li>Für fächerübergreifende Werke über zwischenstaatliche Organisationen siehe <a href="melvilsearch?bs=dnbportal&amp;id=1409300">Die internationale Gemeinschaft</a> .</li> </ul> </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060"> 2 Titel</a> </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060*"> 24 Titel</a> </td> </tr> <tr class="melvilSnippetOdd"> <td class="melvilSnippetLevel4"><a
href="melvilsearch?bs=dnb-portal&amp;id=590731">Spezielle Themen zu allgemeinen Organisationen</a></td> <td> 0 Titel </td> <td> 0 Titel </td> </tr> <tr> <td
class="melvilSnippetLevel4"><a href="melvilsearch?bs=dnb-portal&amp;id=590734">Historische und personenbezogene Behandlung</a></td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060.9"> 1 Titel</a> </td> <td> <a href="https://portal.dnb.de/opac.htm?method=simpleSearch&amp;query=ddc%3D060.9*"> 10 Titel</a> </td> </tr> </table> <hr> <div id="pageinfo"> <p> <!-- <a href="#top"><img
src="pics/top.gif" height="25" width="25" alt="Seitenanfang"></a>--> <a href="mailto:[email protected]"><img src="pics/email.gif" height="25" width="25" alt="Email an
Lars G. Svensson"></a><a href="mailto:[email protected]">[email protected]</a> </p> <p><a href="http://www.ddc-deutsch.de/project/team.html#svensson">Lars G.
Svensson</a> / 3.3.2004</p> </div> </div> <div id="Footer"> <hr> <a href="http://validator.w3.org/check/referer"><img src="pics/valid-html401.gif" alt="Valid HTML
4.01!" height="31" width="88"></a> <a href="http://jigsaw.w3.org/css-validator/check/referer"><img src="pics/vcss.gif" alt="Valid CSS!" height="31" width="88"></a> <a
| DDC on the Semantic Web | April 28, 2009
Semantically interlinked vocabularies are
helpful both for information retrieval and
for automated indexing
instanceOf
instanceOf
Weimarer Republik
TimeSpan:1919-1933
instanceOf
Blandine Ebinger
Dreigroschenoper
instanceOf
Composer: Kurt Weill
Friedrich Hollaender
Marlene Dietrich
Lieder
eines armen Mädchens (Chansonzyklus)
27
| 28
| DDC on the Semantic Web | April 28, 2009
Time:1922 / isRelatedTo: Tim Fischer
Bertolt Brecht
http://upload.wikimedia.org/wikipedia/commons/4/4d/Tim_Berners-Lee_CP_2_head_crop.jpg
The Giant Global Graph: Where Dewey
meets Berners-Lee
28 | 28
| DDC on the Semantic Web | April 28, 2009