Linked Data and Semantic Web Technologies for Aid Transparency Aman Shakya PhD (National Institute of Informatics, Japan) Asst.

Download Report

Transcript Linked Data and Semantic Web Technologies for Aid Transparency Aman Shakya PhD (National Institute of Informatics, Japan) Asst.

Linked Data and Semantic Web Technologies for Aid Transparency Aman Shakya PhD (National Institute of Informatics, Japan) Asst. Professor (Institute of Engineering, Nepal) IT Consultant http://nepalaid.yipl.com.np/

Outline     Introduction ◦ Semantic Web ◦ Linked Open Data Aid Transparency Open Data ◦ ◦ IATI standard XML data Applications IATI as Linked Data and Applications Tools and Platforms 2

Introduction 3

WWW - World Wide Web page URL: http://nepalaid.yipl.com.np

?

4

WWW –Web of Documents Hyperlinks URL http://www.aidtransparency.net/ Hyperlinks URL http://nepalaid.yipl.com.np

Hyperlinks 5

Semantic Web  Sir Tim Berners-Lee (inventor of the WWW)

“.. an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation..” Scientific American (2001)

• Allows data to be shared and reused across application, enterprise, and community boundaries • Collaborative effort led by W3C with participation from a large number of researchers and industrial partners.

6

The Semantic Web 1.

Web of Structured Data 2.

Machine understandable semantics Semantics = Meaning 3.

Data modeling and Knowledge representation 4.

Interoperable data standards 77

Meaningful Data Nodes & Data Links Web of Data GGG (Giant Global Graph) Source: http://www.w3.org/2006/Talks/0123-sb-W3C-ThingsWeb/ 8

URI – Uniform Resource Identifier URI: http://www.china.org/geography/rivers#Yangtze Is a Yangtze Has length Located in China URI: http://dbpedia.org/resource/China River Has capital 6300 km Beijing URI: http://dbpedia.org/resource/Beijing 9

RDF - Resource Description Format

Predicate Subject

 Triples (subject, predicate, object) or (resource, property, value)  Graph data model

Object

10

Syntax (computer format) for RDF   RDF/XML N-triples, turtle, etc.

 Embedded formats ◦ RDFa, microformats, HTML5  JSON (lightweight structured format)  GRDDL 11

RDF/XML Example <rdf:Description rdf:about="http://www.china.org/geography/rivers#Yangtze" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://www.geodesy.org/river#">

6300 kilometers western China's Qinghai-Tibet Plateau East China Sea rdf:Description> 12

RDFa Example file 13

Semantic Web (explained)  Machine understandable Semantics by ◦ Data Modeling & Knowledge Representation  Ontology

“.. an explicit specification of a conceptualization” Gruber (1993)

◦ Modeling of Concepts and Relationships that exist in the area of interest 14

Example – Activities Ontology http://intelleo.eu/ontologies/activities/spec/ 15

Travel Ontology 16

Ontology Languages  RDF Schema ◦ A vocabulary description language  OWL - Web Ontology Language ◦ A richer ontology language ◦ Based on Description Logic 17

Ontologies  Domain ontology ◦ Open Biomedical Ontologies (OBO) ◦ CIDOC CRM - an ontology for cultural heritage information … etc.

 Lightweight ontology ◦ ◦ ◦ Vocabularies Taxonomy Thesaurus, etc.

18

SKOS  Simple Knowledge Organization System ◦ Used for Taxonomies, Thesaurus 19

Semantic Web Layers Cake The Semantic Web Cake 20

Linked Data (Web of Data on the Web) 21

Linked Data Publishing and interlinking structured data on the Semantic Web 4 basic principles of Linked Data: 1.

Use URIs to identify things.

2.

3.

4.

Use HTTP URIs so that we can locate and look up (dereference) these things.

Provide useful information when its URI is dereferenced.

Include links to related URIs to improve information discovery on the Web.

22

Linked Open Data Sources on the Web 2007 Open 23

2009 24

2010 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ 25

Querying Semantic Web Data  SPARQL RDF Query Language  SPARQL endpoint – query interface  Example SPARQL query } PREFIX foaf: SELECT ?name ?email WHERE { ?person a foaf:Person. ?person foaf:name ?name. ?person foaf:mbox ?email. 26

Beautiful Applications over Raw Data Source: Berners-Lee, TED 2009 27

http://www.data.gov

28

APPS SHOWCASE 29

Employment Market Explorer 30

Linked Open Data Rating (Berners-Lee, 2010) ★ ★★ ★★★ ★★★★ ★★★★★ - Available on the web (whatever format), but with an open licence Available as machine-readable structured data (e.g. excel instead of image scan of a table) - Non-proprietary format (e.g. CSV instead of excel) - All the above plus, Use open standards from W3C (RDF and SPARQL) to identify things - All the above, plus: Link your data to other people’s data 31

Aid Transparency Open Data 32

Aid Transparency Open Data  The IATI Standard ◦ Activity Standard ◦ Organization Standard ◦ IATI Codelists  Available as IATI XML schemas 33

IATI Activity Standard        Basic activity information Participating organizations Geopolitical information Classifications Financial details Related documents, activities Performance 34

IATI Organization Standard 35

IATI Codelists 36

IATI Data Publishing  Organisations publish IATI standard files on their own website  Submit a link to the central IATI Registry  Use IATI Registry to find, use and stay updated on data from different publishers  Whenever an organisation updates/adds IATI files, they let the registry know 37

IATI Registry 38

IATI Data of Nepal 39

 Making aid more transparent ◦ high quality, timely information on Aid flow  Actively involved in supporting the IATI 40

41

Map of Aid Activities 42

IATI Aid Activity Tables 43

       Data Explorer CSV Transformations Query Builder XPath Endpoint XSLT Transformations Lists service XQuery service 44

45

Data Explorer – Table View Faceted Search / Browsing 46

Data Explorer – Timeline View 47

Developed using … SIMILE project (MIT) http://simile.mit.edu/ 48

http://iatidata.heroku.com/ 49

50

IATI Data as Linked Data 51

IATI Open Data Rating ★★★ Machine-readable data in XML format Towards RDF Linked Data ★★★★ 52

IATI Data as Linked Data  What is happening Already?

◦   Code lists as Linked Data API access to the code-lists Return RDF linked data, SKOS concepts ◦ ◦ Sketching Linked Data Vocabularies Support for linked data in the IATI standard ref: http://www.aidinfolabs.org/archives/384 (Tim Davies) 53

Tim Davies Blog: http://www.aidinfolabs.org/archives/582 http://beta.kasabi.com/dataset/iati 54

XSLT- XML Stylesheet Transformation Source: http://www.w3.org/Consortium/Offices/Presentations/XSLT_XPATH/ 55

Further Possibilities 56

Further possibilities   Mix-and-match existing vocabularies ◦ ◦ ◦ ◦ Perhaps ◦ Payment Ontology Business Ontologies, Semantic XBRL Academic Research Project Funding Ontology (ARPFO) DOAP (Description of a Project) …….

 Generalized ‘Resource Flow’ ontology 57

Payments Ontology

http://data.gov.uk/resources/payments 58

XML to RDF Linked Data  XML Semantics Reuse Methodology (Roberto García) ◦ XML Schema to OWL ◦ XML to RDF  ReDeFer ◦ Convert XML to RDF/OWL 59

GRDDL  Gleaning Resource Descriptions from Dialects of Languages  Extract RDF data from XML formats via transformations identified by URIs and typically expressed in XSLT 60

Linking IATI to existing Linked Data       Government Data ◦ data.gov, data.gov.uk.. etc.

Semantic XBRL ◦ Financial Reports Research/Project Data (funded by the donor) Dbpedia (Wikipedia data) Geo Linked Data ……… so on 61

62

http://linked4.org/lsd/explore.html

Linked data from data.gov.uk

63

64

Open Knowledge Foundation http://wheredoesmymoneygo.org/ 65

66

67

http://data.gov.uk/apps 68

For the Developers 69

Semantic Web Platforms and Tools    ◦ ◦ Semantic Web Frameworks ◦ Triple Stores Parsers and APIs SPARQL query processor ◦ ◦ ◦ ◦ Examples ◦ Java – Jena, Sesame Jena .NET

PHP – ARC, Redland, RAP Ruby - ActiveRDF Virtuoso, Redland, 3store, Talis platform, RDF Pro, etc.

Ontology editors ◦ Protégé, OntoStudio, TopBraid composer, etc.

70

Linked Data Platforms and Tools   ◦ ◦ ◦ Linked Data Publishing Platforms ◦ D2R server ◦ Triplify ◦ Talis platform Kasabi Pubby Paget … etc.

Linked Data/RDF Editors and Validators ◦ Hyena ◦ Vapour 71

Semantic Web Platforms and Tools    Semantic Web CMS and frameworks ◦ Semantic Drupal ◦ Paggr ◦ ConStruct ◦ Structured Dynamics PoolParty ◦ Thesaurus management and SKOS editor Rhizomer ◦ publishing, querying, browsing, editing and interacting with semantic data 72

Social Semantic Web Tools   ◦ ◦ Semantic Wikis ◦ Semantic Media Wiki OntoWiki KIWI: Knowledge in a Wiki … etc.

Semantic blogging, tagging… etc.

73

Linked Data VisualizationTools  SIMILE project (MIT) http://simile.mit.edu/ ◦ Exhibit ◦ Timeline ◦ Longwell (faceted browser)  Many eyes ◦ Open platform for visualization 74

http://www-958.ibm.com/software/data/cognos/manyeyes/ 75

Conclusion     Semantic Web ◦ A global Linked Open Data commons IATI standard data IATI as Linked Open Data ◦ Ongoing work ◦ Further possibilities Applications and Tools 76

Thank you!

 Semantic Questions?

Contact: [email protected]

77