Semantic Web - ????????

Download Report

Transcript Semantic Web - ????????

Semantic Web

Dr. Mohsen Kahani [email protected]

http://web.um.ac.ir/~kahani/

SW Scenario

 “At the doctor's office, Lucy instructed her Semantic Web agent through her handheld Web browser. The agent promptly retrieved information about Mom's prescribed treatment from the doctor's agent, looked up several lists of providers , and checked for the ones insurance within a 20- mile radius in-plan for Mom's of her home and with a rating of excellent or very good on trusted rating services. It then began trying to find a match between available appointment times (supplied by the agents of individual providers through their Web sites) and Pete's and Lucy's busy schedules.”  (The emphasized keywords indicate terms whose semantics, or meaning, were defined for the agent through the Semantic Web.) Scientific American: The Semantic Web: May 2001 < http://www.sciam.com/2001/0501issue/0501berners-lee.html>

Doctor’s appointment

“The Semantic Web”,

Scientific American, May 2001 Insurance Co.

Rating Provider sites Mom Physician’s Agent

required treatment in-plan?

close-by?

Specialist?

Schedule appointment Driving schedule

Lucy’s Agent Pete’ Agent

“Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.”

Berners Lee, T, Hendler, J & Lassila, O ‘The semantic web’,

Scientific American

, May 2001

Where we are Today: the Syntactic Web

Current Web

500 million user more than 8 billion pages

Static

WWW

URI, HTML, HTTP

The vision

 The World Wide Web is a big and impressive success story, both in terms  of the amount of available information and  the growth rate of human users  It starts to penetrate most areas of our daily life and business.

 This success is based on its simplicity the restrictivenss of HTTP and HTML allowed software developers, information providers and users to make easy access of the new media helping it to reach a critical mass

The Vision

However this simplicity may hamper the further development of the Web What we see currently is the very first version of the web and the next version will probably even more bigger and much powerful compared to what we have now.

Semantic Web

Serious Problems in information

finding

extracting

representing

interpreting

and maintaining

Static

WWW

URI, HTML, HTTP

Semantic Web

RDF, RDF(S), OWL

Semantic Web Technology

Tim Berners-Lee has a vision of a Semantic Web which

has machine-understandable semantics of information, and

millions of small specialized reasoning services that provide support in automated task achievement based on the accessible information

The Syntactic Web is…

   

A hypermedia, a digital library

 A library of documents called (web pages) interconnected by a hypermedia of links

A database, an application platform

 A common portal to applications accessible through web pages, and presenting their results as web pages

A platform for multimedia

 BBC Radio 4 anywhere in the world! Terminator 3 trailers !

A naming scheme

 Unique identity for those documents

A place where computers do the presentation (easy) and people do the linking and interpreting (hard). Why not get computers to do more of the hard work?

What is the Problem?

  Markup consists of:  rendering information (e.g., font size and colour)  Hyper-links to related content Semantic content is accessible to humans but not (easily) to computers…

What information can we see…

WWW2002 The eleventh international world wide web conference Sheraton waikiki hotel Honolulu, hawaii, USA 7-11 may 2002 1 location 5 days learn interact Registered participants coming from australia, canada, chile denmark, france, germany, ghana, hong kong, india, ireland, italy, japan, malta, new zealand, the netherlands, norway, singapore, switzerland, the united kingdom, the united states, vietnam, zaire Register now On the 7 th May Honolulu will provide the backdrop of the eleventh international world wide web conference. This prestigious event … Speakers confirmed Tim berners-lee Tim is the well known inventor of the Web, … Ian Foster Ian is the pioneer of the Grid, the next generation internet …

What information can a machine see…

WWW2002 The eleventh international world wide web conference Sheraton waikiki hotel Honolulu, hawaii, USA 7-11 may 2002 1 location 5 days learn interact Registered participants coming from australia, canada, chile denmark, france, germany, ghana, hong kong, india, ireland, italy, japan, malta, new zealand, the netherlands, norway, singapore, switzerland, the united kingdom, the united states, vietnam, zaire Register now On the 7 th May Honolulu will provide the backdrop of the eleventh international world wide web conference This prestigious event  Speakers confirmed Tim berners-lee Tim is the well known inventor of the Web,  Ian Foster Ian is the pioneer of the Grid, the next generation internet 

Solution: XML markup with “meaningful” tags?

WWW2002 The eeventh internationa word wide webcon Sheraton waikiki hote Honouu, hawaii, USA 7-11 may 2002 1 ocation 5 days earn interact Registered participants coming from austraia, canada, chie denmark, france, germany, ghana, hong kong, india, ireand, itay, japan, mata, new zeaand, the netherands, norway, singapore, switzerand, the united kingdom, the united states, vietnam, zaire Register now On the 7 th May Honouu wi provide the backdrop of the eeventh internationa word wide web conference. This prestigious event  Speakers confirmed Tim berners-ee Tim is the we known inventor of the Web, … 

That is, structure and semantics of documents are interwoven

Agree upon the meaning of tags through the use of namespaces

XML Is Not Enough!

 XML:   Should be used Is extensible (DC qualifiers) Brian Kelly [email protected]  But:   XML describes the syntax Does not provide semantics (what does DC.Creator

mean

?)   The meaning may be agreed & understood within DC applications – but this does not allow for

extensibility

Similar applications may be described using different XML DTDs: e.g. is the same as or <Доклады>

Scenario – Buying A Car

User

You live in London and want to buy a car locally. You can afford up to £500. The car must be red.

Honest EuroJoe’s Used Car Web Site

Joe uses: Brixton €400 maroon Old banger Ford Escort

Result

• Car not found – even though structured information is provided. • A human would know that this was a valid match, because it understands the meanings and relationships.

• The Semantic Web aims to solve this problem.

We Need Extensibility!

 We can see a progression from Web sites which are:  •

Joe’s Used Cars

Ford Escort

This maroon car costs €400 Joe’s Used Cars Ford Escort maroon €400 We need a mechanism which allows equivalent resources to be identified, without programming this knowledge into software

Buying Car On The Semantic Web

Motor Trade schema database model Scottish Motor Trade schema Mapping service Joe’s Used Car Web Site Ford dealers schema vehicle-type Wordnet  Joe is part of the motor trade association, which has defined its own schema for selling cars.

The Scots use a different schema, as do the car manufacturers (which mainly sell new cars).

A mapping service provides a mapping between these machine understandable schemata.

Wordnet maps relationships between words (e.g. red and maroon)

The Semantic Web

A Vision Of Possibilities

“The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation.

-- Tim Berners-Lee, James Hendler and Ora Lassila, The Semantic Web, Scientific American, May 2001

Scenario – Buying A Car (2)

 We’ve seen how this query can be answered:

Find me a red car in London for < £500.

How about this maroon Escort in Brixton for €400?

 The Semantic Web will be extensible enabling interactions with other services which may use different XML DTDs:

Give me the AA’s report on this type of car.

OK here it is

Check the DVLA details for the reg. no.

OK – the car is registered correctly

Model For Buying A Car

Motor Trade schema database Scottish Motor Trade schema Ford dealers schema AA Web site Mapping service AA vehicle schema Joe’s Used Car Web Site Value-added services DVLA schema With machine-understandable data it becomes easier to extend services

Need to Add “Semantics”

External agreement on meaning of annotations

 E.g

.,

Dublin Core

• • • • • Agree on the meaning of a set of annotation tags

Title

: A name given to the resource.

Creator

: An entity primarily responsible for making the

Subject

: The topic of the content of the resource.

Description

: An account of the content of the resource . ….

Problems with this approach

• Inflexible • Limited number of things can be expressed

Need to Add “Semantics” (Cont.)

Motivation toward RDF(S), DAML+OIL, OWL,…

Establish a widely accepted standard for representing expressive declarative knowledge on the Web.

    Knowledge Representation Knowledge sharing Communication Language for query

Representation Language (RL)

Representation means that we encode the description in a way that enables someone to use it

Some representational languages are:

- UML - RDF(S) - Topic Maps - DAML+OIL - OWL …..

Levels of Representation Needed for Models

Syntax

- An XML document is syntactically correct or not 

Structure

Validating an XML document with respect to DTD or XML Schema language 

Semantic

- Mapping between some structured subset of data and a model of some set of objects in the domain 

Pragmatics

- Has to do with the intent of the semantics and actual semantic usage

• • •

RDF (

Resource Description Framework

)

RDF provides a way of describing resources via metadata (data about data) It restricts the description of resources to

triplets (subject,predicate,object)

It provides interoperability between applications that exchange machine understandable information on the Web.

The original broad goal of RDF was to define a mechanism for describing resources that makes no assumptions about a particular application domain, nor defines (a priori) the semantics of any application domain. • • Uses XML as the interchange syntax. Provides a

lightweight

ontology system.

The formal specification of RDF is available at: http://www.w3.org/TR/REC-rdf-syntax/

RDF (cont

d)

An example of RDF using the Dublin Core language

The Future of Metadata Jacky Crystal 1998-01-01 Metadata, RDF, Dublin Core

RDF Syntax

Subject, Predicate and Object Triplets (Tuples) • Subject: The resource being described.

• Predicate: A property of the resource • Object: The value of the property A combination of them is said to be a Statement (or a rule) John Doe http://foo.bar.org/index.html

A web page being described Author A property of the web page (author) [Subject] [Predicate] The value of the predicate (here the author) [Object]

RDF Example

Namespace for the RDF spec Namespace ‘s’, a custom namespace John Doe Subject Author (property of the subject) The above statement says : The Author of http://foo.bar.org/index.html

is “John Doe” In this way, we can have different objects (resources) pointing to other objects (resources) You can also make statements about statements – reification Ex: ‘xyz’ says that ‘ The Author of http://foo.bar.org/index.html

is John Doe’

RDF Schema

• A schema defines the terms that will be used in the RDF statements and gives specific meanings to them.

http://www.w3.org/TR/rdf-schema/

Example: RDF Schema Namespace An “ID” attribute actually defines a new resource “ Resource” is the top level class PassengerVehicle is a

subclass

of MotorVehicle

Example (cont..)

Multiple Inheritance

Domain

of a property

Range

of a property

Ontology

Ontology

is a term borrowed from philosophy that refers to the science of describing the kinds of entities in the world and how they are related  An

OWL ontology

may include descriptions of

classes

,

properties

and their instances.

What is Ontology…...

an ontology is a formal, explicit specification of a shared conceptualization – Gruber

Conceptualization

’ refers to an abstract model of phenomena in the world by having identified the relevant concepts of those phenomena .

   ‘

Explicit

’ means that the type of concepts used, and the constraints on their use are explicitly defined.

Formal

’ refers to the fact that the ontology should be machine readable.

Shared

’ reflects that ontology should capture consensual knowledge accepted by the communities

Example: Cars Ontology with RDF Schema Semantics

} @cars { xyz:MotorVehicle[rdfs:subClassOf -> rdfs:Resource].

xyz:PassengerVehicle[rdfs:subClassOf -> xyz:MotorVehicle].

xyz:Truck[rdfs:subClassOf -> xyz:MotorVehicle]. xyz:Van[rdfs:subClassOf -> xyz:MotorVehicle].

xyz:MiniVan[ rdfs:subClassOf -> xyz:Van; rdfs:subClassOf -> xyz:PassengerVehicle]. FORALL X < X[rdfs:subClassOf -> xyz:MotorVehicle]@cars.

FORALL X < X[rdfs:subClassOf -> xyz:MotorVehicle]@rdfschema(cars).

xyz:Van xyz:MotorVehicl e xyz:Truc k xyz:PassengerVehicl e xyz:MiniVan

X = xyz:Van X = xyz:Truck X = xyz:PassengerVehicle X = xyz:Van X = xyz:Truck X = xyz:PassengerVehicle X = xyz:MiniVan

OIL

 OIL adds a simple Description Logic to RDF Schema  It allows to define axioms that logically describe classes, properties and their hierarchies  OIL enables to define necessary and sufficient conditions that define class membership of instances

O

IL

 OIL  Developed in the Ontoknowledge project www.ontoknowledge.org

 Core language contains consensus primitives, extensions add aditional expresiveness  Layered architecture: • Applications are not forced to work with a language more complex and expresive than required • Applications that can only process a low level of complexity are able to catch the aspects of the ontology • Applications aware of higher level of complexity can still understand a simpler ontology language

Description Logic

: Formal Semantics & Reasoning support

OIL Web languages

: XML- and RDF-based syntax

Frame-based systems

: Epistemological Modeling Primitives

DAML+OIL

DAML+OIL

 Semantic markup language  Joint efford of the American and European communities  DAML+OIL = DAM-ONT + OIL  Designed to describe the structure of a domain in terms of classes and objects  Supports full range of datatypes in XML

An Example (from www.daml.org)

Start of an

An example ontology

ontology (about = “” implies ‘this’ document)

Animal

The label is not used for logical interpretation

This class of animals is illustrative of a number of ontological idioms.

Can explicitly specify the set of Females to be disjoin The Person class is defined later To be read t with the set of Males conjunctively ‘Person’ and a ‘Male’ . A man is a sub-class of

Example (contd..)

An objectProperty relates objects to objects Describes the element which encloses this Property Describes the value of the Property Note: Contrary to RDF, DAML takes the ‘intersection’ of the domains/ranges if multiple domains/ranges are specified A datatype property relates an object to a primitive datatype value The XML Schema datatype is referenced here The Restriction defines an anonymous class of all things that satisfy the restriction.

Restrictions on the property hasParent (only for the Person class – Local scope, as opposed to rdfs:range) A person can have only another Person as it’s parent A Person can have only 1 Father

Example (contd..)

Addition to the Animal Class without modifying it -- “about” Restrictions on the property hasParent An animal can have exactly 2 parents Restrictions on the property hasSpouse A person can have only 1 spouse Further constructs that the example doesn’t use : Properties: TransitiveProperty (hasAncestor), UniqueProperty (hasMother), inverseOf(hasChild -> hasParent), etc.

Classes: intersectionOf (a daml:collection), unionOf (a daml:collection), sameClassAs, complementOf, etc.

OWL

 OWL     Under development of W3C Web Ontology Working Group DAML+OIL based Offers: • More accurate web searches • Intelligent agents • Knowledge management Abstract syntax that provides: • Higher level way of writing ontologies • Clear statement of semantics • Compound axioms resembling frames  OWL-lite  OWL sublanguage    OWL-lite = RDFs + 0/1 cardinality Suites well to express light weight ontologies Limited expresiveness power • Some fields require a full-fledged semantic web modeling language

OWL Example

unionOf

rdf:parseType="Collection"> <

adjacentRegion

rdf:resource="

#SonomaRegion

" />

Event:title Event:WebPage < > rdf:type photo:Photograph, Photo:File http://…/images#image1, Photo:topic :event1#event:speaker. Event1 a Event:event; date “May 7 speaker http://…#timbl.html

Title “WWW 2002…” TimBL rdf:type w3c-ont:person; name “Tim Berners … 11”, Lee”

g/ontologies/swrc-onto-2000-09 10.daml#Event"/>

Web of Trust

 Claims can be verified if there is supporting evidence from another (trusted) source  We only believe that someone is a professor at a university if the university also claims that person is a professor, and the university is on a list I trust.

believe(c1) :- claims(x, c1) ^ predicate(c1, professorAt) ^ arg1(c1, x) ^ arg2(c1, y) ^ claims(c2, y) ^ predicate(c2, professorAt) ^ arg1(c2, x) ^ arg2(c2, y) ^ AccreditedUniversity(y) AcknowledgedUniversity(u) :- link-from( “ http://www.cs.umd.edu/university-list” , u)

Notice this one

Query Languages

A QL defines primitives for queries and answers

- A query can be a set of sentences containing some variables - Adding the ability of representing variables into different RLs different QLs; produces - QL is usually related to a RL; 

Some query languages are:

SiLRI (Simple Logic-based RDF Interpreter) - RDQL (RDF Query Language) - TRIPLE - DQL (DAML Query Language)

RDQL Examples

finds resources from given URI having age more than 24 SELECT

?resource

WHERE

(?resource info:age ?age)

AND

?age >= 24

USING

info

FOR

finds the family name and given name from any vcards with formatted name (FN) "John Smith".

SELECT

?family , ?given

WHERE

(?vcard vcard:FN "John Smith") (?vcard vcard:N ?name) (?name vcard:Family ?family) (?name vcard:Given ?given)

USING

vcard

FOR

“http://www.w3.org/2001/vcard-rdf/3.0#”

How it may work

User browser query agent Index data base web page rules Knowledge representation scheme

Semantic Web - Language tower

Applications

Web Query

 RSS

(RDF Site Summary)

Knowledge Management

Electronic Commerce

Pervasive Computing

Web queries - for real!!

“How many train lines are there in Tokyo?”

* Query processed:

73 answers found     

Google document search finds 235,312 possible page hits.

Http://www…jp/JRTrains.html

claims the answer is 27 A database entitled “ JRTLDB” can be queried for the answer, but you will need “Japanese Railroad Access Code.” A computer program that can compute that number is offered by the Tokyo Digital City information analysis, program.

click here to run ...

RSS

 RDF Site Summary (RSS) is a lightweight multipurpose extensible metadata description and syndication format.  RSS is an XML application  An RSS summary, at a minimum, is a document describing a "channel" consisting of URL-retrievable items .  Each item consists of a title, link , and brief description .

RSS History

"Rich Site Summary“

Version .91

"RDF Site Summary“

Version 1.0

"Really Simple Syndication"

Version 2.0

RSS Example

Meerkat http://meerkat.oreillynet.com Meerkat: An Open Wire Service The O'Reilly Network Rael Dornfest (mailto:[email protected]) Copyright © 2000 O'Reilly & Associates, Inc. 2000-01-01T12:00+00:00 hourly 2 2000-01-01T12:00+00:00

Common uses for

RSS

 Blogs – summaries of daily blog posts  Newsletters – synopses of newsletters alerting users that a new newsletter is available  Weather Alerts – notification of severe weather  Press Announcements – new product announcements  Specials or Discounts – weekly deals or discount offers for customers  Calendars – listings of upcoming events, deadlines or holidays

Knowledge Management

SWAP

Semantic Web Application Platform

On-to Knowledge

British Telecom Call Center

(ontoShare)  Swiss Life Application 

EnerSearch Applications

Electronic Commerce

Mechanized support is needed in

finding and comparing

vendors and their offers.

Mechanized support is needed in dealing with

numerous and heterogeneous data formats

.

Mechanized support is needed in dealing with

numerous and heterogeneous business logics

.

Services off the desktop

Or perhaps on different desktops

The Semantic Web: What is it?

Many things to many people..

Thank you for the attention!

Questions?