下載/瀏覽Download

Download Report

Transcript 下載/瀏覽Download

多媒體網路安全實驗室
Combining ontological profiles with context
in information retrieval
Date:2012.03.06
Reporter : Hong Ji Wei
Auther : Geir Solskinnsbakk and Jon Atle Gulla
出處: Data & Knowledge Engineering,vol 69, no 3, 2010, pp 251–260
多媒體網路安全實驗室
Outline
1
Introduction
2
Related Work
3
Ontological profiles
4
Construction of ontological profiles
35
Experiment
46
Context
37
Conclusions
2
多媒體網路安全實驗室
Introduction
An ontology is a formal conceptualization of a
domain,specifying the concepts of the domain
and the relations between them.
大體而言,本體論的兩大重點:
1.以已經定義的字彙名詞(vocabulary of terms)
來描述以存在的實體(entity)
2.以一定規格(specification)表示出這些實體間
的關係(relationship)與存在的意義(meaning)
3
多媒體網路安全實驗室
Early semantic search engines tried to use
ontology concepts and structures as controlled
search vocabularies,but this was unpractical
both functionally and usability perspective.
Ontological profile=Ontology and concept
characterizations.
4
多媒體網路安全實驗室
Related Work
Su describes in a method for ontology mapping,
based on an extension of the ontology.
Tomassen describes an ontology driven
information retrieval system based on extending
ontologies.
Sieg et al describe an approach for
representing user context for search.
5
多媒體網路安全實驗室
Ontological profiles
An ontological profile is an extension of a
domain ontology.
The ontology is extended with semantically
related terms that are added as vectors for
each of the concepts of the ontology.
The concept vector can be viewed as an
extended semantic characterization of the
concept, reflecting the semantics of the concept
in the document collection.
6
多媒體網路安全實驗室
therefore argue that the ontological profile may
be better suited:
(i) a specific document collection
(ii) the vocabulary in the documents
(iii)the use of the ontology concepts in the
documents.
EX:
7
多媒體網路安全實驗室
8
多媒體網路安全實驗室
Construction of ontological profiles
Construction of the ontological profile is based
on three important aspects:
1. Statistics: Apply statistical techniques to the
document collection by counting the frequency of
the terms in the documents
2. Linguistics:the terms of the documents, namely
stemming, to collapse certain semantically similar
terms into a single form.
3. Proximity:Apply a proximity measure to the cooccurrence of concept names and terms
9
多媒體網路安全實驗室
10
多媒體網路安全實驗室
Experiment
The process consists of two steps:
(i) query interpretation
1. Simple query interpretation
•
This is the most basic strategy for query interpretation
2. Best match query interpretation
•
•
In contrast to the simple query interpretation
During interpretation of the query we try to recognize
the connection between the query terms by mapping
the terms collectively to a single concept
11
多媒體網路安全實驗室
3. Cosine similarity query interpretation
• we are not able to directly recognize the relation from the
query terms.
• Try to recognize relations between the concepts that the
query terms map to via the ontological profile.
4. Ontology structure query interpretation
• The measure we use is based on the distance between
the concepts in terms of relations.
• by generating a graph representation of the ontology, in
which nodes represent concepts and edges represent
relations.(to find the distance for each pair of concepts)
12
多媒體網路安全實驗室
(ii)Query expansion
 The query expansion process reformulates the
original query by adding semantically related terms
to the query.
 The weight of the original query terms are boosted
to reflect their importance.
 The motivation behind adding these terms is to
remove some of the noise introduced by the
increased set of query terms.
13
多媒體網路安全實驗室
Experimental data and results
14
多媒體網路安全實驗室
Context
Context is the set of suitable environmental
states and settings concerning a user.
Attributes are context if they are non-essential
(for tasks).
We can thus conclude that only non-essential
information used to enhance the applications
computations can be considered context.
15
多媒體網路安全實驗室
1. Search context
 Definition. Search context is the context of a user
posting queries to a search application.
We can partition the systems knowledge about
the user into two separate parts:
 The first is of course the query
 The second part is additional knowledge which can
characterize the situation of the user(i.e., the
context.)
16
多媒體網路安全實驗室
2. Contextualized ontological search
Ontology Profiles



FQV   UQn   CQun (    1)
17
多媒體網路安全實驗室
Conclusions
The ontological profile describes each concept
as a vector of terms with weights describing the
strength of the relation between them.
We defined search context and discussed how
contextual search may be incorporated into our
semantic search approach
18
多媒體網路安全實驗室