10:30-12:00 How to Build an Ontology 1-2pm Best Practices and Lessons Learned 2-3pm BIRN Ontologies: An Overview.

Download Report

Transcript 10:30-12:00 How to Build an Ontology 1-2pm Best Practices and Lessons Learned 2-3pm BIRN Ontologies: An Overview.

10:30-12:00 How to Build an
Ontology
1-2pm Best Practices and Lessons
Learned
2-3pm BIRN Ontologies: An
Overview
How to Build an Ontology
http://ontology.buffalo.edu/smith
2
General trend on the part of NIH, FDA and
other bodies to consolidate ontologybased standards for the communication
and processing of biomedical data.
NCIT / caBIG / NECTAR / BIRN / OBO ...
High quality shared ontologies
build communities
http://ontology.buffalo.edu/smith
3
TWO STRATEGIES:
Ad hoc creation of new database
schemas for each research group /
research hypothesis
vs.
Pre-established interoperable
stable reference ontologies in
terms of which all database
schemas need to be defined
http://ontology.buffalo.edu/smith
4
How to create the conditions for a
step-by-step evolution towards gold
standard reference ontologies in the
biomedical domain
... and why we need to create
these conditions
OBO Core project
http://ontology.buffalo.edu/smith
5
Ontology =def
A representation of the types of entities
existing in a given domain of reality, and
of the relations between these types
http://ontology.buffalo.edu/smith
6
Types have instances
Ontologies are like science texts: they are
about types
(Diaries, databases, clinical records are
about instances)
http://ontology.buffalo.edu/smith
7
The need
strong general-purpose classification
hierarchies created by domain specialists
clear, rigorous definitions
thoroughly tested in real cases
ontologies teach us about the instances in
reality by supporting cross-disciplinary
(cross-ontology) reasoning about types
http://ontology.buffalo.edu/smith
8
The actuality (too often)
myriad special purpose ‘light’ ontologies,
prepared by ontology engineers and
deposited in internet ‘repositories’ or
‘registries’
http://ontology.buffalo.edu/smith
9
these light ontologies often do not
generalize …
repeat work already done by others
are not interoperable
reproduce the very problems of communication
which ontology was designed to solve
contain incoherent definitions
and incoherent documentation
http://ontology.buffalo.edu/smith
10
BIRN Ontology Experiences
In the short-term, users will probably download the
data or analyses and extract the results using their
preferred methods.
In the long term, however, that will become infeasible
– the databases will have to be made interoperable with
standard datamining software.
This is where the neuroanatomy ontologies come in.
– We will need to know what the ROI is and which naming
scheme it came from (e.g., a Brodmann’s area, or a
sulcal/gyral area, etc.). We’ll need to know how it was
defined (Talairach atlas? MNI atlas? LONI atlas? Or
subject-specific regions?) and what the statistic is.
http://ontology.buffalo.edu/smith
11
BIRN Ontology Experiences
In the short-term, users will probably download the
data or analyses and extract the results using their
preferred methods.
In the long term that will become infeasible
http://ontology.buffalo.edu/smith
12
The long term begins here
http://ontology.buffalo.edu/smith
13
A methodology for qualityassurance of ontologies
tested thus far in the biomedical domain on:
– FMA
– GO + other OBO Ontologies
– FuGO
– SNOMED
– UMLS Semantic Network
– NCI Thesaurus
– ICF (International Classification of Functioning,
Disability and Health)
– ISO Terminology Standards
– HL7-RIM
http://ontology.buffalo.edu/smith
14
A methodology for qualityassurance of ontologies
accepted need for application of this methodology:
– FMA
– GO + other OBO Ontologies
– FuGO
– SNOMED
– UMLS Semantic Network
– NCI Thesaurus
– ICF (International Classification of Functioning,
Disability and Health)
– ISO Terminology Standards
– HL7-RIM
http://ontology.buffalo.edu/smith
15
A methodology for qualityassurance of ontologies
signs of hope:
– FMA
– GO + other OBO Ontologies
– FuGO
– SNOMED
– UMLS Semantic Network
– NCI Thesaurus
– ICF (International Classification of
Functioning, Disability and Health)
– ISO Terminology Standards
– HL7-RIM
http://ontology.buffalo.edu/smith
16
We know that high-quality
ontologies built according to this
methodology can help in creating
high-quality mappings between
human and model organism
phenotypes
http://ontology.buffalo.edu/smith
17
“Alignment of Multiple Ontologies of
Anatomy: Deriving Indirect Mappings from
Direct Mappings to a Reference Ontology”
Songmao Zhang
Olivier Bodenreider
AMIA 2005
http://ontology.buffalo.edu/smith
18
We also know that OWL is not enough
to ensure high-quality ontologies
and that the use of a common syntax and
logical machinery and the careful
separating out of ontologies into
namespaces does not solve the problem
of ontology integration
http://ontology.buffalo.edu/smith
19
A basic distinction
type vs. instance
science text vs. clinical document
man vs. Musen
http://ontology.buffalo.edu/smith
20
Instances are not represented in
an ontology
It is the generalizations that are important
(but instances must still be taken into
account)
http://ontology.buffalo.edu/smith
21
A
B
C
515287
521683
521682
DC3300 Dust Collector Fan
Gilmer Belt
Motor Drive Belt
http://ontology.buffalo.edu/smith
22
Ontology
Types Instances
http://ontology.buffalo.edu/smith
23
Ontology = A Representation of Types
http://ontology.buffalo.edu/smith
24
Each node of an ontology
consists of:
• preferred term (aka term)
• term identifier (TUI, aka CUI)
• synonyms
• definition, glosses, comments
Ontology = A Representation of Types
http://ontology.buffalo.edu/smith
25
Nodes in an ontology are
connected by relations:
primarily: is_a (= is subtype of)
and part_of
designed to support search,
reasoning and annotation
Ontology = A Representation of Types
http://ontology.buffalo.edu/smith
26
types
substance
organism
animal
mammal
cat
leaf class
siamese
frog
instances
http://ontology.buffalo.edu/smith
27
Rules for formating terms
• Terms should be in the singular
• Terms should be lower case
• Avoid abbreviations even when it is clear in
context what they mean (‘breast’ for ‘breast
tumor’)
• Avoid acronyms
• Avoid mass terms (‘tissue’, ‘brain mapping’,
‘clinical research’ ...)
• Each term ‘A’ in an ontology is shorthand for a
term of the form ‘the type A’
http://ontology.buffalo.edu/smith
28
Motivation: to capture reality
Inferences and decisions we make are
based upon what we know of reality.
An ontology is a computable representation
of the underlying biological reality.
Designed to enable a computer to reason
over the data we derive from this reality in
(some of) the ways that we do.
http://ontology.buffalo.edu/smith
29
Concepts
Biomedical ontology integration will never be
achieved through integration of meanings
or concepts
The problem is precisely that different user
communities use different concepts
Concepts are in your head and will change
as your understanding changes
http://ontology.buffalo.edu/smith
30
Concepts
Ontologies represent types: not concepts,
meanings, ideas ...
Types exist, with their instances, in objective
reality
– including types of image, of imaging
process, of brain region, of clinical
procedure, etc.
http://ontology.buffalo.edu/smith
31
Rules on types
Don’t confuse types with words
Don’t confuse types with concepts
Don’t confuse types with ways of getting to
know types
Don’t confuse types with ways of talking
about types
Don’t confuses types with data about types
http://ontology.buffalo.edu/smith
32
Some other simple rules for high
quality ontologies
http://ontology.buffalo.edu/smith
33
Univocity
Terms should have the same meanings on
every occasion of use.
They should refer to the same kinds of
entities in reality
Basic ontological relations such as is_a and
part_of should be used in the same way
by all ontologies
http://ontology.buffalo.edu/smith
34
Positivity
Complements of types are not themselves types.
Hence terms such as
non-mammal
non-membrane
other metalworker in New Zealand
do not designate types in reality
There are also no conjunctive and disjunctive
types:
protoplasmic astrocyte and Schwann cell
Purkinje neuron or dendritic shaft
http://ontology.buffalo.edu/smith
35
Objectivity
Which types exist is not a function of our
knowledge.
Terms such as ‘unknown’ or ‘unclassified’ or
‘unlocalized’ do not designate types in
reality.
http://ontology.buffalo.edu/smith
36
Single Inheritance
No kind in a classificatory hierarchy
should have more than one is_a
parent on the immediate higher
level
http://ontology.buffalo.edu/smith
37
Multiple Inheritance
thing
blue thing
car
is_a1
is_a2
blue car
http://ontology.buffalo.edu/smith
38
is_a Overloading
serves as obstacle to integration with
neighboring ontologies
The success of ontology alignment demands
that ontological relations (is_a, part_of, ...)
have the same meanings in the different
ontologies to be aligned.
See “Relations in Biomedical Ontologies”,
Genome Biology May 2005.
 DISEASE MAPS
http://ontology.buffalo.edu/smith
39
General Rule
Formulate universal statements first
Move to A may be B in such and such a
context later
http://ontology.buffalo.edu/smith
40
Intelligibility of Definitions
The terms used in a definition should be
simpler (more intelligible) than the term to be
defined; otherwise the definition provides no
assistance
– to human understanding
– to machine processing
http://ontology.buffalo.edu/smith
41
Definitions should be intelligible to
both machines and humans
Machines can cope with the full formal
representation
Humans need clarity and modularity
http://ontology.buffalo.edu/smith
42
But
Some terms are primitive (cannot be defined)
AVOID CIRCULAR DEFINITIONS
Avoid definitions of the forms:
An A is an A which is B (person = person with
identity documents)
An A is the B of an A (heptolysis = the causes of
heptolysis)
http://ontology.buffalo.edu/smith
43
Case Study: The National Cancer
Institute Thesaurus (NCIT)
does not (yet) satisfy these and other simple
principles
http://ontology.buffalo.edu/smith
44
The NCIT reflects a recognition of
the need
for high quality shared ontologies and
terminologies the use of which by clinical
researchers in large communities can
ensure re-usability of data collected by
different research groups
http://ontology.buffalo.edu/smith
45
NCIT
“a biomedical vocabulary that provides
consistent, unambiguous codes and
definitions for concepts used in cancer
research”
“exhibits ontology-like properties in its
construction and use”.
http://ontology.buffalo.edu/smith
46
Goals
to make use of current terminology “best practices”
to relate relevant concepts to one another in a
formal structure, so that computers as well as
humans can use the Thesaurus for a variety of
purposes, including the support of automatic
reasoning;
to speed the introduction of new concepts and
new relationships in response to the emerging
needs of basic researchers, clinical trials,
information services and other users.
http://ontology.buffalo.edu/smith
47
Formal Definitions
of 37,261 nodes, 33,720 were stipulated to
be primitive in the DL sense
Thus only a small portion of the NCIT
ontology can be used for purposes of
automatic classification and error-checking
by using OWL.
http://ontology.buffalo.edu/smith
48
Verbal Definitions
About half the NCIT terms are assigned
verbal definitions
Unfortunately some are assigned more than
one
http://ontology.buffalo.edu/smith
49
Disease Progression
Definition1
Cancer that continues to grow or spread.
Definition2
Increase in the size of a tumor or spread of
cancer in the body.
Definition3
The worsening of a disease over time. This
concept is most often used for chronic and
incurable diseases where the stage of the
disease is an important determinant of therapy
and prognosis.
http://ontology.buffalo.edu/smith
50
To make matters worse Disease
Progression has as subclass:
Cancer Progression
Definition:
The worsening of a cancer over time. This
concept is most often used for incurable
cancers where the stage of the cancer is
an important determinant of therapy and
prognosis.
http://ontology.buffalo.edu/smith
51
Cancer
a process (of getting better or worse)
an object (which can grow and spread)
http://ontology.buffalo.edu/smith
52
Confuses definitions with
descriptions
Tuberculosis
Definition
A chronic, recurrent infection caused by the bacterium
Mycobacterium tuberculosis. Tuberculosis (TB) may affect almost
any tissue or organ of the body with the lungs being the most
common site of infection. The clinical stages of TB are primary or
initial infection, latent or dormant infection, and recrudescent or
adult-type TB. Ninety to 95% of primary TB infections may go
unrecognized. Histopathologically, tissue lesions consist of
granulomas which usually undergo central caseation necrosis. Local
symptoms of TB vary according to the part affected; acute
symptoms include hectic fever, sweats, and emaciation; serious
complications include granulomatous erosion of pulmonary bronchi
associated with hemoptysis. If untreated, progressive TB may be
associated with a high degree of mortality. This infection is
frequently observed in immunocompromised individuals with AIDS
or a history of illicit IV drug use.
http://ontology.buffalo.edu/smith
53
Confuses definitions with
descriptions
Tuberculosis
Definition
A chronic, recurrent infection caused by the bacterium
Mycobacterium tuberculosis. Tuberculosis (TB) may affect almost
any tissue or organ of the body with the lungs being the most
common site of infection. The clinical stages of TB are primary or
initial infection, latent or dormant infection, and recrudescent or
adult-type TB. Ninety to 95% of primary TB infections may go
unrecognized. Histopathologically, tissue lesions consist of
granulomas which usually undergo central caseation necrosis. Local
symptoms of TB vary according to the part affected; acute
symptoms include hectic fever, sweats, and emaciation; serious
complications include granulomatous erosion of pulmonary bronchi
associated with hemoptysis. If untreated, progressive TB may be
associated with a high degree of mortality. This infection is
frequently observed in immunocompromised individuals with AIDS
or a history of illicit IV drug use.
http://ontology.buffalo.edu/smith
54
A better definition
Tuberculosis
Definition:
A chronic, recurrent infection caused by the
bacterium Mycobacterium tuberculosis.
http://ontology.buffalo.edu/smith
55
NCIT inherits this ontological and terminological
incoherence from source vocabularies in UMLS
Conceptual Entities =def
An organizational header for concepts
representing mostly abstract entities.
Includes as subtypes:
action, change, color, death, event, fluid,
injection, temperature
http://ontology.buffalo.edu/smith
56
Conceptual Entities =def
An organizational header for concepts
representing mostly abstract entities.
Confuses use and mention (swimming is
healthy and has eight letters)
http://ontology.buffalo.edu/smith
57
Duratec, Lactobutyrin, Stilbene
Aldehyde
are classified by the NCIT as Unclassified
Drugs and Chemicals
http://ontology.buffalo.edu/smith
58
and problematic synonyms
Anatomic Structure, System, or Substance ~ Anatomic
Structures and Systems
Does ‘anatomic’ apply only to structure or also to system
and substance?
Biological Function ~ Biological Process
some biological processes are the exercises of biological
functions
others (e.g. pathological processes, side effects) not
Genetic Abnormality ~ Molecular Abnormality (with
subtype: Molecular Genetic Abnormality) (definitions
not supplied)
http://ontology.buffalo.edu/smith
59
Problematic synonyms
Diseases and Disorders ~ Disease ~ Disorder
Definition1 for Disease:
A disease is any abnormal condition of the body or mind
that causes discomfort, dysfunction, or distress to the
person affected or those in contact with the person. ...
Definition2 for Disease
A definite pathologic process with a characteristic set of
signs and symptoms. ...
Condition  Process
Definition2 contradicts NCIT’s own classification hierarchy
http://ontology.buffalo.edu/smith
60
Three disjoint classes of plants
Vascular Plant
Non-vascular Plant
Other Plant
http://ontology.buffalo.edu/smith
61
Three kinds of cells
Abnormal Cell is a top-level class (thus not
subsumed by Cell
Normal Cell is a subclass of Microanatomy.
Cell is a subclass of Other Anatomic Concept
(so that cells themselves are concepts)
http://ontology.buffalo.edu/smith
62
NCIT as now constituted will block
automatic reasoning
Neither Normal Cells nor Abnormal Cells are
Cells within the context of the NCIT
http://ontology.buffalo.edu/smith
63
Some consolations
NCIT is open source
NCIT has broad coverage
NCIT has some formal structure (OWL-DL)
NCIT is much, much better than (for
example) the HL7-RIM
NCIT has realized the errors of its ways
http://ontology.buffalo.edu/smith
64
The road ahead
http://www.cbdnet.com/index.php/search/show/938464
= “Review of NCI Thesaurus and
Development of Plan to Achieve OBO
Compliance”
and welcome to the Pre-NCIT:
http://nciterms.nci.nih.gov/NCIBrowser/Dic
tionary.do
http://ontology.buffalo.edu/smith
65
Fragment of Pre-NCIT Hierarchy
Murine Tissue Type
Body Fluids and Substances (MMHCC)
Cardiovascular System (MMHCC)
Blood Vessel (MMHCC)
Heart (MMHCC)
Digestive System (MMHCC)
http://ontology.buffalo.edu/smith
66
First step
Alignment of OBO ontologies through a
common system of formally defined
relations in the OBO-RO (OBO Relation
Ontology)
see “Relations in Biomedical Ontologies”,
Genome Biology Apr. 2005
http://ontology.buffalo.edu/smith
67
is_a (sensu UMLS)
A is_a B =def
‘A’ is narrower in meaning than ‘B’
grows out of the heritage of dictionaries
(which ignore the basic distinction between
types and instances)
http://ontology.buffalo.edu/smith
68
To build a high quality shared
ontology requires hard work and
staying power
You cannot cheat by borrowing
from UMLS
UMLS (= the UMLS
Metathesaurus) is not an ontology
http://ontology.buffalo.edu/smith
69
Concepts, Concept Names, and
their Identifiers in the UMLS
The Metathesaurus is organized by concept. One of its
primary purposes is to connect different names for the
same concept from many different vocabularies.
A concept is a meaning. A meaning can have many
different names. A key goal of Metathesaurus
construction is to understand the intended meaning of
each name in each source vocabulary and to link all the
names from all of the source vocabularies that mean the
same thing (the synonyms). This is not an exact science.
... Metathesaurus editors decide what view of synonymy
to represent in the Metathesaurus concept structure.
Please note that each source vocabulary’s view of
synonymy is also present in the Metathesaurus,
irrespective of whether it agrees or disagrees with the
Metathesaurus view.
http://ontology.buffalo.edu/smith
70
This strange mapping
between names as they appear in different
source vocabularies created for widely
different purposes can still be very useful
but the source vocabularies themselves are
of variable quality
(not all mappings are created equal)
and the sorts of search which the UMLS
supports reflects an already outmoded
technology
http://ontology.buffalo.edu/smith
71
is_a
congenital absent nipple is_a nipple
surgical procedure not carried out because of
patient’s decision is_a surgical procedure
cancer documentation is_a cancer
disease prevention is_a disease
living subject is_a information object representing
an animal or complex organism
individual allele is_a act of observation
limb is_a tissue
http://ontology.buffalo.edu/smith
72
is_a (sensu UMLS)
both testes is_a testis
plant leaves is_a plant
smoking is_a individual behavior
walking is_a social behavior
http://ontology.buffalo.edu/smith
73
is_a
A is_a B =def
For all x, if x instance_of A then x
instance_of B
cell division is_a biological process
adult is_a child ???
http://ontology.buffalo.edu/smith
74
Two kinds of entities
occurrents (processes, events, happenings)
cell division, ovulation, death
continuants (objects, qualities, ...)
cell, ovum, organism, temperature of
organism, ...
http://ontology.buffalo.edu/smith
75
is_a (for occurrents)
A is_a B =def
For all x, if x instance_of A then x
instance_of B
cell division is_a biological process
http://ontology.buffalo.edu/smith
76
is_a (for continuants)
A is_a B =def
For all x, t if x instance_of A at t then x
instance_of B at t
abnormal cell is_a cell
adult human is_a human
but not: adult is_a child
http://ontology.buffalo.edu/smith
77
part_of
Composes, with one or more other physical units,
some larger whole.
(UMLS Semantic Network)
what does this relation relate?
A is_a B =def A is narrower in meaning than B
http://ontology.buffalo.edu/smith
78
Part_of as a relation between
types is more problematic than
is standardly supposed
heart part_of human being ?
human heart part_of human being ?
human being has_part human testis ?
testis part_of human being ?
http://ontology.buffalo.edu/smith
79
Definition of part_of as a relation
between types
A part_of B =Def all instances of A are
instance-level parts of some instance of B
human testis part_of adult human being
http://ontology.buffalo.edu/smith
80
two kinds of parthood
1. between instances:
Mary’s heart part_of Mary
this nucleus part_of this cell
2. between types
human heart part_of human
cell nucleus part_of cell
http://ontology.buffalo.edu/smith
81
part_of (for occurrents)
A part_of B =def.
For all x, if x instance_of A then there is
some y, y instance_of B and x part_of y
where ‘part_of’ is the instance-level part
relation
EVERY A IS PART OF SOME B
http://ontology.buffalo.edu/smith
82
part_of (for continuants)
A part_of B =def.
For all x, t if x instance_of A at t then there
is some y, y instance_of B at t and x
part_of y
where ‘part_of’ is the instance-level part
relation
NOTE THE ALL-SOME STRUCTURE
http://ontology.buffalo.edu/smith
83
A part_of B, B part_of C ...
The all-some structure of such definitions
allows
cascading of inferences
(i) within ontologies
(ii) between ontologies
(iii) between ontologies and EHR
repositories of instance-data
http://ontology.buffalo.edu/smith
84
Cascading inferences
Whichever A you choose, the instance of B
of which it is a part will be included in
some C, which will include as part also the
A with which you began
The same principle applies to the other
relations in the OBO-RO:
located_at, transformation_of, derived_from,
adjacent_to, etc.
http://ontology.buffalo.edu/smith
85
is_a and part_of never cross
categorial divides
(cf. tripartite organization of GO)
if A is_a B
then A is an object type iff B is an object
type
then A is a process type iff B is a process
type
then A is a characteristic type iff B is a
characteristic type
http://ontology.buffalo.edu/smith
86
Kinds of relations
Between types:
– is_a, part_of, ...
Between an instance and a type
– this explosion instance_of the type explosion
Between instances:
– Mary’s heart part_of Mary
http://ontology.buffalo.edu/smith
87
Continuity
instance a continuous_with instance b
is always symmetric
But consider the types lymph node and
lymphatic vessel:
Each lymph node is continuous with some
lymphatic vessel, but there are lymphatic
vessels (e.g. lymphs and lymphatic trunks)
which are not continuous with any lymph
nodes
Continuity on the type level is not symmetric.
http://ontology.buffalo.edu/smith
88
Adjacency as a relation
between universals is not
symmetric
Consider
seminal vesicle adjacent_to urinary
bladder
Not: urinary bladder adjacent_to
seminal vesicle
http://ontology.buffalo.edu/smith
89
Instance level
this nucleus is adjacent to this cytoplasm
implies:
this cytoplasm is adjacent to this nucleus
Type level
nucleus adjacent_to cytoplasm
Not: cytoplasm adjacent_to nucleus
http://ontology.buffalo.edu/smith
90
Applications
Expectations of symmetry e.g. for proteinprotein interactions hmay hold only at the
instance level
if A interacts with B, it does not follow that B
interacts with A
if A is expressed simultaneously with B, it
does not follow that B is expressed
simultaneously with A
http://ontology.buffalo.edu/smith
91
Definitions of the all-some form
allow cascading inferences
If A R1 B and B R2 C, then we know that
every A stands in R1 to some B, but we
know also that, whichever B this is, it can
be plugged into the R2 relation
http://ontology.buffalo.edu/smith
92
GALEN: Vomitus contains carrot
All portions of vomit contain all portions of
carrot
All portions of vomit contain some portion of
carrot
Some portions of vomit contain some portion
of carrot
Some portions of vomit contain all portions
of carrot
http://ontology.buffalo.edu/smith
93
transformation_of
same instance
C
c at t
pre-RNA
child
http://ontology.buffalo.edu/smith
C1
c at t1
time
mature RNA
adult
94
transformation_of
A transformation_of B =Def.
Every instance of A was at some earlier time an
instance of B
adult transformation_of child
http://ontology.buffalo.edu/smith
95
embryological development
C
c at t
http://ontology.buffalo.edu/smith
C1
c at t1
96
tumor development
C
c at t
http://ontology.buffalo.edu/smith
C1
c at t1
97
derives_from
C
C1
c at t
c1 at t1
time
C'
c' at t
instances
ovum
zygote derives_from
sperm
http://ontology.buffalo.edu/smith
98
Request from Bill Bug
How best to effectively bring together:
- spatial/morphological ontologies;
- neuroscience terminologies (e.g.,
NeuroNames) and;
- data-centric neuroanatomical indexing
systems (voxel-based 3D atlases);
to promote optimal integration of
neuroscience data sets that include a
spatial component, however coarse.
http://ontology.buffalo.edu/smith
99
A suite of defined relations between
universals
Foundational is_a
part_of
Spatial
Temporal
Participation
located_in
contained_in
adjacent_to
transformation_of
derives_from
preceded_by
has_participant
has_agent
http://ontology.buffalo.edu/smith
100
Logical Theory of Spatial Relations
RCC: Region-Connection Calculus
(Leeds Qualitative Spatial Reasoning
Group)
Cf. Dameron et al. Modeling
dependencies between relations to
ensure consistency of a cerebral
cortex anatomy knowledge base
http://ontology.buffalo.edu/smith
101
Principles
1 anatomical structure  1 region
has_location
Define the relationships of adjacency,
connectedness etc. using RCC-8 and its
extensions
PO
DC
EC
http://ontology.buffalo.edu/smith
TPP
NTPP
EQ
102
Example 1
Reasoning with part and location at the
instance level:
Frontal Lobe
Inferior Frontal Gyrus
http://ontology.buffalo.edu/smith
Operc. Pars of
Inferior Frontal Gyrus
103
Example 2
Reasoning with location, continuity and
external connection
Frontal Lobe
PreCentral Gyrus
http://ontology.buffalo.edu/smith
PostCentral Gyrus
104
Extension to the 3-D case
B
x
y
time
SNAP-ti.
SPAN
substances x, y participate in process B
http://ontology.buffalo.edu/smith
slice of
x’s life
105
Most ontologies are execrable
But some good ontologies do already
exist
• as far as possible don’t reinvent
• use the power of combination and collaboration
• ontologies are like telephones: they are valuable
only to the degree that they are used and
networked with other ontologies
• but choose working telephones
• most UMLS telephones were broken from the
start
http://ontology.buffalo.edu/smith
106
Why do we need rules/standards
for good ontology?
Ontologies must be intelligible both to humans (for
annotation) and to machines (for reasoning and
error-checking): unintuitive rules for
classification lead to errors
Intuitive rule facilitate training of curators and
annotators
Common rules allow alignment with other
ontologies
Logically coherent rules enhance harvesting of
content through automatic reasoning systems
http://ontology.buffalo.edu/smith
107
To the degree that basic rules
of good ontology are not
satisfied, error checking and
ontology alignment will be
achievable, at best, only
– with human intervention
– via force majeure
– with unstable results
http://ontology.buffalo.edu/smith
108
Current practice in the domain of
clinical research
Results of clinical trials are organized too
tightly around specific diagnostic criteria
imposed by specific, local, hypotheses
A change in criteria forces a costly reexamination and re-coding of all existing
records to make them usable in future
hypothesis generation and testing.
http://ontology.buffalo.edu/smith
109
How to solve this problem?
Just as clinical hypotheses need to be tied
to basic science, so special-purpose
application ontologies need to be tied to
general-purpose reference ontologies
http://ontology.buffalo.edu/smith
110
How to solve this problem?
We separate
data as interpreted in terms of current criteria
from
the structure of the underlying biomedical reality
and ensure that the first is stored and processed
always by using terms drawn from a shared,
stable representation (a reference ontology) of the
latter.
Diagnostic criteria for a disease can then be
changed but we will still maintain access to the
data relevant to all prior diagnosed cases of the
disease in question.
http://ontology.buffalo.edu/smith
111
Not only data needs to be aligned
through pre-established reference
ontologies, so also does software
Currently, application ontologies are built afresh
for each new application
They commonly introduce new idiosyncrasies of
terminology, format or logic, plus simplifications
or distortions of their subject-matters.
This may do no harm in relation to the specific
application (for example radiology, tissue
classification, cancer staging) – and keeps the
software simple
http://ontology.buffalo.edu/smith
112
But what happens
when other applications want to use the data
annotated in their terms, or when we need to
extend to a larger portion of biomedical reality?
Now the expanded ontology will no longer be
compatible with the software designed for its
original application.
Different groups now need to start working with
different and incompatible versions of an
ontology, engendering a spiralling complexity as
these different versions themselves become
revised and extended for different purposes.
http://ontology.buffalo.edu/smith
113
The solution
The methodology of always developing
application ontologies against the
backgrund of formally robust reference
ontologies can both counteract these
tendencies toward ontology proliferation
and ensure the interoperability of
application ontologies as they become
further developed in the future.
http://ontology.buffalo.edu/smith
114
The methodology of reference
ontologies
can provide locally developed application
ontologies with cross-granular
understanding of the ways processes at
the gene and protein level are linked to
clinically salient processes at coarser
granularity
and it can allow them take advantage of
existing logical tools and methods for
reasoning across large bodies of data.
http://ontology.buffalo.edu/smith
115
An application ontology
is comparable to an engineering artifact
such as a software tool. It is constructed
for a specific practical purpose.
Examples:
NCIT
FuGO Functional Genomics Investigation
Ontology
http://ontology.buffalo.edu/smith
116
A reference ontology
A reference ontology has a unified subject-matter,
which consists of entities existing independently
of the ontology, and it seeks to optimize
descriptive or representational adequacy to this
subject matter.
A reference ontology is analogous to a scientific
theory. Thus it consists of representations of
biological reality which are correct when viewed
in light of our current understanding of reality,
and it must be subjected to updating in light of
scientific advance.
Example: The Foundational Model of Anatomy
http://ontology.buffalo.edu/smith
117
Current Best Practice
http://ontology.buffalo.edu/smith
118
http://ontology.buffalo.edu/smith
119
Anatomical
Structure
Anatomical Space
Organ Cavity
Subdivision
Organ
Cavity
Organ
Serous Sac
Cavity
Subdivision
Serous Sac
Cavity
Serous Sac
Organ
Component
Organ
Subdivision
Pleural Sac
Pleural
Cavity
Parietal
Pleura
Interlobar
recess
Organ Part
Mediastinal
Pleura
http://ontology.buffalo.edu/smith
Tissue
Pleura(Wall
of Sac)
Visceral
Pleura
Mesothelium
of Pleura
120
The Foundational Model of Anatomy
Follows formal rules for ‘Aristotelian’ definitions
When A is_a B, the definition of ‘A’ takes the form:
an A =def. a B which ...
a human being =def. an animal which is rational
http://ontology.buffalo.edu/smith
121
FMA Example
Cell =def. an anatomical structure which
consists of cytoplasm surrounded by a
plasma membrane with or without a cell
nucleus
Plasma membrane =def. a cell part that
surrounds the cytoplasm
http://ontology.buffalo.edu/smith
122
The FMA regimentation
Brings the advantage that each definition
reflects the position in the hierarchy to which
a defined term belongs.
The position of a term within the hierarchy
enriches its own definition by incorporating
automatically the definitions of all the terms
above it.
The entire information content of the FMA’s
term hierarchy can be translated very cleanly
into a computer representation
http://ontology.buffalo.edu/smith
123
GO now adopting structured definitions
which contain both genus and differentiae
Species =def Genus + Differentiae
neuron cell differentiation =def
differentiation by which a cell acquires features of a neuron
http://ontology.buffalo.edu/smith
124
Ontology alignment
One of the current goals of GO is to align:
Cell Types in GO
with
cone cell fate commitment
Cell Types in the Cell Ontology
retinal_cone_cell
keratinocyte differentiation
keratinocyte
adipocyte differentiation
fat_cell
dendritic cell activation
dendritic_cell
lymphocyte proliferation
lymphocyte
T-cell homeostasis
T_lymphocyte
garland cell differentiation
garland_cell
heterocyst cell differentiation
heterocyst
http://ontology.buffalo.edu/smith
125
Alignment of the two ontologies will permit the
generation of consistent and complete definitions
GO
id: CL:0000062
name: osteoblast
def: "A bone-forming cell which secretes an extracellular matrix.
Hydroxyapatite crystals are then deposited into the matrix to form
bone." [MESH:A.11.329.629]
is_a: CL:0000055
relationship: develops_from CL:0000008
relationship: develops_from CL:0000375
+
Cell type
=
Osteoblast differentiation: Processes whereby an
osteoprogenitor cell or a cranial neural crest cell
acquires the specialized features of an osteoblast, a
bone-forming cell which secretes extracellular matrix.
http://ontology.buffalo.edu/smith
New Definition
126
Other Ontologies to be aligned
with GO
Chemical ontologies
– 3,4-dihydroxy-2-butanone-4-phosphate synthase
activity
Anatomy ontologies
– metanephros development
GO itself
– mitochondrial inner membrane peptidase activity
 OBO core
http://ontology.buffalo.edu/smith
127
eventually to comprehend all of
OBO
http://ontology.buffalo.edu/smith
128
is_a
Anatomical
Structure
Anatomical Space
Organ Cavity
Subdivision
Organ
Cavity
Organ
Serous Sac
Cavity
Subdivision
Serous Sac
Cavity
Serous Sac
Organ
Component
Organ
Subdivision
Pleural Sac
Pleural
Cavity
Parietal
Pleura
Interlobar
recess
Organ Part
Mediastinal
Pleura
http://ontology.buffalo.edu/smith
Tissue
Pleura(Wall
of Sac)
Visceral
Pleura
Mesothelium
of Pleura
129
Anatomical Entity
Physical
Anatomical Entity
Conceptual
Anatomical Entity
-is a-
Anatomical
Relationship
Material Physical
Anatomical Entity
Body
Substance
Biological
Macromolecule
Non-material Physical
Anatomical Entity
Anatomical
Space
Anatomical
Structure
Cell
Cell
Organ
Tissue
Part
Part
http://ontology.buffalo.edu/smith
Organ
Organ
System
Body
Part
Human
Body
130
The Anatomy Reference Ontology
is organized in a graph-theoretical structure
involving two sorts of links or edges:
is-a (= is a subtype of )
(pleural sac is-a serous sac)
part-of
(cervical vertebra part-of vertebral column)
http://ontology.buffalo.edu/smith
131
at every level of granularity
http://ontology.buffalo.edu/smith
132
What do the kidneys do?
Modularity
http://ontology.buffalo.edu/smith
133
NEPHRONHow
does a kidney work?
http://ontology.buffalo.edu/smith
134
FUNCTIONAL
SEGMENTS
Nephron Functions
http://ontology.buffalo.edu/smith
135
Top-Level Categories in the FMA
anatomical
entity
physical
anatomical entity
material physical
anatomical entity
anatomical
structure
body
substance
non-physical
anatomical entity
non-material physical
anatomical entity
body
space
http://ontology.buffalo.edu/smith
boundary
anatomical
attribute
anatomical
relationship
136
anatomical structure (cell, lung, nerve,
tooth)
result from the coordinated expression of
structural genes
have their own 3-D shape
http://ontology.buffalo.edu/smith
137
portion of body substance
inherits its shape from container
portion of urine
portion of menstrual fluid
portion of blood
http://ontology.buffalo.edu/smith
138
anatomical space
cavities, conduits
http://ontology.buffalo.edu/smith
139
anatomical attribute
mass
weight
temperature
your temperature
its value now
http://ontology.buffalo.edu/smith
140
anatomical relationship
located_in
contained_in
adjacent_to
connected_to
surrounds
lateral_to (West_of)
anterior_to
http://ontology.buffalo.edu/smith
141
boundary
bona fide / fiat
http://ontology.buffalo.edu/smith
142
www.enel.ucalgary.ca/ People/Mintchev/stomach.htm
Connectedness and
Continuity
The body is a highly connected entity.
Exceptions: cells floating free in blood
continuous_with,
attached_to (muscle to bone)
synapsed_with (nerve to nerve and nerve to
muscle)
Two continuants are continuous on the instance
level if and only if they share a fiat boundary.
http://ontology.buffalo.edu/smith
143
Anatomical
Structure
Anatomical Space
Organ Cavity
Subdivision
Organ
Cavity
Organ
Serous Sac
Cavity
Subdivision
Serous Sac
Cavity
Serous Sac
Organ Part
Organ
Component
Organ
Subdivision
basis for generalization
to other species
Pleural Sac
Pleural
Cavity
Parietal
Pleura
Interlobar
recess
Mediastinal
Pleura
http://ontology.buffalo.edu/smith
Tissue
Pleura(Wall
of Sac)
Visceral
Pleura
Mesothelium
of Pleura
144
Anatomical
Structure
Anatomical Space
Organ Cavity
Subdivision
Organ
Cavity
Organ
Serous Sac
Cavity
Subdivision
Serous Sac
Cavity
Serous Sac
Organ
Component
Organ
Subdivision
Pleural Sac
Pleural
Cavity
Parietal
Pleura
Interlobar
recess
Organ Part
Mediastinal
Pleura
http://ontology.buffalo.edu/smith
Tissue
Pleura(Wall
of Sac)
Visceral
Pleura
Mesothelium
of Pleura
145
Web-Based Representations of Neuroanatomy
http://ontology.buffalo.edu/smith
146
http://ontology.buffalo.edu/smith
147
includes Neuronames
http://ontology.buffalo.edu/smith
148
http://ontology.buffalo.edu/smith
149
with thanks to Christine
Fennema-Notestine and
Jessica Turner
Human Morphometry and
Function BIRN Testbeds
CBiO/BIRN Workshop 2006
BIRN Ontology Needs
GOAL: User will employ BIRN interface and Mediator
to perform scientific queries on data from
•
•
•
•
structural and functional MRI experiments
clinical assessments
psychiatric interviews
and/or behavioral experiments
BIRN needs for common vocabularies
– Mediator needs to talk across databases to find
relevant/similar information; this requires linking of
concepts to table columns and values
– Query interface needs semantic network to find related
information
http://ontology.buffalo.edu/smith
151
Example queries:
– Find all datasets of schizophrenics with
structural and functional imaging data related
to working memory
– Find the correlation between hippocampal
volume and working memory performance in
AD subjects
http://ontology.buffalo.edu/smith
152
MBIRN priorities
“To relate clinical assessments, cognitive
function, and neuroanatomy within mBIRN’s
multi-site AD sample, with future branching into
neuropsychiatric measures”
– Only a high quality reference ontology of
neuro(patho)anatomy from the macroscopic to
the subcellular levels of granularity can give you
this
http://ontology.buffalo.edu/smith
153
Existing neuroanatomical ontology
Brain
…
Cerebellum
Cerebrum
Cerebral white matter
Frontal cortex
…
…
Need to create
related “function”based ontology
Cerebral cortex
CVLT
Temporal cortex
Memory
Superior temporal
…
…
Mesial temporal
Amygdala
http://ontology.buffalo.edu/smith
Hippocampus
154
‘Need to create related “function”based ontology’
UMLS: mental process is_a organism function
Function vs. functioning
Many entities have functions which they never
realise
A has function B = A can realise B (under
which circumstances?)
http://ontology.buffalo.edu/smith
155
‘Need to create related “function”based ontology’
A function is a disposition of an independent
continuant to engage in corresponding
processes.
To what extent are the various functions identified
by BIRN are in fact complex processes with
many less complex processes as their parts.
How are functions different from disfunctions /
malfunctions ?
Are all function such that their execution is good
for the organism?
http://ontology.buffalo.edu/smith
156
‘Need to create related “function”based ontology’
“You cannot classify parts of the brain on the basis
of which parts can think, remember, effect
movement or perceive various kinds of
sensations, just as you cannot sort anatomical
entities on the basis of which can pump, digest,
secrete, fertilize or stabilize.”
“It is impossible to create an inheritance class
subsumption hierarchy of neuroanatomical
entities at any meaningful depth on the basis of
function.”
http://ontology.buffalo.edu/smith
157
Assessment
Brain
Neuropsychology
Cerebrum
Amnesia
Cognition
Cerebral cortex
Frontal
Temporal
Memory
Learning
Cognitive
impairment
Mesial temporal
CVLT
Hippocampus
http://ontology.buffalo.edu/smith
Task and score description158
Can we reason on the basis of a graph of this sort?
Behavioral
Paradigm
Assessment
SCID-Patient
CVLT
SIRP
Working memory
Attention
Breathhold
Long Term memory
Memory
Cognitive
Process
http://ontology.buffalo.edu/smith
Action
159
Bonfire Relations
relation:
the type of relation between the concept to the left and the concept to the ri
PAR = Parent
CHD = Child
SIB = Sibling
RB = Broader Relationship
RN = Narrower Relationship
RO = Other Relationship
http://ontology.buffalo.edu/smith
160
BIRN Relations
UMLS (PAR, CHD, RN, RO, RB, SY).
RB: has a broader relationship
RN: has a narrower relationship
RO: has relationship other than
synonymous, narrower, or broader
CHD: has child relationship in a
Metathesaurus
SIB: has sibling relationship in a
Metathesaurus source vocabulary
http://ontology.buffalo.edu/smith
161
“Circular Hierarchical Relationships in the UMLS:
Etiology, Diagnosis, Treatment, Complications and Prevention”
Olivier Bodenreider
Topographic regions: General terms
Physical anatomical entity
Anatomical spatial entity
Anatomical surface
Body regions
Topographic regions
http://ontology.buffalo.edu/smith
162
MeSH
MeSH Descriptors
Index Medicus Descriptor
Anthropology, Education, Sociology and
Social Phenomena (MeSH Category)
Social Sciences
Political Systems
National Socialism
National Socialism is_a Political Systems
National Socialism is_a Anthropology ...
http://ontology.buffalo.edu/smith
163
MeSH
National Socialism is_a MeSH Descriptor
Cf. NeuroNames:
Ontology =def a codification of the
relationships between words and concepts
http://ontology.buffalo.edu/smith
164
Human BIRN data includes:
Participant demographics such as age, gender, …
Clinical and psychiatric information
– Assessments used, data type
– Diagnostic information
Behavioral data during fMRI tasks
– Need to know how to interpret that (“is a button 1 response
a yes or a no?”)
Raw structural and functional images
– Need information about data collection and preprocessing
methods
Single-subject and group level analyses and results
– Need information about analytic methods used
http://ontology.buffalo.edu/smith
165
Areas where application ontologies will be needed
Participant demographics such as age, gender, …
Clinical and psychiatric information
– Assessments used, data type
– Diagnostic information
Behavioral data during fMRI tasks
– Need to know how to interpret that (“is a button 1 response
a yes or a no?”)
Raw structural and functional images
– Need information about data collection and preprocessing
methods
Single-subject and group level analyses and results
– Need information about analytic methods used
http://ontology.buffalo.edu/smith
166
Bottom-up search:
User’s dataset contains the CVLT – what does it measure?
• Search for CVLT
• Related to PARENT concepts like “Neuropsychological
tests” or “Assessment Scales” or SIBLING concepts of
other tests
• What is the CVLT? This doesn’t answer the user’s
question.
• Need relationship links to function: memory and
learning
• Need relationship links to structure: anatomical
regions reflected in change of performance on this
measure  hippocampus
http://ontology.buffalo.edu/smith
167
Top-down search:
User interested in studying the relationship
between hippocampal volume and memory
performance in Alzheimer’s disease.
• Search for measures of memory
• Would like to see memory linked to CVLT
• Would like to see memory linked to hippocampus at a
very basic level
• Would like to see links to potential disorders assessed,
e.g., amnesia or AD
http://ontology.buffalo.edu/smith
168
Ontology/Terminology Infrastructure
GOAL: to allow database mediation and
scientific queries for multi-site clinical
neuroimaging studies. This requires the
relationship of database tables to
concepts and to relate brain structure
and function through neuroanatomical
regions, neuropsychological and
cognitive terms, and clinical
assessments.
http://ontology.buffalo.edu/smith
169
Ontology/Terminology Infrastructure
– To do this, the Mediator relies in part on
defined terms/concepts to define relationships
between similar terms from different
databases.
– If a user is interested in data related to “long
delay free recall," it is important to also
include information related to “memory." This
type of relational knowledge is critical to find
other values in other databases that have
similar information.
http://ontology.buffalo.edu/smith
170
Ontology/Terminology Infrastructure
In addition, the ontology will provide a semantic
network; for a user searching for “memory"
information, related information would include
– Cognitive terms, e.g., recall, recognition, short and
long term memory
– Assessment terms, e.g., California Verbal Learning
Test
– “Disorders of” terms, e.g., Alzheimer’s disease is a
disorder of memory
How block information overload?
http://ontology.buffalo.edu/smith
171
Bottom-up search:
User’s resultant dataset contains the MMSE – the user
asks what does it measure?
• Search for MMSE concept
• Related to PARENT concepts like Neuropsychological tests”
or “Assessment Scales” or SIBLING concepts of other tests
• What is the MMSE? This doesn’t answer the user’s question.
• Need relationship links to function: general cognitive ability,
cognitive impairment, dementia severity, brain damage …
• Need relationship links to structure: anatomical regions
reflected in change of performance on this measure, although
a relatively non-specific measure
http://ontology.buffalo.edu/smith
172
Top-down search:
What variables exist that would provide a measure
of general cognitive function and dementia
severity?
• Search for measures of (general) cognitive function
• Would like to see general cognitive ability, cognitive
impairment, dementia severity linked to MMSE
• Would like to see general cognitive ability, cognitive
impairment, dementia severity linked to
neuroanatomical regions, simply brain in this case
• Would like to see links to potential disorders
measured, e.g., AD
http://ontology.buffalo.edu/smith
173
NeuroNames (with thanks to Onard Mejino)
has a limited scope.
It deals with neuroanatomical structures only at the
gross level. No cellular, subcellular or
macromolecular entities are represented.
The peripheral nervous system and the spinal cord
are not included.
It represents structures from different species
(human, macaque and rodent) in the same
hierarchy.
http://ontology.buffalo.edu/smith
174
NN’s main hierarchy
•
•
•
•
is a partonomy based on mutually exclusive and exhaustive
volumetric partitions, the equivalent of regional partition in the
FMA.
The partonomy supports only ONE partition view and therefore
does not accommodate
other recognized regional partitions like Brodman areas (treated
as “ancillary structures”)
constitutional parts like the internal pyramidal layer of neocortex
and the vasculature of neuraxis (entities that have important
clinical significance)
new partitions advanced by new technology like gene
expression mappings or radiologic imaging techniques
partitions determined by formal spatial region-based ontologies
like RCC
http://ontology.buffalo.edu/smith
175
The Neuronames partonomy
will serve at best as an application ontology for
annotating segmented images of the brain.
But it will still be very difficult to link the
annotated image data to all the other types of
data which will BIRN will need to describe
 a reference ontology of neuroanatomy is a first
priority.
http://ontology.buffalo.edu/smith
176
Neuronames
• Since univocity is not enforced in the
literature of neuroanatomy, e.g. the term
‘Basal ganglia’ represents different
structures when used in association with
anatomic, functional and clinical views.
• How will NN resolve or clarify this?
http://ontology.buffalo.edu/smith
177
Neuronames
• entities are primarily identified on the basis of stains that
distinguish gray matter from white matter
• thus not on principles or rules that define the type of the
entity in question, thereby yielding a partition not in
accord with the standards commonly accepted for
representing the rest of the body.
• gray matter and white matter are viewed as tissues. But
tissue is usually defined as an aggregate of similarly
specialized cells and intercellular matrix.
• yet gray matter consists not of cells but of cell bodies,
white matter not of cells but of neurites
http://ontology.buffalo.edu/smith
178
Neuronames
• gives no explicit definitions, and the representations it
gives (e.g. of the Fourth Ventricle*) are often at odds with
consensual usage
• hence scalability, extendability, combinability with other
ontologies is limited – how then can it be used to bridge
research efforts at the genomic / proteomic level with
those at the clinical level?
• Information unique to neuroanatomical entities such as
axonal input/output relationships, connectivity, neuron
type, neurotransmitter and receptor types are
indispensable in establishing and understanding both
structural and physiological relationships among
neuroanatomical entities and their relationship with the
rest of the body.
http://ontology.buffalo.edu/smith
179
BIRNLex
does provide definitions, normally taken
over from UMLS
http://ontology.buffalo.edu/smith
180
Rules for definitions
‘A’ = child term
‘B’ = parent term
an A =def a B which Cs
If a definition is correct it should always make
sense to substitute ‘a B which Cs’ for ‘an A’
“A human being is subject to processes of aging”
“A rational animal is subject to processes of aging”
http://ontology.buffalo.edu/smith
181
BIRNLex
The eye =def.
The eyeball and its constituent parts, e.g. retina
mouse =def.
common name for the species mus musculus
http://ontology.buffalo.edu/smith
182
BIRNLex
http://ontology.buffalo.edu/smith
183
BIRNLex
http://ontology.buffalo.edu/smith
184
BIRNLex
http://ontology.buffalo.edu/smith
185
BIRNLex
bear in mind always that
your ontology needs
to be interoperable with
other ontologies
http://ontology.buffalo.edu/smith
186
BIRNLex
bear in mind always that
your ontology needs
to be interoperable with
other ontologies
http://ontology.buffalo.edu/smith
187
BIRNLex
surface =def 3D segmentation obtained by
fitting a polygonal mesh around the boundary
of an object of interest, creating a 3D surface
Concept =def Generic ideas or categories
derived from common properties of objects,
events, or qualities, usually represented by
words or symbols
http://ontology.buffalo.edu/smith
188
BIRNLex
brain imaging =def none; synonymous with
positrocephalogram, nos
CA1 =def CA1 cytoarchitectonic field of
hippocampus
cognitive process = def. conceptual function
or thinking in all its forms
http://ontology.buffalo.edu/smith
189
BIRNLex and UMLS-SN
Rest =SN Daily or Recreational Activity
Principal Investigator =SN Professional or
Occupational Group
Left handedness =SN Organism Attribute
Ambidextrous =SN Finding
Brain Imaging =SN Diagnostic Procedure
Brain Mapping =SN Diagnostic Procedure & Research
Activity
Healthy Adult =SN Finding
http://ontology.buffalo.edu/smith
190
BIRNLex
http://ontology.buffalo.edu/smith
191
Mouse BIRN: Ontologies
Mouse
BIRN:
Maryann Martone
and Ontologies
Maryann Martone and Bill Bug
Bill Bug
2005 All Hands Meeting
Use of Ontologies in BIRN
•Databases
•Enforces semantic consistency within a database
•Data Sharing
•Establishes semantic relationship among concepts contained in
distributed databases
•Data integration
•Bridging across multiscale and multimodal data
•Concept-based queries:
•Ontologies can be used to alter semantic context to present a view of the
conceptual aspects of a data set or meta-analysis result most relevant to a
particular neuroscientist
http://ontology.buffalo.edu/smith
193
Objectives of Working
Group
Educate BIRN participants on the use of ontologies and
associated tools for data integration
– Tuesday (PM) and Wednesday (AM)
Develop a set of ontology resources for BIRN
participants, based on existing ontologies where
possible
Identify areas that are not well covered by existing
ontologies for possible development.
***Develop a clear set of policies and procedures for
working with ontologies
– Including curation, addition of core ontologies, extension of
ontologies, mapping of databases to ontologies
http://ontology.buffalo.edu/smith
194
Goals of OTF
•Provide a dynamic knowledge infrastructure to support integration and
analysis of BIRN federated data sets, one which is conducive to
accepting novel data from researchers to include in this analysis.
•Identify and assess existing ontologies and terminologies for
summarizing, comparing, merging, and mining datasets. Relevant
subject domains include clinical assessments, demographics, cognitive
task descriptions, imaging parameters/data provenance in general, and
derived (fMRI) data.
•Identify the resources needed to achieve the ontological objectives of
individual test-beds and of the BIRN overall. May include finding other
funding sources, making connections with industry and other consortia
facing similar issues, and planning a strategy to acquire the necessary
resources.
http://ontology.buffalo.edu/smith
195
BIRN Ontology Resources
Bonfire Ontology
Browser and
Extension Tool
Mouse BIRN Ontology Resource Page
http://ontology.buffalo.edu/smith
http://nbirn.net/Resources/Users/Ontologies/
196
Current Ontology Development
by Mouse BIRN Participants
Developmental Ontology
• Seth Ruffins, Cal Tech
Subcellular Anatomy
• Maryann Martone and Lisa Fong, UCSD
http://ontology.buffalo.edu/smith
197
Ontology for Subcellular Anatomy of Nervous System
http://ontology.buffalo.edu/smith
198
CCDB Dictionary
Term
Ontology
ConceptID
Semantic Type
Definition
Cerebellum
UMLS
C0007765
Body Part, Organ,
or Organ
Component
Part of the metencephalon that
lies in the posterior cranial fossa
behind the brain stem. It is
concerned with the coordination
of movement. (MSH)
Glial Fibrillary
Acidic Protein
UMLS
C0017626
Amino Acid, Peptide,
or Protein,
Biologically Active
Substance
An intermediate filament protein
found only in glial cells or cells of
glial origin. MW 51,000. (MSH)
Medium Spiny
Neuron
Bonfire
BID000012
Cell
Small (10-15 µm in diameter)
projection neurons found in
neostriatum, possessing a
rougly spherical dendritic tree
composed of 3-5 primary
dendrites. Dendrites are
covered with dendritic spines.
Purkinje cell
UMLS
C0034143
Cell
large branching neurons of the
middle layer of cerebellar cortex,
characterized by vast arrays of
dendrites; the output neurons of
the cerebellar cortex.
http://ontology.buffalo.edu/smith
199
Some Areas of Interest to
BIRN
Navigating through Multi-resolution
information
Linking animal and human imaging
data
brain
Entopeduncular
nucleus
Globus pallidus,
internal segment
Animal Model
Disease Process
cerebellum
cerebellar cortex
Purkinje cell
•***Map between Human and Animal models
dendritic spine
•Functional assessment
http://ontology.buffalo.edu/smith
200
Anatomical Knowledge Sources
Foundational model of anatomy
Neuronames (Brain Info)***
BAMS***
Adult Mouse Anatomical Dictionary
(Edinburgh/GO)
“Although BIRN is an open, diverse and fluid environment, the use of
ontologies for enhanced interoperability will be pointless if we allow
random use of ontologies. The OTF recommends that there be a set of
ontologies that are approved for use and a set of policies and procedures
for adding or creating additional knowledge sources. Current knowledge
sources that are currently in use include UMLS, GO, LOINC, SNOMED,
NEURONAMES.”
http://ontology.buffalo.edu/smith-OTF report to BEC 8/05
201
Other Resources Likely of Use
Mouse Phenome Project:
a collection of phenotypic and genotypic
data for the laboratory mouse
anatomy
behavior
biological factors
blood
cancer
diet effects
drug effects, toxicity
genotype
heart, lung
intake, metabolism
musculoskeletal
neurosensory
reproduction
http://ontology.buffalo.edu/smith
202
Neuronames-UMLS-Smart Atlas
•Mapping of rodent nomenclature onto UMLS
•Neuronames has now included many of the terms
•Using concepts in Neuronames and Paxinos to create new
hierarchy
http://ontology.buffalo.edu/smith
203
What do we need to do in the
next year
Identify areas of mouse BIRN not covered
– Do ontologies exist?
– If not, do we develop them
What known ontologies should be added to BIRN
ontology resources
– Who will handle semantic concordance
– How do we represent these in BIRN?
Mapping databases to ontologies
– Time frame
– What should be mapped?
– Who will do this at each site
http://ontology.buffalo.edu/smith
204
Mouse BIRN Global Conceptual
Schema
Project
Experiments
Molecular
Distributions
Atlas
Subject
Experimental
Data
Microarray
Results
Region of
Interest
Animal Type
Anatomical
Properties
Images
Worked with Data Integration group to define global schema
http://ontology.buffalo.edu/smith
205