OBO-Edit tutorial

Download Report

Transcript OBO-Edit tutorial

OBO-Edit tutorial
David Osumi-Sutherland
FlyBase / Virtual Fly Brain / OBO-Edit
Working Group (OEWG)
Resources
• Slides and test tutorial can be downloaded
from the bottom of the Tutorial wiki page:
– http://www.bioontology.org/wiki/index.php/OBOEdit
• The latest version of OE2 (beta release), is
here:
– http://oboedit.org/2.1betarelease/
Your backgrounds
• Who here has never used an ontology editing
tool?
• Who here is an OBO-Edit user?
– If you are, have you used Cross-Products?
• Who here is a Protégé user?
• Protégé 3
• Protégé 4
– With reasoner?
• For those of you who speak OWL(ish), I'll try to provide translations to
OWL Manchester syntax.
OBO 101
•
An ontology contains terms (e.g. hand) and
relationships (e.g. hand has_part finger)
•
Terms refer to types (classes).
•
Types are classifications of things
(instances) in the real world, based on
some set of criteria.
– My left hand is an instance of the type hand
•
The criteria for membership of a class
(type) is recorded using textual definitions,
relationships, or some combinations of the
two
– name: hand
– def: “An anatomical structure that has four
fingers and a thumb and is attached to the
end of an arm.” [reference: DOS]
– relationship: hand has_part finger
– relationship: hand has_part thumb
– relationship: part_of arm
Image from Gray’s Anatomy (copyright
expired)
OBO basics- instance/type distinction
• If OBO terms refer to types, why worry about
instances?
– instances are central to defining OBO type-level
relations
• OBO <-> OWL:
– OWL class = OBO type
– OWL individual = OBO instance
OBO relations
• Instance level relations (by convention written in bold)
– 'my left little finger' part_of 'my left hand'
• Note – quantifiers – all, some – are not needed.
• Type level relations (by convention, written in italic).
– Type level relations are defined using instance level relations + quantifiers.
• In OBO, the quantifiers (all, some) are buried in the definition of the
type-level relation.
– e.g.- X part_of Y is defined as:
» for all x,t if Xxt then there is some xy such that Yyt and x
part_of y at t
• OBO <-> OWL:
– OBO instance level relation = OWL property
– In OWL, quantifiers are used explicitly in ontology construction when relating
classes
• OBO: finger part_of hand can be written in OWL using the OBO instance
level relation with added quantifiers: all finger part_of some hand *
– OBO is_a = OWL SubClassOf
*Ignoring the time component of the OBO definitions, as this cannot be expressed in OWL
Quantifiers
• It is critical to be aware of the direction of quantifiers when using OBO
relations:
True: All breasts are part of some human
False: All humans have breasts
True: All vertebrate motor neurons release neurotransmitter acetylcholine
False: All neurons that release acetylcholine are vertebrate motor neurons.
Be especially careful in cases where the instance level relation is symmetric:
True: All lion’s mane connected_to some lion’s neck
False: All lion’s neck connected_to some lion’s mane
Relationships and definitions
• You should think of the relationships a term
has as a formalised part of their definition.
• A regular relationship in OBO (SubClassOf in
OWL) specifies necessary conditions for
membership of a class.
– e.g. finger part_of hand (all finger part_of some
hand) states that a necessary condition of being in
the class finger is to be part of some hand.
Transitivity
• If a relation is transitive:
– A rel B rel C rel D therefore A rel D
• U1 neuron is_a U neuron is_a motor neuron
– therefore U1 neuron is_a motor neuron
• ellipsoid body part_of central complex part_of adult
brain
– therefore ellipsoid body part_of adult brain
Transitivity and redundancy
• ellipsoid body part_of central complex part_of
adult brain
– therefore ellipsoid body part_of adult brain
• therefore asserting the relationship "ellipsoid body
part_of adult brain" in your ontology would be
redundant.
Some simple reasoning
• U neuron part_of larval antennal segment
– (All U neuron part_of some larval antennal segment)
• U1 neuron is_a U neuron
• therefore U1 neuron part_of larval antennal
segment
– (All U neuron part_of some larval antennal segment;
U1 neuron SubClassOf U neuron therefore…)
• So, adding 'U1 neuron' part_of 'larval antennal
segment’ would be redundant.
Demo – OBO basics
Basic OBO-Edit2 editing setup
•
•
•
•
•
•
- 2 x Ontology Tree Editor (OTE)
- One parent editor
- One text editor
- One search panel
- One reasoner manager
- One graph viewer (Note - currently only
works with Link Pile Reasoner on)
Browsing
Browsing - Trees
The ontology tree editor is a good way to browse down the
ontology graph, but not all parents visible in one view
Click to
expand or
contract
branch
help
Preferences
save a picture
Quick Filtering
Browsing - parents
The parent editor provides a quick way to check all parental
relationships – usually these are not all these are visible in a single
tree view
help
Preferences
save a picture
help
Preferences
save a picture
Graph viewer setup
For an uncluttered view:
Browsing - graphs
A good way to view and browse ancestral relationships, graphs of ancestors via
transitive relations answer questions
e.g.- what is X? What is X part of? What does X develop from?
Note – currently requires link pile reasoner to be turned on.
Browsing – The Text Editor
Basic Searching - single leg
Basic searching – multi-leg
AND/OR
Add new leg
nesting (parentheses)
Remove leg
All searches can also be filters or
renders
Rendering options
Editing
Drag and drop editing in the ontology
tree editor (OTE)
• Left clicking a term choose it
• Right clicking displays a menu of editing
options
• Dragging and dropping single or multiple
terms allows terms to be copied, moved or
merged.
Global vs local selection modes
local mode
-selection in other
components doesn’t
affect selection here
global mode
- 2 way auto
sync with other
components
Drag and drop term move
Drag and drop term move
Drag and drop term move
Drag and drop term
merge
Making new terms
• First, make sure your ID generator is set up
correctly:
Edit profile
ID prefix
ID length
start from
end at
Create new child
Committing
Check this box to commit text edits automatically.
Note, committing will not change your ontology file
Deletion, obsoletion, destruction
OTE – right click menu:
… deletes the relationship between the selected term and its immediate
parent in the ontology tree editor. When the selected term has only one
parent, this option switches to:
… changes the status of term to obsolete. The OBO file retains
the ID for future reference and to prevent re-use. To indicate
replacement terms, drag suitable terms to the obsoleted term
=>
Within the context of a single editing session, or a pre-release file, you may
wish instead to destroy the term completely (BUT BE CAREFUL!)
Parent Editor
Delete parent relationship
Does what it says. But be careful it adds is_a
parents by default. For other relations, switch
after adding
Graph Editor
Graph Editor
Hide parent terms
Hide child terms
show parent terms
show child terms
hide term
• Right click provides editing options and
hide-all
• Choosing quick filtering => manageable
view
Making new relationships in the graph
editor – demo only
Time savers
• Cloning
– Right click clone option makes clone of chosen
term – identical in every way except for ID and
‘CLONE OF’ appended to term name
• Create multiple children
– Right click ‘create multiple children’ option. Does
what is says on the box. Pops up interface were a
bunch of new terms can be named at once.
Managing multiple inheritance
Disjunction
• X disjoint_from Y
– Nothing that is an instance of X is also an instance of Y.
• e.g. anatomical structure disjoint_from biological process
– Nothing exists that is an instance of both of these types.
– Extremely useful for error checking
– also speeds up some reasoners
• OBO <-> OWL
– 0BO disjoint_from ≅ OWL DisjointWith
Using the reasoner to check for
disjoint violations - demo
• In the test ontology, make one of the children of
biological process an is_a child of sensillum.
• Now run the link pile reasoner
• Undo
• Making disjoints - demo
– by hand
– automatically making all children disjoint (use
sparingly!)
Ways to classify neurons
• sensory
– Sensory modality
– Sense organ
•
•
motor
interneuron
– local
– relay
• neurotransmitter
– serotonergic,
– dopaminergic…
• Location
– cell body
– fasciculation pattern (e.g.- labial nerve)
– innervation pattern (antennal lobe glomerulus DL1)
all siblings in the same color are (probably) disjoint_from each other.
Multiple inheritance needed
• Name: ORN ab1a
• Def: A cholinergic olfactory neuron whose
dendrite innervates an ab1 basiconic sensillum on
the 3rd segment of the antenna. Like other
antennal olfactory neurons, it sends an axon
through the antennal nerve that innervates a
single antennal lobe glomerulus DL1
–
–
–
–
olfactory neuron
cholinergic neuron
antennal sensory neuron
DL1 innervating neuron
Multiple inheritance is very hard to manage by
hand
It is difficult to keep track of multiple
classification chains to:
ensure completeness;
avoid redundancy;
avoid introducing error due to inheritance
of classification criteria from a distant ancestor
Cross Products
• In OBO, cross products, also known as
intersections, provide a way to record
necessary and sufficient definitions.
• necessary and sufficient definitions allow
classification to be automated using reasoners
• OBO <-> OWL
– intersection_of ≅ EquivalentTO
Genus and Differentia
• Anatomy of a cross product:
– name: antennal sensillum
– genus: sensillum
– differentium: (part_of antenna)
• In OBO format
– intersection_of: sensillum
– intersection_of: part_of antenna
• In OWL:
– EquivalentTO: sensillum and part_of some antenna
OBO-Edit reasoners
• OBO-Edit2 currently has 3 reasoners
• In the near future, 2 of these will be retired,
leaving just the ‘rule based reasoner’ (RBR)
• Currently the link pile reasoner is needed for
flagging disjointness violations and for running
the graph viewer. For everything else you
should use the RBR
Cross Product Demo
• Choose the term 'antennal sensillum'
– Check out the text editor cross product tab
• Turn the reasoner off.
– You should see antennal sensillum at the root
• Run the rule-based-reasoner
– check out how the term is integrated nonredundantly into the classification hierarchy.
– view the parent editor
Cross Product Demo
• Using terms from other ontologies
– cholinergic neuron
– olfactory sensillum
Making new cross product terms
• Add a new root class:
Cross Product demo
• Making new cross products
– glutamatergic neuron
– taste sensillum
– chemosensory sensillum
• Re-run the rule-based-reasoner
• Check the sensillum hierarchy
– Explain how olfactory sensillum got its parent
Useful renders for working with crossproducts
• Cross-Product detector:
• Multiple asserted inheritance detector:
• Unclassified term detector:
• Incompletely classified term detector:
Combining Cross-Products with
relationships => hidden assertions
• How can we record the generalization – All neurons with the
function ‘smell’ are cholinergic?
–
–
–
–
–
name: olfactory receptor neuron
intersection_of: neuron
intersection_of: has_function smell
relationship: releases_neurotransmitter acetylcholine
(OWL: EquivalentTo( neuron AND has_function some smell ), SubClassOf(
releases_neurotransmitter some acetylcholine )
• Why is this potentially dangerous?
–
Future editors using ‘has_function smell’ to record the function of a neuron class
may not be aware that they are adding the assertion that the class is cholinergic
• Safeguards:
– if neurons classifed by neurotransmitter are disjoint, the reasoner can flag
contradictions.
– Keep a record of all examples (perhaps as a standardised comment)
– State this assertion in the term definition with a link to a supporting reference.
Hidden assertion demo
Detecting redundancy with the
reasoner
Use the Rule Base Reasoner*
Redundant relationship
*Link pile reasoner over-flags redundancy when cross-product terms are present
If the reasoner is flagging relationships
you don’t think are redundant
• Are you sure your intended meaning for a
relationship is transitive?
– e.g.- develops_from (transitive) is sometimes used
as if it means ‘directly develops from’ (nontransitive)
– In that case – request a new relation.
– If you are interested in how to link relations such as develops_from and
develops_directly_from so that they can be used for reasoning, please ask me
or Chris after the tutorial
Advanced Searching
• OE2 can combine string searching and logical
querying.
• Logical querying requires the reasoner to be
turned on.
Logical queries
** Note – reasoner required **
Find all sensilla that are part of some head:
The first leg of the search finds all subtypes of sensillum. The second leg finds all parts of
the head. The ‘matches all’ radio button ensures the two legs are combined by a
boolean AND.
OWL-DL: sensillum and part_of some head
Logical queries
** Note – reasoner required **
How is sensillum classified?
What does adPN DL1 develop_from ?
Note – this query has no OWL equivalent
Importing foreign ‘helper’ terms
• Formalising definitions is likely to require
terms from other ontologies.
• If you want to reason using the classification
from another ontology, you need to import
the full classification of each term.
• In order to keep up-to-date, you need a
mechanism to re-import the foreign terms you
use.
Creating and maintaining a helper
term filter
1.
2.
Open foreign ontology and run the rule based reasoner
In the search tool, add the following two filter legs for each helper term
First leg finds helper term
2nd leg finds terms that classify
helper term
3
Load / Save
Saving helper terms
Saving helper terms
Check filter terms
Avoid importing foreign ID rules
Make sure “allow
dangling parents” is
NOT checked
Record version of foreign ontology here
Saving helper terms
Load helper term filter
Refreshing helper terms
• The examples I’ve shown here use term
names.
• This make the filter easy to read and edit, but
names often change.
• Therefore more sustainable to use IDs instead.
Instantiating inferred is_a for release
• If your users are not using your ontology with
a reasoner, you made need to pre-reason for
them.
• To do this, you can use the ‘save implied links’
function in the save interface:
Acknowledgements – OBO-Edit
developers
• Current developers
– Amina Abdulla
– Chris Mungall
– Jennifer Deegan
• Former developers
– John Day-Richter
– Nomi Harris
Acknowledgements - OEWG
•
•
•
•
•
•
•
•
•
•
•
•
•
Amina Abdulla, Chris Mungall, Karen Eilbeck, Suzanna Lewis, - BBOP, LBNL, Berkeley, CA, USA
Midori A. Harris, Jennifer Deegan, Amelia Ireland, Jane Lomax - GO-EBI, Hinxton, UK
David Hill, Alexander D. Diehl, Harold Drabkin - MGI, The Jackson Laboratory, Bar Harbor, ME, USA
Karen R. Christie - SGD, Department of Genetics, Stanford University, Stanford, CA, USA
Tanya Berardini - TAIR, Carnegie Institution, Department of Plant Biology, Stanford, CA, USA
Petra Fey - DictyBase, Northwestern University, Chicago, IL, USA
Carol A. Bastiani, Ranjana Kishore - WormBase, California Institute of Technology, Pasadena, CA, USA
Victoria Petri - RGD, Medical College of Wisconsin, Milwaukee, WI, USA
Colin Batchelor- Royal Society of Chemistry, Cambridge UK
Shuly Avraham - Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
Pankaj Jaiswal - Gramene, Department of Plant Breeding, Cornell University, Ithaca, NY, USA
Melissa Haendel - ZFIN, University of Oregon, Eugene, OR, USA
John Osborne - Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, USA
Acknowledgments
•
•
•
•
Michael Ashburner
FlyBase
Virtual Fly Brain
ICBO