The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
Download ReportTranscript The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith) 1 how create broad-coverage semantic annotation systems for biomedicine? UMLS, Semantic Web, Moby, wikis, etc. let a million flowers bloom integration relies on post hoc mappings 2 O2 O3 O1 O4 On 3 O2 O3 O1 O4 On 4 for science a new approach create an evolutionary path towards evidence-based terminology 5 O2 O3 O1 O4 On 6 FMA Cell Ontology PaTO TLO Shared Organizatorial, Scientific, and Ontological principles RNAO ChEBI for science a new approach create an evolutionary path towards evidencebased terminology Terminologies/ontologies that • have proven their usefulness through applications • based on/ verified through scientific methodology • structured/formalized according to ontological principles 8 First step (Oct. 11, 2001) a shared portal for (so far) 58 ontologies originally called ‘GOBO’ (for Global Open Biological Ontologies) 9 GOBO Principles All ontologies will be welcome if, and only if, they conform to the following principles of gobo: 1. They are ‘open’ 2. They are in, or can be instantiated in, the GO syntax or extensions of this syntax. 3. They are orthogonal to other ontologies already lodged with gobo. 4. Shared ID space. 5. Terms are defined. 10 Second step (March 2004) Initiation by the Gene Ontology Consortium - logical reforms of the GO and related ontologies 11 e.g. linking GO logically to other OBO ontologies via cross-products GO id: CL:0000062 name: osteoblast def: "A bone-forming cell which secretes an extracellular matrix. Hydroxyapatite crystals are then deposited into the matrix to form bone." is_a: CL:0000055 relationship: develops_from CL:0000008 relationship: develops_from CL:0000375 Osteoblast differentiation: Processes whereby an osteoprogenitor cell or a cranial neural crest cell acquires the specialized features of an osteoblast, a bone-forming cell which secretes extracellular matrix. + Cell type (CL) = New Definition 12 Third step (2005-6) The OBO Foundry further principles added + intense collaborative development 13 The OBO Foundry a family of interoperable gold standard biomedical reference ontologies to serve the annotation of scientific literature model organism databases clinical data à la Ida experimental results etc. 14 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast to: post hoc mapping à la UMLS) several ontologies now being constructed ab initio 15 Initial set of 10 principles = 5 GOBO principles (with broadened family of allowable types of syntax, including OWLDL syntaxes) plus: 6. Collaborative development 7. Intelligible documentation. 8. Plurality of independent users. 9. Clearly specified content. 10. Uses relations which are formally defined 16 Ontology Scope URL Custodians Cell Ontology (CL) cell types from prokaryotes to mammals obo.sourceforge.net/cgibin/detail.cgi?cell Jonathan Bard, Michael Ashburner, Oliver Hofman Chemical Entities of Biological Interest (ChEBI) molecular entities ebi.ac.uk/chebi Paula Dematos, Rafael Alcantara Common Anatomy Reference Ontology (CARO) anatomical structures in human and model organisms (under development) Melissa Haendel, Terry Hayamizu, Cornelius Rosse, David Sutherland, Foundational Model of Anatomy (FMA) structure of the human body fma.biostr.washington. edu JLV Mejino Jr., Cornelius Rosse Functional Genomics Investigation Ontology (FuGO) design, protocol, data instrumentation, and analysis fugo.sf.net FuGO Working Group Gene Ontology (GO) cellular components, molecular functions, biological processes www.geneontology.org Gene Ontology Consortium Phenotypic Quality Ontology (PaTO) qualities of biomedical entities obo.sourceforge.net/cgi -bin/ detail.cgi? attribute_and_value Michael Ashburner, Suzanna Lewis, Georgios Gkoutos Protein Ontology (PrO) protein types and modifications (under development) Protein Ontology Consortium Relation Ontology (RO) relations obo.sf.net/relationship Barry Smith, Chris Mungall RNA Ontology (RnaO) three-dimensional RNA structures (under development) RNA Ontology Consortium Sequence Ontology (SO) properties and features of nucleic sequences song.sf.net Karen Eilbeck RELATION TO TIME GRANULARITY INDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) CELL AND CELLULAR COMPONENT Cell (CL) MOLECULE CONTINUANT DEPENDENT Anatomical Organ Entity Function (FMA, (FMP, CPRO) Phenotypic CARO) Quality (PaTO) Cellular Cellular Component Function (FMA, GO) (GO) Molecule (ChEBI, SO, RnaO, PrO) OCCURRENT Molecular Function (GO) OBO Foundry coverage Organism-Level Process (GO) Cellular Process (GO) Molecular Process (GO) RELATION TO TIME CONTINUANT INDEPENDENT OCCURRENT DEPENDENT GRANULARITY ORGAN AND ORGANISM Organism (NCBI Taxonomy) CELL AND CELLULAR COMPONENT Cell (CL) MOLECULE Anatomical Organ Entity Function (FMA, (FMP, CPRO) Phenotypic CARO) Quality (PaTO) Cellular Cellular Component Function (FMA, GO) (GO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Biological Process (GO) Molecular Process (GO) Building out from the original GO Current state of OBO Foundry 1. Ontologies already mature but still undergoing continuous reform: Cell Ontology (CL) Foundational Model of Anatomy (FMA) Gene Ontology (GO) Zebrafish Anatomy Ontology (ZAO) 20 2. Early versions exist Clinical Trial Ontology (CTO, part of OBI) Common Anatomy Reference Ontology (CARO, DB1 & DB2) Mosquito Anatomy Ontology (MAO) Ontology for Biomedical Investigations (OBI) Phenotypic Quality Ontology (PATO, DB1 & DB2) Protein Ontology (PRO) Relation Ontology (RO) RNA Ontology (RnaO) 21 3. Still in need of thorough review Chemical Entities of Biological Interest (ChEBI) Disease Ontology (DO) Sequence Ontology (SO) various other OBO ontologies (Mammalian Phenotype Ontology ...) 22 4. Still in planning phase Biobank/Biorepository Ontology (BrO, part of OBI) Environment Ontology (EnvO) – inaugural meeting August, 2007 organized by NERC in OERC (Oxford E-Research Center) Fish Multi-Species Anatomy Ontology (funding received; no acronym yet) Infectious Disease Ontology (IDO) Mouse Adult Neurogenesis Ontology (MANGO) Xenopus Anatomy Ontology (XAO) 23 OBO Foundry Listservs (Jan. 2007) 24 Foundry developers have agreed in advance to accept a common set of principles designed to ensure compatibility interoperability formal robustness http://obofoundry.org/ 25 CRITERIA CRITERIA The ontology is open and available to be used by all. The ontology is in, or can be instantiated in, a common formal language. The developers of the ontology agree in advance to collaborate with developers of other OBO Foundry ontology where domains overlap. http://obofoundry.org/ 26 CRITERIA UPDATE: The developers of each ontology commit to its maintenance in light of scientific advance, and to soliciting community feedback for its improvement. ORTHOGONALITY: They commit to ensuring that there is community convergence on a single controlled vocabulary for each domain http://obofoundry.org/ 27 CRITERIA CRITERIA IDENTIFIERS: The ontology possesses a unique identifier space within OBO. VERSIONING: The ontology provider has procedures for identifying distinct successive versions. The ontology includes textual definitions for all terms. http://obofoundry.org/ 28 CRITERIA CLEARLY BOUNDED: The ontology has a clearly specified and clearly delineated content. DOCUMENTATION: The ontology is welldocumented. USERS: The ontology has a plurality of independent users. http://obofoundry.org/ 29 CRITERIA COMMON ARCHITECTURE: The ontology uses relations which are unambiguously defined following the pattern of definitions laid down in the OBO Relation Ontology.* * Smith et al., Genome Biology 2005, 6:R46 http://obofoundry.org/ 30 The Foundational Model of Anatomy (FMA) 31 The Foundational Model of Anatomy (FMA) FMA Uni-Washington Protégé – frame system http://fme.biostr.washi ngton.edu:8089/FM E/index.html Human anatomy 32 Digital Anatomist Foundational Model of Anatomy (FMA) High level scheme FMA= (At, ASA, ATA, Mk) where: At ASA ATA Mk = = = = Anatomy taxonomy Anatomical Structural Abstraction Anatomical Transformation Abstraction Metaknowledge Mejino & Rosse, KR-Med 0433 Anatomy Taxonomy (Mejino & Rosse, KR-Med 04) Anatomical Entity Physical Anatomical Entity Spatial entity (located in space) -is a- Non-physical Anatomical Entity Non-spatial entity (not located in space) 34 Anatomy Taxonomy (Mejino & Rosse, KR-Med 04) Anatomical Entity Physical Anatomical Entity Material Physical Anatomical Entity -is a- Non-physical Anatomical Entity Non-material Physical Anatomical Entity Space Surface Line 35 Anatomical spaces (is-a hierarchy) QuickTime™ and a TIFF (LZW) decompressor are needed to see this picture. 36 Anatomy Taxonomy (Mejino & Rosse, KR-Med 04) Anatomical Entity Physical Anatomical Entity -is a- Material Physical Anatomical Entity Body Substance Non-physical Anatomical Entity Non-material Physical Anatomical Entity Anatomical Structure Space Surface Line 37 Anatomy Taxonomy (Mejino & Rosse, KR-Med 04) Anatomical Entity Physical Anatomical Entity Non-physical Anatomical Entity -is a- Material Physical Anatomical Entity Body Substance Non-material Physical Anatomical Entity Anatomical Structure Space Surface Line Human Biological Macromolecule Body Body Acellular Anatomical Structure Part Cell Part Cell Tissue Organ Part Organ Organ System 38 Universal parthood: The universal human body The universal human head Human anatomy Human heads are parts of human bodies The semantics of universal parthood X PART-OF Y iff For every instance x of X there exists an instance y of Y such that x part-of y AND For every instance y of Y there exists an instance x of X such that x part-of y