Biopath (BMOND) – A NEW APPROACH TO FORMAL DESCRIPTION AND

Download Report

Transcript Biopath (BMOND) – A NEW APPROACH TO FORMAL DESCRIPTION AND

BioUML – new concepts and possibilities Nikita Tolstykh, Sergey Lapukhov, Ilya Kiselev, Aleksey Shadrin and Fedor Kolpakov Institute of Systems Biology, Novosibirsk, Russia; Design Technological Institute of Digital Techniques SB RAS, Novosibirsk, Russia *Contacts: [email protected]

Motivation

Reconstruction of complex biological systems requires integrated software that provides: 1. integration with a wide range of biological databases; 2. integration with omics data; 3. powerful search capabilities; 4. visual modelling approach; 5. client-server architecture for team work.

BioUMLworkbench

BioUML (

http://www.biouml.org

) is an open source integrated Java workbench for systems biology. Its core is a meta-model that provides an abstract layer for comprehensive formal description of wide range of biological and other complex systems as a compartmentalized graph.

New possibilities

1.

EBI data pack

– database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO. These databases are installed on the special BioUML server and can be easily searched and used by BioUML workbench (client).

2.

Composite database module

– allows a user to specify a set of databases that he would like to use simultaneously as a source of components during creation and editing of diagrams.

3.

Composite diagrams

(hierarchical) – allow a user to join several existing diagrams/models into a bigger one.

4.

Experiment

– allows to specify conditions (initial values, parameters, external stimulus, etc.) for virtual experiments on the diagram.

5.

BioHub

– an approach that allows to arrange relationships between biological objects from different databases formally and use this information for searching. Results of such search can be presented as an editable graph.

6.

Graphic notation editor

– it allows an advanced user to create new graphic notation or extend an existing one. As an example SBGN graphic notation was created.

7.

Microarray data analysis

- allows a user to load his microarray data, identify up/down regulated genes and highligt diagram elements according to microarray data or results of this data analysis.

Virtual experiment concept

To make a virtual experiment it is frequently needed to modify initial model. To skip model duplications for each virtual experiment we introduce “changes” concept. Typical modifications (changes) are: • changing of initial values • changing of model parameters to imitate different conditions or mutations • deleting of some model elements to imitate knock-out mutations • adding events to imitate external influences on the model.

The figure below demonstrates how changes are shown on diagram for pRB knock-out experiment.

Metaphor Levels of biological information

Biological systems reconstruction is like solitaire (patience) game:

desk

– BioUML editor;

solitaire cards

– biological pathway; – biological objects (genes, proteins, lipids, etc.);

pack of cards

– different biological databases.

L evel 3: P roblem s pec ific L evel 2: P athways , models

Cyclonet

- leads - actions - targets

w i k i refers UbiProt

classifications: E1, E2, E3, …

GeneModels refers Ensembl refers UniProt w i k i BMOND refers ChEBI LipidNet

classifications: - lipids - genes

GO refers w i k i

L evel 1: C atalog s E xperimental data, res ults of analys is

OMICS data

B iolog ic al objec ts

Results BioHUB of analysis E xternal databas es :

- K E G G - L ipidMap, L ipidB ank - R eactome, …

MSigDB GeneAtlas, NCI60

Main ideas for data integration and pathway reconstruction: - escape information duplication.

- classify components of biological pathways by levels.

- each next level should refer but do not duplicate information from previous levels.

- use free EBI databases whenever it is possible.

Composite diagram (hierarchical) :

block (EModel)

dx/dt = f1 dy/dt = f2 z = f3

main concepts

x x y y

block 2 (EModel)

dx/dt = f5 dy/dt = f6 + z

k

+z+f4 = 0 x x f(x) forbidden e x Indirect link

subdiagram (EModel)

e

block 3 (EModel)

dx/dt = f5 dy/dt = f6 +

block2.k

k+z+f4 = 0

BioHub concept

BioHUB – an approach link information from different databases.

Main usage: • binding microarray (omics) data to pathway diagrams; • graph search; • DBReferences editor; • microarray (omics) data analysis.

BioHUB is conceptually similar with MIRIAM standard and defines: • references to database objects; • relationships between biological objects (for example: is-a, part-of, form-of ).

s1 s1 R s2 s4 direct participation of subdiagram element in a reaction s1

Block types:

1) block – only mathematical equations.

Used mainly for physiological models; 2) subdiagram – other diagram s2 R s3

Connection types:

1) directed – input  output. Transformation function can be used; 2) undirected – contact. Indicates that 2 nodes in mode is the same entity.

Semantic constraints:

There are semantic constraints, for example: block can have only one input for each variable. Two inputs are forbidden for the same variable.

Flat model:

Before Matlab or Java code generation composite model is transformed into flat model and usual genertions routines are used.

Graphic notation editor BioHub usage: graph search engine BioHub usage: coloring diagrams according to microarray data Conclusion

We believe that new version of BioUML workbench will greatly simplify, speed up and improve the process of formal description and reconstruction of biological pathways.

Acknowledgements

This work was supported by FP6 grant 037590 “Net2Drug”, FP7 grant 090107 "LipidomicNet" and interdisciplinary project 46 of SB RAS.

Each bar corresponds to one value from corresponding microarray series.