Qualitative Simulation of Large and Complex Genetic
Download
Report
Transcript Qualitative Simulation of Large and Complex Genetic
Modeling and Simulation of
Genetic Regulatory Networks
using Ordinary Differential Equations
Hidde de Jong
Projet HELIX
Institut National de Recherche en Informatique et en Automatique
Unité de Recherche Rhône-Alpes
655, avenue de l’Europe
Montbonnot, 38334 Saint Ismier CEDEX
Email: [email protected]
Overview
1. Analysis of genetic regulatory networks
2. Approaches towards modeling and simulation of genetic
regulatory networks
overview
nonlinear differential equations
linear differential equations
piecewise-linear differential equations
4. Discussion: towards virtual cells
2
Genome
Genome is genetic material in chromosomes of organism
DNA in most organisms, RNA in some viruses
Many prokaryotic and eukaryotic genomes have been
sequenced in recent years
E. coli genome: 4300 genes
3
Genes and proteins
Genes code for proteins that are essential for development
and functioning of organism: gene expression
DNA
transcription
RNA
translation
protein
protein and
modifier molecule
post-translational
modification
4
Molecular interactions
Cellular processes involve interactions between proteins,
genes, metabolites, and other molecules:
cell structure
metabolism
membrane
enzyme
metabolite
gene regulation
gene
transcription factor
kinase
signal transduction
phosphorylated
regulatory protein
5
Organism as biochemical system
Organism can be viewed as biochemical system, structured
by network of interactions between its molecular components
6
Systems biology
Challenge of systems biology: understand how global
behavior of organism emerges from local interactions between
its molecular components
"A transition is occurring in biology from the molecular level
to the system level that promises to revolutionize our
understanding of complex biological regulatory systems... "
Kitano (2002), Science, 295(5560):564
Elements of systems biology:
High-throughput experimental techniques
Advanced computational techniques and powerful computers
Integrated application of experimental and computational tools
7
Model-driven analysis of biological systems
Model-driven analysis: integrated application of experimental
and computational tools
biological
system
experimental
conditions
choose
experiments
experimental
simulate
observations
predictions
conditions
perform
experiments
compare
models
biological
knowledge
construct
and revise
models
fit of models
experimental
observations
data
Model composition versus model induction (reverse
engineering)
8
Genetic regulatory networks
Genetic regulatory network is part of biochemical network
consisting (mainly) of genes and their regulatory interactions
9
Experimental tools
Study of large and complex genetic regulatory networks
requires powerful experimental tools
High-throughput, low-cost, reliable, precise
Information obtained from experimental tools in genomics:
DNA sequence (genes) of organism
interactions between proteins and DNA (microarrays)
temporal variation of gene products (microarrays, mass spectometry)
10
Computational tools
Computer support indispensable for dynamical analysis of
genetic regulatory networks: modeling and simulation
precise and unambiguous description of network
systematic derivation of behavior predictions
First models of genetic regulatory networks date back to early
days of molecular biology
Regulation of lac operon (Jacob and Monod)
Goodwin (1963), Temporal Organization in Cells
Variety of modeling formalisms exist…
de Jong (2002), J. Comput. Biol., 9(1): 69-105
Hasty et al. (2001), Nat. Rev. Genet., 2(4):268-279
Smolen et al. (2000), Bull. Math. Biol., 62(2):247-292
11
Hierarchy of modeling formalisms
Graphs
Boolean equations
abstraction
precision
Ordinary differential equations
feasibility
Stochastic master equations
Differential equations are major formalism for modeling
genetic regulatory networks :
nonlinear, linear, piecewise-linear differential equations
12
Nonlinear differential equation models
Cellular concentration of proteins, mRNAs, and other molecules
at time-point t represented by continuous variable xi(t) R0
Regulatory interactions modeled by differential equations
dx
.
x f (x),
dt
where x [x1,…, xn]´and f (x) is nonlinear rate law
No analytical solution for most nonlinear differential equations
Approximation of solution obtained by numerical simulation,
given parameter values and initial conditions x(0) x0
13
Model of cross-inhibition network
gene 1
gene 2
x1 = 1 f (x2) 1 x1
.
x2 = 2 f (x1) 2 x2
.
x1 = concentration protein 1
x2 = concentration protein 2
1, 2 > 0, production rate constants
1, 2 > 0, degradation rate constants
f (x )
1
f (x) =
0
n
+x
n
n
,
> 0 threshold
x
14
Phase-plane analysis
Analysis of steady states in phase plane
.
x1
x1 = 0
.
x1 = 0 : x1 =
.
.
x2 = 0
0
x2 = 0 : x2 =
1
1 f (x2)
2
f (x1)
2
x2
Two stable and one unstable steady state. System will
converge to one of two stable steady states (differentiation)
System displays hysteresis effect: perturbation may cause
irreversible switch to another steady state
15
Construction of cross inhibition network
Construction of cross inhibition network in vivo
Gardner et al. (2000), Nature, 403(6786): 339-342
Differential equation model of network
.
u=
α1
1+vβ
–u
.
v=
α2
1+u
–v
16
Experimental test of model
Experimental test of mathematical model (bistability and
hysteresis)
Gardner et al. (2000), Nature, 403(6786): 339-342
17
Bacteriophage infection of E. coli
Response of E. coli to phage
infection involves decision between
alternative developmental pathways:
lytic cycle and lysogeny
Ptashne, A Genetic Switch, Cell Press,1992
18
Simulation of phage infection
Differential equation model of the regulatory network underlying
decision between lytic cycle and lysogeny
McAdams, Shapiro (1995), Science, 269(5524): 650-656
19
Simulation of phage infection
Numerical simulation of promoter activity and protein
concentrations in (a) lysogenic and (b) lytic pathways
Cell follows one of two pathways for different initial conditions
20
Evaluation nonlinear differential equations
Pro: reasonably accurate description of underlying molecular
interactions
Contra: for more complex networks, difficult to analyze
mathematically, due to nonlinearities
Pro: approximate solution can be obtained through numerical
simulation
Contra: simulation techniques difficult to apply in practice, due
to lack of numerical values for parameters and initial conditions
21
Linear differential equation models
Cellular concentration of proteins, mRNAs, and other molecules
at time-point t represented by continuous variable xi(t) R0
Regulatory interactions modeled by differential equations
dx
.
x f (x) Ax b,
dt
where x [x1,…, xn]´ and f (x) is linear rate law
Analytical solution exists for linear differential equations:
t
x(t) eAt x0 eA(t-τ) dτ b
0
22
Model of cross-inhibition network
gene 1
gene 2
x1 = 1 f (x2) 1 x1
.
x2 = 2 f (x1) 2 x2
.
1
1, 2 > 0, production rate constants
1, 2 > 0, degradation rate constants
f (x) = 1 x / (2 ) ,
f (x )
0
x1 = concentration protein 1
x2 = concentration protein 2
2
x
> 0,
x 2
x1 = 1 x1 11 x2 1
.
x2 = 22 x1 2 x2 2
.
23
Phase-plane analysis
Analysis of steady states in phase plane
.
x1
x1 = 0
.
x1 = 0 : x1 =
.
.
x2 = 0
0
x2 = 0 : x2 =
1
1 f (x2)
2
f (x1)
2
x2
Single unstable steady state.
Linear differential equations too simple to capture dynamic
phenomena of interest: no bistability and no hysteresis
24
Model induction
Linear differential equation models much used for induction of
model of regulatory network from gene expression data
network reconstruction, reverse engineering
Given time-series of gene expression data, find A and b, such
that solution of
.
x Ax b ξ,
with noise term ξ, fits expression data
Powerful techniques for induction of linear model from
experimental data
Ljung (1995), System Identification, Prentice Hall, 1999
25
SOS response in E. coli
SOS response of E. coli regulates cell survival and repair
after DNA damage
Gardner et al. (2003), Science,
301(5629): 102-105
26
Induction of model of SOS network
Reconstruction of subnetwork by inducing linear differential
equation model from gene expression data
Steady-state response of bacterium measured under
genetic and physiological perturbations
Gardner et al. (2003), Science,
301(5629): 102-105
Method robust to measurement noise and upscalable
27
Evaluation of linear differential equations
Pro: analytical solution exists, thus facilitating qualitative
analysis of complex systems
Contra: too simple to capture important dynamical
phenomena of regulatory network, due to neglect of nonlinear
character of interactions
Pro: powerful techniques for induction of model of network
from gene expression data
28
Piecewise-linear differential equation models
Cellular concentration of proteins, mRNAs, and other molecules
at time-point t represented by continuous variable xi(t) R0
Regulatory interactions modeled by differential equations
AD1 x
dx
.
x f (x)
dt
n
b D1 ,
D1 R0
bDm,
Dm R0
ADm x
n
where x [x1,…, xn]´and f (x) is piecewise-linear (PL)
Global solution obtained by piecing together local solutions of
linear differential equations in regions Dj
29
Model of cross-inhibition network
gene 1
gene 2
x1 = 1 f (x2) 1 x1
.
x2 = 2 f (x1) 2 x2
.
x1 = concentration protein 1
x2 = concentration protein 2
1, 2 > 0, production rate constants
1, 2 > 0, degradation rate constants
f (x )
1
0
f (x) = s( x, ) =
1,
x<
0,
x>
x
30
Phase-plane analysis
Analysis of dynamics in phase plane
x1 = 1 s (x2, 2) 1 x1
.
x2 = 2 s (x1, 1) 2 x2
.
x1
.
x1 = 0
.
1
x2 = 0
D1 D 2
0 x. 1 = 0 2
.
x2 = 0
x2
in D21 :
x1 = 11 x11 x1 , x1 = 0 : x1 =c
= 01 1
.
.
,
x
x2 = 2 2 x2
2 = 0 : x2 = 2 2
.
.
In every region Dj , model simplifies to system of piecewiseaffine differential equations
All solutions, while being in Dj , converge towards target steady state
Different regions have different target steady states
31
Phase-plane analysis
Global phase-plane analysis by combining analyses in local
regions of phase plane
Techniques for dealing with discontinuities due to step functions
Gouzé, Sari (2003), Dyn. Syst., 17(4):299-316
.
x1
x1 = 0
1
.
x2 = 0
0
2
x2
Piecewise-linear model good approximation of nonlinear
model, retaining properties of bistability and hysteresis
32
Initiation of sporulation in B. subtilis
B. subtilis can sporulate when environmental conditions
become unfavorable
de Jong et al. (2004), Bull. Math. Biol., 66(2):261-300
33
Network underlying initiation of sporulation
Initiation of sporulation controlled by complex genetic regulatory
network integrating environmental, cell-cyle and metabolic
signals
de Jong et al. (2004), Bull.
Math. Biol., 66(2):261-300
34
Genetic Network Analyzer (GNA)
Qualitative simulation of initiation of sporulation using tool
based on piecewise-linear differential equation models (GNA)
de Jong et al. (2003),
Bioinformatics, 19(3):336-344
35
Qualitative simulation of sporulation
Predictions obtained through
qualitative simulation consistent
with observed behavior of
B. subtilis cells under starvation
Decision between sporulation
and vegetative growth outcome
of competition between positive
and negative feedback loops
de Jong et al. (2004), Bull. Math. Biol., 66(2):261-300
36
Evaluation of PL differential equations
Pro: captures important dynamical phenomena of network, by
suitable approximation of nonlinearities
Pro: qualitative analysis of dynamics of complex systems
possible, due to favorable mathematical properties
Pro: powerful techniques for induction of model of network from
gene expression data
37
Conclusions
Several kinds of mathematical model of genetic regulatory
networks
Nonlinear models give reasonably accurate description of
regulatory interactions, but difficult to apply in practice
Linear models have favorable mathematical and computational
properties, but can only give rough picture of regulatory
structure
Piecewise-linear models are compromise between nonlinear
and linear models, satisfying biological applicability and
computational feasibility
38
Beyond modeling and simulation
Integration of modeling and simulation with other computational
and experimental tools:
Biological knowledge and databases
Selection of discriminatory experiments
Validation of model predictions with experimental data
biological
system
experimental
conditions
choose
experiments
experimental
simulate
observations
predictions
perform
experiments
conditions
compare
models
biological
knowledge
construct
and revise
models
fit of models
observations
39
Beyond genetic regulatory networks
Integration of genetic networks with metabolic and signal
transduction networks
Virtual cell or whole-cell simulation
Tomita et al. (1999), Bioinformatics,
15(1):72-84
40