Boolean dynamics of genetic regulatory networks inferred from
Transcription
Boolean dynamics of genetic regulatory networks inferred from
Boolean dynamics of genetic regulatory networks inferred from microarray time series data Wen Wei Rong Supervisor: Prof. Dr. Heike Siebert Outline • Motivation • Methods • Algorithm • Results • Summary Motivation • 1969 – Modeling genetic regulatory networks • 1998 – Boolean networks to infer genetic regulatory systems from time series gene expression data • 1999 – dynamic Baysian networks • 2002 – probabilistic Boolean networks Motivation • Probabilistic Boolean network • Dynamic Bayesian network Computationally complex Create a single network -> not realistic! • Modified methods from Akutsu et al. (2000) -all possible networks are considered -networks grouped by attractor basins Microarray http://www.fastol.com/~renkwitz/microarray_chips.htm Microarray Dataset Workflow Normalization Discretization Clustering Normalized Dataset A Set of Possible Networks Meta-Genes Expression Profiles Discretized Expression Profiles Algorithm for inferrencing Boolean Dynamics Clustering with k-mean • Reducing problem size (Meta-Gene level) • Analysis of networks tractable Clustering Choosing k http://public.lanl.gov/mewall/kluwer2002.html Singular Value Decomposition Singular Value Decomposition as measure for internal consistency http://smd.stanford.edu/help/svd.shtml Discretization http://www.cs.columbia.edu/4761/notes07/chapter7.3-regulation.pdf Discretization – Step 1 Support Vector Regression http://www.di.ens.fr/~mschmidt/Software/minFunc/minFunc.html Support Vector Regression Kernel Function: Gaussian kernel with width σ=1 http://www.cs.columbia.edu/4761/notes07/chapter7.3-regulation.pdf Gaussian kernel http://www.didaktik.mathematik.uni-wuerzburg.de/history/ausstell/gauss/geldschein.html Support Vector Regression Kernel Function: Gaussian kernel with width σ=1 Width ε: 3/2*average standard deviation http://www.di.ens.fr/~mschmidt/Software/minFunc/minFunc.html Average, Standard Deviation Average: Stantard Deviation: Discretization - Step 2 Thresholding Full set to reduced set • Loss of information • Introduction of error: Choosing k-means from a host of algorithms Choosing k Random starting condition for k means Algorithm • Verify that the expression profiles given as input can be reproduced by these inferred networks • Explore the dynamics beyond the time series that were provided as input • Predict expression profiles under different initial conditions Of particular interest: computing the steady state or equilibrium dynamic of the networks INFER_NETWORKS Input n nodes {v1, v2, ..., vn} c Profiles {P1, P2, ..., Pc} Q ... the biggest number of tuples INFER_NETWORKS Runnning Time: O(nq+1) INFER_NETWORKS Input n nodes {v1, v2, ..., vn} c Profiles {P1, P2, ..., Pc} Q ... the biggest number of tuples INFER_FUNCTION k=1 k=2 RUN_NETWORK ATTRACTOR „The time step t1 is the first time step such that the expression profile of the nodes at time t1 is the same as the expression profile of the nodes at time T.“ Enumeration of All Networks Workflow ENUMERATE_NETWORKS INFER_FUNCTION Expression Profiles Boolean Functions INFER_NETWORK Possible Connections and Boolean Functions ATTRACTOR Attractors Profile P* RUN_NETWORK IL-2-stimulated immune responce experiment Three measurements each at twelve different time points: 39000 transkripts were analyzed http://www.focusonnature.com/SouthAmericaMammalList.htm Results Results Results Results Results Summary „Many networks fall into the same attractor basins, suggesting that many of the networks are equaly valid. They also have many locally consistent substructures.“ General intuition: biological system modular, robust and often function correctly despite extreme change Literatur • Boolean dynamics of genetic regulatory networks inferred from microarray time series data – Martin et al. (2007) • Inferring qualitative relations in genetic networks and metabolic pathways – Akutsu et al. (2000)