Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge.

Gespeichert in:
Bibliographische Detailangaben
Titel: Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge.
Autoren: Leifeld T; Institute of Automatic Control, Technische Universität Kaiserslautern, Kaiserslautern, Germany., Zhang Z; Institute of Automatic Control, Technische Universität Kaiserslautern, Kaiserslautern, Germany., Zhang P; Institute of Automatic Control, Technische Universität Kaiserslautern, Kaiserslautern, Germany.
Quelle: Frontiers in physiology [Front Physiol] 2018 Jun 08; Vol. 9, pp. 695. Date of Electronic Publication: 2018 Jun 08 (Print Publication: 2018).
Publikationsart: Journal Article
Sprache: English
Info zur Zeitschrift: Publisher: Frontiers Research Foundation Country of Publication: Switzerland NLM ID: 101549006 Publication Model: eCollection Cited Medium: Print ISSN: 1664-042X (Print) Linking ISSN: 1664042X NLM ISO Abbreviation: Front Physiol Subsets: PubMed not MEDLINE
Imprint Name(s): Original Publication: Lausanne : Frontiers Research Foundation
Abstract: Motivation: Mathematical models take an important place in science and engineering. A model can help scientists to explain dynamic behavior of a system and to understand the functionality of system components. Since length of a time series and number of replicates is limited by the cost of experiments, Boolean networks as a structurally simple and parameter-free logical model for gene regulatory networks have attracted interests of many scientists. In order to fit into the biological contexts and to lower the data requirements, biological prior knowledge is taken into consideration during the inference procedure. In the literature, the existing identification approaches can only deal with a subset of possible types of prior knowledge. Results: We propose a new approach to identify Boolean networks from time series data incorporating prior knowledge, such as partial network structure, canalizing property, positive and negative unateness. Using vector form of Boolean variables and applying a generalized matrix multiplication called the semi-tensor product (STP), each Boolean function can be equivalently converted into a matrix expression. Based on this, the identification problem is reformulated as an integer linear programming problem to reveal the system matrix of Boolean model in a computationally efficient way, whose dynamics are consistent with the important dynamics captured in the data. By using prior knowledge the number of candidate functions can be reduced during the inference. Hence, identification incorporating prior knowledge is especially suitable for the case of small size time series data and data without sufficient stimuli. The proposed approach is illustrated with the help of a biological model of the network of oxidative stress response. Conclusions: The combination of efficient reformulation of the identification problem with the possibility to incorporate various types of prior knowledge enables the application of computational model inference to systems with limited amount of time series data. The general applicability of this methodological approach makes it suitable for a variety of biological systems and of general interest for biological and medical research.
References: Bioinformatics. 2015 Apr 1;31(7):1154-9. (PMID: 25619997)
PLoS One. 2013 Jul 26;8(7):e69008. (PMID: 23922675)
Pac Symp Biocomput. 1998;:18-29. (PMID: 9697168)
BMC Bioinformatics. 2006 Mar 20;7 Suppl 1:S7. (PMID: 16723010)
BMC Bioinformatics. 2016 Oct 6;17(1):410. (PMID: 27716031)
BMC Syst Biol. 2007 Feb 02;1:11. (PMID: 17408501)
BMC Syst Biol. 2012 Oct 18;6:133. (PMID: 23079107)
PLoS One. 2015 Jul 24;10(7):e0131832. (PMID: 26207376)
J Theor Biol. 1974 Mar;44(1):167-90. (PMID: 4595774)
Development. 1997 May;124(10):1851-64. (PMID: 9169833)
Pac Symp Biocomput. 1999;:17-28. (PMID: 10380182)
PLoS One. 2013 Jun 21;8(6):e66031. (PMID: 23805196)
Nucleic Acids Res. 2006 Mar 20;34(5):1608-19. (PMID: 16549873)
Proteomics. 2010 Mar;10(6):1202-11. (PMID: 20077407)
BMC Genomics. 2012;13 Suppl 6:S4. (PMID: 23134720)
Nature. 1959 Jun 13;183(4676):1654-5. (PMID: 13666847)
Mol Cancer Ther. 2003 Jul;2(7):679-84. (PMID: 12883041)
J Comput Biol. 2012 Jan;19(1):30-41. (PMID: 22216865)
IEEE Trans Neural Netw Learn Syst. 2017 Feb;28(2):464-469. (PMID: 26829809)
Phys Biol. 2012 Oct;9(5):055001. (PMID: 23011283)
Bioinformatics. 2010 May 1;26(9):1239-45. (PMID: 20305266)
Phys Rev Lett. 2000 Jun 12;84(24):5660-3. (PMID: 10991019)
IEEE Trans Neural Netw. 2011 Apr;22(4):525-36. (PMID: 21342840)
Bioinformatics. 2006 Jul 15;22(14):e124-31. (PMID: 16873462)
Biosystems. 2016 Nov;149:139-153. (PMID: 27484338)
IEEE/ACM Trans Comput Biol Bioinform. 2012;9(2):487-98. (PMID: 21464514)
Exp Cell Res. 2000 Nov 25;261(1):91-103. (PMID: 11082279)
BMC Proc. 2011 May 28;5 Suppl 2:S5. (PMID: 21554763)
Proc Natl Acad Sci U S A. 2003 Dec 9;100(25):14796-9. (PMID: 14657375)
PLoS One. 2008 Feb 27;3(2):e1672. (PMID: 18301750)
EURASIP J Bioinform Syst Biol. 2014;2014(1):10. (PMID: 25093019)
J Theor Biol. 1969 Mar;22(3):437-67. (PMID: 5803332)
Contributed Indexing: Keywords: Boolean networks; identification; network inference; prior knowledge; time series data
Entry Date(s): Date Created: 20180626 Latest Revision: 20240327
Update Code: 20250114
PubMed Central ID: PMC6002699
DOI: 10.3389/fphys.2018.00695
PMID: 29937735
Datenbank: MEDLINE
Beschreibung
Abstract:Motivation: Mathematical models take an important place in science and engineering. A model can help scientists to explain dynamic behavior of a system and to understand the functionality of system components. Since length of a time series and number of replicates is limited by the cost of experiments, Boolean networks as a structurally simple and parameter-free logical model for gene regulatory networks have attracted interests of many scientists. In order to fit into the biological contexts and to lower the data requirements, biological prior knowledge is taken into consideration during the inference procedure. In the literature, the existing identification approaches can only deal with a subset of possible types of prior knowledge. Results: We propose a new approach to identify Boolean networks from time series data incorporating prior knowledge, such as partial network structure, canalizing property, positive and negative unateness. Using vector form of Boolean variables and applying a generalized matrix multiplication called the semi-tensor product (STP), each Boolean function can be equivalently converted into a matrix expression. Based on this, the identification problem is reformulated as an integer linear programming problem to reveal the system matrix of Boolean model in a computationally efficient way, whose dynamics are consistent with the important dynamics captured in the data. By using prior knowledge the number of candidate functions can be reduced during the inference. Hence, identification incorporating prior knowledge is especially suitable for the case of small size time series data and data without sufficient stimuli. The proposed approach is illustrated with the help of a biological model of the network of oxidative stress response. Conclusions: The combination of efficient reformulation of the identification problem with the possibility to incorporate various types of prior knowledge enables the application of computational model inference to systems with limited amount of time series data. The general applicability of this methodological approach makes it suitable for a variety of biological systems and of general interest for biological and medical research.
ISSN:1664-042X
DOI:10.3389/fphys.2018.00695