CADLIVE optimizer: web-based parameter estimation for dynamic models
© Inoue et al.; licensee BioMed Central Ltd. 2012
Received: 25 April 2012
Accepted: 15 August 2012
Published: 28 August 2012
Computer simulation has been an important technique to capture the dynamics of biochemical networks. In most networks, however, few kinetic parameters have been measured in vivo because of experimental complexity. We develop a kinetic parameter estimation system, named the CADLIVE Optimizer, which comprises genetic algorithms-based solvers with a graphical user interface. This optimizer is integrated into the CADLIVE Dynamic Simulator to attain efficient simulation for dynamic models.
KeywordsSimulator Optimization Genetic algorithm Dynamic model
The simulation of dynamic models is a powerful approach that can be used for: (i) checking the consistency of a postulated model with a set of experimental measurements, (ii) answering ‘what if?’ questions and (iii) exploring possible behaviors of a model . Differential equations play a critical role in the dynamic simulation of biochemical network models and the robustness analysis of them.
A major problem for the dynamic modeling is to know the values of kinetic parameters in vivo, but it is very hard to measure the exact values of them due to experimental complexity. Stochastic methods including evolutionary (genetic) algorithms can provide high-quality solutions in less computational cost [2, 3]. At present evolutionary searches are widely used to optimize a dynamic model of biochemical networks (SBML-PET , libSRES , BioNessie , and AMIGO ).
Genetic algorithms (GAs) are known as one of the algorithms that can seek out the global minimum, based on the heuristic assumptions that the best solutions will be found in the regions of the parameter space containing a relatively high proportion of good solutions and that these regions can be explored by the genetic operators of selection, crossover, and mutation. In 2005 we developed the GA-based optimization programs for the CADLIVE Dynamic Simulator . Details of description for GAs are in Additional file 1. However, these programs are provided in the C language form, thus users have to manually edit several functions and parameters necessary for a search by GAs and use the LINUX command lines. This manual operation requires users to learn the skills and knowledge of programming and evolutionary search, which reduces the efficiency of the parameter estimation process. In this report, we develop the GUI application that enables users to intelligibly handle the optimization programs without these annoying procedures.
Parameter and function settings
GA parameter setting file
Users set the parameters necessary for GAs, i.e. encode method, GA type, digenesis, immigration, crossover, and mutation.
Search parameter setting file
It is derived from the parameter files generated by the CADLIVE Dynamic Simulator. Users select search parameters and edit their initial values and search ranges.
User function setting file
It is derived from the usrfunc files generated by the CADLIVE Dynamic Simulator. Users make the objective function necessary for optimization. The usrfunc file has two contents: the mathematical equations and objective function. The mathematical equations are provided by the CADLIVE Dynamic Simulator. In objective function setting, users can select the sum of squared errors (SSE) by default, where users upload experimental or reference time-course data necessary for the calculation of SSE. If necessary, users can arbitrarily edit the objective function according to the grammar of the C language. Simulated data are interpolated using the spline interpolation function.
After setting the input files using the GUI, users start a GA search. When the optimization finishes, users obtain the three functions: "Simulation", "Process of fitness", and "Download", as follows.
Users can simulate the optimized dynamic model with respect to the objective function.
Process of fitness
This function displays the changes in the fitness values with respect to generation. The fitness value shows how much the simulated data agree to the requirement given by the objective function.
Download the result files
Users can download three input files for the GA parameter setting, search parameter setting, and user function setting and the two output files for the GA result and optimized search parameters.
A personal computer (CPU:Intel Xeon E3110 3 GHz, RAM:4GByte) is used as the server machine. The GUI program is written in PHP 4.4.7 on the LINUX Cent OS 5.3 and integrated into the CADLIVE Dynamic Simulator. The C compiler on LINUX processes the three input files: the GA parameter setting file, search parameter setting file, and user function setting file, to execute the search by GAs. The progress of the GA search is supervised by cron in Linux. The mail service is implemented that notifies the completion of optimization. The simulated results with the optimized parameters are drawn by the library of the CADLIVE Dynamic Simulator. JpGraph (http://www.jpgraph.net) is used to display the process of fitness values. The manuals of the CADLIVE Optimizer are provided in Additional file 2.
Results and discussion
We automatically converted three example models: a simple enzyme reaction model, an E. coli heat shock response system , and an E. coli nitrogen assimilation system  (see Additional file 3, Additional file 4) into their associated dynamic models by using the CADLIVE Dynamic Simulator, and subsequently estimated the kinetic parameter values of them by the CADLIVE Optimizer. An enzyme reaction model is the simplest model to let users understand the basic functions of the optimizer. The heat shock response system is used to tell how ordinary biochemical models are optimized, where the SSE between the experimental and simulated time course data are employed as the default objective function. The nitrogen assimilation system shows that an arbitrary objective function can be edited using the experimental data . The proposed GUI application greatly facilitates the feasibility of the optimization programs, by removing complicated procedures such as the edition of the files necessary for setting GAs and objective functions and handling of the LINUX commands.
CADLIVE has originally been developed to perform rational computer-aided design of biochemical networks and used for various studies [12, 13]. In this report, as an extension of CADLIVE, we propose the CADLIVE Optimizer, a powerful tool for optimizing a mathematical model generated by the CADLIVE Dynamic Simulator. The CADLIVE Optimizer attains a critical progress for developing the standard technology for automatic optimization of dynamic models, i.e., the automatic generation of a dynamic model with tuned kinetic parameters without any complicated operations.
This work was supported by Grant-in-Aid for Scientific Research (B) (22300101) from Japan Society for the Promotion of Science and by Grant-in-Aid for Scientific Research on Innovative Areas (23134506) from the Ministry of Education, Culture, Sports, Science and Technology of Japan.
- Mendes P, Kell D: Non-linear optimization of biochemical pathways: applications to metabolic engineering and parameter estimation. Bioinformatics. 1998, 14: 869-883. 10.1093/bioinformatics/14.10.869.View ArticlePubMedGoogle Scholar
- Moles CG, Mendes P, Banga JR: Parameter estimation in biochemical pathways: a comparison of global optimization methods. Genome Res. 2003, 13: 2467-2474. 10.1101/gr.1262503.PubMed CentralView ArticlePubMedGoogle Scholar
- Vilela M, Chou IC, Vinga S, Vasconcelos AT, Voit EO, Almeida JS: Parameter optimization in S-system models. BMC Syst Biol. 2008, 2: 35-10.1186/1752-0509-2-35.PubMed CentralView ArticlePubMedGoogle Scholar
- Zi Z, Klipp E: SBML-PET: a Systems Biology Markup Language-based parameter estimation tool. Bioinformatics. 2006, 22: 2704-2705. 10.1093/bioinformatics/btl443.View ArticlePubMedGoogle Scholar
- Ji X, Xu Y: libSRES: a C library for stochastic ranking evolution strategy for parameter estimation. Bioinformatics. 2006, 22: 124-126. 10.1093/bioinformatics/bti753.View ArticlePubMedGoogle Scholar
- Liu X, Jiang J, Ajayi O, Gu X, Gilbert D, Sinnott R: BioNessie(G) - a Grid enabled biochemical networks simulation environment. Stud Health Technol Inform. 2008, 138: 147-157.PubMedGoogle Scholar
- Balsa-Canto E, Banga JR: AMIGO, a toolbox for advanced model identification in systems biology using global optimization. Bioinformatics. 2011, 27: 2311-2313. 10.1093/bioinformatics/btr370.PubMed CentralView ArticlePubMedGoogle Scholar
- Kurata H, Inoue K, Maeda K, Masaki K, Shimokawa Y, Zhao Q: Extended CADLIVE: a novel graphical notation for design of biochemical network maps and computational pathway analysis. Nucleic Acids Res. 2007, 35: e134-10.1093/nar/gkm769.PubMed CentralView ArticlePubMedGoogle Scholar
- Kurata H, Masaki K, Sumida Y, Iwasaki R: CADLIVE dynamic simulator: direct link of biochemical networks to dynamic models. Genome Res. 2005, 15: 590-600. 10.1101/gr.3463705.PubMed CentralView ArticlePubMedGoogle Scholar
- Kurata H, Matoba N, Shimizu N: CADLIVE for constructing a large-scale biochemical network based on a simulation-directed notation and its application to yeast cell cycle. Nucleic Acids Res. 2003, 31: 4071-4084. 10.1093/nar/gkg461.PubMed CentralView ArticlePubMedGoogle Scholar
- Inoue K, Tomeda S, Tonami S, Shimokawa Y, Ono M, Kurata H: CADLIVE Converter for constructing a biochemical network map. Biochem Eng J. 2011, 54: 200-206. 10.1016/j.bej.2011.02.022.View ArticleGoogle Scholar
- Maeda K, Kurata H: Quasi-multiparameter sensitivity measure for robustness analysis of complex biochemical networks. J Theor Biol. 2011, 272: 174-186. 10.1016/j.jtbi.2010.12.012.View ArticlePubMedGoogle Scholar
- Masaki K, Maeda K, Kurata H: Biological design principles of complex feedback modules in the E. coli ammonia assimilation system. Artificial life. 2012, 18: 53-90.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.