Skip to main content

2018 | OriginalPaper | Buchkapitel

Scalable Inference of Gene Regulatory Networks with the Spark Distributed Computing Platform

verfasst von : Cristóbal Barba-González, José García-Nieto, Antonio Benítez-Hidalgo, Antonio J. Nebro, José F. Aldana-Montes

Erschienen in: Intelligent Distributed Computing XII

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Inference of Gene Regulatory Networks (GRNs) remains an important open challenge in computational biology. The goal of bio-model inference is to, based on time-series of gene expression data, obtain the sparse topological structure and the parameters that quantitatively understand and reproduce the dynamics of biological system. Nevertheless, the inference of a GRN is a complex optimization problem that involve processing S-System models, which include large amount of gene expression data from hundreds (even thousands) of genes in multiple time-series (essays). This complexity, along with the amount of data managed, make the inference of GRNs to be a computationally expensive task. Therefore, the generation of parallel algorithmic proposals that operate efficiently on distributed processing platforms is a must in current reconstruction of GRNs. In this paper, a parallel multi-objective approach is proposed for the optimal inference of GRNs, since minimizing the Mean Squared Error using S-System model and Topology Regularization value. A flexible and robust multi-objective cellular evolutionary algorithm is adapted to deploy parallel tasks, in form of Spark jobs. The proposed approach has been developed using the framework jMetal, so in order to perform parallel computation, we use Spark on a cluster of distributed nodes to evaluate candidate solutions modeling the interactions of genes in biological networks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
We use either the term processor or core to refer the same processing unit.
 
Literatur
1.
Zurück zum Zitat Akutsu, T., Kuhara, S., Maruyama, O., Miyano, S.: Identification of genetic networks by strategic gene disruptions and gene overexpressions under a boolean model. Theoret. Comput. Sci. 298(1), 235–251 (2003)MathSciNetCrossRef Akutsu, T., Kuhara, S., Maruyama, O., Miyano, S.: Identification of genetic networks by strategic gene disruptions and gene overexpressions under a boolean model. Theoret. Comput. Sci. 298(1), 235–251 (2003)MathSciNetCrossRef
2.
Zurück zum Zitat Angus, T.S., Yaochu, J.: Reconstructing biological gene regulatory networks: where optimization meets big data. Evol. Intell. 7(1), 29–47 (2014)CrossRef Angus, T.S., Yaochu, J.: Reconstructing biological gene regulatory networks: where optimization meets big data. Evol. Intell. 7(1), 29–47 (2014)CrossRef
3.
Zurück zum Zitat Barba-Gonzaléz, C., García-Nieto, J., Nebro, A.J., Aldana-Montes, J.F.: Multi-objective big data optimization with jMetal and spark. In: International Conference on Evolutionary Multi-Criterion Optimization, pp. 16–30. Springer (2017) Barba-Gonzaléz, C., García-Nieto, J., Nebro, A.J., Aldana-Montes, J.F.: Multi-objective big data optimization with jMetal and spark. In: International Conference on Evolutionary Multi-Criterion Optimization, pp. 16–30. Springer (2017)
4.
Zurück zum Zitat Deb, K.: Multi-Objective Optimization Using Evolutionary Algorithms. Wiley, New York (2001)MATH Deb, K.: Multi-Objective Optimization Using Evolutionary Algorithms. Wiley, New York (2001)MATH
5.
Zurück zum Zitat Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42, 760–771 (2011)CrossRef Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42, 760–771 (2011)CrossRef
6.
Zurück zum Zitat Friedman, N., Linial, M., Nachman, I.: Using Bayesian networks to analyze expression data. J. Comput. Biol. 7, 3–4 (2004) Friedman, N., Linial, M., Nachman, I.: Using Bayesian networks to analyze expression data. J. Comput. Biol. 7, 3–4 (2004)
7.
Zurück zum Zitat Nebro, A.J., Durillo, J.J., Luna, F., Dorronsoro, B., Alba, E.: Design issues in a multiobjective cellular genetic algorithm, pp. 126–140. Springer, Heidelberg (2007) Nebro, A.J., Durillo, J.J., Luna, F., Dorronsoro, B., Alba, E.: Design issues in a multiobjective cellular genetic algorithm, pp. 126–140. Springer, Heidelberg (2007)
8.
Zurück zum Zitat Noman, N., Iba, H.: Inferring gene regulatory networks using differential evolution with local search heuristics. TCBB 4(4), 634–647 (2007) Noman, N., Iba, H.: Inferring gene regulatory networks using differential evolution with local search heuristics. TCBB 4(4), 634–647 (2007)
9.
Zurück zum Zitat Palafox, L., Noman, N., Iba, H.: Reverse engineering of gene regulatory networks using dissipative particle swarm optimization. IEEE Trans. Evol. Comput. 17(4), 577–587 (2013)CrossRef Palafox, L., Noman, N., Iba, H.: Reverse engineering of gene regulatory networks using dissipative particle swarm optimization. IEEE Trans. Evol. Comput. 17(4), 577–587 (2013)CrossRef
10.
Zurück zum Zitat Prill, R.J., Marbach, D., Saez-Rodriguez, J., Sorger, P.K., Alexopoulos, L.G., Xue, X., Clarke, N.D., Altan-Bonnet, G., Stolovitzky, G.: Towards a rigorous assessment of systems biology models: the DREAM3 challenges. PLoS ONE 5(2), 1–18 (2010)CrossRef Prill, R.J., Marbach, D., Saez-Rodriguez, J., Sorger, P.K., Alexopoulos, L.G., Xue, X., Clarke, N.D., Altan-Bonnet, G., Stolovitzky, G.: Towards a rigorous assessment of systems biology models: the DREAM3 challenges. PLoS ONE 5(2), 1–18 (2010)CrossRef
11.
Zurück zum Zitat Savageau, M.: Biochemical Systems Analysis: A Study of Function and Design in Molecular Biology. Addison-Wesley Educational Publishers Inc., Reading (2010) Savageau, M.: Biochemical Systems Analysis: A Study of Function and Design in Molecular Biology. Addison-Wesley Educational Publishers Inc., Reading (2010)
12.
Zurück zum Zitat Sirbu, A., Ruskin, H.J., Crane, M.: Comparison of evolutionary algorithms in gene regulatory network model inference. BMC Bioinfor. 11(1), 59 (2010)CrossRef Sirbu, A., Ruskin, H.J., Crane, M.: Comparison of evolutionary algorithms in gene regulatory network model inference. BMC Bioinfor. 11(1), 59 (2010)CrossRef
13.
Zurück zum Zitat Voit, E.O.: Computational Analysis of Biochemical Systems. A Practical Guide for Biochemists and Molecular Biologists. Cambridge University Press, New York (2000) Voit, E.O.: Computational Analysis of Biochemical Systems. A Practical Guide for Biochemists and Molecular Biologists. Cambridge University Press, New York (2000)
14.
Zurück zum Zitat Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, Berkeley, CA, USA, p. 10. USENIX Association (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, Berkeley, CA, USA, p. 10. USENIX Association (2010)
Metadaten
Titel
Scalable Inference of Gene Regulatory Networks with the Spark Distributed Computing Platform
verfasst von
Cristóbal Barba-González
José García-Nieto
Antonio Benítez-Hidalgo
Antonio J. Nebro
José F. Aldana-Montes
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-99626-4_6