Skip to main content
Top

2018 | OriginalPaper | Chapter

Structural Learning of Probabilistic Graphical Models of Cumulative Phenomena

Authors : Daniele Ramazzotti, Marco S. Nobile, Marco Antoniotti, Alex Graudenzi

Published in: Computational Science – ICCS 2018

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

One of the critical issues when adopting Bayesian networks (BNs) to model dependencies among random variables is to “learn” their structure. This is a well-known NP-hard problem in its most general and classical formulation, which is furthermore complicated by known pitfalls such as the issue of I-equivalence among different structures. In this work we restrict the investigation to a specific class of networks, i.e., those representing the dynamics of phenomena characterized by the monotonic accumulation of events. Such phenomena allow to set specific structural constraints based on Suppes’ theory of probabilistic causation and, accordingly, to define constrained BNs, named Suppes-Bayes Causal Networks (SBCNs). Within this framework, we study the structure learning of SBCNs via extensive simulations with various state-of-the-art search strategies, such as canonical local search techniques and Genetic Algorithms. This investigation is intended to be an extension and an in-depth clarification of our previous works on SBCN structure learning. Among the main results, we show that Suppes’ constraints do simplify the learning task, by reducing the solution search space and providing a temporal ordering on the variables, which simplifies the complications derived by I-equivalent structures. Finally, we report on tradeoffs among different optimization techniques that can be used to learn SBCNs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The events accumulate over time and when later events occur, earlier events are observed as well.
 
2
We are aware of special formulations of the problem that are solvable in polynomial time. Their existence points to interesting questions regarding the “barrier” between NP problems and polynomial ones; however, these are questions beyond the scope of the present paper.
 
3
Since BNs are DAGs, the representation can be reduced by not encoding the elements on the diagonal, which are always equal to zero. In such case, the strings representing the individual have length \(K \times K - K\).
 
4
Here we stick with the efficient search scheme of CAPRI and, for this reason, we do not consider exclusive disjunction (\(\oplus \)) parent sets.
 
5
Further experiments on multi-objective optimization techniques, such as Non-dominated Sorting Genetic Algorithm (NSGA- II), were performed, but are not shown here because of the worse overall performance, and of the higher computational cost, with respect to canonical GA.
 
Literature
2.
go back to reference Antoniotti, M., Caravagna, G., De Sano, L., Graudenzi, A., Mauri, G., Mishra, B., Ramazzotti, D.: Design of the TRONCO bioconductor package for translational oncology. R J. 8(2), 39–59 (2016) Antoniotti, M., Caravagna, G., De Sano, L., Graudenzi, A., Mauri, G., Mishra, B., Ramazzotti, D.: Design of the TRONCO bioconductor package for translational oncology. R J. 8(2), 39–59 (2016)
3.
go back to reference Back, T.: Selective pressure in evolutionary algorithms: a characterization of selection mechanisms. In: Proceedings of the First IEEE Conference on Evolutionary Computation, 1994. IEEE World Congress on Computational Intelligence, pp. 57–62. IEEE (1994) Back, T.: Selective pressure in evolutionary algorithms: a characterization of selection mechanisms. In: Proceedings of the First IEEE Conference on Evolutionary Computation, 1994. IEEE World Congress on Computational Intelligence, pp. 57–62. IEEE (1994)
4.
go back to reference Bonchi, F., Hajian, S., Mishra, B., Ramazzotti, D.: Exposing the probabilistic causal structure of discrimination. Int. J. Data Sci. Anal. 3(1), 1–21 (2017)CrossRef Bonchi, F., Hajian, S., Mishra, B., Ramazzotti, D.: Exposing the probabilistic causal structure of discrimination. Int. J. Data Sci. Anal. 3(1), 1–21 (2017)CrossRef
5.
go back to reference Buldyrev, S.V., Parshani, R., Paul, G., Stanley, H.E., Havlin, S.: Catastrophic cascade of failures in interdependent networks. Nature 464(7291), 1025–1028 (2010)CrossRef Buldyrev, S.V., Parshani, R., Paul, G., Stanley, H.E., Havlin, S.: Catastrophic cascade of failures in interdependent networks. Nature 464(7291), 1025–1028 (2010)CrossRef
6.
go back to reference Buntine, W.: Theory refinement on Bayesian networks. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 52–60. Morgan Kaufmann Publishers Inc. (1991)CrossRef Buntine, W.: Theory refinement on Bayesian networks. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 52–60. Morgan Kaufmann Publishers Inc. (1991)CrossRef
7.
go back to reference Caravagna, G., Graudenzi, A., Ramazzotti, D., Sanz-Pamplona, R., De Sano, L., Mauri, G., Moreno, V., Antoniotti, M., Mishra, B.: Algorithmic methods to infer the evolutionary trajectories in cancer progression. Proc. Natl. Acad. Sci. 113(28), E4025–E4034 (2016)CrossRef Caravagna, G., Graudenzi, A., Ramazzotti, D., Sanz-Pamplona, R., De Sano, L., Mauri, G., Moreno, V., Antoniotti, M., Mishra, B.: Algorithmic methods to infer the evolutionary trajectories in cancer progression. Proc. Natl. Acad. Sci. 113(28), E4025–E4034 (2016)CrossRef
8.
go back to reference Carvalho, A.M.: Scoring functions for learning Bayesian networks. Inesc-id Technical report (2009) Carvalho, A.M.: Scoring functions for learning Bayesian networks. Inesc-id Technical report (2009)
10.
go back to reference Chickering, D.M., Heckerman, D., Meek, C.: Large-sample learning of Bayesian networks is NP-hard. J. Mach. Learn. Res. 5(Oct), 1287–1330 (2004)MathSciNetMATH Chickering, D.M., Heckerman, D., Meek, C.: Large-sample learning of Bayesian networks is NP-hard. J. Mach. Learn. Res. 5(Oct), 1287–1330 (2004)MathSciNetMATH
11.
go back to reference Cooper, G.F., Herskovits, E.: A Bayesian method for constructing Bayesian belief networks from databases. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 86–94. Morgan Kaufmann Publishers Inc. (1991)CrossRef Cooper, G.F., Herskovits, E.: A Bayesian method for constructing Bayesian belief networks from databases. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 86–94. Morgan Kaufmann Publishers Inc. (1991)CrossRef
12.
go back to reference Cooper, G.F., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9(4), 309–347 (1992)MATH Cooper, G.F., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9(4), 309–347 (1992)MATH
13.
go back to reference De Sano, L., Caravagna, G., Ramazzotti, D., Graudenzi, A., Mauri, G., Mishra, B., Antoniotti, M.: TRONCO: an R package for the inference of cancer progression models from heterogeneous genomic data. Bioinformatics 32(12), 1911–1913 (2016)CrossRef De Sano, L., Caravagna, G., Ramazzotti, D., Graudenzi, A., Mauri, G., Mishra, B., Antoniotti, M.: TRONCO: an R package for the inference of cancer progression models from heterogeneous genomic data. Bioinformatics 32(12), 1911–1913 (2016)CrossRef
14.
go back to reference Farahani, H.S., Lagergren, J.: Learning oncogenetic networks by reducing to mixed integer linear programming. PloS One 8(6), e65773 (2013)CrossRef Farahani, H.S., Lagergren, J.: Learning oncogenetic networks by reducing to mixed integer linear programming. PloS One 8(6), e65773 (2013)CrossRef
15.
go back to reference Gao, G., Mishra, B., Ramazzotti, D.: Efficient simulation of financial stress testing scenarios with Suppes-Bayes causal networks. Procedia Comput. Sci. 108, 272–284 (2017)CrossRef Gao, G., Mishra, B., Ramazzotti, D.: Efficient simulation of financial stress testing scenarios with Suppes-Bayes causal networks. Procedia Comput. Sci. 108, 272–284 (2017)CrossRef
17.
go back to reference Good, I.J., Mittal, Y., et al.: The amalgamation and geometry of two-by-two contingency tables. Ann. Stat. 15(2), 694–711 (1987)MathSciNetCrossRef Good, I.J., Mittal, Y., et al.: The amalgamation and geometry of two-by-two contingency tables. Ann. Stat. 15(2), 694–711 (1987)MathSciNetCrossRef
18.
go back to reference Heckerman, D., Geiger, D., Chickering, D.M.: Learning bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20(3), 197–243 (1995)MATH Heckerman, D., Geiger, D., Chickering, D.M.: Learning bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20(3), 197–243 (1995)MATH
19.
go back to reference Hitchcock, C.: Probabilistic causation. Stanford encyclopedia of philosophy (2010) Hitchcock, C.: Probabilistic causation. Stanford encyclopedia of philosophy (2010)
20.
go back to reference Holland, J.H.: Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. U Michigan Press (1975) Holland, J.H.: Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. U Michigan Press (1975)
21.
go back to reference Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, Cambridge (2009)MATH Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, Cambridge (2009)MATH
22.
go back to reference Korsunsky, I., Ramazzotti, D., Caravagna, G., Mishra, B.: Inference of cancer progression models with biological noise. arXiv preprint arXiv:1408.6032 (2014) Korsunsky, I., Ramazzotti, D., Caravagna, G., Mishra, B.: Inference of cancer progression models with biological noise. arXiv preprint arXiv:​1408.​6032 (2014)
23.
go back to reference Larranaga, P., Poza, M., Yurramendi, Y., Murga, R.H., Kuijpers, C.M.: Structure learning of Bayesian networks by genetic algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 18(9), 912–926 (1996)CrossRef Larranaga, P., Poza, M., Yurramendi, Y., Murga, R.H., Kuijpers, C.M.: Structure learning of Bayesian networks by genetic algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 18(9), 912–926 (1996)CrossRef
24.
go back to reference Loohuis, L.O., Caravagna, G., Graudenzi, A., Ramazzotti, D., Mauri, G., Antoniotti, M., Mishra, B.: Inferring tree causal models of cancer progression with probability raising. PloS One 9(10), e108358 (2014)CrossRef Loohuis, L.O., Caravagna, G., Graudenzi, A., Ramazzotti, D., Mauri, G., Antoniotti, M., Mishra, B.: Inferring tree causal models of cancer progression with probability raising. PloS One 9(10), e108358 (2014)CrossRef
25.
go back to reference Oliphant, T.: A Guide to Numpy, vol. 1. Trelgol Publishing, Spanish Fork (2006) Oliphant, T.: A Guide to Numpy, vol. 1. Trelgol Publishing, Spanish Fork (2006)
26.
27.
go back to reference Pearson, K.: Mathematical contributions to the theory of evolution on a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc. R. Soc. Lond. 60(359–367), 489–498 (1896)CrossRef Pearson, K.: Mathematical contributions to the theory of evolution on a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc. R. Soc. Lond. 60(359–367), 489–498 (1896)CrossRef
28.
go back to reference Ramazzotti, D., Caravagna, G., Olde Loohuis, L., Graudenzi, A., Korsunsky, I., Mauri, G., Antoniotti, M., Mishra, B.: CAPRI: efficient inference of cancer progression models from cross-sectional data. Bioinformatics 31(18), 3016–3026 (2015)CrossRef Ramazzotti, D., Caravagna, G., Olde Loohuis, L., Graudenzi, A., Korsunsky, I., Mauri, G., Antoniotti, M., Mishra, B.: CAPRI: efficient inference of cancer progression models from cross-sectional data. Bioinformatics 31(18), 3016–3026 (2015)CrossRef
29.
go back to reference Ramazzotti, D., Graudenzi, A., Caravagna, G., Antoniotti, M.: Modeling cumulative biological phenomena with Suppes-Bayes causal networks. arXiv preprint arXiv:1602.07857 (2016) Ramazzotti, D., Graudenzi, A., Caravagna, G., Antoniotti, M.: Modeling cumulative biological phenomena with Suppes-Bayes causal networks. arXiv preprint arXiv:​1602.​07857 (2016)
30.
go back to reference Ramazzotti, D., Nobile, M.S., Cazzaniga, P., Mauri, G., Antoniotti, M.: Parallel implementation of efficient search schemes for the inference of cancer progression models. In: IEEE International Conference on Computational Intelligence in Bioinformatics and Computational Biology. IEEE (2016) Ramazzotti, D., Nobile, M.S., Cazzaniga, P., Mauri, G., Antoniotti, M.: Parallel implementation of efficient search schemes for the inference of cancer progression models. In: IEEE International Conference on Computational Intelligence in Bioinformatics and Computational Biology. IEEE (2016)
31.
go back to reference Hagberg, A., Swart, P., Chult D.S.: Exploring network structure, dynamics, and function using NetworkX. In: Proceedings of the 7th Python in Science Conferences (SciPy 2008), vol. 2008, pp. 11–16 (2008) Hagberg, A., Swart, P., Chult D.S.: Exploring network structure, dynamics, and function using NetworkX. In: Proceedings of the 7th Python in Science Conferences (SciPy 2008), vol. 2008, pp. 11–16 (2008)
34.
go back to reference Suppes, P.: A Probabilistic Theory of Causality. North-Holland Publishing Company, Amsterdam (1970) Suppes, P.: A Probabilistic Theory of Causality. North-Holland Publishing Company, Amsterdam (1970)
35.
go back to reference Teyssier, M., Koller, D.: Ordering-based search: a simple and effective algorithm for learning Bayesian networks. arXiv preprint arXiv:1207.1429 (2012) Teyssier, M., Koller, D.: Ordering-based search: a simple and effective algorithm for learning Bayesian networks. arXiv preprint arXiv:​1207.​1429 (2012)
Metadata
Title
Structural Learning of Probabilistic Graphical Models of Cumulative Phenomena
Authors
Daniele Ramazzotti
Marco S. Nobile
Marco Antoniotti
Alex Graudenzi
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-93698-7_52

Premium Partner