Skip to main content

2017 | OriginalPaper | Buchkapitel

Discovery of Causal Models that Contain Latent Variables Through Bayesian Scoring of Independence Constraints

verfasst von : Fattaneh Jabbari, Joseph Ramsey, Peter Spirtes, Gregory Cooper

Erschienen in: Machine Learning and Knowledge Discovery in Databases

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Discovering causal structure from observational data in the presence of latent variables remains an active research area. Constraint-based causal discovery algorithms are relatively efficient at discovering such causal models from data using independence tests. Typically, however, they derive and output only one such model. In contrast, Bayesian methods can generate and probabilistically score multiple models, outputting the most probable one; however, they are often computationally infeasible to apply when modeling latent variables. We introduce a hybrid method that derives a Bayesian probability that the set of independence tests associated with a given causal model are jointly correct. Using this constraint-based scoring method, we are able to score multiple causal models, which possibly contain latent variables, and output the most probable one. The structure-discovery performance of the proposed method is compared to an existing constraint-based method (RFCI) using data generated from several previously published Bayesian networks. The structural Hamming distances of the output models improved when using the proposed method compared to RFCI, especially for small sample sizes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The faithfulness assumption states that if X and Y conditional on a set \(\mathbf{Z}\) are d-connected in the structure of the data-generating CBN, then X and Y are dependent given \(\mathbf{Z}\) in the probability distribution defined by the data-generating CBN.
 
Literatur
1.
Zurück zum Zitat Abramson, B., Brown, J., Edwards, W., Murphy, A., Winkler, R.L.: Hailfinder: a Bayesian system for forecasting severe weather. Int. J. Forecast. 12(1), 57–71 (1996)CrossRef Abramson, B., Brown, J., Edwards, W., Murphy, A., Winkler, R.L.: Hailfinder: a Bayesian system for forecasting severe weather. Int. J. Forecast. 12(1), 57–71 (1996)CrossRef
2.
Zurück zum Zitat Beal, M.J., Ghahramani, Z.: The variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures. In: Proceedings of the Seventh Valencia International Meeting, pp. 453–464 (2003) Beal, M.J., Ghahramani, Z.: The variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures. In: Proceedings of the Seventh Valencia International Meeting, pp. 453–464 (2003)
3.
Zurück zum Zitat Beinlich, I.A., Suermondt, H.J., Chavez, R.M., Cooper, G.F.: The ALARM monitoring system: a case study with two probabilistic inference techniques for belief networks. In: Hunter, J., Cookson, J., Wyatt, J. (eds.) AIME 89. LNMI, vol. 38, pp. 247–256. Springer, Heidelberg (1989). https://doi.org/10.1007/978-3-642-93437-7_28 CrossRef Beinlich, I.A., Suermondt, H.J., Chavez, R.M., Cooper, G.F.: The ALARM monitoring system: a case study with two probabilistic inference techniques for belief networks. In: Hunter, J., Cookson, J., Wyatt, J. (eds.) AIME 89. LNMI, vol. 38, pp. 247–256. Springer, Heidelberg (1989). https://​doi.​org/​10.​1007/​978-3-642-93437-7_​28 CrossRef
5.
6.
Zurück zum Zitat Chickering, D.M.: Optimal structure identification with greedy search. J. Mach. Learn. Res. 3, 507–554 (2002)MathSciNetMATH Chickering, D.M.: Optimal structure identification with greedy search. J. Mach. Learn. Res. 3, 507–554 (2002)MathSciNetMATH
7.
Zurück zum Zitat Choi, M.J., Tan, V.Y., Anandkumar, A., Willsky, A.S.: Learning latent tree graphical models. J. Mach. Learn. Res. 12, 1771–1812 (2011)MathSciNetMATH Choi, M.J., Tan, V.Y., Anandkumar, A., Willsky, A.S.: Learning latent tree graphical models. J. Mach. Learn. Res. 12, 1771–1812 (2011)MathSciNetMATH
8.
Zurück zum Zitat Claassen, T., Heskes, T.: A Bayesian approach to constraint based causal inference. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 207–216 (2012) Claassen, T., Heskes, T.: A Bayesian approach to constraint based causal inference. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 207–216 (2012)
9.
Zurück zum Zitat Claassen, T., Mooij, J., Heskes, T.: Learning sparse causal models is not NP-hard. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence (2013) Claassen, T., Mooij, J., Heskes, T.: Learning sparse causal models is not NP-hard. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence (2013)
10.
Zurück zum Zitat Colombo, D., Maathuis, M.H., Kalisch, M., Richardson, T.S.: Learning high-dimensional directed acyclic graphs with latent and selection variables. Ann. Stat. 40(1), 294–321 (2012)MathSciNetCrossRefMATH Colombo, D., Maathuis, M.H., Kalisch, M., Richardson, T.S.: Learning high-dimensional directed acyclic graphs with latent and selection variables. Ann. Stat. 40(1), 294–321 (2012)MathSciNetCrossRefMATH
11.
Zurück zum Zitat Daly, R., Shen, Q., Aitken, S.: Review: learning Bayesian networks: approaches and issues. Knowl. Eng. Rev. 26(2), 99–157 (2011)CrossRef Daly, R., Shen, Q., Aitken, S.: Review: learning Bayesian networks: approaches and issues. Knowl. Eng. Rev. 26(2), 99–157 (2011)CrossRef
12.
Zurück zum Zitat Dash, D., Druzdzel, M.J.: A hybrid anytime algorithm for the construction of causal models from sparse data. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 142–149 (1999) Dash, D., Druzdzel, M.J.: A hybrid anytime algorithm for the construction of causal models from sparse data. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 142–149 (1999)
13.
Zurück zum Zitat De Campos, L.M., FernndezLuna, J.M., Puerta, J.M.: An iterated local search algorithm for learning Bayesian networks with restarts based on conditional independence tests. Int. J. Intell. Syst. 18(2), 221–235 (2003)CrossRefMATH De Campos, L.M., FernndezLuna, J.M., Puerta, J.M.: An iterated local search algorithm for learning Bayesian networks with restarts based on conditional independence tests. Int. J. Intell. Syst. 18(2), 221–235 (2003)CrossRefMATH
14.
Zurück zum Zitat Drton, M., Maathuis, M.H.: Structure learning in graphical modeling. Annu. Rev. Stat. Appl. 4, 365–393 (2016)CrossRef Drton, M., Maathuis, M.H.: Structure learning in graphical modeling. Annu. Rev. Stat. Appl. 4, 365–393 (2016)CrossRef
15.
Zurück zum Zitat Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. CRC Press, Boca Raton (1994)MATH Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. CRC Press, Boca Raton (1994)MATH
16.
Zurück zum Zitat Elidan, G., Friedman, N.: Learning hidden variable networks: the information bottleneck approach. J. Mach. Learn. Res. 6(Jan), 81–127 (2005)MathSciNetMATH Elidan, G., Friedman, N.: Learning hidden variable networks: the information bottleneck approach. J. Mach. Learn. Res. 6(Jan), 81–127 (2005)MathSciNetMATH
17.
Zurück zum Zitat Friedman, N.: The Bayesian structural EM algorithm. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 129–138 (1998) Friedman, N.: The Bayesian structural EM algorithm. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 129–138 (1998)
18.
Zurück zum Zitat Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20(3), 197–243 (1995)MATH Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20(3), 197–243 (1995)MATH
19.
Zurück zum Zitat Heckerman, D., Meek, C., Cooper, G.: A Bayesian approach to causal discovery. In: Glymour, C., Cooper, G.F. (eds.) Computation, Causation, and Discovery, pp. 141–165. MIT Press, Menlo Park, CA (1999) Heckerman, D., Meek, C., Cooper, G.: A Bayesian approach to causal discovery. In: Glymour, C., Cooper, G.F. (eds.) Computation, Causation, and Discovery, pp. 141–165. MIT Press, Menlo Park, CA (1999)
20.
Zurück zum Zitat Hyttinen, A., Eberhardt, F., Jrvisalo, M.: Constraint-based causal discovery: conflict resolution with answer set programming. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI), pp. 340–349 (2014) Hyttinen, A., Eberhardt, F., Jrvisalo, M.: Constraint-based causal discovery: conflict resolution with answer set programming. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI), pp. 340–349 (2014)
21.
Zurück zum Zitat Illari, P.M., Russo, F., Williamson, J.: Causality in the Sciences. Oxford University Press, Oxford (2011)CrossRefMATH Illari, P.M., Russo, F., Williamson, J.: Causality in the Sciences. Oxford University Press, Oxford (2011)CrossRefMATH
22.
Zurück zum Zitat Koski, T.J., Noble, J.: A review of Bayesian networks and structure learning. Math. Appl. 40(1), 51–103 (2012)MathSciNetMATH Koski, T.J., Noble, J.: A review of Bayesian networks and structure learning. Math. Appl. 40(1), 51–103 (2012)MathSciNetMATH
23.
Zurück zum Zitat Lazic, N., Bishop, C.M., Winn, J.M.: Structural Expectation Propagation (SEP): Bayesian structure learning for networks with latent variables. In: Proceedings of the Conference on Artificial Intelligence and Statistics (AISTATS), pp. 379–387 (2013) Lazic, N., Bishop, C.M., Winn, J.M.: Structural Expectation Propagation (SEP): Bayesian structure learning for networks with latent variables. In: Proceedings of the Conference on Artificial Intelligence and Statistics (AISTATS), pp. 379–387 (2013)
24.
Zurück zum Zitat Magliacane, S., Claassen, T., Mooij, J.M.: Ancestral causal inference. In: Advances in Neural Information Processing Systems, pp. 4466–4474 (2016) Magliacane, S., Claassen, T., Mooij, J.M.: Ancestral causal inference. In: Advances in Neural Information Processing Systems, pp. 4466–4474 (2016)
25.
Zurück zum Zitat Nandy, P., Hauser, A., Maathuis, M.H.: High-dimensional consistency in score-based and hybrid structure learning. arXiv preprint arXiv:1507.02608 (2015) Nandy, P., Hauser, A., Maathuis, M.H.: High-dimensional consistency in score-based and hybrid structure learning. arXiv preprint arXiv:​1507.​02608 (2015)
26.
Zurück zum Zitat Ogarrio, J.M., Spirtes, P., Ramsey, J.: A hybrid causal search algorithm for latent variable models. In: Conference on Probabilistic Graphical Models, pp. 368–379 (2016) Ogarrio, J.M., Spirtes, P., Ramsey, J.: A hybrid causal search algorithm for latent variable models. In: Conference on Probabilistic Graphical Models, pp. 368–379 (2016)
27.
Zurück zum Zitat Onisko, A.: Probabilistic causal models in medicine: application to diagnosis of liver disorders. Ph.D. dissertation, Institute of Biocybernetics and Biomedical Engineering, Polish Academy of Science, Warsaw (2003) Onisko, A.: Probabilistic causal models in medicine: application to diagnosis of liver disorders. Ph.D. dissertation, Institute of Biocybernetics and Biomedical Engineering, Polish Academy of Science, Warsaw (2003)
28.
Zurück zum Zitat Parviainen, P., Koivisto, M.: Ancestor relations in the presence of unobserved variables. Mach. Learn. Knowl. Discov. Databases 6912, 581–596 (2011) Parviainen, P., Koivisto, M.: Ancestor relations in the presence of unobserved variables. Mach. Learn. Knowl. Discov. Databases 6912, 581–596 (2011)
29.
Zurück zum Zitat Pearl, J.: Causality: Models, Reasoning, and Inference. Cambridge University Press, New York (2009)CrossRefMATH Pearl, J.: Causality: Models, Reasoning, and Inference. Cambridge University Press, New York (2009)CrossRefMATH
30.
Zurück zum Zitat Peters, J., Mooij, J., Janzing, D., Schlkopf, B.: Identifiability of causal graphs using functional models. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 589–598 (2012) Peters, J., Mooij, J., Janzing, D., Schlkopf, B.: Identifiability of causal graphs using functional models. In: Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 589–598 (2012)
31.
Zurück zum Zitat Ramsey, J.D.: Scaling up greedy equivalence search for continuous variables. CoRR, abs/1507.07749 (2015) Ramsey, J.D.: Scaling up greedy equivalence search for continuous variables. CoRR, abs/1507.07749 (2015)
32.
Zurück zum Zitat Singh, M., Valtorta, M.: Construction of claass network structures from data: a brief survey and an efficient algorithm. Int. J. Approx. Reason. 12(2), 111–131 (1995)CrossRefMATH Singh, M., Valtorta, M.: Construction of claass network structures from data: a brief survey and an efficient algorithm. Int. J. Approx. Reason. 12(2), 111–131 (1995)CrossRefMATH
33.
Zurück zum Zitat Spirtes, P., Glymour, C.N., Scheines, R.: Causation, Prediction, and Search. MIT Press, Cambridge (2000)MATH Spirtes, P., Glymour, C.N., Scheines, R.: Causation, Prediction, and Search. MIT Press, Cambridge (2000)MATH
34.
Zurück zum Zitat Triantafillou, S., Tsamardinos, I., Roumpelaki, A.: Learning neighborhoods of high confidence in constraint-based causal discovery. In: van der Gaag, L.C., Feelders, A.J. (eds.) PGM 2014. LNCS (LNAI), vol. 8754, pp. 487–502. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11433-0_32 Triantafillou, S., Tsamardinos, I., Roumpelaki, A.: Learning neighborhoods of high confidence in constraint-based causal discovery. In: van der Gaag, L.C., Feelders, A.J. (eds.) PGM 2014. LNCS (LNAI), vol. 8754, pp. 487–502. Springer, Cham (2014). https://​doi.​org/​10.​1007/​978-3-319-11433-0_​32
35.
Zurück zum Zitat Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing Bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)CrossRef Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing Bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)CrossRef
36.
Zurück zum Zitat Zhang, J.: On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias. Artif. Intell. 172(16), 1873–1896 (2008)MathSciNetCrossRefMATH Zhang, J.: On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias. Artif. Intell. 172(16), 1873–1896 (2008)MathSciNetCrossRefMATH
Metadaten
Titel
Discovery of Causal Models that Contain Latent Variables Through Bayesian Scoring of Independence Constraints
verfasst von
Fattaneh Jabbari
Joseph Ramsey
Peter Spirtes
Gregory Cooper
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-71246-8_9

Premium Partner