Skip to main content
Top
Published in: Data Mining and Knowledge Discovery 2/2015

01-03-2015

Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models

Authors: Johan Pensar, Henrik Nyman, Timo Koski, Jukka Corander

Published in: Data Mining and Knowledge Discovery | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We introduce a novel class of labeled directed acyclic graph (LDAG) models for finite sets of discrete variables. LDAGs generalize earlier proposals for allowing local structures in the conditional probability distribution of a node, such that unrestricted label sets determine which edges can be deleted from the underlying directed acyclic graph (DAG) for a given context. Several properties of these models are derived, including a generalization of the concept of Markov equivalence classes. Efficient Bayesian learning of LDAGs is enabled by introducing an LDAG-based factorization of the Dirichlet prior for the model parameters, such that the marginal likelihood can be calculated analytically. In addition, we develop a novel prior distribution for the model structures that can appropriately penalize a model for its labeling complexity. A non-reversible Markov chain Monte Carlo algorithm combined with a greedy hill climbing approach is used for illustrating the useful properties of LDAG models for both real and synthetic data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Andersson S, Madigan D, Perlman M (1997) A characterization of Markov equivalence classes for acyclic digraphs. Ann Stat 25:505–541CrossRefMATHMathSciNet Andersson S, Madigan D, Perlman M (1997) A characterization of Markov equivalence classes for acyclic digraphs. Ann Stat 25:505–541CrossRefMATHMathSciNet
go back to reference Boutilier C, Friedman N, Goldszmidt M, Koller D (1996) Context-specific independence in Bayesian networks. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 115–123 Boutilier C, Friedman N, Goldszmidt M, Koller D (1996) Context-specific independence in Bayesian networks. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 115–123
go back to reference Buntine W (1991) Theory refinement on Bayesian networks. In: Proceedings of the seventh conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 52–60 Buntine W (1991) Theory refinement on Bayesian networks. In: Proceedings of the seventh conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 52–60
go back to reference Chickering DM, Heckerman D, Meek C (1997) A Bayesian approach to learning Bayesian networks with local structure. In: Proceedings of thirteenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco Chickering DM, Heckerman D, Meek C (1997) A Bayesian approach to learning Bayesian networks with local structure. In: Proceedings of thirteenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco
go back to reference Cooper G, Herskovitz E (1992) A Bayesian method for the induction of probabilistic networks from data. Mach Learn 9:309–347MATH Cooper G, Herskovitz E (1992) A Bayesian method for the induction of probabilistic networks from data. Mach Learn 9:309–347MATH
go back to reference Corander J, Gyllenberg M, Koski T (2006) Bayesian model learning based on a parallel MCMC strategy. Stat Comput 16:355–362CrossRefMathSciNet Corander J, Gyllenberg M, Koski T (2006) Bayesian model learning based on a parallel MCMC strategy. Stat Comput 16:355–362CrossRefMathSciNet
go back to reference Corander J, Ekdahl M, Koski T (2008) Parallel interacting MCMC for learning of topologies of graphical models. Data Min Knowl Discov 17:431–456CrossRefMathSciNet Corander J, Ekdahl M, Koski T (2008) Parallel interacting MCMC for learning of topologies of graphical models. Data Min Knowl Discov 17:431–456CrossRefMathSciNet
go back to reference Eriksen PS (1999) Context specific interaction models. AAlborg University, Tech. rep. Eriksen PS (1999) Context specific interaction models. AAlborg University, Tech. rep.
go back to reference Friedman N, Goldszmidt M (1996) Learning Bayesian networks with local structure. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 252–262 Friedman N, Goldszmidt M (1996) Learning Bayesian networks with local structure. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 252–262
go back to reference Geiger D, Heckerman D (1996) Knowledge representation and inference in similarity networks and Bayesian multinets. Artif Intell 82:45–74CrossRefMathSciNet Geiger D, Heckerman D (1996) Knowledge representation and inference in similarity networks and Bayesian multinets. Artif Intell 82:45–74CrossRefMathSciNet
go back to reference Heckerman D (1991) Probabilistic similarity networks. MIT Press, Cambridge Heckerman D (1991) Probabilistic similarity networks. MIT Press, Cambridge
go back to reference Heckerman D, Geiger D, Chickering D (1995) Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 20:197–243MATH Heckerman D, Geiger D, Chickering D (1995) Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 20:197–243MATH
go back to reference Højsgaard S (2003) Split models for contingency tables. Comput Stat Data Anal 42:621–645CrossRef Højsgaard S (2003) Split models for contingency tables. Comput Stat Data Anal 42:621–645CrossRef
go back to reference Højsgaard S (2004) Statistical inference in context specific interaction models for contingency tables. Scand J Stat 31:143–158CrossRef Højsgaard S (2004) Statistical inference in context specific interaction models for contingency tables. Scand J Stat 31:143–158CrossRef
go back to reference Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT Press, Cambridge Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT Press, Cambridge
go back to reference Marttinen P, Corander J (2009) Bayesian learning of graphical vector autoregressions with unequal lag-lengths. Mach Learn 75:217–243CrossRef Marttinen P, Corander J (2009) Bayesian learning of graphical vector autoregressions with unequal lag-lengths. Mach Learn 75:217–243CrossRef
go back to reference Pagallo G, Haussler D (1990) Boolean feature discovery in empirical learning. Mach Learn 5:71–99CrossRef Pagallo G, Haussler D (1990) Boolean feature discovery in empirical learning. Mach Learn 5:71–99CrossRef
go back to reference Pearl J (1988) Probabilistic reasoning in intelligent systems. Morgan Kaufmann, San Francisco Pearl J (1988) Probabilistic reasoning in intelligent systems. Morgan Kaufmann, San Francisco
go back to reference Poole D (1997) Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference. In: Proceedings of the 15th international joint conference on artificial intelligence, pp 1284–1291 Poole D (1997) Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference. In: Proceedings of the 15th international joint conference on artificial intelligence, pp 1284–1291
go back to reference Poole D, Zhang N (2003) Exploiting contextual independence in probabilistic inference. J Artif Intell Res 18:263–313MATHMathSciNet Poole D, Zhang N (2003) Exploiting contextual independence in probabilistic inference. J Artif Intell Res 18:263–313MATHMathSciNet
go back to reference Robinson R (1977) Counting unlabelled acyclic digraphs. Springer Lect Notes Math 622:28–43CrossRef Robinson R (1977) Counting unlabelled acyclic digraphs. Springer Lect Notes Math 622:28–43CrossRef
go back to reference Silander T, Kontkanen P, Myllymäki P (2007) On sensitivity of the MAP Bayesian network structure to the equivalent sample size parameter. In: Proceedings of the 23rd conference on uncertainty in artificial intelligence, AUAI Press, Corvallis, pp 360–367 Silander T, Kontkanen P, Myllymäki P (2007) On sensitivity of the MAP Bayesian network structure to the equivalent sample size parameter. In: Proceedings of the 23rd conference on uncertainty in artificial intelligence, AUAI Press, Corvallis, pp 360–367
go back to reference Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, New YorkMATH Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, New YorkMATH
go back to reference Zhang N, Poole D (1999) On the role of context-specific independence in probabilistic inference. In: Proceedings of the 16th international joint conference on artificial intelligence, pp 1288–1293 Zhang N, Poole D (1999) On the role of context-specific independence in probabilistic inference. In: Proceedings of the 16th international joint conference on artificial intelligence, pp 1288–1293
Metadata
Title
Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models
Authors
Johan Pensar
Henrik Nyman
Timo Koski
Jukka Corander
Publication date
01-03-2015
Publisher
Springer US
Published in
Data Mining and Knowledge Discovery / Issue 2/2015
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-014-0355-0

Other articles of this Issue 2/2015

Data Mining and Knowledge Discovery 2/2015 Go to the issue

Premium Partner