Top

Data Mining and Knowledge Discovery

Published in:

01-03-2015

Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models

Authors: Johan Pensar, Henrik Nyman, Timo Koski, Jukka Corander

Published in: Data Mining and Knowledge Discovery | Issue 2/2015

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

We introduce a novel class of labeled directed acyclic graph (LDAG) models for finite sets of discrete variables. LDAGs generalize earlier proposals for allowing local structures in the conditional probability distribution of a node, such that unrestricted label sets determine which edges can be deleted from the underlying directed acyclic graph (DAG) for a given context. Several properties of these models are derived, including a generalization of the concept of Markov equivalence classes. Efficient Bayesian learning of LDAGs is enabled by introducing an LDAG-based factorization of the Dirichlet prior for the model parameters, such that the marginal likelihood can be calculated analytically. In addition, we develop a novel prior distribution for the model structures that can appropriately penalize a model for its labeling complexity. A non-reversible Markov chain Monte Carlo algorithm combined with a greedy hill climbing approach is used for illustrating the useful properties of LDAG models for both real and synthetic data sets.

previous article A framework for dissimilarity-based partitioning clustering of categorical time series

next article Survey on distance metric learning and dimensionality reduction in data mining

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Available only for authorised users

Andersson S, Madigan D, Perlman M (1997) A characterization of Markov equivalence classes for acyclic digraphs. Ann Stat 25:505–541CrossRefMATHMathSciNet

Boutilier C, Friedman N, Goldszmidt M, Koller D (1996) Context-specific independence in Bayesian networks. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 115–123

Buntine W (1991) Theory refinement on Bayesian networks. In: Proceedings of the seventh conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 52–60

Chickering DM, Heckerman D, Meek C (1997) A Bayesian approach to learning Bayesian networks with local structure. In: Proceedings of thirteenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco

Cooper G, Herskovitz E (1992) A Bayesian method for the induction of probabilistic networks from data. Mach Learn 9:309–347MATH

Corander J (2003) Labelled graphical models. Scand J Stat 30:493–508CrossRefMATHMathSciNet

Corander J, Gyllenberg M, Koski T (2006) Bayesian model learning based on a parallel MCMC strategy. Stat Comput 16:355–362CrossRefMathSciNet

Corander J, Ekdahl M, Koski T (2008) Parallel interacting MCMC for learning of topologies of graphical models. Data Min Knowl Discov 17:431–456CrossRefMathSciNet

Eriksen PS (1999) Context specific interaction models. AAlborg University, Tech. rep.

Friedman N, Goldszmidt M (1996) Learning Bayesian networks with local structure. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 252–262

Geiger D, Heckerman D (1996) Knowledge representation and inference in similarity networks and Bayesian multinets. Artif Intell 82:45–74CrossRefMathSciNet

Heckerman D (1991) Probabilistic similarity networks. MIT Press, Cambridge

Heckerman D, Geiger D, Chickering D (1995) Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 20:197–243MATH

Højsgaard S (2003) Split models for contingency tables. Comput Stat Data Anal 42:621–645CrossRef

Højsgaard S (2004) Statistical inference in context specific interaction models for contingency tables. Scand J Stat 31:143–158CrossRef

Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT Press, Cambridge

Koski T, Noble J (2009) Bayesian networks. An introduction. Wiley, Chippenham, WiltshireCrossRefMATH

Marttinen P, Corander J (2009) Bayesian learning of graphical vector autoregressions with unequal lag-lengths. Mach Learn 75:217–243CrossRef

Pagallo G, Haussler D (1990) Boolean feature discovery in empirical learning. Mach Learn 5:71–99CrossRef

Pearl J (1988) Probabilistic reasoning in intelligent systems. Morgan Kaufmann, San Francisco

Poole D (1997) Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference. In: Proceedings of the 15th international joint conference on artificial intelligence, pp 1284–1291

Poole D, Zhang N (2003) Exploiting contextual independence in probabilistic inference. J Artif Intell Res 18:263–313MATHMathSciNet

Robinson R (1977) Counting unlabelled acyclic digraphs. Springer Lect Notes Math 622:28–43CrossRef

Silander T, Kontkanen P, Myllymäki P (2007) On sensitivity of the MAP Bayesian network structure to the equivalent sample size parameter. In: Proceedings of the 23rd conference on uncertainty in artificial intelligence, AUAI Press, Corvallis, pp 360–367

Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, New YorkMATH

Zhang N, Poole D (1999) On the role of context-specific independence in probabilistic inference. In: Proceedings of the 16th international joint conference on artificial intelligence, pp 1288–1293

Title: Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models
Authors: Johan Pensar
Henrik Nyman
Timo Koski
Jukka Corander
Publication date: 01-03-2015
Publisher: Springer US
Published in: Data Mining and Knowledge Discovery / Issue 2/2015
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI: https://doi.org/10.1007/s10618-014-0355-0

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 2/2015

Inhibiting diffusion of complex contagions in social networks: theoretical and experimental results

Survey on distance metric learning and dimensionality reduction in data mining

Addressing the cold-start problem in location recommendation using geo-social correlations

Exemplar-based low-rank matrix decomposition for data clustering

A framework for dissimilarity-based partitioning clustering of categorical time series

Using the minimum description length to discover the intrinsic cardinality and dimensionality of time series

Premium Partner