Skip to main content
Top
Published in: Data Mining and Knowledge Discovery 5/2017

07-07-2017

MixedTrails: Bayesian hypothesis comparison on heterogeneous sequential data

Authors: Martin Becker, Florian Lemmerich, Philipp Singer, Markus Strohmaier, Andreas Hotho

Published in: Data Mining and Knowledge Discovery | Issue 5/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Sequential traces of user data are frequently observed online and offline, e.g., as sequences of visited websites or as sequences of locations captured by GPS. However, understanding factors explaining the production of sequence data is a challenging task, especially since the data generation is often not homogeneous. For example, navigation behavior might change in different phases of browsing a website or movement behavior may vary between groups of users. In this work, we tackle this task and propose MixedTrails , a Bayesian approach for comparing the plausibility of hypotheses regarding the generative processes of heterogeneous sequence data. Each hypothesis is derived from existing literature, theory, or intuition and represents a belief about transition probabilities between a set of states that can vary between groups of observed transitions. For example, when trying to understand human movement in a city and given some data, a hypothesis assuming tourists to be more likely to move towards points of interests than locals can be shown to be more plausible than a hypothesis assuming the opposite. Our approach incorporates such hypotheses as Bayesian priors in a generative mixed transition Markov chain model, and compares their plausibility utilizing Bayes factors. We discuss analytical and approximate inference methods for calculating the marginal likelihoods for Bayes factors, give guidance on interpreting the results, and illustrate our approach with several experiments on synthetic and empirical data from Wikipedia and Flickr. Thus, this work enables a novel kind of analysis for studying sequential data in many application areas.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
Note that this is a slightly simplified version of the original Trial Roulette method from the HypTrails paper (Singer et al. 2015) hyptrails regarding paper regarding two aspects. First, we do not distribute chips but multiply by a concentration factor which is effectively equivalent and easier to compute. Second, we assume in this paper the same weight in each row of the Markov chain which makes formulating hypotheses and interpreting results easier. However, these simplifications are not required and reverting them is straightforward.
 
3
The scripts for generating the synthetic data are included in the code, the Wikispeedia data set (cf. 4.3) is accessible online and the Flickr data (cf. 4.4) is available via e-mail to Martin Becker.
 
4
The Wikipedia articles are available at schools-wikipedia.orf (version 2007).
 
6
Differing from our approach, West et al. (2009) use the similarity between the clicked article and the target concept cos(it), but report that along the game progress, the similarity of the current and the clicked/next article is qualitatively similar. Thus, we use the latter approach since we can only use information from already visited states, not future states.
 
8
Note that our approach can also be applied to very different settings in a straight-forward manner.
 
Literature
go back to reference Asahara A, Maruyama K, Sato A, Seto K (2011) Pedestrian-movement prediction based on mixed Markov-chain model. In: Proceedings of the 19th ACM SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 25–33 Asahara A, Maruyama K, Sato A, Seto K (2011) Pedestrian-movement prediction based on mixed Markov-chain model. In: Proceedings of the 19th ACM SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 25–33
go back to reference Baccigalupo C, Plaza E (2006) Case-based sequential ordering of songs for playlist recommendation. In: European conference on case-based reasoning. Springer, pp 286–300 Baccigalupo C, Plaza E (2006) Case-based sequential ordering of songs for playlist recommendation. In: European conference on case-based reasoning. Springer, pp 286–300
go back to reference Becker M, Singer P, Lemmerich F, Hotho A, Helic D, Strohmaier M (2015) Photowalking the city: comparing hypotheses about urban photo trails on Flickr. In: Liu TY, Scollon CN, Zhu W (eds) Social informatics. Springer, pp 227–244 Becker M, Singer P, Lemmerich F, Hotho A, Helic D, Strohmaier M (2015) Photowalking the city: comparing hypotheses about urban photo trails on Flickr. In: Liu TY, Scollon CN, Zhu W (eds) Social informatics. Springer, pp 227–244
go back to reference Becker M, Mewes H, Hotho A, Dimitrov D, Lemmerich F, Strohmaier M (2016) Sparktrails: a MapReduce implementation of HypTrails for comparing hypotheses about human trails. In: Bourdeau J, Hendler J, Nkambou R, Horrocks I, Zhao BY (eds) Proceedings of the 25th international conference companion on world wide web. WWW’16 Companion, Canada. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 17–18 Becker M, Mewes H, Hotho A, Dimitrov D, Lemmerich F, Strohmaier M (2016) Sparktrails: a MapReduce implementation of HypTrails for comparing hypotheses about human trails. In: Bourdeau J, Hendler J, Nkambou R, Horrocks I, Zhao BY (eds) Proceedings of the 25th international conference companion on world wide web. WWW’16 Companion, Canada. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 17–18
go back to reference Benavoli A, Mangili F, Corani G, Zaffalon M, Ruggeri F (2014) A Bayesian Wilcoxon signed-rank test based on the Dirichlet process. In: Proceedings of the 31st international conference on machine learning, ICML’14, Beijing, China, June 2014. JMLR.org, pp 1026–1034 Benavoli A, Mangili F, Corani G, Zaffalon M, Ruggeri F (2014) A Bayesian Wilcoxon signed-rank test based on the Dirichlet process. In: Proceedings of the 31st international conference on machine learning, ICML’14, Beijing, China, June 2014. JMLR.org, pp 1026–1034
go back to reference Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodological) 57:289–300MathSciNetMATH Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodological) 57:289–300MathSciNetMATH
go back to reference Blackstone A (2012) Sociological inquiry principles: qualitative and quantitative methods. Flat World Knowledge, Irvington, NY, USA Blackstone A (2012) Sociological inquiry principles: qualitative and quantitative methods. Flat World Knowledge, Irvington, NY, USA
go back to reference Blei DM , Moreno PJ (2001) Topic segmentation with an aspect hidden Markov model. In: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 343–348 Blei DM , Moreno PJ (2001) Topic segmentation with an aspect hidden Markov model. In: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 343–348
go back to reference Brumby DP, Howes A (2004) Good enough but i’ll just check: web-page search as attentional refocusing. In: Lovett MC, Schunn CD, Lebiere C, Munro P (eds) Sixth international conference on cognitive modeling: ICCM - 2004. Psychology Press, pp 46–51 Brumby DP, Howes A (2004) Good enough but i’ll just check: web-page search as attentional refocusing. In: Lovett MC, Schunn CD, Lebiere C, Munro P (eds) Sixth international conference on cognitive modeling: ICCM - 2004. Psychology Press, pp 46–51
go back to reference Catledge LD, Pitkow JE (1995) Characterizing browsing strategies in the world-wide web. Comput Netw ISDN Syst 27(6):1065–1073CrossRef Catledge LD, Pitkow JE (1995) Characterizing browsing strategies in the world-wide web. Comput Netw ISDN Syst 27(6):1065–1073CrossRef
go back to reference Chalmers M, Rodden K, Brodbeck D (1998) The order of things: activity-centred information access. Comput Netw ISDN Syst 30(1):359–367CrossRef Chalmers M, Rodden K, Brodbeck D (1998) The order of things: activity-centred information access. Comput Netw ISDN Syst 30(1):359–367CrossRef
go back to reference Chi EH, Pirolli PLT, Chen K, Pitkow J (2001) Using information scent to model user information needs and actions and the web. In: Conference on human factors in computing systems. ACM, pp 490–497 Chi EH, Pirolli PLT, Chen K, Pitkow J (2001) Using information scent to model user information needs and actions and the web. In: Conference on human factors in computing systems. ACM, pp 490–497
go back to reference De Choudhury M, Feldman M, Amer-Yahia S, Golbandi N, Lempel R, Yu C (2010) Automatic construction of travel itineraries using social breadcrumbs. In: Proceedings of the 21st ACM conference on hypertext and hypermedia, HT’10, Toronto, Ontario, Canada. ACM, New York, NY, USA, pp 35–44 De Choudhury M, Feldman M, Amer-Yahia S, Golbandi N, Lempel R, Yu C (2010) Automatic construction of travel itineraries using social breadcrumbs. In: Proceedings of the 21st ACM conference on hypertext and hypermedia, HT’10, Toronto, Ontario, Canada. ACM, New York, NY, USA, pp 35–44
go back to reference Dimitrov D, Singer P, Lemmerich F, Strohmaier M (2017) What makes a link successful on wikipedia? In: Proceedings of the 26th International Conference on World Wide Web. WWW ’17, Perth, Australia. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 917–926 Dimitrov D, Singer P, Lemmerich F, Strohmaier M (2017) What makes a link successful on wikipedia? In: Proceedings of the 26th International Conference on World Wide Web. WWW ’17, Perth, Australia. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 917–926
go back to reference Figueiredo F, Ribeiro B, Almeida JM , Andrade N, Faloutsos C (2016a) Mining online music listening trajectories. In: Proceedings of the 17th ISMIR conference, New York City, USA, August 7–11, 2016 Figueiredo F, Ribeiro B, Almeida JM , Andrade N, Faloutsos C (2016a) Mining online music listening trajectories. In: Proceedings of the 17th ISMIR conference, New York City, USA, August 7–11, 2016
go back to reference Figueiredo F, Ribeiro B, Almeida JM, Faloutsos C (2016b) Tribeflow: mining & predicting user trajectories. In: Proceedings of the 25th international conference on world wide web. WWW ’16, Canada. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 695–706 Figueiredo F, Ribeiro B, Almeida JM, Faloutsos C (2016b) Tribeflow: mining & predicting user trajectories. In: Proceedings of the 25th international conference on world wide web. WWW ’16, Canada. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 695–706
go back to reference Fox EB, Sudderth EB, Jordan MI, Willsky AS (2010) Bayesian nonparametric methods for learning Markov switching processes. IEEE Signal Process Mag 27(6):43–54 Fox EB, Sudderth EB, Jordan MI, Willsky AS (2010) Bayesian nonparametric methods for learning Markov switching processes. IEEE Signal Process Mag 27(6):43–54
go back to reference Frühwirth-Schnatter S, Kaufmann S (2008) Model-based clustering of multiple time series. J Bus Econ Stat 26(1):78–89MathSciNetCrossRef Frühwirth-Schnatter S, Kaufmann S (2008) Model-based clustering of multiple time series. J Bus Econ Stat 26(1):78–89MathSciNetCrossRef
go back to reference Gabriel KR, Neumann J (1962) A Markov chain model for daily rainfall occurrence at Tel Aviv. Q J R Meteorol Soc 88(375):90–95CrossRef Gabriel KR, Neumann J (1962) A Markov chain model for daily rainfall occurrence at Tel Aviv. Q J R Meteorol Soc 88(375):90–95CrossRef
go back to reference Gambs S, Killijian M-O, del Prado Cortez MN (2010) Show me how you move and I will tell you who you are. In: Proceedings of the 3rd ACM SIGSPATIAL international workshop on security and privacy in GIS and LBS, SPRINGL ’10, ACM, New York, NY, USA, pp 34–41 Gambs S, Killijian M-O, del Prado Cortez MN (2010) Show me how you move and I will tell you who you are. In: Proceedings of the 3rd ACM SIGSPATIAL international workshop on security and privacy in GIS and LBS, SPRINGL ’10, ACM, New York, NY, USA, pp 34–41
go back to reference Gelman A, Hill J, Yajima M (2012) Why we (usually) don’t have to worry about multiple comparisons. J Res Educ Eff 5(2):189–211 Gelman A, Hill J, Yajima M (2012) Why we (usually) don’t have to worry about multiple comparisons. J Res Educ Eff 5(2):189–211
go back to reference Ghahramani Z, Jordan MI, Smyth P (1997) Factorial hidden Markov models. Mach Learn 29(2–3):245–273CrossRefMATH Ghahramani Z, Jordan MI, Smyth P (1997) Factorial hidden Markov models. Mach Learn 29(2–3):245–273CrossRefMATH
go back to reference Goldwater S, Griffiths T (2007) A fully Bayesian approach to unsupervised part-of-speech tagging. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, June 2007. Association for Computational Linguistics, pp 744–751 Goldwater S, Griffiths T (2007) A fully Bayesian approach to unsupervised part-of-speech tagging. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, June 2007. Association for Computational Linguistics, pp 744–751
go back to reference Gonzalez MC, Hidalgo CA, Barabasi A-L (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782CrossRef Gonzalez MC, Hidalgo CA, Barabasi A-L (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782CrossRef
go back to reference Goodman SN (1998) Multiple comparisons, explained. Am J Epidemiol 147(9):807–812CrossRef Goodman SN (1998) Multiple comparisons, explained. Am J Epidemiol 147(9):807–812CrossRef
go back to reference Gupta R, Kumar R, Vassilvitskii S (2016) On mixtures of Markov chains. In: Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R (eds) Advances in neural information processing systems 29. Curran Associates, Inc., pp 3441–3449 Gupta R, Kumar R, Vassilvitskii S (2016) On mixtures of Markov chains. In: Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R (eds) Advances in neural information processing systems 29. Curran Associates, Inc., pp 3441–3449
go back to reference Hayes B et al (2013) First links in the Markov chain. Am Sci 101(2):92–97CrossRef Hayes B et al (2013) First links in the Markov chain. Am Sci 101(2):92–97CrossRef
go back to reference Herr N (2008) The Sourcebook for Teaching Science, Grades 6-12: Strategies, Activities, and Instructional Resources, Wiley Herr N (2008) The Sourcebook for Teaching Science, Grades 6-12: Strategies, Activities, and Instructional Resources, Wiley
go back to reference Huberman BA, Pirolli PLT, Pitkow JE, Lukose RM (1998) Strong regularities in world wide web surfing. Science 280(5360):95–97CrossRef Huberman BA, Pirolli PLT, Pitkow JE, Lukose RM (1998) Strong regularities in world wide web surfing. Science 280(5360):95–97CrossRef
go back to reference Kemeny JG, Snell JL et al (1960) Finite Markov chains, vol 356. van Nostrand, PrincetonMATH Kemeny JG, Snell JL et al (1960) Finite Markov chains, vol 356. van Nostrand, PrincetonMATH
go back to reference Kruschke JK (2013) Bayesian estimation supersedes the t test. J Exp Psychol Gen 142(2):573CrossRef Kruschke JK (2013) Bayesian estimation supersedes the t test. J Exp Psychol Gen 142(2):573CrossRef
go back to reference Kruschke J (2015) In: Doing Bayesian Data Analysis, 2nd edn. Academic Press, Boston Kruschke J (2015) In: Doing Bayesian Data Analysis, 2nd edn. Academic Press, Boston
go back to reference Laxman S, Tankasali V, White RW (2008) Stream prediction using a generative model based on frequent episodes in event sequences. In: International conference on knowledge discovery and data mining. ACM, pp 453–461 Laxman S, Tankasali V, White RW (2008) Stream prediction using a generative model based on frequent episodes in event sequences. In: International conference on knowledge discovery and data mining. ACM, pp 453–461
go back to reference Lemmerich F, Becker M, Singer P, Helic D, Hotho A, Strohmaier M (2016) Mining subgroups with exceptional transition behavior. In: KDD ’16: proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. ACM Lemmerich F, Becker M, Singer P, Helic D, Hotho A, Strohmaier M (2016) Mining subgroups with exceptional transition behavior. In: KDD ’16: proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. ACM
go back to reference Markov AA (2006) An example of statistical investigation of the text Eugene Onegin concerning the connection of samples in chains. Sci Context 19(04):591–600 Originally published in 1913CrossRefMATH Markov AA (2006) An example of statistical investigation of the text Eugene Onegin concerning the connection of samples in chains. Sci Context 19(04):591–600 Originally published in 1913CrossRefMATH
go back to reference Matsubara Y, Sakurai Y, Faloutsos C (2014) Autoplait: automatic mining of co-evolving time sequences. In: Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, pp 193–204 Matsubara Y, Sakurai Y, Faloutsos C (2014) Autoplait: automatic mining of co-evolving time sequences. In: Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, pp 193–204
go back to reference Murphy KP (2002) Dynamic Bayesian networks: representation, inference and learning. PhD thesis, University of California, Berkeley Murphy KP (2002) Dynamic Bayesian networks: representation, inference and learning. PhD thesis, University of California, Berkeley
go back to reference Noulas A, Scellato S, Lambiotte R, Pontil M, Mascolo C (2012) A tale of many cities: universal patterns in human urban mobility. PLoS ONE 7(5):1–10CrossRef Noulas A, Scellato S, Lambiotte R, Pontil M, Mascolo C (2012) A tale of many cities: universal patterns in human urban mobility. PLoS ONE 7(5):1–10CrossRef
go back to reference Noulas A, Scellato S, Lathia N, Mascolo C (2012) Mining user mobility features for next place prediction in location-based services. In: Proceedings of the 2012 IEEE 12th international conference on data mining, ICDM ’12. IEEE Computer Society, Washington, DC, USA, pp 1038–1043 Noulas A, Scellato S, Lathia N, Mascolo C (2012) Mining user mobility features for next place prediction in location-based services. In: Proceedings of the 2012 IEEE 12th international conference on data mining, ICDM ’12. IEEE Computer Society, Washington, DC, USA, pp 1038–1043
go back to reference Page L, Brin S, Motwani R, Winograd T (1999) The pagerank citation ranking: bringing order to the web. Stanford InfoLab Page L, Brin S, Motwani R, Winograd T (1999) The pagerank citation ranking: bringing order to the web. Stanford InfoLab
go back to reference Pirolli PLT, Card SK (1999) Information foraging. Psychol Rev 106(4):643–675CrossRef Pirolli PLT, Card SK (1999) Information foraging. Psychol Rev 106(4):643–675CrossRef
go back to reference Ponte JM, Croft WB (1997) Text segmentation by topic. In: International conference on theory and practice of digital libraries. Springer, pp 113–125 Ponte JM, Croft WB (1997) Text segmentation by topic. In: International conference on theory and practice of digital libraries. Springer, pp 113–125
go back to reference Poulsen CS (1990) Mixed Markov and latent Markov modelling applied to brand choice behaviour. Int J Res Mark 7(1):5–19MathSciNetCrossRef Poulsen CS (1990) Mixed Markov and latent Markov modelling applied to brand choice behaviour. Int J Res Mark 7(1):5–19MathSciNetCrossRef
go back to reference Rabiner LR, Juang B-H (1986) An introduction to hidden Markov models. IEEE ASSP Mag 3(1):4–16CrossRef Rabiner LR, Juang B-H (1986) An introduction to hidden Markov models. IEEE ASSP Mag 3(1):4–16CrossRef
go back to reference Rendle S, Freudenthaler C, Schmidt-Thieme L (2010) Factorizing personalized Markov chains for next-basket recommendation. In: Proceedings of the 19th International Conference on World Wide Web. WWW ’10, Raleigh, North Carolina, USA. ACM, New York, NY, USA, pp 811–820 Rendle S, Freudenthaler C, Schmidt-Thieme L (2010) Factorizing personalized Markov chains for next-basket recommendation. In: Proceedings of the 19th International Conference on World Wide Web. WWW ’10, Raleigh, North Carolina, USA. ACM, New York, NY, USA, pp 811–820
go back to reference Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G (2009) Bayesian t tests for accepting and rejecting the null hypothesis. Psychon Bull Rev 16(2):225–237CrossRef Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G (2009) Bayesian t tests for accepting and rejecting the null hypothesis. Psychon Bull Rev 16(2):225–237CrossRef
go back to reference Singer P, Helic D, Taraghi B, Strohmaier M (2014) Detecting memory and structure in human navigation patterns using Markov chain models of varying order. PLoS ONE 9(7):e102070CrossRef Singer P, Helic D, Taraghi B, Strohmaier M (2014) Detecting memory and structure in human navigation patterns using Markov chain models of varying order. PLoS ONE 9(7):e102070CrossRef
go back to reference Singer P, Helic D, Hotho A, Strohmaier M (2015) Hyptrails: a Bayesian approach for comparing hypotheses about human trails on the web. In: Proceedings of the 24th International Conference on World Wide Web. WWW ’15, Florence, Italy. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 1003–1013 Singer P, Helic D, Hotho A, Strohmaier M (2015) Hyptrails: a Bayesian approach for comparing hypotheses about human trails on the web. In: Proceedings of the 24th International Conference on World Wide Web. WWW ’15, Florence, Italy. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, pp 1003–1013
go back to reference Smith LM, Sanders JZ, Kaiser RJ, Hughes P, Dodd C, Connell CR, Heiner C, Kent SB, Hood LE (1985) Fluorescence detection in automated DNA sequence analysis. Nature 321(6071):674–679CrossRef Smith LM, Sanders JZ, Kaiser RJ, Hughes P, Dodd C, Connell CR, Heiner C, Kent SB, Hood LE (1985) Fluorescence detection in automated DNA sequence analysis. Nature 321(6071):674–679CrossRef
go back to reference Strelioff CC, Crutchfield JP, Hübler AW (2007) Inferring Markov chains: Bayesian estimation, model comparison, entropy rate, and out-of-class modeling. Phys Rev E 76(1):011106MathSciNetCrossRef Strelioff CC, Crutchfield JP, Hübler AW (2007) Inferring Markov chains: Bayesian estimation, model comparison, entropy rate, and out-of-class modeling. Phys Rev E 76(1):011106MathSciNetCrossRef
go back to reference Trochim W (2001) Research methods knowledge base, 2nd edn. Atomic Dog Publishing, Cincinnati, OH, USA Trochim W (2001) Research methods knowledge base, 2nd edn. Atomic Dog Publishing, Cincinnati, OH, USA
go back to reference Van Mulbregt P, Carp I, Gillick L, Lowe S, Yamron J (1998) Text segmentation and topic tracking on broadcast news via a hidden Markov model approach. In: ICSLP Van Mulbregt P, Carp I, Gillick L, Lowe S, Yamron J (1998) Text segmentation and topic tracking on broadcast news via a hidden Markov model approach. In: ICSLP
go back to reference Walk S, Singer P, Strohmaier M (2014) Sequential action patterns in collaborative ontology-engineering projects: a case-study in the biomedical domain. In: International conference on conference on information & knowledge management. ACM Walk S, Singer P, Strohmaier M (2014) Sequential action patterns in collaborative ontology-engineering projects: a case-study in the biomedical domain. In: International conference on conference on information & knowledge management. ACM
go back to reference Wallach HM (2006) Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd international conference on machine learning. ACM, pp 977–984 Wallach HM (2006) Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd international conference on machine learning. ACM, pp 977–984
go back to reference West R, Leskovec J (2012) Human wayfinding in information networks. In: Proceedings of the 21st international conference on world wide web. ACM, pp 619–628 West R, Leskovec J (2012) Human wayfinding in information networks. In: Proceedings of the 21st international conference on world wide web. ACM, pp 619–628
go back to reference West R, Pineau J, Precup D (2009) Wikispeedia: an online game for inferring semantic distances between concepts. In: Proceedings of the 21st international joint conference on artificial intelligence, pp 1598–1603 West R, Pineau J, Precup D (2009) Wikispeedia: an online game for inferring semantic distances between concepts. In: Proceedings of the 21st international joint conference on artificial intelligence, pp 1598–1603
go back to reference Wetzels R, Tutschkow D, Dolan C, van der Sluis S, Dutilh G, Wagenmakers E-J (2016) A bayesian test for the hot hand phenomenon. J Math Psychol 72:200–209MathSciNetCrossRefMATH Wetzels R, Tutschkow D, Dolan C, van der Sluis S, Dutilh G, Wagenmakers E-J (2016) A bayesian test for the hot hand phenomenon. J Math Psychol 72:200–209MathSciNetCrossRefMATH
go back to reference White RW, Huang J (2010) Assessing the scenic route: measuring the value of search trails in web logs. In Conference on research and development in information retrieval. ACM, pp 587–594 White RW, Huang J (2010) Assessing the scenic route: measuring the value of search trails in web logs. In Conference on research and development in information retrieval. ACM, pp 587–594
go back to reference Yang J, McAuley J, Leskovec J, LePendu P, Shah N (2014) Finding progression stages in time-evolving event sequences. In: Proceedings of the 23rd international conference on World wide web. ACM, pp 783–794 Yang J, McAuley J, Leskovec J, LePendu P, Shah N (2014) Finding progression stages in time-evolving event sequences. In: Proceedings of the 23rd international conference on World wide web. ACM, pp 783–794
Metadata
Title
MixedTrails: Bayesian hypothesis comparison on heterogeneous sequential data
Authors
Martin Becker
Florian Lemmerich
Philipp Singer
Markus Strohmaier
Andreas Hotho
Publication date
07-07-2017
Publisher
Springer US
Published in
Data Mining and Knowledge Discovery / Issue 5/2017
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-017-0518-x

Other articles of this Issue 5/2017

Data Mining and Knowledge Discovery 5/2017 Go to the issue

Premium Partner