Skip to main content

2018 | OriginalPaper | Buchkapitel

Event Abstraction for Process Mining Using Supervised Learning Techniques

verfasst von : Niek Tax, Natalia Sidorova, Reinder Haakma, Wil M. P. van der Aalst

Erschienen in: Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Process mining techniques focus on extracting insight in processes from event logs. In many cases, events recorded in the event log are too fine-grained, causing process discovery algorithms to discover incomprehensible process models or process models that are not representative of the event log. We show that when process discovery algorithms are only able to discover an unrepresentative process model from a low-level event log, structure in the process can in some cases still be discovered by first abstracting the event log to a higher level of granularity. This gives rise to the challenge to bridge the gap between an original low-level event log and a desired high-level perspective on this log, such that a more structured or more comprehensible process model can be discovered. We show that supervised learning can be leveraged for the event abstraction task when annotations with high-level interpretations of the low-level events are available for a subset of the sequences (i.e., traces). We present a method to generate feature vector representations of events based on XES extensions, and describe an approach to abstract events in an event log with Condition Random Fields using these event features. Furthermore, we propose a sequence-focused metric to evaluate supervised event abstraction results that fits closely to the tasks of process discovery and conformance checking. We conclude this paper by demonstrating the usefulness of supervised event abstraction for obtaining more structured and/or more comprehensible process models using both real life event data and synthetic event data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat van der Aalst, W.M.P.: Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer, Heidelberg (2011)CrossRefMATH van der Aalst, W.M.P.: Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer, Heidelberg (2011)CrossRefMATH
2.
Zurück zum Zitat van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)CrossRef van der Aalst, W.M.P., Weijters, A.J.M.M., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)CrossRef
3.
Zurück zum Zitat Günther, C.W., van der Aalst, W.M.P.: Fuzzy mining-adaptive process simplification based on multi-perspective metrics. In: Business Process Management, pp. 328–343. Springer (2007) Günther, C.W., van der Aalst, W.M.P.: Fuzzy mining-adaptive process simplification based on multi-perspective metrics. In: Business Process Management, pp. 328–343. Springer (2007)
4.
Zurück zum Zitat Van der Werf, J.M.E.M., van Dongen, B.F., Hurkens, C.A.J., Serebrenik, A.: Process discovery using integer linear programming. In: Applications and Theory of Petri Nets, pp. 368–387. Springer (2008) Van der Werf, J.M.E.M., van Dongen, B.F., Hurkens, C.A.J., Serebrenik, A.: Process discovery using integer linear programming. In: Applications and Theory of Petri Nets, pp. 368–387. Springer (2008)
5.
Zurück zum Zitat Weijters, A.J.M.M., Ribeiro, J.T.S.: Flexible heuristics miner (FHM). In: Proceedings of the 2011 IEEE Symposium on Computational Intelligence and Data Mining, pp. 310–317. IEEE (2011) Weijters, A.J.M.M., Ribeiro, J.T.S.: Flexible heuristics miner (FHM). In: Proceedings of the 2011 IEEE Symposium on Computational Intelligence and Data Mining, pp. 310–317. IEEE (2011)
6.
Zurück zum Zitat Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs—a constructive approach. In: Application and Theory of Petri Nets and Concurrency. LNCS, pp. 311–329. Springer (2013) Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs—a constructive approach. In: Application and Theory of Petri Nets and Concurrency. LNCS, pp. 311–329. Springer (2013)
7.
Zurück zum Zitat Bose, R.P.J.C., van der Aalst, W.M.P.: Abstractions in process mining: a taxonomy of patterns. In: Business Process Management. LNCS, pp. 159–175. Springer (2009) Bose, R.P.J.C., van der Aalst, W.M.P.: Abstractions in process mining: a taxonomy of patterns. In: Business Process Management. LNCS, pp. 159–175. Springer (2009)
8.
Zurück zum Zitat Günther, C.W., Rozinat, A., van der Aalst, W.M.P.: Activity mining by global trace segmentation. In: Business Process Management Workshops. LNBIP, pp. 128–139. Springer (2010) Günther, C.W., Rozinat, A., van der Aalst, W.M.P.: Activity mining by global trace segmentation. In: Business Process Management Workshops. LNBIP, pp. 128–139. Springer (2010)
9.
Zurück zum Zitat van Dongen, B.F., Adriansyah, A.: Process mining: fuzzy clustering and performance visualization. In: Business Process Management Workshops. LNBIP, pp. 158–169. Springer (2010) van Dongen, B.F., Adriansyah, A.: Process mining: fuzzy clustering and performance visualization. In: Business Process Management Workshops. LNBIP, pp. 158–169. Springer (2010)
11.
Zurück zum Zitat van Kasteren, T., Noulas, A., Englebienne, G., Kröse, B.: Accurate activity recognition in a home setting. In: Proceedings of the 10th International Conference on Ubiquitous Computing, pp. 1–9. ACM (2008) van Kasteren, T., Noulas, A., Englebienne, G., Kröse, B.: Accurate activity recognition in a home setting. In: Proceedings of the 10th International Conference on Ubiquitous Computing, pp. 1–9. ACM (2008)
12.
Zurück zum Zitat Tapia, E.M., Intille, S.S., Larson, K.: Activity recognition in the home using simple and ubiquitous sensors. In: Ferscha, A., Mattern, F. (eds.) Pervasive Computing. LNCS, pp. 158–175. Springer (2004) Tapia, E.M., Intille, S.S., Larson, K.: Activity recognition in the home using simple and ubiquitous sensors. In: Ferscha, A., Mattern, F. (eds.) Pervasive Computing. LNCS, pp. 158–175. Springer (2004)
13.
Zurück zum Zitat Bao, L., Intille, S.S.: Activity recognition from user-annotated acceleration data. In: Ferscha, A., Mattern, F. (eds.) Pervasive Computing. LNCS, pp. 1–17. Springer (2004) Bao, L., Intille, S.S.: Activity recognition from user-annotated acceleration data. In: Ferscha, A., Mattern, F. (eds.) Pervasive Computing. LNCS, pp. 1–17. Springer (2004)
14.
Zurück zum Zitat Kwapisz, J.R., Weiss, G.M., Moore, S.A.: Activity recognition using cell phone accelerometers. ACM SIGKDD Explor. Newslett. 12(2), 74–82 (2011)CrossRef Kwapisz, J.R., Weiss, G.M., Moore, S.A.: Activity recognition using cell phone accelerometers. ACM SIGKDD Explor. Newslett. 12(2), 74–82 (2011)CrossRef
15.
Zurück zum Zitat Poppe, R.: A survey on vision-based human action recognition. Image Vis. Comput. 28(6), 976–990 (2010)CrossRef Poppe, R.: A survey on vision-based human action recognition. Image Vis. Comput. 28(6), 976–990 (2010)CrossRef
16.
Zurück zum Zitat Chen, L., Nugent, C.: Ontology-based activity recognition in intelligent pervasive environments. Int. J. Web Inf. Syst. 5(4), 410–430 (2009)CrossRef Chen, L., Nugent, C.: Ontology-based activity recognition in intelligent pervasive environments. Int. J. Web Inf. Syst. 5(4), 410–430 (2009)CrossRef
17.
Zurück zum Zitat Riboni, D., Bettini, C.: OWL 2 modeling and reasoning with complex human activities. Pervasive Mob. Comput. 7(3), 379–395 (2011)CrossRef Riboni, D., Bettini, C.: OWL 2 modeling and reasoning with complex human activities. Pervasive Mob. Comput. 7(3), 379–395 (2011)CrossRef
18.
Zurück zum Zitat van Kasteren, T., Kröse, B.: Bayesian activity recognition in residence for elders. In: Proceedings of the 3rd IET International Conference on Intelligent Environments, pp. 209–212. IEEE (2007) van Kasteren, T., Kröse, B.: Bayesian activity recognition in residence for elders. In: Proceedings of the 3rd IET International Conference on Intelligent Environments, pp. 209–212. IEEE (2007)
19.
Zurück zum Zitat Lafferty, J., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann (2001) Lafferty, J., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann (2001)
20.
Zurück zum Zitat Rabiner, L.R., Juang, B.-H.: An introduction to hidden Markov models. ASSP Mag. 3(1), 4–16 (1986)CrossRef Rabiner, L.R., Juang, B.-H.: An introduction to hidden Markov models. ASSP Mag. 3(1), 4–16 (1986)CrossRef
21.
Zurück zum Zitat Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)CrossRefMATH Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)CrossRefMATH
22.
Zurück zum Zitat Kim, E., Helal, S., Cook, D.: Human activity recognition and pattern discovery. Pervasive Comput. 9(1), 48–53 (2010)CrossRef Kim, E., Helal, S., Cook, D.: Human activity recognition and pattern discovery. Pervasive Comput. 9(1), 48–53 (2010)CrossRef
23.
Zurück zum Zitat Reisig, W.: Petri Nets: An Introduction, vol. 4. Springer, New York (2012)MATH Reisig, W.: Petri Nets: An Introduction, vol. 4. Springer, New York (2012)MATH
24.
Zurück zum Zitat Murata, T.: Petri nets: properties, analysis and applications. Proc. IEEE 77(4), 541–580 (1989)CrossRef Murata, T.: Petri nets: properties, analysis and applications. Proc. IEEE 77(4), 541–580 (1989)CrossRef
25.
Zurück zum Zitat Verbeek, H.M.W., Buijs, J.C.A.M., Van Dongen, B.F., van der Aalst, W.M.P.: ProM 6: the process mining toolkit. In: Proceedings of the Business Process Management Demonstration Track, pp. 34–39 (2010). CEUR-WS.org Verbeek, H.M.W., Buijs, J.C.A.M., Van Dongen, B.F., van der Aalst, W.M.P.: ProM 6: the process mining toolkit. In: Proceedings of the Business Process Management Demonstration Track, pp. 34–39 (2010). CEUR-WS.​org
27.
Zurück zum Zitat Andrew, G., Gao, J.: Scalable training of L1-regularized log-linear models. In: Proceedings of the 24th International Conference on Machine Learning, pp. 33–40. ACM (2007) Andrew, G., Gao, J.: Scalable training of L1-regularized log-linear models. In: Proceedings of the 24th International Conference on Machine Learning, pp. 33–40. ACM (2007)
29.
Zurück zum Zitat Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10, 707–710 (1966)MathSciNetMATH Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10, 707–710 (1966)MathSciNetMATH
30.
Zurück zum Zitat Bose, R.P.J.C., Verbeek, H.M.W., van der Aalst, W.M.P.: Discovering hierarchical process models using ProM. In: IS Olympics: Information Systems in a Diverse World, pp. 33–48. LNBIP. Springer (2012) Bose, R.P.J.C., Verbeek, H.M.W., van der Aalst, W.M.P.: Discovering hierarchical process models using ProM. In: IS Olympics: Information Systems in a Diverse World, pp. 33–48. LNBIP. Springer (2012)
31.
Zurück zum Zitat Bose, R.P.J.C.: Process mining in the large: preprocessing, discovery, and diagnostics. Ph.D. dissertation, Technische Universiteit Eindhoven (2012) Bose, R.P.J.C.: Process mining in the large: preprocessing, discovery, and diagnostics. Ph.D. dissertation, Technische Universiteit Eindhoven (2012)
32.
Zurück zum Zitat Vanhatalo, J., Völzer, H., Koehler, J.: The refined process structure tree. Data Knowl. Eng. 68(9), 793–818 (2009)CrossRef Vanhatalo, J., Völzer, H., Koehler, J.: The refined process structure tree. Data Knowl. Eng. 68(9), 793–818 (2009)CrossRef
Metadaten
Titel
Event Abstraction for Process Mining Using Supervised Learning Techniques
verfasst von
Niek Tax
Natalia Sidorova
Reinder Haakma
Wil M. P. van der Aalst
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-56994-9_18