Skip to main content

2017 | OriginalPaper | Buchkapitel

Compound Trace Clustering to Generate Accurate and Simple Sub-Process Models

verfasst von : Yaguang Sun, Bernhard Bauer, Matthias Weidlich

Erschienen in: Service-Oriented Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Business process model discovery targets the construction of conceptual models from event data that has been recorded during the execution of a business process. While a plethora of discovery techniques have been proposed in the literature, most existing techniques fail to cope with complex control-flow patterns as they are observed in event logs of highly flexible processes. In this paper, we follow the idea of splitting-up an event log into sub-logs, before applying process model discovery. This yields a set of sub-process models, one per sub-log, each describing a major variant of the business process. Unlike existing techniques, our clustering approach is guided by the result of model discovery: It first optimises the average complexity of the resulting models, before improving the accuracy of each model in isolation. Our experimental evaluation highlights that our approach yields more accurate sub-process models (that are of comparatively low complexity) than state-of-the-art trace clustering techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat van der Aalst, W.M.P.: Process Mining: Data Science in Action. Springer, Berlin (2016)CrossRef van der Aalst, W.M.P.: Process Mining: Data Science in Action. Springer, Berlin (2016)CrossRef
2.
Zurück zum Zitat Song, M., Günther, C.W., van der Aalst, W.M.P.: Trace clustering in process mining. In: Ardagna, D., Mecella, M., Yang, J. (eds.) BPM 2008. LNBIP, vol. 17, pp. 109–120. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00328-8_11 CrossRef Song, M., Günther, C.W., van der Aalst, W.M.P.: Trace clustering in process mining. In: Ardagna, D., Mecella, M., Yang, J. (eds.) BPM 2008. LNBIP, vol. 17, pp. 109–120. Springer, Heidelberg (2009). doi:10.​1007/​978-3-642-00328-8_​11 CrossRef
3.
Zurück zum Zitat Bose, R.P.J.C., van der Aalst, W.M.P.: Trace clustering based on conserved patterns: towards achieving better process models. In: Rinderle-Ma, S., Sadiq, S., Leymann, F. (eds.) BPM 2009. LNBIP, vol. 43, pp. 170–181. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12186-9_16 CrossRef Bose, R.P.J.C., van der Aalst, W.M.P.: Trace clustering based on conserved patterns: towards achieving better process models. In: Rinderle-Ma, S., Sadiq, S., Leymann, F. (eds.) BPM 2009. LNBIP, vol. 43, pp. 170–181. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-12186-9_​16 CrossRef
4.
Zurück zum Zitat Bose, R., van der Aalst, W.M.P.: Context aware trace clustering: towards improving process mining results. In: SIAM International Conference on Data Mining, pp. 401–402 (2009) Bose, R., van der Aalst, W.M.P.: Context aware trace clustering: towards improving process mining results. In: SIAM International Conference on Data Mining, pp. 401–402 (2009)
5.
Zurück zum Zitat Ferreira, D., Zacarias, M., Malheiros, M., Ferreira, P.: Approaching process mining with sequence clustering: experiments and findings. In: Alonso, G., Dadam, P., Rosemann, M. (eds.) BPM 2007. LNCS, vol. 4714, pp. 360–374. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75183-0_26 CrossRef Ferreira, D., Zacarias, M., Malheiros, M., Ferreira, P.: Approaching process mining with sequence clustering: experiments and findings. In: Alonso, G., Dadam, P., Rosemann, M. (eds.) BPM 2007. LNCS, vol. 4714, pp. 360–374. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-75183-0_​26 CrossRef
6.
Zurück zum Zitat Weerdt, J.D., vanden Broucke, S., Vanthienen, J., Baesens, B.: Active trace clustering for improved process discovery. IEEE Trans. Knowl. Data Eng. 25(12), 2708–2720 (2013) Weerdt, J.D., vanden Broucke, S., Vanthienen, J., Baesens, B.: Active trace clustering for improved process discovery. IEEE Trans. Knowl. Data Eng. 25(12), 2708–2720 (2013)
7.
Zurück zum Zitat Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs - a constructive approach. In: Colom, J.-M., Desel, J. (eds.) PETRI NETS 2013. LNCS, vol. 7927, pp. 311–329. Springer, Heidelberg (2013). doi:10.1007/978-3-642-38697-8_17 CrossRef Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs - a constructive approach. In: Colom, J.-M., Desel, J. (eds.) PETRI NETS 2013. LNCS, vol. 7927, pp. 311–329. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-38697-8_​17 CrossRef
8.
Zurück zum Zitat Ekanayake, C.C., Dumas, M., García-Bañuelos, L., La Rosa, M.: Slice, mine and dice: complexity-aware automated discovery of business process models. In: Daniel, F., Wang, J., Weber, B. (eds.) BPM 2013. LNCS, vol. 8094, pp. 49–64. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40176-3_6 CrossRef Ekanayake, C.C., Dumas, M., García-Bañuelos, L., La Rosa, M.: Slice, mine and dice: complexity-aware automated discovery of business process models. In: Daniel, F., Wang, J., Weber, B. (eds.) BPM 2013. LNCS, vol. 8094, pp. 49–64. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-40176-3_​6 CrossRef
9.
Zurück zum Zitat Garcia, L., Dumas, M., Rosa, M.L., Weerdt, J.D., Ekanayake, C.C.: Controlled automated discovery of collections of business process models. Inf. Syst. 46, 85–101 (2014)CrossRef Garcia, L., Dumas, M., Rosa, M.L., Weerdt, J.D., Ekanayake, C.C.: Controlled automated discovery of collections of business process models. Inf. Syst. 46, 85–101 (2014)CrossRef
10.
Zurück zum Zitat Greco, G., Guzzo, A., Pontieri, L.: Discovering expressive process models by clustering log traces. IEEE Trans. Knowl. Data Eng. 18(8), 1010–1027 (2006)CrossRef Greco, G., Guzzo, A., Pontieri, L.: Discovering expressive process models by clustering log traces. IEEE Trans. Knowl. Data Eng. 18(8), 1010–1027 (2006)CrossRef
11.
Zurück zum Zitat Weijters, A.J.M.M., Ribeiro, J.T.S.: Flexible Heuristics Miner (FHM). BETA Working Paper Series, WP 334. Eindhoven University of Technology, Eindhoven (2010) Weijters, A.J.M.M., Ribeiro, J.T.S.: Flexible Heuristics Miner (FHM). BETA Working Paper Series, WP 334. Eindhoven University of Technology, Eindhoven (2010)
12.
13.
Zurück zum Zitat de Medeiros, A.A.: Genetic process mining. Ph.D. thesis, Eindhoven University of Technology (2006) de Medeiros, A.A.: Genetic process mining. Ph.D. thesis, Eindhoven University of Technology (2006)
14.
Zurück zum Zitat Mendling, J., Strembeck, M.: Influence factors of understanding business process models. In: Abramowicz, W., Fensel, D. (eds.) BIS 2008. LNBIP, vol. 7, pp. 142–153. Springer, Heidelberg (2008). doi:10.1007/978-3-540-79396-0_13 CrossRef Mendling, J., Strembeck, M.: Influence factors of understanding business process models. In: Abramowicz, W., Fensel, D. (eds.) BIS 2008. LNBIP, vol. 7, pp. 142–153. Springer, Heidelberg (2008). doi:10.​1007/​978-3-540-79396-0_​13 CrossRef
15.
Zurück zum Zitat Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000)MATH Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000)MATH
16.
Zurück zum Zitat Shengnan, C., Han, J., David, P.: Parallel mining of closed sequential patterns. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, KDD 2005, pp. 562–567. ACM, New York (2005) Shengnan, C., Han, J., David, P.: Parallel mining of closed sequential patterns. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, KDD 2005, pp. 562–567. ACM, New York (2005)
17.
Zurück zum Zitat Sun, Y., Bauer, B.: A novel heuristic method for improving the fitness of mined business process models. In: Sheng, Q.Z., Stroulia, E., Tata, S., Bhiri, S. (eds.) ICSOC 2016. LNCS, vol. 9936, pp. 537–546. Springer, Cham (2016). doi:10.1007/978-3-319-46295-0_33 CrossRef Sun, Y., Bauer, B.: A novel heuristic method for improving the fitness of mined business process models. In: Sheng, Q.Z., Stroulia, E., Tata, S., Bhiri, S. (eds.) ICSOC 2016. LNCS, vol. 9936, pp. 537–546. Springer, Cham (2016). doi:10.​1007/​978-3-319-46295-0_​33 CrossRef
18.
Zurück zum Zitat Conforti, R., Dumas, M., García-Bañuelos, L., La Rosa, M.: Beyond tasks and gateways: discovering BPMN models with subprocesses, boundary events and activity markers. In: Sadiq, S., Soffer, P., Völzer, H. (eds.) BPM 2014. LNCS, vol. 8659, pp. 101–117. Springer, Cham (2014). doi:10.1007/978-3-319-10172-9_7 Conforti, R., Dumas, M., García-Bañuelos, L., La Rosa, M.: Beyond tasks and gateways: discovering BPMN models with subprocesses, boundary events and activity markers. In: Sadiq, S., Soffer, P., Völzer, H. (eds.) BPM 2014. LNCS, vol. 8659, pp. 101–117. Springer, Cham (2014). doi:10.​1007/​978-3-319-10172-9_​7
19.
Zurück zum Zitat Lassen, K.B., van der Aalst, W.M.P.: Complexity metrics for workflow nets. Inf. Softw. Technol. 51(3), 610–626 (2009)CrossRef Lassen, K.B., van der Aalst, W.M.P.: Complexity metrics for workflow nets. Inf. Softw. Technol. 51(3), 610–626 (2009)CrossRef
Metadaten
Titel
Compound Trace Clustering to Generate Accurate and Simple Sub-Process Models
verfasst von
Yaguang Sun
Bernhard Bauer
Matthias Weidlich
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-69035-3_12