Skip to main content
Erschienen in: Soft Computing 10/2016

07.06.2015 | Methodologies and Application

Mining unexpected patterns using decision trees and interestingness measures: a case study of endometriosis

verfasst von: Ming-Yang Chang, Rui-Dong Chiang, Shih-Jung Wu, Chien-Hui Chan

Erschienen in: Soft Computing | Ausgabe 10/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Because clinical research is carried out in complex environments, prior domain knowledge, constraints, and expert knowledge can enhance the capabilities and performance of data mining. In this paper we propose an unexpected pattern mining model that uses decision trees to compare recovery rates of two different treatments, and to find patterns that contrast with the prior knowledge of domain users. In the proposed model we define interestingness measures to determine whether the patterns found are interesting to the domain. By applying the concept of domain-driven data mining, we repeatedly utilize decision trees and interestingness measures in a closed-loop, in-depth mining process to find unexpected and interesting patterns. We use retrospective data from transvaginal ultrasound-guided aspirations to show that the proposed model can successfully compare different treatments using a decision tree, which is a new usage of that tool. We believe that unexpected, interesting patterns may provide clinical researchers with different perspectives for future research.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Berlanda N, Vercellini P, Fedele L (2010) The outcomes of repeat surgery for recurrent symptomatic endometriosis. Curr Opin Obstet Gynecol 22:320–325 Berlanda N, Vercellini P, Fedele L (2010) The outcomes of repeat surgery for recurrent symptomatic endometriosis. Curr Opin Obstet Gynecol 22:320–325
Zurück zum Zitat Bolton S, Bon C (2009a) Analysis of variance. Pharmaceutical statistics: practical and clinical applications, 5th edn. Informa Healthcare, New York, pp 182–221 Bolton S, Bon C (2009a) Analysis of variance. Pharmaceutical statistics: practical and clinical applications, 5th edn. Informa Healthcare, New York, pp 182–221
Zurück zum Zitat Bolton S, Bon C (2009b) Linear regression and correlation. Pharmaceutical statistics: practical and clinical applications, 5th edn. Informa Healthcare, New York, pp 147–181 Bolton S, Bon C (2009b) Linear regression and correlation. Pharmaceutical statistics: practical and clinical applications, 5th edn. Informa Healthcare, New York, pp 147–181
Zurück zum Zitat Breiman L (1984) Classification and regression trees. In: The Wadsworth statistics/probability series. Wadsworth International Group, Belmont Breiman L (1984) Classification and regression trees. In: The Wadsworth statistics/probability series. Wadsworth International Group, Belmont
Zurück zum Zitat Bulletti C, Coccia M, Battistoni S, Borini A (2010) Endometriosis and infertility. J Assist Reprod Genet 27:441–447CrossRef Bulletti C, Coccia M, Battistoni S, Borini A (2010) Endometriosis and infertility. J Assist Reprod Genet 27:441–447CrossRef
Zurück zum Zitat Cao L, Zhang C (2007) Domain-driven, actionable knowledge discovery. IEEE Intell Syst 22:78–88CrossRef Cao L, Zhang C (2007) Domain-driven, actionable knowledge discovery. IEEE Intell Syst 22:78–88CrossRef
Zurück zum Zitat Cao L, Zhang C, Yu PS, Zhao Y (2010a) Challenges and trends. Domain driven data mining. Springer, US, pp 1–25CrossRef Cao L, Zhang C, Yu PS, Zhao Y (2010a) Challenges and trends. Domain driven data mining. Springer, US, pp 1–25CrossRef
Zurück zum Zitat Cao L, Zhang C, Yu PS, Zhao Y (2010b) D\(^3\)M methodology. Domain driven data mining. Springer, US, pp 27–47CrossRef Cao L, Zhang C, Yu PS, Zhao Y (2010b) D\(^3\)M methodology. Domain driven data mining. Springer, US, pp 27–47CrossRef
Zurück zum Zitat Cao L, Zhang C (2006) Domain-driven actionable knowledge discovery in the real world. In: Ng W-K, Kitsuregawa M, Li J, Chang K (eds) Advances in knowledge discovery and data mining, Lecture notes in computer science, vol 3918. Springer, Berlin, pp 821–830. doi:10.1007/11731139_96 Cao L, Zhang C (2006) Domain-driven actionable knowledge discovery in the real world. In: Ng W-K, Kitsuregawa M, Li J, Chang K (eds) Advances in knowledge discovery and data mining, Lecture notes in computer science, vol 3918. Springer, Berlin, pp 821–830. doi:10.​1007/​11731139_​96
Zurück zum Zitat Freitas AA (1999) On rule interestingness measures. Knowl-Based Syst 12:309–315CrossRef Freitas AA (1999) On rule interestingness measures. Knowl-Based Syst 12:309–315CrossRef
Zurück zum Zitat Hsieh C-L, Shiau C-S, Lo L-M, Hsieh Ts-Ta, Chang M-Y (2009) Effectiveness of ultrasound-guided aspiration and sclerotherapy with 95 % ethanol for treatment of recurrent ovarian endometriomas. Fertil Steril 91:2709–2713CrossRef Hsieh C-L, Shiau C-S, Lo L-M, Hsieh Ts-Ta, Chang M-Y (2009) Effectiveness of ultrasound-guided aspiration and sclerotherapy with 95 % ethanol for treatment of recurrent ovarian endometriomas. Fertil Steril 91:2709–2713CrossRef
Zurück zum Zitat Ikuta A et al (2006) Management of transvaginal ultrasound-guided absolute ethanol sclerotherapy for ovarian endometriotic cysts. J Med Ultrason 33:99–103CrossRef Ikuta A et al (2006) Management of transvaginal ultrasound-guided absolute ethanol sclerotherapy for ovarian endometriotic cysts. J Med Ultrason 33:99–103CrossRef
Zurück zum Zitat Kafali H, Yurtseven S, Atmaca F, Ozardali I (2003) Management of non-neoplastic ovarian cysts with sclerotherapy. Int J Gynaecol Obstet 81:41–45CrossRef Kafali H, Yurtseven S, Atmaca F, Ozardali I (2003) Management of non-neoplastic ovarian cysts with sclerotherapy. Int J Gynaecol Obstet 81:41–45CrossRef
Zurück zum Zitat Kass GV (1980) An exploratory technique for investigating large quantities of categorical data. Appl Stat 29(2):119–127CrossRef Kass GV (1980) An exploratory technique for investigating large quantities of categorical data. Appl Stat 29(2):119–127CrossRef
Zurück zum Zitat Kennedy S et al (2005) ESHRE guideline for the diagnosis and treatment of endometriosis. Hum Reprod 20:2698–2704CrossRef Kennedy S et al (2005) ESHRE guideline for the diagnosis and treatment of endometriosis. Hum Reprod 20:2698–2704CrossRef
Zurück zum Zitat Kontonasios K-N, Spyropoulou E, De Bie T (2012) Knowledge discovery interestingness measures based on unexpectedness. Wiley Interdiscip Rev Data Min Knowl Discov 2:386–399CrossRef Kontonasios K-N, Spyropoulou E, De Bie T (2012) Knowledge discovery interestingness measures based on unexpectedness. Wiley Interdiscip Rev Data Min Knowl Discov 2:386–399CrossRef
Zurück zum Zitat Ling CX, Tielin C, Qiang Y, Jie C (2002) Mining optimal actions for profitable CRM. In: Paper presented at the proceedings of the 2002 IEEE international conference on data mining, 2002 Ling CX, Tielin C, Qiang Y, Jie C (2002) Mining optimal actions for profitable CRM. In: Paper presented at the proceedings of the 2002 IEEE international conference on data mining, 2002
Zurück zum Zitat Nap AW, Groothuis PG, Demir AY, Evers JLH, Dunselman GAJ (2004) Pathogenesis of endometriosis. Best Pract Res Clin Obstet Gynaecol 18:233–244CrossRef Nap AW, Groothuis PG, Demir AY, Evers JLH, Dunselman GAJ (2004) Pathogenesis of endometriosis. Best Pract Res Clin Obstet Gynaecol 18:233–244CrossRef
Zurück zum Zitat Noma J, Yoshida N (2001) Efficacy of ethanol sclerotherapy for ovarian endometriomas. Int J Gynaecol Obstet 72:35–39CrossRef Noma J, Yoshida N (2001) Efficacy of ethanol sclerotherapy for ovarian endometriomas. Int J Gynaecol Obstet 72:35–39CrossRef
Zurück zum Zitat Padmanabhan B, Tuzhilin A (1999) Unexpectedness as a measure of interestingness in knowledge discovery. Decis Support Syst 27:303–318CrossRef Padmanabhan B, Tuzhilin A (1999) Unexpectedness as a measure of interestingness in knowledge discovery. Decis Support Syst 27:303–318CrossRef
Zurück zum Zitat Piatetsky-Shapiro G (1991) Discovery, analysis, and presentation of strong rules. In: Piatetsky-Shapiro G, Frawley W (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 229–248 Piatetsky-Shapiro G (1991) Discovery, analysis, and presentation of strong rules. In: Piatetsky-Shapiro G, Frawley W (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 229–248
Zurück zum Zitat Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco
Zurück zum Zitat Rokach L, Maimon O (2008) Data mining with decision trees: theory and applications. World Scientific Publishing Company, MAMATH Rokach L, Maimon O (2008) Data mining with decision trees: theory and applications. World Scientific Publishing Company, MAMATH
Zurück zum Zitat Sebastian Y, Then PHH (2011) Domain-driven KDD for mining functionally novel rules and linking disjoint medical hypotheses. Knowl-Based Syst 24:609–620CrossRef Sebastian Y, Then PHH (2011) Domain-driven KDD for mining functionally novel rules and linking disjoint medical hypotheses. Knowl-Based Syst 24:609–620CrossRef
Zurück zum Zitat Silberschatz A, Tuzhilin A (1995) On subjective measures of interestingness in knowledge discovery. In: Paper presented at the proceedings of the 1st international conference on knowledge discovery and data mining (KDD’ 95) Silberschatz A, Tuzhilin A (1995) On subjective measures of interestingness in knowledge discovery. In: Paper presented at the proceedings of the 1st international conference on knowledge discovery and data mining (KDD’ 95)
Zurück zum Zitat Tsay L-S, Raś ZW (2005) Action rules discovery: system DEAR2, method and experiments. J Exp Theory Artif Intell 17:119–128CrossRefMATH Tsay L-S, Raś ZW (2005) Action rules discovery: system DEAR2, method and experiments. J Exp Theory Artif Intell 17:119–128CrossRefMATH
Zurück zum Zitat Vercellini P, Somigliana E, ViganÒ P, De Matteis S, Barbara G, Fedele L (2009) The effect of second-line surgery on reproductive performance of women with recurrent endometriosis: a systematic review. Acta Obstet Gynecol Scand 88:1074–1082. doi:10.1080/00016340903214973 CrossRef Vercellini P, Somigliana E, ViganÒ P, De Matteis S, Barbara G, Fedele L (2009) The effect of second-line surgery on reproductive performance of women with recurrent endometriosis: a systematic review. Acta Obstet Gynecol Scand 88:1074–1082. doi:10.​1080/​0001634090321497​3 CrossRef
Zurück zum Zitat Wang K, Zhou S, Han J (2002) Profit mining: from patterns to actions. In: Paper presented at the proceedings of the 8th international conference on extending database technology: advances in database technology Wang K, Zhou S, Han J (2002) Profit mining: from patterns to actions. In: Paper presented at the proceedings of the 8th international conference on extending database technology: advances in database technology
Zurück zum Zitat Yao Y, Chen Y, Yang X (2006) A measurement-theoretic foundation of rule interestingness evaluation. In: Young Lin T, Ohsuga S, Liau C-J, Hu X (eds) Foundations and novel approaches in data mining, Studies in computational intelligence, vol 9. Springer, Berlin, pp 41–59. doi:10.1007/11539827_3 Yao Y, Chen Y, Yang X (2006) A measurement-theoretic foundation of rule interestingness evaluation. In: Young Lin T, Ohsuga S, Liau C-J, Hu X (eds) Foundations and novel approaches in data mining, Studies in computational intelligence, vol 9. Springer, Berlin, pp 41–59. doi:10.​1007/​11539827_​3
Zurück zum Zitat Zhu Z, Gu J, Zhang L, Song W, Gao R (2009) Research on domain-driven actionable knowledge discovery. In: Shi Y, Wang S, Peng Y, Li J, Zeng Y (eds) Cutting-edge research topics on multiple criteria decision making, Communications in computer and information science, vol 35. Springer, Berlin, pp 176–183. doi:10.1007/978-3-642-02298-2_27 Zhu Z, Gu J, Zhang L, Song W, Gao R (2009) Research on domain-driven actionable knowledge discovery. In: Shi Y, Wang S, Peng Y, Li J, Zeng Y (eds) Cutting-edge research topics on multiple criteria decision making, Communications in computer and information science, vol 35. Springer, Berlin, pp 176–183. doi:10.​1007/​978-3-642-02298-2_​27
Zurück zum Zitat Zhu W, Tan Z, Fu Z, Li X, Chen X, Zhou Y (2011) Repeat transvaginal ultrasound-guided aspiration of ovarian endometrioma in infertile women with endometriosis. Am J Obstet Gynecol 204:61.e61–61.e66 Zhu W, Tan Z, Fu Z, Li X, Chen X, Zhou Y (2011) Repeat transvaginal ultrasound-guided aspiration of ovarian endometrioma in infertile women with endometriosis. Am J Obstet Gynecol 204:61.e61–61.e66
Metadaten
Titel
Mining unexpected patterns using decision trees and interestingness measures: a case study of endometriosis
verfasst von
Ming-Yang Chang
Rui-Dong Chiang
Shih-Jung Wu
Chien-Hui Chan
Publikationsdatum
07.06.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 10/2016
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-015-1735-0

Weitere Artikel der Ausgabe 10/2016

Soft Computing 10/2016 Zur Ausgabe