Skip to main content
Erschienen in: Journal of Visualization 1/2019

24.09.2018 | Regular Paper

GBRTVis: online analysis of gradient boosting regression tree

verfasst von: Yifei Huang, Yuhua Liu, Chenhui Li, Changbo Wang

Erschienen in: Journal of Visualization | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Visualizations of machine learning models have developed rapidly during these days, attracting great interests of industry and researchers. However, a pipeline that visualizations are created from logged data is a time-consuming process. In this work, we adopt progressive visual analytics to propose a new pipeline to facilitate the visual analysis progress of gradient boosting regression tree (GBRT). Visualizations such as tree view, instances view, and cluster view are created according to different types of data in real time. Users can explore GBRT with different visualization components interactively through GBRTVis. Case studies demonstrate that our pipeline can improve the efficiency of the training process and understanding. Furthermore, we propose a mixed structure of GBRT to improve itself. Two tests on different datasets show the effectiveness of the improvement.

Graphical Abstract

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. In: Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms, society for industrial and applied mathematics, pp 1027–1035 Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. In: Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms, society for industrial and applied mathematics, pp 1027–1035
Zurück zum Zitat Barlow T, Neville P (2001) Case study: visualization for decision tree analysis in data mining. In: IEEE symposium on information visualization, 2001. INFOVIS 2001, IEEE, pp 149–152 Barlow T, Neville P (2001) Case study: visualization for decision tree analysis in data mining. In: IEEE symposium on information visualization, 2001. INFOVIS 2001, IEEE, pp 149–152
Zurück zum Zitat Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13(Feb):281–305MathSciNetMATH Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13(Feb):281–305MathSciNetMATH
Zurück zum Zitat Bostock M, Ogievetsky V, Heer J (2011) \(\text{ D }^3\) data-driven documents. IEEE Trans Vis Comput Graph 17(12):2301–2309CrossRef Bostock M, Ogievetsky V, Heer J (2011) \(\text{ D }^3\) data-driven documents. IEEE Trans Vis Comput Graph 17(12):2301–2309CrossRef
Zurück zum Zitat Do TN (2007) Towards simple, easy to understand, an interactive decision tree algorithm. College Information Technology Can tho University, Can Tho, Vietnam, technology report, pp 06–01 Do TN (2007) Towards simple, easy to understand, an interactive decision tree algorithm. College Information Technology Can tho University, Can Tho, Vietnam, technology report, pp 06–01
Zurück zum Zitat Do TN, Poulet F (2004) Enhancing SVM with visualization. In: International conference on discovery science. Springer, New York, pp 183–194 Do TN, Poulet F (2004) Enhancing SVM with visualization. In: International conference on discovery science. Springer, New York, pp 183–194
Zurück zum Zitat El-Assady M, Sevastjanova R, Sperrle F, Keim D, Collins C (2017) Progressive learning of topic modeling parameters: a visual analytics framework. IEEE Trans Vis Comput Graph 24:382–391CrossRef El-Assady M, Sevastjanova R, Sperrle F, Keim D, Collins C (2017) Progressive learning of topic modeling parameters: a visual analytics framework. IEEE Trans Vis Comput Graph 24:382–391CrossRef
Zurück zum Zitat Guerra-Gómez J, Pack ML, Plaisant C, Shneiderman B (2013) Visualizing change over time using dynamic hierarchies: Treeversity2 and the stemview. IEEE Trans Vis Comput Graph 19(12):2566–2575CrossRef Guerra-Gómez J, Pack ML, Plaisant C, Shneiderman B (2013) Visualizing change over time using dynamic hierarchies: Treeversity2 and the stemview. IEEE Trans Vis Comput Graph 19(12):2566–2575CrossRef
Zurück zum Zitat Holten D, Van Wijk JJ (2008) Visual comparison of hierarchically organized data. Comput Graph Forum 27(3):759–766CrossRef Holten D, Van Wijk JJ (2008) Visual comparison of hierarchically organized data. Comput Graph Forum 27(3):759–766CrossRef
Zurück zum Zitat Holten D, Van Wijk JJ (2009) Force-directed edge bundling for graph visualization. Comput Graph Forum 28(3):983–990CrossRef Holten D, Van Wijk JJ (2009) Force-directed edge bundling for graph visualization. Comput Graph Forum 28(3):983–990CrossRef
Zurück zum Zitat Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688CrossRef Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688CrossRef
Zurück zum Zitat Jain N, Mangal P, Mehta D (2015) Angularjs: a modern mvc framework in javascript. J Glob Res Comput Sci 5(12):17–23 Jain N, Mangal P, Mehta D (2015) Angularjs: a modern mvc framework in javascript. J Glob Res Comput Sci 5(12):17–23
Zurück zum Zitat Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: Convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM international conference on multimedia, ACM, pp 675–678 Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: Convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM international conference on multimedia, ACM, pp 675–678
Zurück zum Zitat Kahng M, Andrews PY, Kalro A, Chau DHP (2018) Activis: visual exploration of industry-scale deep neural network models. IEEE Trans Vis Comput Graph 24(1):88–97CrossRef Kahng M, Andrews PY, Kalro A, Chau DHP (2018) Activis: visual exploration of industry-scale deep neural network models. IEEE Trans Vis Comput Graph 24(1):88–97CrossRef
Zurück zum Zitat Korel B (1990) Automated software test data generation. IEEE Trans Softw Eng 16(8):870–879CrossRef Korel B (1990) Automated software test data generation. IEEE Trans Softw Eng 16(8):870–879CrossRef
Zurück zum Zitat Kwon BC, Eysenbach B, Verma J, Ng K, De Filippi C, Stewart WF, Perer A (2018) Clustervision: visual supervision of unsupervised clustering. IEEE Trans Vis Comput Graph 24(1):142–151CrossRef Kwon BC, Eysenbach B, Verma J, Ng K, De Filippi C, Stewart WF, Perer A (2018) Clustervision: visual supervision of unsupervised clustering. IEEE Trans Vis Comput Graph 24(1):142–151CrossRef
Zurück zum Zitat Lamping J, Rao R, Pirolli P (1995) A focus+ context technique based on hyperbolic geometry for visualizing large hierarchies. In: Proceedings of the SIGCHI conference on Human factors in computing systems. ACM Press/Addison-Wesley Publishing Co., pp 401–408 Lamping J, Rao R, Pirolli P (1995) A focus+ context technique based on hyperbolic geometry for visualizing large hierarchies. In: Proceedings of the SIGCHI conference on Human factors in computing systems. ACM Press/Addison-Wesley Publishing Co., pp 401–408
Zurück zum Zitat Liu M, Shi J, Li Z, Li C, Zhu J, Liu S (2017) Towards better analysis of deep convolutional neural networks. IEEE Trans Vis Comput Graph 23(1):91–100CrossRef Liu M, Shi J, Li Z, Li C, Zhu J, Liu S (2017) Towards better analysis of deep convolutional neural networks. IEEE Trans Vis Comput Graph 23(1):91–100CrossRef
Zurück zum Zitat Liu S, Wang X, Liu M, Zhu J (2017) Towards better analysis of machine learning models: a visual analytics perspective. Vis Inform 1(1):48–56CrossRef Liu S, Wang X, Liu M, Zhu J (2017) Towards better analysis of machine learning models: a visual analytics perspective. Vis Inform 1(1):48–56CrossRef
Zurück zum Zitat Liu S, Xiao J, Liu J, Wang X, Wu J, Zhu J (2017) Visual diagnosis of tree boosting methods. IEEE Trans Vis Comput Graph 24:163–173CrossRef Liu S, Xiao J, Liu J, Wang X, Wu J, Zhu J (2017) Visual diagnosis of tree boosting methods. IEEE Trans Vis Comput Graph 24:163–173CrossRef
Zurück zum Zitat Liu Y, Salvendy G (2007) Interactive visual decision tree classification. In: International conference on human-computer interaction, Springer, pp 92–105 Liu Y, Salvendy G (2007) Interactive visual decision tree classification. In: International conference on human-computer interaction, Springer, pp 92–105
Zurück zum Zitat Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Zurück zum Zitat Munzner T, Guimbretière F, Tasiran S, Zhang L, Zhou Y (2003) Treejuxtaposer: scalable tree comparison using focus+ context with guaranteed visibility. ACM Trans Graph 22(3):453–462CrossRef Munzner T, Guimbretière F, Tasiran S, Zhang L, Zhou Y (2003) Treejuxtaposer: scalable tree comparison using focus+ context with guaranteed visibility. ACM Trans Graph 22(3):453–462CrossRef
Zurück zum Zitat Musser DR (1997) Introspective sorting and selection algorithms. Softw Pract Exper 27(8):983–993CrossRef Musser DR (1997) Introspective sorting and selection algorithms. Softw Pract Exper 27(8):983–993CrossRef
Zurück zum Zitat Nguyen TD, Ho TB, Shimodaira H (2000) Interactive visualization in mining large decision trees. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 345–348 Nguyen TD, Ho TB, Shimodaira H (2000) Interactive visualization in mining large decision trees. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 345–348
Zurück zum Zitat Paiva JGS, Schwartz WR, Pedrini H, Minghim R (2015) An approach to supporting incremental visual data classification. IEEE Trans Vis Comput Graph 21(1):4–17CrossRef Paiva JGS, Schwartz WR, Pedrini H, Minghim R (2015) An approach to supporting incremental visual data classification. IEEE Trans Vis Comput Graph 21(1):4–17CrossRef
Zurück zum Zitat Palmas G, Bachynskyi M, Oulasvirta A, Seidel HP, Weinkauf T (2014) An edge-bundling layout for interactive parallel coordinates. In: Visualization symposium (PacificVis), 2014 IEEE Pacific, IEEE, pp 57–64 Palmas G, Bachynskyi M, Oulasvirta A, Seidel HP, Weinkauf T (2014) An edge-bundling layout for interactive parallel coordinates. In: Visualization symposium (PacificVis), 2014 IEEE Pacific, IEEE, pp 57–64
Zurück zum Zitat Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH
Zurück zum Zitat Pezzotti N, Höllt T, Van Gemert J, Lelieveldt BP, Eisemann E, Vilanova A (2018) Deepeyes: progressive visual analytics for designing deep neural networks. IEEE Trans Vis Comput Graph 24(1):98–108CrossRef Pezzotti N, Höllt T, Van Gemert J, Lelieveldt BP, Eisemann E, Vilanova A (2018) Deepeyes: progressive visual analytics for designing deep neural networks. IEEE Trans Vis Comput Graph 24(1):98–108CrossRef
Zurück zum Zitat Pham NK, Do TN, Poulet F, Morin A (2007) Interactive exploration of decision tree results. In: International symposium on applied stochastic models and data analysis, ASMDA’07, pp 152–160 Pham NK, Do TN, Poulet F, Morin A (2007) Interactive exploration of decision tree results. In: International symposium on applied stochastic models and data analysis, ASMDA’07, pp 152–160
Zurück zum Zitat Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106 Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106
Zurück zum Zitat Ren D, Amershi S, Lee B, Suh J, Williams JD (2017) Squares: supporting interactive performance analysis for multiclass classifiers. IEEE Trans Vis Comput Graph 23(1):61–70CrossRef Ren D, Amershi S, Lee B, Suh J, Williams JD (2017) Squares: supporting interactive performance analysis for multiclass classifiers. IEEE Trans Vis Comput Graph 23(1):61–70CrossRef
Zurück zum Zitat Sedlmair M, Heinzl C, Bruckner S, Piringer H, Möller T (2014) Visual parameter space analysis: a conceptual framework. IEEE Trans Vis Comput Graph 20(12):2161–2170CrossRef Sedlmair M, Heinzl C, Bruckner S, Piringer H, Möller T (2014) Visual parameter space analysis: a conceptual framework. IEEE Trans Vis Comput Graph 20(12):2161–2170CrossRef
Zurück zum Zitat Shneiderman B, Plaisant C (1998) Treemaps for space-constrained visualization of hierarchies Shneiderman B, Plaisant C (1998) Treemaps for space-constrained visualization of hierarchies
Zurück zum Zitat Stolper CD, Perer A, Gotz D (2014) Progressive visual analytics: user-driven visual exploration of in-progress analytics. IEEE Trans Vis Comput Graph 20(12):1653–1662CrossRef Stolper CD, Perer A, Gotz D (2014) Progressive visual analytics: user-driven visual exploration of in-progress analytics. IEEE Trans Vis Comput Graph 20(12):1653–1662CrossRef
Zurück zum Zitat Tikir MM, Hollingsworth JK (2002) Efficient instrumentation for code coverage testing. In: ACM SIGSOFT software engineering notes, vol. 27, ACM, pp 86–96 Tikir MM, Hollingsworth JK (2002) Efficient instrumentation for code coverage testing. In: ACM SIGSOFT software engineering notes, vol. 27, ACM, pp 86–96
Zurück zum Zitat Tu Y, Shen HW (2007) Visualizing changes of hierarchical data using treemaps. IEEE Trans Vis Comput Graph 13(6):1286–1293CrossRef Tu Y, Shen HW (2007) Visualizing changes of hierarchical data using treemaps. IEEE Trans Vis Comput Graph 13(6):1286–1293CrossRef
Zurück zum Zitat Van Den Elzen S, van Wijk JJ (2011) Baobabview: interactive construction and analysis of decision trees. In: IEEE conference on visual analytics science and technology (VAST), pp 151–160 Van Den Elzen S, van Wijk JJ (2011) Baobabview: interactive construction and analysis of decision trees. In: IEEE conference on visual analytics science and technology (VAST), pp 151–160
Zurück zum Zitat Ware M, Frank E, Holmes G, Hall M, Witten IH (2001) Interactive machine learning: letting users build classifiers. Int J Human-Comput Stud 55(3):281–292MATHCrossRef Ware M, Frank E, Holmes G, Hall M, Witten IH (2001) Interactive machine learning: letting users build classifiers. Int J Human-Comput Stud 55(3):281–292MATHCrossRef
Zurück zum Zitat Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol 67(2):301–320MathSciNetMATHCrossRef Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol 67(2):301–320MathSciNetMATHCrossRef
Metadaten
Titel
GBRTVis: online analysis of gradient boosting regression tree
verfasst von
Yifei Huang
Yuhua Liu
Chenhui Li
Changbo Wang
Publikationsdatum
24.09.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal of Visualization / Ausgabe 1/2019
Print ISSN: 1343-8875
Elektronische ISSN: 1875-8975
DOI
https://doi.org/10.1007/s12650-018-0514-2

Weitere Artikel der Ausgabe 1/2019

Journal of Visualization 1/2019 Zur Ausgabe