Skip to main content
Erschienen in: Journal of Visualization 2/2022

27.09.2021 | Regular Paper

ACMViz: a visual analytics approach to understand DRL-based autonomous control model

verfasst von: Shiyu Cheng, Xiaochen Li, Guihua Shan, Beifang Niu, Yang Wang, MaoKang Luo

Erschienen in: Journal of Visualization | Ausgabe 2/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep reinforcement learning (DRL) has been widely used in autonomous control due to its superior performance. DRL-based autonomous control model (ACM) aims to train an agent to achieve self-control and learn optimal policy through pre-defined rewards. Despite the super-human performance, ACM is regarded as a black box, and the interpretation of its internal working mechanism remains a challenge to domain experts. In addition, adjusting the reward settings of ACM is also challenging due to the uncertain relationship between rewards setting and strategies. In this paper, we propose ACMViz, a visual analytics system to explore control strategies at different stages and reveal the relationship between rewards and action patterns. Focusing on controlling a lunar lander, ACMViz investigates different landing trajectories and action sequences to interpret the model and control the training. From our visual analytics of the action patterns, we diagnose and improve reward settings for different control targets. Through our case studies with deep learning experts, we validate the effectiveness of ACMViz.

Graphical abstract

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alemzadeh S, Niemann U, Ittermann T, Völzke H, Schneider D, Spiliopoulou M, Bühler K, Preim B (2020) Visual analysis of missing values in longitudinal cohort study data. In: Computer graphics forum, vol 39. Wiley Online Library, pp 63–75 Alemzadeh S, Niemann U, Ittermann T, Völzke H, Schneider D, Spiliopoulou M, Bühler K, Preim B (2020) Visual analysis of missing values in longitudinal cohort study data. In: Computer graphics forum, vol 39. Wiley Online Library, pp 63–75
Zurück zum Zitat Arbesser C, Spechtenhauser F, Mühlbacher T, Piringer H (2016) Visplause: visual data quality assessment of many time series using plausibility checks. IEEE Trans Vis Comput Graph 23(1):641–650CrossRef Arbesser C, Spechtenhauser F, Mühlbacher T, Piringer H (2016) Visplause: visual data quality assessment of many time series using plausibility checks. IEEE Trans Vis Comput Graph 23(1):641–650CrossRef
Zurück zum Zitat Berger M, McDonough K, Seversky LM (2016) cite2vec: citation-driven document exploration via word embeddings. IEEE Trans Vis Comput Graph 23(1):691–700CrossRef Berger M, McDonough K, Seversky LM (2016) cite2vec: citation-driven document exploration via word embeddings. IEEE Trans Vis Comput Graph 23(1):691–700CrossRef
Zurück zum Zitat Bezerianos A, Dragicevic P, Fekete JD, Bae J, Watson B (2010) Geneaquilts: a system for exploring large genealogies. IEEE Trans Vis Comput Graph 16(6):1073–1081CrossRef Bezerianos A, Dragicevic P, Fekete JD, Bae J, Watson B (2010) Geneaquilts: a system for exploring large genealogies. IEEE Trans Vis Comput Graph 16(6):1073–1081CrossRef
Zurück zum Zitat Bilal A, Jourabloo A, Ye M, Liu X, Ren L (2017) Do convolutional neural networks learn class hierarchy? IEEE Trans Vis Comput Graph 24(1):152–162CrossRef Bilal A, Jourabloo A, Ye M, Liu X, Ren L (2017) Do convolutional neural networks learn class hierarchy? IEEE Trans Vis Comput Graph 24(1):152–162CrossRef
Zurück zum Zitat Blumenschein M, Behrisch M, Schmid S, Butscher S, Wahl DR, Villinger K, Renner B, Reiterer H, Keim DA (2018) Smartexplore: simplifying high-dimensional data analysis through a table-based visual analytics approach. In: 2018 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 36–47 Blumenschein M, Behrisch M, Schmid S, Butscher S, Wahl DR, Villinger K, Renner B, Reiterer H, Keim DA (2018) Smartexplore: simplifying high-dimensional data analysis through a table-based visual analytics approach. In: 2018 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 36–47
Zurück zum Zitat Bors C, Gschwandtner T, Miksch S (2019) Capturing and visualizing provenance from data wrangling. IEEE Comput Graph Appl 39(6):61–75CrossRef Bors C, Gschwandtner T, Miksch S (2019) Capturing and visualizing provenance from data wrangling. IEEE Comput Graph Appl 39(6):61–75CrossRef
Zurück zum Zitat Chen C, Yuan J, Lu Y, Liu Y, Su H, Yuan S, Liu S (2020) Oodanalyzer: interactive analysis of out-of-distribution samples. IEEE Trans Vis Comput Graph 27(7):3335–3349CrossRef Chen C, Yuan J, Lu Y, Liu Y, Su H, Yuan S, Liu S (2020) Oodanalyzer: interactive analysis of out-of-distribution samples. IEEE Trans Vis Comput Graph 27(7):3335–3349CrossRef
Zurück zum Zitat Dextras-Romagnino K, Munzner T (2019) Segmentifier: interactive refinement of clickstream data. Comput Graph Forum 38(3):623–634CrossRef Dextras-Romagnino K, Munzner T (2019) Segmentifier: interactive refinement of clickstream data. Comput Graph Forum 38(3):623–634CrossRef
Zurück zum Zitat Gotz D, Stavropoulos H (2014) Decisionflow: visual analytics for high-dimensional temporal event sequence data. IEEE Trans Vis Comput Graph 20(12):1783–1792CrossRef Gotz D, Stavropoulos H (2014) Decisionflow: visual analytics for high-dimensional temporal event sequence data. IEEE Trans Vis Comput Graph 20(12):1783–1792CrossRef
Zurück zum Zitat Jaunet T, Vuillemot R, Wolf C (2020) Drlviz: understanding decisions and memory in deep reinforcement learning. Comput Graph Forum 39(3):49–61CrossRef Jaunet T, Vuillemot R, Wolf C (2020) Drlviz: understanding decisions and memory in deep reinforcement learning. Comput Graph Forum 39(3):49–61CrossRef
Zurück zum Zitat Li G, Wang J, Shen HW, Chen K, Shan G, Lu Z (2020) Cnnpruner: pruning convolutional neural networks with visual analytics. IEEE Trans Vis Comput Graph 27(2):1364–1373CrossRef Li G, Wang J, Shen HW, Chen K, Shan G, Lu Z (2020) Cnnpruner: pruning convolutional neural networks with visual analytics. IEEE Trans Vis Comput Graph 27(2):1364–1373CrossRef
Zurück zum Zitat Liu M, Shi J, Cao K, Zhu J, Liu S (2017a) Analyzing the training processes of deep generative models. IEEE Trans Vis Comput Graph 24(1):77–87CrossRef Liu M, Shi J, Cao K, Zhu J, Liu S (2017a) Analyzing the training processes of deep generative models. IEEE Trans Vis Comput Graph 24(1):77–87CrossRef
Zurück zum Zitat Liu M, Shi J, Li Z, Li C, Zhu J, Liu S (2017b) Towards better analysis of deep convolutional neural networks. IEEE Trans Vis Comput Graph 23(1):91–100CrossRef Liu M, Shi J, Li Z, Li C, Zhu J, Liu S (2017b) Towards better analysis of deep convolutional neural networks. IEEE Trans Vis Comput Graph 23(1):91–100CrossRef
Zurück zum Zitat Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, pp 3111–3119
Zurück zum Zitat Ming Y, Cao S, Zhang R, Li Z, Chen Y, Song Y, Qu H (2017) Understanding hidden memories of recurrent neural networks. In: 2017 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 13–24 Ming Y, Cao S, Zhang R, Li Z, Chen Y, Song Y, Qu H (2017) Understanding hidden memories of recurrent neural networks. In: 2017 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 13–24
Zurück zum Zitat Perer A, Sun J (2012) Matrixflow: temporal network visual analytics to track symptom evolution during disease progression. In: AMIA annual symposium proceedings, vol 2012, p 716 Perer A, Sun J (2012) Matrixflow: temporal network visual analytics to track symptom evolution during disease progression. In: AMIA annual symposium proceedings, vol 2012, p 716
Zurück zum Zitat Plaisant C, Milash B, Rose A, Widoff S, Shneiderman B (1996) Lifelines: visualizing personal histories. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp 221–227 Plaisant C, Milash B, Rose A, Widoff S, Shneiderman B (1996) Lifelines: visualizing personal histories. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp 221–227
Zurück zum Zitat Wang J, Gou L, Shen HW, Yang H (2018a) Dqnviz: a visual analytics approach to understand deep q-networks. IEEE Trans Vis Comput Graph 25(1):288–298CrossRef Wang J, Gou L, Shen HW, Yang H (2018a) Dqnviz: a visual analytics approach to understand deep q-networks. IEEE Trans Vis Comput Graph 25(1):288–298CrossRef
Zurück zum Zitat Wang J, Gou L, Yang H, Shen HW (2018b) Ganviz: a visual analytics approach to understand the adversarial game. IEEE Trans Vis Comput Graph 24(6):1905–1917CrossRef Wang J, Gou L, Yang H, Shen HW (2018b) Ganviz: a visual analytics approach to understand the adversarial game. IEEE Trans Vis Comput Graph 24(6):1905–1917CrossRef
Zurück zum Zitat Wang J, Zhang W, Yang H, Yeh CCM, Wang L (2021) Visual analytics for RNN-based deep reinforcement learning. IEEE Trans Vis Comput Graph Wang J, Zhang W, Yang H, Yeh CCM, Wang L (2021) Visual analytics for RNN-based deep reinforcement learning. IEEE Trans Vis Comput Graph
Zurück zum Zitat Wongsuphasawat K, Guerra Gómez JA, Plaisant C, Wang TD, Taieb-Maimon M, Shneiderman B (2011) Lifeflow: visualizing an overview of event sequences. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1747–1756 Wongsuphasawat K, Guerra Gómez JA, Plaisant C, Wang TD, Taieb-Maimon M, Shneiderman B (2011) Lifeflow: visualizing an overview of event sequences. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1747–1756
Zurück zum Zitat Xiang S, Ye X, Xia J, Wu J, Chen Y, Liu S (2019) Interactive correction of mislabeled training data. In: 2019 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 57–68 Xiang S, Ye X, Xia J, Wu J, Chen Y, Liu S (2019) Interactive correction of mislabeled training data. In: 2019 IEEE conference on visual analytics science and technology (VAST). IEEE, pp 57–68
Zurück zum Zitat Zhao Y, Luo F, Chen M, Wang Y, Xia J, Zhou F, Wang Y, Chen Y, Chen W (2018) Evaluating multi-dimensional visualizations for understanding fuzzy clusters. IEEE Trans Vis Comput Graph 25(1):12–21CrossRef Zhao Y, Luo F, Chen M, Wang Y, Xia J, Zhou F, Wang Y, Chen Y, Chen W (2018) Evaluating multi-dimensional visualizations for understanding fuzzy clusters. IEEE Trans Vis Comput Graph 25(1):12–21CrossRef
Zurück zum Zitat Zhao Y, Luo X, Lin X, Wang H, Kui X, Zhou F, Wang J, Chen Y, Chen W (2019) Visual analytics for electromagnetic situation awareness in radio monitoring and management. IEEE Trans Vis Comput Graph 26(1):590–600CrossRef Zhao Y, Luo X, Lin X, Wang H, Kui X, Zhou F, Wang J, Chen Y, Chen W (2019) Visual analytics for electromagnetic situation awareness in radio monitoring and management. IEEE Trans Vis Comput Graph 26(1):590–600CrossRef
Metadaten
Titel
ACMViz: a visual analytics approach to understand DRL-based autonomous control model
verfasst von
Shiyu Cheng
Xiaochen Li
Guihua Shan
Beifang Niu
Yang Wang
MaoKang Luo
Publikationsdatum
27.09.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal of Visualization / Ausgabe 2/2022
Print ISSN: 1343-8875
Elektronische ISSN: 1875-8975
DOI
https://doi.org/10.1007/s12650-021-00793-9

Weitere Artikel der Ausgabe 2/2022

Journal of Visualization 2/2022 Zur Ausgabe