Skip to main content
Top
Published in: Artificial Intelligence Review 4/2021

08-10-2020

Interactive clustering: a scoping review

Authors: Thais Rodrigues Neubauer, Sarajane Marques Peres, Marcelo Fantinato, Xixi Lu, Hajo Alexander Reijers

Published in: Artificial Intelligence Review | Issue 4/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We present in this paper a scoping review conducted in the interactive clustering area. Interactive clustering has been applied to leverage the strengths of both unsupervised and supervised learning. In interactive clustering, supervised learning is represented by inserting the knowledge of human experts in an originally unsupervised data analysis process. This scoping review aimed to organize the knowledge on (i) the applicability of interactive clustering methods, (ii) clustering algorithms being used to support interactive clustering, (iii) how to model the expert supervision and (iv) the effects brought by the expert supervision in the results produced. A systematic search for related literature was conducted in the Scopus database, resulting in the selection of 50 primary studies published by 2018. The analysis of these studies allowed us to identify trends such as: the application in text/image; use of partitioning and hierarchical algorithms; application of strategies based on split/merge, pairwise constraints, similarity metrics learning and data reassignment; and concern with visualization. In addition, some relevant issues not yet adequately addressed were identified, such as: the evaluation of expert supervision; the evaluation of the expert’s effort; and the conduction of studies effectively involving human experts, instead of computer simulations.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
Imposing restrictions on the clustering algorithm is implemented following the constrained clustering principles (Han et al. 2011; Chapelle et al. 2010). According to Wagstaff (2010), constrained clustering algorithms may enforce every constraint in the solution, or they may use the constraints as guidance rather than hard requirements.
 
2
The term active learning is used when the learner has some role in determining on what data it will be trained (Cohn 2010), which can be done based on expert’s knowledge.
 
4
For 2018, only publications indexed until September were analyzed.
 
5
The reproducibility evaluation disregarded the need for a sample of experts statistically equivalent to that of the study carried out.
 
6
As there are studies that present experiments with different data types, the sum accumulated in the graph is higher than the number of studies analyzed.
 
7
independent and identically distributed (i.i.d.)—from the probability theory and statistics.
 
8
Only acquisition strategies found in at least two primary studies are shown in the figure.
 
9
As there are studies that present experiments with different strategies for acquiring knowledge, the sum accumulated in the graph is higher than the number of studies analyzed.
 
10
Only strategies found in at least two primary studies are shown in the figure.
 
11
As there are studies that present experiments with different strategies for using the acquired knowledge, the sum accumulated in the graph is higher than the number of studies analyzed.
 
12
As there are studies that present experiments using different evaluation strategies, the sum accumulated in the graph is higher than the number of studies analyzed.
 
13
Only validation measures found in at least two primary studies are shown in the figure.
 
14
As there are studies that present experiments using different validation measures, the sum accumulated in the graph is higher than the number of studies analyzed.
 
Literature
go back to reference Achlioptas D, McSherry F (2005) On spectral learning of mixtures of distributions. In: Proceedings of the 18th annual conference on learning theory. Springer-Verlag, Berlin, Germany, COLT’05, pp 458–469 Achlioptas D, McSherry F (2005) On spectral learning of mixtures of distributions. In: Proceedings of the 18th annual conference on learning theory. Springer-Verlag, Berlin, Germany, COLT’05, pp 458–469
go back to reference Alagambigai P, Thangavel K, Visalakshi NK (2008) Interactive clustering in distributed environment. In: First international conference on emerging trends in engineering and technology, pp 1200–1205 Alagambigai P, Thangavel K, Visalakshi NK (2008) Interactive clustering in distributed environment. In: First international conference on emerging trends in engineering and technology, pp 1200–1205
go back to reference Arın Í, Erpam MK, Saygın Y (2018) I-TWEC: interactive clustering tool for twitter. Expert Syst Appl 96:1–13 Arın Í, Erpam MK, Saygın Y (2018) I-TWEC: interactive clustering tool for twitter. Expert Syst Appl 96:1–13
go back to reference Awasthi P, Balcan M, Voevodski K (2017) Local algorithms for interactive clustering. J Mach Learn Res 18:1–35MathSciNetMATH Awasthi P, Balcan M, Voevodski K (2017) Local algorithms for interactive clustering. J Mach Learn Res 18:1–35MathSciNetMATH
go back to reference Balcan MF, Blum A (2008) Clustering with interactive feedback. In: Proceedings of the 19th international conference on algorithmic learning theory (ALT), vol 5254, pp 316–328 Balcan MF, Blum A (2008) Clustering with interactive feedback. In: Proceedings of the 19th international conference on algorithmic learning theory (ALT), vol 5254, pp 316–328
go back to reference Behr AA, Liu KZ, Liu-Fang G, Nakka P, Ramachandran S (2016) Pong: fast analysis and visualization of latent clusters in population genetic data. Bioinformatics 32(18):2817–2823 Behr AA, Liu KZ, Liu-Fang G, Nakka P, Ramachandran S (2016) Pong: fast analysis and visualization of latent clusters in population genetic data. Bioinformatics 32(18):2817–2823
go back to reference Belkin M, Sinha K (2010) Polynomial learning of distribution families. In: Proceedings of the IEEE 51st annual symposium on foundations of computer science, IEEE computer society, Washington, DC, USA, FOCS ’10, pp 103–112 Belkin M, Sinha K (2010) Polynomial learning of distribution families. In: Proceedings of the IEEE 51st annual symposium on foundations of computer science, IEEE computer society, Washington, DC, USA, FOCS ’10, pp 103–112
go back to reference Biolchini J, Mian PG, Natali ACC, Travassos GH (2005) Systematic review in software engineering. Technical Report TR—ES 679 / 05, Systems Engineering and Computer Science Department, Universidade Federal do Rio de Janeiro, Rio de Janeiro Biolchini J, Mian PG, Natali ACC, Travassos GH (2005) Systematic review in software engineering. Technical Report TR—ES 679 / 05, Systems Engineering and Computer Science Department, Universidade Federal do Rio de Janeiro, Rio de Janeiro
go back to reference Borg I, Groenen P (1997) Modern multidimensional scaling. Theory and applications. Springer, GermanyMATH Borg I, Groenen P (1997) Modern multidimensional scaling. Theory and applications. Springer, GermanyMATH
go back to reference Boudjeloud-Assala L, Pinheiro P, Blansché A, Tamisier T, Otjacques B (2016) Interactive and iterative visual clustering. Inf Vis 15(3):181–197 Boudjeloud-Assala L, Pinheiro P, Blansché A, Tamisier T, Otjacques B (2016) Interactive and iterative visual clustering. Inf Vis 15(3):181–197
go back to reference Brubaker SC, Vempala S (2008) Isotropic PCA and affine-invariant clustering. In: Proceedings of the 49th annual IEEE symposium on foundations of computer science, IEEE computer society, Washington, DC, USA, FOCS ’08, pp 551–560 Brubaker SC, Vempala S (2008) Isotropic PCA and affine-invariant clustering. In: Proceedings of the 49th annual IEEE symposium on foundations of computer science, IEEE computer society, Washington, DC, USA, FOCS ’08, pp 551–560
go back to reference Bruneau P, Otjacques B (2013) An interactive, example-based, visual clustering system. In: Proceedings of the international conference on information visualisation Bruneau P, Otjacques B (2013) An interactive, example-based, visual clustering system. In: Proceedings of the international conference on information visualisation
go back to reference Cao N, Gotz D, Sun J, Qu H (2011) Dicon: interactive visual analysis of multidimensional clusters. IEEE Trans Vis Comput Graph 17(12):2581–2590 Cao N, Gotz D, Sun J, Qu H (2011) Dicon: interactive visual analysis of multidimensional clusters. IEEE Trans Vis Comput Graph 17(12):2581–2590
go back to reference Cavallo M, Demiralp Ç (2018) Clustrophile 2: guided visual clustering analysis. IEEE Trans Vis Comput Graph 25:267–276 Cavallo M, Demiralp Ç (2018) Clustrophile 2: guided visual clustering analysis. IEEE Trans Vis Comput Graph 25:267–276
go back to reference Chang S, Dai P, Hong L, Sheng C, Zhang T, Chi EH (2016) AppGrouper: knowledge-based interactive clustering tool for app search results. In: Proceedings of the 21st international conference on intelligent user interfaces, ACM, New York, NY, USA, IUI ’16, pp 348–358 Chang S, Dai P, Hong L, Sheng C, Zhang T, Chi EH (2016) AppGrouper: knowledge-based interactive clustering tool for app search results. In: Proceedings of the 21st international conference on intelligent user interfaces, ACM, New York, NY, USA, IUI ’16, pp 348–358
go back to reference Chapelle O, Schlkopf B, Zien A (2010) Semi-supervised learning, 1st edn. The MIT Press, Cambridge Chapelle O, Schlkopf B, Zien A (2010) Semi-supervised learning, 1st edn. The MIT Press, Cambridge
go back to reference Chaudhuri K, Dasgupta S (2010) Rates of convergence for the cluster tree. In: Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A (eds) Advances in neural information processing systems, vol 23. Curran Associates Inc., New York, pp 343–351 Chaudhuri K, Dasgupta S (2010) Rates of convergence for the cluster tree. In: Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A (eds) Advances in neural information processing systems, vol 23. Curran Associates Inc., New York, pp 343–351
go back to reference Coden A, Danilevsky M, Gruhl D, Kato L, Nagarajan M (2017) A method to accelerate human in the loop clustering. In: Proceedings of the 17th SIAM international conference on data mining, society for industrial and applied mathematics, Philadelphia, Pennsylvania, United States, pp 237–245 Coden A, Danilevsky M, Gruhl D, Kato L, Nagarajan M (2017) A method to accelerate human in the loop clustering. In: Proceedings of the 17th SIAM international conference on data mining, society for industrial and applied mathematics, Philadelphia, Pennsylvania, United States, pp 237–245
go back to reference Correa G, Marcacini R, Hruschka E, Rezende S (2015) Interactive textual feature selection for consensus clustering. Pattern Recognit Lett 52:25–31 Correa G, Marcacini R, Hruschka E, Rezende S (2015) Interactive textual feature selection for consensus clustering. Pattern Recognit Lett 52:25–31
go back to reference Dasgupta S, Ng V (2010) Which clustering do you want? Inducing your ideal clustering with minimal feedback. J Artif Intell Res 39(1):581–632MathSciNetMATH Dasgupta S, Ng V (2010) Which clustering do you want? Inducing your ideal clustering with minimal feedback. J Artif Intell Res 39(1):581–632MathSciNetMATH
go back to reference desJardins M, MacGlashan J, Ferraioli J (2007) Interactive visual clustering. In: 12th International conference on intelligent user interfaces (IUI), pp 361–364 desJardins M, MacGlashan J, Ferraioli J (2007) Interactive visual clustering. In: 12th International conference on intelligent user interfaces (IUI), pp 361–364
go back to reference Dubey A, Bhattacharya I, Godbole S (2010) A cluster-level semi-supervision model for interactive clustering. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 6321 LNAI (PART 1), pp 409–424 Dubey A, Bhattacharya I, Godbole S (2010) A cluster-level semi-supervision model for interactive clustering. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 6321 LNAI (PART 1), pp 409–424
go back to reference Dudley JJ, Kristensson PO (2018) A review of user interface design for interactive machine learning. ACM Trans Interact Intell Syst 8(2):1–37 Dudley JJ, Kristensson PO (2018) A review of user interface design for interactive machine learning. ACM Trans Interact Intell Syst 8(2):1–37
go back to reference Dyba T, Dingsoyr T, Hanssen GK (2007) Applying systematic reviews to diverse study types: an experience report. In: First international symposium on empirical software engineering and measurement (ESEM 2007), pp 225–234 Dyba T, Dingsoyr T, Hanssen GK (2007) Applying systematic reviews to diverse study types: an experience report. In: First international symposium on empirical software engineering and measurement (ESEM 2007), pp 225–234
go back to reference Emamjomeh-Zadeh E, Kempe D (2017) A general framework for robust interactive learning. In: Conference on neural information processing systems, NIPS, 2017 Emamjomeh-Zadeh E, Kempe D (2017) A general framework for robust interactive learning. In: Conference on neural information processing systems, NIPS, 2017
go back to reference Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, pp 226–231. AAAI Press Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, pp 226–231. AAAI Press
go back to reference Fayyad UM, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (eds) (1996) Advances in knowledge discovery and data mining. American Association for Artificial Intelligence, Menlo Park Fayyad UM, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (eds) (1996) Advances in knowledge discovery and data mining. American Association for Artificial Intelligence, Menlo Park
go back to reference Ferrero G, Primadhanty A, Quattoni A (2017) Intoevents: an interactive toolkit for discovering and building event schemas. In: Proceedings of the software demonstrations of the 15th conference of the European chapter of the association for computational linguistics, pp 104–107. Association for Computational Linguistics Ferrero G, Primadhanty A, Quattoni A (2017) Intoevents: an interactive toolkit for discovering and building event schemas. In: Proceedings of the software demonstrations of the 15th conference of the European chapter of the association for computational linguistics, pp 104–107. Association for Computational Linguistics
go back to reference Fredj SB, Glatard T, Casta C, Clarysse P (2011) An exploration framework for segmentation parameter spaces. IEEE Int Conf Image Process 18:2257–2260 Fredj SB, Glatard T, Casta C, Clarysse P (2011) An exploration framework for segmentation parameter spaces. IEEE Int Conf Image Process 18:2257–2260
go back to reference Geerts F, Ndindi R (2014) Interactive correlation clustering. In: International conference on data science and advanced analytics (DSAA), pp 170–176 Geerts F, Ndindi R (2014) Interactive correlation clustering. In: International conference on data science and advanced analytics (DSAA), pp 170–176
go back to reference Gharehchopogh FS (2010) Approach and review of user oriented interactive data mining. In: Proceedings of the 4th international conference on application of information and communication technologies, pp 1–4 Gharehchopogh FS (2010) Approach and review of user oriented interactive data mining. In: Proceedings of the 4th international conference on application of information and communication technologies, pp 1–4
go back to reference Gieseke F, Pahikkala T, Heskes T (2015) Batch steepest-descent-mildest-ascent for interactive maximum margin clustering. Lect Notes Comput Sci 9385:95–107 Gieseke F, Pahikkala T, Heskes T (2015) Batch steepest-descent-mildest-ascent for interactive maximum margin clustering. Lect Notes Comput Sci 9385:95–107
go back to reference Guo P, Xiao H, Wang Z, Yuan X (2010) Interactive local clustering operations for high dimensional data in parallel coordinates. In: IEEE Pacific visualization symposium (PacificVis), pp 97–104 Guo P, Xiao H, Wang Z, Yuan X (2010) Interactive local clustering operations for high dimensional data in parallel coordinates. In: IEEE Pacific visualization symposium (PacificVis), pp 97–104
go back to reference Halkidi M, Batistakis Y, Vazirgiannis M (2001) On clustering validation techniques. J Intell Inf Syst 17(2):107–145MATH Halkidi M, Batistakis Y, Vazirgiannis M (2001) On clustering validation techniques. J Intell Inf Syst 17(2):107–145MATH
go back to reference Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San FranciscoMATH Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San FranciscoMATH
go back to reference Holzinger A, Dehmer M, Jurisica I (2014) Knowledge discovery and interactive data mining in bioinformatics–state-of-the-art, future challenges and research directions. BMC Bioinform 15(S–6):I1 Holzinger A, Dehmer M, Jurisica I (2014) Knowledge discovery and interactive data mining in bioinformatics–state-of-the-art, future challenges and research directions. BMC Bioinform 15(S–6):I1
go back to reference Hu Y, Milios EE, Blustein J (2011) Interactive feature selection for document clustering. In: Proceedings of the ACM symposium on applied computing, pp 1143–1150 Hu Y, Milios EE, Blustein J (2011) Interactive feature selection for document clustering. In: Proceedings of the ACM symposium on applied computing, pp 1143–1150
go back to reference Hu Y, Milios EE, Blustein J (2014) Interactive document clustering with feature supervision through reweighting. Intell Data Anal 18:561–581 Hu Y, Milios EE, Blustein J (2014) Interactive document clustering with feature supervision through reweighting. Intell Data Anal 18:561–581
go back to reference Iorio F, Miele G, Napolitano F, Raiconi G, Tagliaferri R (2007) An interactive tool for data visualization and clustering. In: Knowledge-based intelligent information and engineering systems: 11th international conference (KES), XVII Italian workshop on neural networks, pp 870–877 Iorio F, Miele G, Napolitano F, Raiconi G, Tagliaferri R (2007) An interactive tool for data visualization and clustering. In: Knowledge-based intelligent information and engineering systems: 11th international conference (KES), XVII Italian workshop on neural networks, pp 870–877
go back to reference Jang S, Elmqvist N, Ramani K (2014) GestureAnalyzer: Visual analytics for pattern analysis of mid-air hand gestures. In: Proceedings of the 2nd ACM symposium on spatial user interaction, ACM, New York, NY, USA, pp 30–39 Jang S, Elmqvist N, Ramani K (2014) GestureAnalyzer: Visual analytics for pattern analysis of mid-air hand gestures. In: Proceedings of the 2nd ACM symposium on spatial user interaction, ACM, New York, NY, USA, pp 30–39
go back to reference Ji D, Wang M, Tian Q, X-SHua (2010) Semi-automatic photo clustering with distance metric learning. In: Proceedings of SPIE—the international society for optical engineering, vol 7744 Ji D, Wang M, Tian Q, X-SHua (2010) Semi-automatic photo clustering with distance metric learning. In: Proceedings of SPIE—the international society for optical engineering, vol 7744
go back to reference Kalai AT, Moitra A, Valiant G (2010) Efficiently learning mixtures of two gaussians. In: Proceedings of the 42nd ACM symposium on theory of computing, ACM, New York, NY, USA, STOC ’10, pp 553–562 Kalai AT, Moitra A, Valiant G (2010) Efficiently learning mixtures of two gaussians. In: Proceedings of the 42nd ACM symposium on theory of computing, ACM, New York, NY, USA, STOC ’10, pp 553–562
go back to reference Khodabandeh M, Vahdat A, Zhou GT, Hajimirsadeghi H, Roshtkhari M, Mori G, Se S (2015) Discovering human interactions in videos with limited data labeling. In: IEEE computer society conference on computer vision and pattern recognition workshops, IEEE, New Jersey, USA, vol 2015-October, pp 9–18 Khodabandeh M, Vahdat A, Zhou GT, Hajimirsadeghi H, Roshtkhari M, Mori G, Se S (2015) Discovering human interactions in videos with limited data labeling. In: IEEE computer society conference on computer vision and pattern recognition workshops, IEEE, New Jersey, USA, vol 2015-October, pp 9–18
go back to reference Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering. Technical report, School of Computer Science and Mathematics Keele University, Keele and Department of Computer Science, University of Durham, Durham Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering. Technical report, School of Computer Science and Mathematics Keele University, Keele and Department of Computer Science, University of Durham, Durham
go back to reference Kohonen T (2000) Self organizing maps, 3rd edn. Springer, BerlinMATH Kohonen T (2000) Self organizing maps, 3rd edn. Springer, BerlinMATH
go back to reference Lai HP, Visani M, Boucher A, Ogier J (2014) A new interactive semi-supervised clustering model for large image database indexing. Pattern Recognit Lett 37:94–106 Lai HP, Visani M, Boucher A, Ogier J (2014) A new interactive semi-supervised clustering model for large image database indexing. Pattern Recognit Lett 37:94–106
go back to reference Lee H, Kihm J, Choo J, Stasko J, Park H (2012) iVisClustering: an interactive visual document clustering via topic modeling. Comput Graph Forum 31(3pt3):1155–1164 Lee H, Kihm J, Choo J, Stasko J, Park H (2012) iVisClustering: an interactive visual document clustering via topic modeling. Comput Graph Forum 31(3pt3):1155–1164
go back to reference Lelkes AD, Reyzin L (2015) Interactive clustering of linear classes and cryptographic lower bounds. Algorithmic learning theory. Springer, Cham, pp 165–176 Lelkes AD, Reyzin L (2015) Interactive clustering of linear classes and cryptographic lower bounds. Algorithmic learning theory. Springer, Cham, pp 165–176
go back to reference Mai ST, Amer-Yahia S, Douzal A, Nguyen KT, Nguyen AD (2018) Scalable active constrained clustering for temporal data. In: 23rd International conference on database systems for advanced applications (DASFAA), pp 566–582 Mai ST, Amer-Yahia S, Douzal A, Nguyen KT, Nguyen AD (2018) Scalable active constrained clustering for temporal data. In: 23rd International conference on database systems for advanced applications (DASFAA), pp 566–582
go back to reference Marcacini RM, Correa GN, Rezende SO (2012) An active learning approach to frequent itemset-based text clustering. In: Proceedings of the international conference on pattern recognition, IEEE, USA, pp 3529–3532 Marcacini RM, Correa GN, Rezende SO (2012) An active learning approach to frequent itemset-based text clustering. In: Proceedings of the international conference on pattern recognition, IEEE, USA, pp 3529–3532
go back to reference Marcacini RM, Domingues MA, Rezende SO (2013) Improving consensus clustering of texts using interactive feature selection. In: Proceedings of the 22nd international conference on world wide web, ACM, USA, WWW’13 Companion, pp 237–238 Marcacini RM, Domingues MA, Rezende SO (2013) Improving consensus clustering of texts using interactive feature selection. In: Proceedings of the 22nd international conference on world wide web, ACM, USA, WWW’13 Companion, pp 237–238
go back to reference Marrero M, Urbano J (2018) A semi-automatic and low-cost method to learn patterns for named entity recognition. Nat Lang Eng 24(1):39–75 Marrero M, Urbano J (2018) A semi-automatic and low-cost method to learn patterns for named entity recognition. Nat Lang Eng 24(1):39–75
go back to reference Mauder M, Bobkova Y, Ntoutsi E (2016) GMMbuilder-user-driven discovery of clustering structure for bioarchaeology. In: Lecture notes in computer science 9853 LNCS, pp 8–11 Mauder M, Bobkova Y, Ntoutsi E (2016) GMMbuilder-user-driven discovery of clustering structure for bioarchaeology. In: Lecture notes in computer science 9853 LNCS, pp 8–11
go back to reference Moitra A, Valiant G (2010) Settling the polynomial learnability of mixtures of gaussians. In IEEE 51st annual symposium on foundations of computer science. IEEE, Washington, DC, USA, pp 93–102 Moitra A, Valiant G (2010) Settling the polynomial learnability of mixtures of gaussians. In IEEE 51st annual symposium on foundations of computer science. IEEE, Washington, DC, USA, pp 93–102
go back to reference Momma M, Morinaga S, Komura D (2009) Promoting total efficiency in text clustering via iterative and interactive metric learning. In: 9th IEEE international conference on data mining, pp 878–883 Momma M, Morinaga S, Komura D (2009) Promoting total efficiency in text clustering via iterative and interactive metric learning. In: 9th IEEE international conference on data mining, pp 878–883
go back to reference Mukhopadhyay A (2016) Interactive approach to multiobjective genetic fuzzy clustering for satellite image segmentation. In: IEEE Uttar Pradesh section international conference on electrical, computer and electronics engineering (UPCON), pp 630–634 Mukhopadhyay A (2016) Interactive approach to multiobjective genetic fuzzy clustering for satellite image segmentation. In: IEEE Uttar Pradesh section international conference on electrical, computer and electronics engineering (UPCON), pp 630–634
go back to reference Mukhopadhyay A, Maulik U, Bandyopadhyay S (2013) An interactive approach to multiobjective clustering of gene expression patterns. IEEE Trans Biomed Eng 60(1):35–41 Mukhopadhyay A, Maulik U, Bandyopadhyay S (2013) An interactive approach to multiobjective clustering of gene expression patterns. IEEE Trans Biomed Eng 60(1):35–41
go back to reference Nourashrafeddin S, Sherkat E, Minghim R, Milios EE (2018) A visual approach for interactive Keyterm-based clustering. ACM Trans Interact Intell Syst 8(1):1–35 Nourashrafeddin S, Sherkat E, Minghim R, Milios EE (2018) A visual approach for interactive Keyterm-based clustering. ACM Trans Interact Intell Syst 8(1):1–35
go back to reference Okabe M, Yamada S (2010a) Constrained clustering with interactive similarity learning. In: Joint 5th international conference on soft computing and intelligent systems and 11th international symposium on advanced intelligent systems. Japan society for fuzzy theory and intelligent informatics, Japan, pp 1295–1300 Okabe M, Yamada S (2010a) Constrained clustering with interactive similarity learning. In: Joint 5th international conference on soft computing and intelligent systems and 11th international symposium on advanced intelligent systems. Japan society for fuzzy theory and intelligent informatics, Japan, pp 1295–1300
go back to reference Okabe M, Yamada S (2010b) An interactive tool for constrained clustering with human sampling. In: Proceedings of the IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology, IEEE computer society, Washington, DC, USA, WI-IAT ’10, vol 3, pp 108–111 Okabe M, Yamada S (2010b) An interactive tool for constrained clustering with human sampling. In: Proceedings of the IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology, IEEE computer society, Washington, DC, USA, WI-IAT ’10, vol 3, pp 108–111
go back to reference Okabe M, Yamada S (2011) An interactive tool for human active learning in constrained clustering. J Emerg Technol Web Intell 3:20–27 Okabe M, Yamada S (2011) An interactive tool for human active learning in constrained clustering. J Emerg Technol Web Intell 3:20–27
go back to reference Olsson F (2009) A literature survey of active machine learning in the context of natural language processing. Technical Report T2009:06, Swedish Institute of Computer Science, Kista, Sweden Olsson F (2009) A literature survey of active machine learning in the context of natural language processing. Technical Report T2009:06, Swedish Institute of Computer Science, Kista, Sweden
go back to reference Paré G, Trudel MC, Jaana M, Kitsiou S (2015) Synthesizing information systems knowledge: a typology of literature reviews. Inf Manag 52:183–199 Paré G, Trudel MC, Jaana M, Kitsiou S (2015) Synthesizing information systems knowledge: a typology of literature reviews. Inf Manag 52:183–199
go back to reference Patrick EA, Shen LYL (1971) Interactive use of problem knowledge for clustering and decision making. IEEE Trans Comput C–20(2):216–222MATH Patrick EA, Shen LYL (1971) Interactive use of problem knowledge for clustering and decision making. IEEE Trans Comput C–20(2):216–222MATH
go back to reference Sacha D, Kraus M, Bernard J, Behrisch M, Schreck T, Asano Y, Keim DA (2018) SOMFlow: guided exploratory cluster analysis with self-organizing maps and analytic provenance. IEEE Trans Vis Comput Graph 24:120–130 Sacha D, Kraus M, Bernard J, Behrisch M, Schreck T, Asano Y, Keim DA (2018) SOMFlow: guided exploratory cluster analysis with self-organizing maps and analytic provenance. IEEE Trans Vis Comput Graph 24:120–130
go back to reference Schreck T, Bernard J, Landesberger T, Kohlhammer J (2009) Visual cluster analysis of trajectory data with interactive kohonen maps. Inf Vis 8:14–29 Schreck T, Bernard J, Landesberger T, Kohlhammer J (2009) Visual cluster analysis of trajectory data with interactive kohonen maps. Inf Vis 8:14–29
go back to reference Schwenker F, Trentin E (2014) Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recognit Lett 37(1):4–14 Schwenker F, Trentin E (2014) Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recognit Lett 37(1):4–14
go back to reference Senderovich N, Maysuradze A (2015) Interactive coding of responses to open-ended questions in Russian Knowledge Engineering and Semantic Web. Springer, Cham, pp 195–209 Senderovich N, Maysuradze A (2015) Interactive coding of responses to open-ended questions in Russian Knowledge Engineering and Semantic Web. Springer, Cham, pp 195–209
go back to reference Settles B (2009) Active learning literature survey. Technical Report CSTR 1648, University of Wisconsin–Madison, Madison, Wisconsin, USA Settles B (2009) Active learning literature survey. Technical Report CSTR 1648, University of Wisconsin–Madison, Madison, Wisconsin, USA
go back to reference Sherkat E, Nourashrafeddin S, Milios EE, Minghim R (2018) Interactive document clustering revisited: a visual analytics approach. In: 23rd International conference on intelligent user interfaces, ACM, New York, NY, USA, IUI’18, pp 281–292 Sherkat E, Nourashrafeddin S, Milios EE, Minghim R (2018) Interactive document clustering revisited: a visual analytics approach. In: 23rd International conference on intelligent user interfaces, ACM, New York, NY, USA, IUI’18, pp 281–292
go back to reference Sun L, Wang X (2010) A survey on active learning strategy. Proc Int Conf Mach Learn Cybern 1:161–166 Sun L, Wang X (2010) A survey on active learning strategy. Proc Int Conf Mach Learn Cybern 1:161–166
go back to reference Tubío ÓD, Grimán A, Juzgado NJ (2009) Developing search strategies for detecting relevant experiments. Empir Softw Eng 14:513–539 Tubío ÓD, Grimán A, Juzgado NJ (2009) Developing search strategies for detecting relevant experiments. Empir Softw Eng 14:513–539
go back to reference Ultsch A, Siemon HP (1990) Kohonen’s self organizing feature maps for exploratory data analysis. In: Proceedings of the international neural network conference, Dordrecht, Netherlands: Kluwer, Paris, France, pp 305–308 Ultsch A, Siemon HP (1990) Kohonen’s self organizing feature maps for exploratory data analysis. In: Proceedings of the international neural network conference, Dordrecht, Netherlands: Kluwer, Paris, France, pp 305–308
go back to reference van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605MATH van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605MATH
go back to reference Vikram S, Dasgupta S (2016) Interactive Bayesian hierarchical clustering. In: Proceedings of the 33rd international conference on machine learning, JMLR.org, USA, vol 5, pp 3033–3042 Vikram S, Dasgupta S (2016) Interactive Bayesian hierarchical clustering. In: Proceedings of the 33rd international conference on machine learning, JMLR.org, USA, vol 5, pp 3033–3042
go back to reference Vu V, Lai H, Visani M (2016) Towards an approach using metric learning for interactive semi-supervised clustering of images. In: Proceedings of the 8th international conference on knowledge and systems engineering, IEEE, USA, pp 357–362 Vu V, Lai H, Visani M (2016) Towards an approach using metric learning for interactive semi-supervised clustering of images. In: Proceedings of the 8th international conference on knowledge and systems engineering, IEEE, USA, pp 357–362
go back to reference Wang M, Ji D, Tian Q, Hua XS (2012) Intelligent photo clustering with user interaction and distance metric learning. Pattern Recognit Lett 33(4):462–470 Wang M, Ji D, Tian Q, Hua XS (2012) Intelligent photo clustering with user interaction and distance metric learning. Pattern Recognit Lett 33(4):462–470
go back to reference Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1):37–52 Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1):37–52
go back to reference Xiong S, Azimi J, Fern XZ (2014) Active learning of constraints for semi-supervised clustering. IEEE Trans Knowl Data Eng 26(1):43–54 Xiong S, Azimi J, Fern XZ (2014) Active learning of constraints for semi-supervised clustering. IEEE Trans Knowl Data Eng 26(1):43–54
go back to reference Xu P, Cao N, Qu H, Stasko J (2016) Interactive visual co-cluster analysis of bipartite graphs. In: IEEE Pacific visualization symposium (PacificVis), pp 32–39 Xu P, Cao N, Qu H, Stasko J (2016) Interactive visual co-cluster analysis of bipartite graphs. In: IEEE Pacific visualization symposium (PacificVis), pp 32–39
go back to reference Zhang XL, Simpson T, Frecker M, Lesieutre G (2012) Supporting knowledge exploration and discovery in multi-dimensional data with interactive multiscale visualisation. J Eng Des 23(1):23–47 Zhang XL, Simpson T, Frecker M, Lesieutre G (2012) Supporting knowledge exploration and discovery in multi-dimensional data with interactive multiscale visualisation. J Eng Des 23(1):23–47
go back to reference Zhang L, Kalashnikov DV, Mehrotra S (2014) Context-assisted face clustering framework with human-in-the-loop. Int J Multimed Inf Retr 3:69–88 Zhang L, Kalashnikov DV, Mehrotra S (2014) Context-assisted face clustering framework with human-in-the-loop. Int J Multimed Inf Retr 3:69–88
go back to reference Zhang Y, Luo W, Mack EA, Maciejewski R (2016) Visualizing the impact of geographical variations on multivariate clustering. Comput Graph Forum 35:101–110 Zhang Y, Luo W, Mack EA, Maciejewski R (2016) Visualizing the impact of geographical variations on multivariate clustering. Comput Graph Forum 35:101–110
go back to reference Zhu Y, Li H, Miller DJ, Wang Z, Xuan J, Clarke R, Hoffman EP, Wang YJ (2008) caBIG\(^{{\rm TM}}\) VISDA: modeling, visualization, and discovery for cluster analysis of genomic data. BMC Bioinform 9:383–383 Zhu Y, Li H, Miller DJ, Wang Z, Xuan J, Clarke R, Hoffman EP, Wang YJ (2008) caBIG\(^{{\rm TM}}\) VISDA: modeling, visualization, and discovery for cluster analysis of genomic data. BMC Bioinform 9:383–383
Metadata
Title
Interactive clustering: a scoping review
Authors
Thais Rodrigues Neubauer
Sarajane Marques Peres
Marcelo Fantinato
Xixi Lu
Hajo Alexander Reijers
Publication date
08-10-2020
Publisher
Springer Netherlands
Published in
Artificial Intelligence Review / Issue 4/2021
Print ISSN: 0269-2821
Electronic ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-020-09913-7

Other articles of this Issue 4/2021

Artificial Intelligence Review 4/2021 Go to the issue

Premium Partner