Skip to main content

2017 | OriginalPaper | Buchkapitel

CHISSL: A Human-Machine Collaboration Space for Unsupervised Learning

verfasst von : Dustin Arendt, Caner Komurlu, Leslie M. Blaha

Erschienen in: Augmented Cognition. Neurocognition and Machine Learning

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We developed CHISSL, a human-machine interface that utilizes interactive supervision to help the user group unlabeled instances by her own mental model. The user primarily interacts via correction (moving a misplaced instance into its correct group) or confirmation (accepting that an instance is placed in its correct group). Concurrent with the user’s interactions, CHISSL trains a classification model guided by the user’s grouping of the data. It then predicts the group of unlabeled instances and arranges some of these alongside the instances manually organized by the user. We hypothesize that this mode of human and machine collaboration is more effective than Active Learning, wherein the machine decides for itself which instances should be labeled by the user. We found supporting evidence for this hypothesis in a pilot study where we applied CHISSL to organize a collection of handwritten digits.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
CHISSL stems from a concatenation of the acronyms for computer-human interaction (CHI) and semi-supervised learning (SSL).
 
2
Data available from the UCI Machine Learning Repository [19], as “Optical Recognition of Handwritten Digits Data Set”.
 
Literatur
1.
Zurück zum Zitat Amershi, S., Fogarty, J., Weld, D.: Regroup. In: Proceedings of the 2012 ACM Annual Conference on Human Factors in Computing Systems - CHI 2012, p. 21. ACM, New York (2012) Amershi, S., Fogarty, J., Weld, D.: Regroup. In: Proceedings of the 2012 ACM Annual Conference on Human Factors in Computing Systems - CHI 2012, p. 21. ACM, New York (2012)
2.
Zurück zum Zitat Basu, S., Drucker, S.M., Lu, H.: Assisting users with clustering tasks by combining metric learning and classification. In: AAAI, pp. 394–400 (2010) Basu, S., Drucker, S.M., Lu, H.: Assisting users with clustering tasks by combining metric learning and classification. In: AAAI, pp. 394–400 (2010)
3.
Zurück zum Zitat Fuchs, J., et al.: The influence of contour on similarity perception of star glyphs. IEEE Trans. Vis. Comput. Graph. 20(12), 2251–2260 (2014)CrossRef Fuchs, J., et al.: The influence of contour on similarity perception of star glyphs. IEEE Trans. Vis. Comput. Graph. 20(12), 2251–2260 (2014)CrossRef
4.
Zurück zum Zitat Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: Twenty-First International Conference on Machine Learning - ICML 2004, p. 11 (2004) Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: Twenty-First International Conference on Machine Learning - ICML 2004, p. 11 (2004)
5.
Zurück zum Zitat Borgo, R., Kehrer, J., Chung, D.H.S., Maguire, E., Laramee, R.S., Hauser, H., Ward, M., Chen, M.: Glyph-based visualization: foundations, design guidelines, techniques and applications. In: Eurographics State of the Art Reports, pp. 39–63 (2013) Borgo, R., Kehrer, J., Chung, D.H.S., Maguire, E., Laramee, R.S., Hauser, H., Ward, M., Chen, M.: Glyph-based visualization: foundations, design guidelines, techniques and applications. In: Eurographics State of the Art Reports, pp. 39–63 (2013)
6.
Zurück zum Zitat Brown, E.T., Liu, J., Brodley, C.E., Chang, R.: Dis-function: learning distance functions interactively. In: 2012 IEEE Conference on Visual Analytics Science and Technology (VAST), pp. 83–92. IEEE (2012) Brown, E.T., Liu, J., Brodley, C.E., Chang, R.: Dis-function: learning distance functions interactively. In: 2012 IEEE Conference on Visual Analytics Science and Technology (VAST), pp. 83–92. IEEE (2012)
7.
Zurück zum Zitat Chi, M.T., Glaser, R., Farr, M.J.: The Nature of Expertise. Psychology Press, Hove (2014)MATH Chi, M.T., Glaser, R., Farr, M.J.: The Nature of Expertise. Psychology Press, Hove (2014)MATH
8.
Zurück zum Zitat Choo, J., Lee, H., Kihm, J., Park, H.: iVisClassifier: an interactive visual analytics system for classification based on supervised dimension reduction. In: 2010 IEEE Symposium on Visual Analytics Science and Technology, pp. 27–34. IEEE (2010) Choo, J., Lee, H., Kihm, J., Park, H.: iVisClassifier: an interactive visual analytics system for classification based on supervised dimension reduction. In: 2010 IEEE Symposium on Visual Analytics Science and Technology, pp. 27–34. IEEE (2010)
9.
Zurück zum Zitat Chuang, J., Ramage, D., Manning, C.D., Heer, J.: Interpretation and trust: designing model-driven visualizations for text analysis. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 443–452 (2005) Chuang, J., Ramage, D., Manning, C.D., Heer, J.: Interpretation and trust: designing model-driven visualizations for text analysis. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 443–452 (2005)
10.
Zurück zum Zitat Cohn, D., Caruana, R.: Semi-supervised clustering: incorporating user feedback to improve cluster utility. In: Proceedings of the Conference on Artificial Intelligence (2000) Cohn, D., Caruana, R.: Semi-supervised clustering: incorporating user feedback to improve cluster utility. In: Proceedings of the Conference on Artificial Intelligence (2000)
11.
Zurück zum Zitat Daszykowski, M., Walczak, B.: Density-based clustering methods. In: Comprehensive Chemometrics, vol. 96, pp. 635–654. Elsevier (2009) Daszykowski, M., Walczak, B.: Density-based clustering methods. In: Comprehensive Chemometrics, vol. 96, pp. 635–654. Elsevier (2009)
12.
Zurück zum Zitat Endert, A., Fiaux, P., North, C.: Semantic interaction for sensemaking: inferring analytical reasoning for model steering. IEEE Trans. Vis. Comput. Graph. 18(12), 2879–2888 (2012)CrossRef Endert, A., Fiaux, P., North, C.: Semantic interaction for sensemaking: inferring analytical reasoning for model steering. IEEE Trans. Vis. Comput. Graph. 18(12), 2879–2888 (2012)CrossRef
13.
Zurück zum Zitat Gauthier, I., Skudlarski, P., Gore, J.C., Anderson, A.W.: Expertise for cars and birds recruits brain areas involved in face recognition. Nat. Neurosci. 3(2), 191–197 (2000)CrossRef Gauthier, I., Skudlarski, P., Gore, J.C., Anderson, A.W.: Expertise for cars and birds recruits brain areas involved in face recognition. Nat. Neurosci. 3(2), 191–197 (2000)CrossRef
14.
Zurück zum Zitat Goldstone, R.: An efficient method for obtaining similarity data. Behav. Res. Methods, Instrum. Comput. 26(4), 381–386 (1994)CrossRef Goldstone, R.: An efficient method for obtaining similarity data. Behav. Res. Methods, Instrum. Comput. 26(4), 381–386 (1994)CrossRef
15.
16.
Zurück zum Zitat Kaynak, C.: Methods of combining multiple classifiers and their applications to handwritten digit recognition. Master’s thesis, Institute of Graduate Studies in Science and Engineering, Bogazici University (1995) Kaynak, C.: Methods of combining multiple classifiers and their applications to handwritten digit recognition. Master’s thesis, Institute of Graduate Studies in Science and Engineering, Bogazici University (1995)
17.
Zurück zum Zitat Kulesza, T., Amershi, S., Caruana, R., Fisher, D., Charles, D.: Structured labeling for facilitating concept evolution in machine learning. In: Proceedings of the 32nd Annual ACM Conference on Human Factors in Computing Systems - CHI 2014, pp. 3075–3084. ACM, New York (2014) Kulesza, T., Amershi, S., Caruana, R., Fisher, D., Charles, D.: Structured labeling for facilitating concept evolution in machine learning. In: Proceedings of the 32nd Annual ACM Conference on Human Factors in Computing Systems - CHI 2014, pp. 3075–3084. ACM, New York (2014)
18.
Zurück zum Zitat Lewis, D.D., Gale, W.A.: A sequential algorithm for training text classifiers. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12. Springer, New York (1994) Lewis, D.D., Gale, W.A.: A sequential algorithm for training text classifiers. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12. Springer, New York (1994)
20.
Zurück zum Zitat Poggio, T., Cauwenberghs, G.: Incremental and decremental support vector machine learning. Adv. Neural Inf. Process. Syst. 13, 409 (2001) Poggio, T., Cauwenberghs, G.: Incremental and decremental support vector machine learning. Adv. Neural Inf. Process. Syst. 13, 409 (2001)
21.
Zurück zum Zitat Roy, N., McCallum, A.: Toward optimal active learning through sampling estimation of error reduction. In: ICML, pp. 441–448 (2001) Roy, N., McCallum, A.: Toward optimal active learning through sampling estimation of error reduction. In: ICML, pp. 441–448 (2001)
22.
Zurück zum Zitat Tanahashi, Y., Hsueh, C.-H., Ma, K.-L.: An efficient framework for generating storyline visualizations from streaming data. IEEE Trans. Vis. Comput. Graph. 21(6), 730–742 (2015)CrossRef Tanahashi, Y., Hsueh, C.-H., Ma, K.-L.: An efficient framework for generating storyline visualizations from streaming data. IEEE Trans. Vis. Comput. Graph. 21(6), 730–742 (2015)CrossRef
23.
Zurück zum Zitat Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287–294. ACM (1992) Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287–294. ACM (1992)
24.
Zurück zum Zitat Smith, J.D., Berg, M.E., Cook, R.G., Murphy, M.S., Crossley, M.J., Boomer, J., Spiering, B., Beran, M.J., Church, B.A., Ashby, F.G., et al.: Implicit and explicit categorization: a tale of four species. Neurosci. Biobehav. Rev. 36(10), 2355–2369 (2012)CrossRef Smith, J.D., Berg, M.E., Cook, R.G., Murphy, M.S., Crossley, M.J., Boomer, J., Spiering, B., Beran, M.J., Church, B.A., Ashby, F.G., et al.: Implicit and explicit categorization: a tale of four species. Neurosci. Biobehav. Rev. 36(10), 2355–2369 (2012)CrossRef
25.
Zurück zum Zitat Srivastava, A., Zou, J., Sutton, C.: Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation, pp. 120–126, February 2016 Srivastava, A., Zou, J., Sutton, C.: Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation, pp. 120–126, February 2016
26.
Zurück zum Zitat Tanaka, J.W., Curran, T., Sheinberg, D.L.: The training and transfer of real-world perceptual expertise. Psychol. Sci. 16(2), 145–151 (2005)CrossRef Tanaka, J.W., Curran, T., Sheinberg, D.L.: The training and transfer of real-world perceptual expertise. Psychol. Sci. 16(2), 145–151 (2005)CrossRef
27.
Zurück zum Zitat Vapnik, V.N., Vapnik, V.: Statistical Learning Theory, vol. 1. Wiley, New York (1998)MATH Vapnik, V.N., Vapnik, V.: Statistical Learning Theory, vol. 1. Wiley, New York (1998)MATH
28.
Zurück zum Zitat Yu, H.F., Huang, F.L., Lin, C.J.: Dual coordinate descent methods for logistic regression and maximum entropy models. Mach. Learn. 85(1–2), 41–75 (2011)MathSciNetCrossRefMATH Yu, H.F., Huang, F.L., Lin, C.J.: Dual coordinate descent methods for logistic regression and maximum entropy models. Mach. Learn. 85(1–2), 41–75 (2011)MathSciNetCrossRefMATH
29.
Zurück zum Zitat Zhu, X.: Semi-supervised learning literature survey. Technical report 1530, Computer Sciences, University of Wisconsin-Madison (2005) Zhu, X.: Semi-supervised learning literature survey. Technical report 1530, Computer Sciences, University of Wisconsin-Madison (2005)
30.
Zurück zum Zitat Liu, S., et al.: Storyflow: tracking the evolution of stories. IEEE Trans. Vis. Comput. Graph. 19(12), 2436–2445 (2013)CrossRef Liu, S., et al.: Storyflow: tracking the evolution of stories. IEEE Trans. Vis. Comput. Graph. 19(12), 2436–2445 (2013)CrossRef
Metadaten
Titel
CHISSL: A Human-Machine Collaboration Space for Unsupervised Learning
verfasst von
Dustin Arendt
Caner Komurlu
Leslie M. Blaha
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-58628-1_33

Neuer Inhalt