Skip to main content
Erschienen in:
Buchtitelbild

2016 | OriginalPaper | Buchkapitel

Adaptation Approaches in Unsupervised Learning: A Survey of the State-of-the-Art and Future Directions

verfasst von : JunHong Wang, YunQian Miao, Alaa Khamis, Fakhri Karray, Jiye Liang

Erschienen in: Image Analysis and Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In real applications, data continuously evolve over time and change from one setting to another. This inspires the development of adaptive learning algorithms to deal with this data dynamics. Adaptation mechanisms for unsupervised learning have received an increasing amount of attention from researchers. This research activity has produced a lot of results in tackling some of the challenging problems of the adaptation process that are still open. This paper is a brief review of adaptation mechanisms in unsupervised learning focusing on approaches recently reported in the literature for adaptive clustering and novelty detection and discussing some future directions. Although these approaches have able to cope with different levels of data non-stationarity, there is a crucial need to extend these approaches to be able to handle large amount of data in distributed resource-limited environments.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chan, A., Gouvea, E., Singh, R., Mosur, R., Rosenfield, R., Sun, Y., Huggins-Daines, D.: (First Draft) Hieroglpyhs : Building speech applications using sphinx and related resources (2004) Chan, A., Gouvea, E., Singh, R., Mosur, R., Rosenfield, R., Sun, Y., Huggins-Daines, D.: (First Draft) Hieroglpyhs : Building speech applications using sphinx and related resources (2004)
2.
Zurück zum Zitat Lee, C.H., Lin, C.H., Juang, B.H.: A study on speaker adaptation of the parameters of continuous density hidden Markov models. IEEE Trans. Sig. Process. 39(4), 806–814 (1991)CrossRef Lee, C.H., Lin, C.H., Juang, B.H.: A study on speaker adaptation of the parameters of continuous density hidden Markov models. IEEE Trans. Sig. Process. 39(4), 806–814 (1991)CrossRef
3.
Zurück zum Zitat Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9(2), 171–185 (1995)CrossRef Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9(2), 171–185 (1995)CrossRef
4.
Zurück zum Zitat Miao, Y.Q.: Adaptive learning algorithms for non-stationary data, Ph.D. thesis, University of Waterloo (2015) Miao, Y.Q.: Adaptive learning algorithms for non-stationary data, Ph.D. thesis, University of Waterloo (2015)
5.
Zurück zum Zitat Miao, Y.Q., Farahat, A.K., Kamel, M.S.: Detecting emerging and evolving novelties with locally adaptive density ratio estimation. Knowl. Inf. Syst., 1–29 (2016) Miao, Y.Q., Farahat, A.K., Kamel, M.S.: Detecting emerging and evolving novelties with locally adaptive density ratio estimation. Knowl. Inf. Syst., 1–29 (2016)
6.
Zurück zum Zitat Farahat, A.K., Kamel, M.S.: Statistical semantics for enhancing document clustering. Knowl. Inf. Syst. 28(2), 365–393 (2011)CrossRef Farahat, A.K., Kamel, M.S.: Statistical semantics for enhancing document clustering. Knowl. Inf. Syst. 28(2), 365–393 (2011)CrossRef
7.
Zurück zum Zitat Pimentel, M.A.F., Clifton, D.A., Clifton, L., Tarassenko, L.: A review of novelty detection. Sig. Process. 99, 215–249 (2014)CrossRef Pimentel, M.A.F., Clifton, D.A., Clifton, L., Tarassenko, L.: A review of novelty detection. Sig. Process. 99, 215–249 (2014)CrossRef
8.
Zurück zum Zitat Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. (CSUR) 41(3), 1–69 (2009)CrossRef Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. (CSUR) 41(3), 1–69 (2009)CrossRef
9.
Zurück zum Zitat Sayed-Mouchaweh, M., Lughofer, E.: Learning in Non-Stationary Environments: Methods and Applications. Springer, New York (2012)CrossRefMATH Sayed-Mouchaweh, M., Lughofer, E.: Learning in Non-Stationary Environments: Methods and Applications. Springer, New York (2012)CrossRefMATH
10.
Zurück zum Zitat Sugiyama, M., Kawanabe, M.: Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation (2012) Sugiyama, M., Kawanabe, M.: Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation (2012)
11.
Zurück zum Zitat Chi, Y., Song, X., Zhou, D., Hino, K., Tseng, B.L.: Evolutionary spectral clustering by incorporating temporal smoothness. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 153–162 (2007) Chi, Y., Song, X., Zhou, D., Hino, K., Tseng, B.L.: Evolutionary spectral clustering by incorporating temporal smoothness. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 153–162 (2007)
12.
Zurück zum Zitat Langone, R.: Clustering evolving data using kernel-based methods. Ph.D. thesis, KU Leuven (2014) Langone, R.: Clustering evolving data using kernel-based methods. Ph.D. thesis, KU Leuven (2014)
13.
Zurück zum Zitat Langone, R., Alzate, C., Suykens, J.A.K.: Kernel spectral clustering with memory efect. Phy. A Stat. Mech. Appl. 392(10), 2588–2606 (2013)MathSciNetCrossRef Langone, R., Alzate, C., Suykens, J.A.K.: Kernel spectral clustering with memory efect. Phy. A Stat. Mech. Appl. 392(10), 2588–2606 (2013)MathSciNetCrossRef
14.
Zurück zum Zitat Alzate, C., Suykens, J.A.K.: Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA. IEEE Trans. Pattern Anal. Mach. Intell. 32(2), 335–347 (2010)CrossRef Alzate, C., Suykens, J.A.K.: Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA. IEEE Trans. Pattern Anal. Mach. Intell. 32(2), 335–347 (2010)CrossRef
16.
Zurück zum Zitat Wang, L., Leckie, C., Kotagiri, R., Bezdek, J.: Approximate pairwise clustering for large data sets via sampling plus extension. Pattern Recogn. 44(2), 222–235 (2011)CrossRef Wang, L., Leckie, C., Kotagiri, R., Bezdek, J.: Approximate pairwise clustering for large data sets via sampling plus extension. Pattern Recogn. 44(2), 222–235 (2011)CrossRef
17.
Zurück zum Zitat Cottrell, M., Fort, J.C., Pags, G.: Theoretical aspects of the SOM algorithm. Neurocomputing 21(1), 119–138 (1998)CrossRefMATH Cottrell, M., Fort, J.C., Pags, G.: Theoretical aspects of the SOM algorithm. Neurocomputing 21(1), 119–138 (1998)CrossRefMATH
18.
Zurück zum Zitat de Bodt, E., Verleysen, M., Cottrell, M.: Kohonen maps versus vector quantization for data analysis. In: ESANN, pp. 211–218 (1997) de Bodt, E., Verleysen, M., Cottrell, M.: Kohonen maps versus vector quantization for data analysis. In: ESANN, pp. 211–218 (1997)
19.
Zurück zum Zitat Barreto, G.A.: Time series prediction with the self-organizing map: A review. In: Hitzler, P., Hammer, B. (eds.) Perspectives on Neural-Symbolic Integration, pp. 135–158. Springer, Berlin (2007)CrossRef Barreto, G.A.: Time series prediction with the self-organizing map: A review. In: Hitzler, P., Hammer, B. (eds.) Perspectives on Neural-Symbolic Integration, pp. 135–158. Springer, Berlin (2007)CrossRef
21.
Zurück zum Zitat Sarlin, P.: Self-organizing time map: an abstraction of temporal multivariate patterns. Neurocomputing 99, 496–508 (2013)CrossRef Sarlin, P.: Self-organizing time map: an abstraction of temporal multivariate patterns. Neurocomputing 99, 496–508 (2013)CrossRef
22.
Zurück zum Zitat Chakraborty, S., Nagwani, N.K.: Analysis and Study of Incremental K-Means Clustering Algorithm. In: Mantri, A., Nandi, S., Kumar, G., Kumar, S. (eds.) HPAGC 2011. CCIS, vol. 169, pp. 338–341. Springer, Heidelberg (2011)CrossRef Chakraborty, S., Nagwani, N.K.: Analysis and Study of Incremental K-Means Clustering Algorithm. In: Mantri, A., Nandi, S., Kumar, G., Kumar, S. (eds.) HPAGC 2011. CCIS, vol. 169, pp. 338–341. Springer, Heidelberg (2011)CrossRef
23.
Zurück zum Zitat Langone, R., Agudelo, O.M., De Moor, B., et al.: Incremental kernel spectral clustering for online learning of non-stationary data. Neurocomputing 139, 246–260 (2014)CrossRef Langone, R., Agudelo, O.M., De Moor, B., et al.: Incremental kernel spectral clustering for online learning of non-stationary data. Neurocomputing 139, 246–260 (2014)CrossRef
24.
Zurück zum Zitat Valgren, C., Duckett, T., Lilienthal, A.: Incremental spectral clustering and its application to topological mapping. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 4283–4288 (2007) Valgren, C., Duckett, T., Lilienthal, A.: Incremental spectral clustering and its application to topological mapping. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 4283–4288 (2007)
25.
Zurück zum Zitat Valgren, C., Lilienthal, A.: Incremental spectral clustering and seasons: Appearance-based localization in outdoor environments. In: Robotics and Automation, ICRA, pp. 1856–1861 (2008) Valgren, C., Lilienthal, A.: Incremental spectral clustering and seasons: Appearance-based localization in outdoor environments. In: Robotics and Automation, ICRA, pp. 1856–1861 (2008)
26.
Zurück zum Zitat Cao, F., Ester, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: Proceedings of the Sixth SIAM International Conference on Data Mining (SDM), pp. 328–339 (2006) Cao, F., Ester, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: Proceedings of the Sixth SIAM International Conference on Data Mining (SDM), pp. 328–339 (2006)
27.
Zurück zum Zitat Chen, H.L., Chen, M.S., Lin, S.C.: Catching the trend: a framework for clustering concept-drifting categorical data. IEEE Trans. Knowl. Data Eng. 21(5), 652–665 (2009)CrossRef Chen, H.L., Chen, M.S., Lin, S.C.: Catching the trend: a framework for clustering concept-drifting categorical data. IEEE Trans. Knowl. Data Eng. 21(5), 652–665 (2009)CrossRef
28.
Zurück zum Zitat Cao, F.Y., Huang, Z.X., Liang, J.Y.: Trend analysis of categorical data streams with a concept change method. Inform. Sci. 276, 160–173 (2007)CrossRef Cao, F.Y., Huang, Z.X., Liang, J.Y.: Trend analysis of categorical data streams with a concept change method. Inform. Sci. 276, 160–173 (2007)CrossRef
29.
Zurück zum Zitat Nasraoui, O., Soliman, M., Saka, E., Badia, A., Germain, R.: A web usage mining framework for mining evolving user profiles in dynamic web sites. Trans. Knowl. Data Eng. 20(2), 202–215 (2008)CrossRef Nasraoui, O., Soliman, M., Saka, E., Badia, A., Germain, R.: A web usage mining framework for mining evolving user profiles in dynamic web sites. Trans. Knowl. Data Eng. 20(2), 202–215 (2008)CrossRef
30.
Zurück zum Zitat Shental, N., Bar-Hillel, A., Hertz, T., et al.: Computing Gaussian mixture models with EM using equivalence constraints. Adv. Neural Inform. Process. Syst. (NIPS) 16(8), 465–472 (2004) Shental, N., Bar-Hillel, A., Hertz, T., et al.: Computing Gaussian mixture models with EM using equivalence constraints. Adv. Neural Inform. Process. Syst. (NIPS) 16(8), 465–472 (2004)
31.
Zurück zum Zitat Park, C., Huang, J.Z., Ding, Y.: A computable plug-in estimator of minimum volume sets for novelty detection. Oper. Res. 58(5), 1469–1480 (2009)MathSciNetCrossRefMATH Park, C., Huang, J.Z., Ding, Y.: A computable plug-in estimator of minimum volume sets for novelty detection. Oper. Res. 58(5), 1469–1480 (2009)MathSciNetCrossRefMATH
32.
Zurück zum Zitat Breaban, M., Luchian, H.: Outlier detection with nonlinear projection pursuit. Int. J. Comput. Commun. Control 8(1), 30–36 (2013)MathSciNetCrossRef Breaban, M., Luchian, H.: Outlier detection with nonlinear projection pursuit. Int. J. Comput. Commun. Control 8(1), 30–36 (2013)MathSciNetCrossRef
33.
Zurück zum Zitat Zhang, K., Hutter, M., Jin, H.: A new local distance-based outlier detection approach for scattered real-word data. Adv. Knowl. Discov. Data Min. 5476, 813–822 (2009) Zhang, K., Hutter, M., Jin, H.: A new local distance-based outlier detection approach for scattered real-word data. Adv. Knowl. Discov. Data Min. 5476, 813–822 (2009)
34.
Zurück zum Zitat Cabral, G.G., Oliveira, A.L.I., Cahu, C.B.G.: Combining nearest neighbor data description and structural risk minimization for one-class classification. Neural Comput. Appl. 18(2), 175–183 (2009)CrossRef Cabral, G.G., Oliveira, A.L.I., Cahu, C.B.G.: Combining nearest neighbor data description and structural risk minimization for one-class classification. Neural Comput. Appl. 18(2), 175–183 (2009)CrossRef
35.
Zurück zum Zitat Bay, S.D., Schwabacher, M.: Mining distance-based outlier sinnear linear time with randomization, a simplep runing rule. In: Proceedings of the 9th ACM International Conference on Knowl edge Discovery and Data Mining (SIGKDD), pp. 29–38. ACM (2003) Bay, S.D., Schwabacher, M.: Mining distance-based outlier sinnear linear time with randomization, a simplep runing rule. In: Proceedings of the 9th ACM International Conference on Knowl edge Discovery and Data Mining (SIGKDD), pp. 29–38. ACM (2003)
36.
Zurück zum Zitat Miao, Y.Q., Farahat, A.K., Kamel, M.S.: Locally adaptive density ratio for detecting novelty in twitter streams. In: The 6th International Workshop on Modeling Social Media (MSM) - Behavioral Analytics in Social Media, Big Data and the Web, pp. 799–804 (2015) Miao, Y.Q., Farahat, A.K., Kamel, M.S.: Locally adaptive density ratio for detecting novelty in twitter streams. In: The 6th International Workshop on Modeling Social Media (MSM) - Behavioral Analytics in Social Media, Big Data and the Web, pp. 799–804 (2015)
37.
Zurück zum Zitat Song, L., Teo, C.H., Smola, A.J.: Relative novelty detection. In: International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 536–543 (2009) Song, L., Teo, C.H., Smola, A.J.: Relative novelty detection. In: International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 536–543 (2009)
38.
Zurück zum Zitat Scholkopf, B., Williamson, R.C., Smola, A.J., Shawe-Taylor, J., Platt, J.: Support vector method for novelty detection. Adv. Neural Inform. Process. Syst.(NIPS) 12, 582–588 (1999) Scholkopf, B., Williamson, R.C., Smola, A.J., Shawe-Taylor, J., Platt, J.: Support vector method for novelty detection. Adv. Neural Inform. Process. Syst.(NIPS) 12, 582–588 (1999)
39.
Zurück zum Zitat Hattori, Y., Inoue, S., Masaki, T., Hirakawa, G., Sudo, O.: Gathering large scale human activity information using mobile sensor devices. In: Proceedings of the Second International Workshop on Network Traffic Control, Analysis and Applications, pp. 708–713 (2010) Hattori, Y., Inoue, S., Masaki, T., Hirakawa, G., Sudo, O.: Gathering large scale human activity information using mobile sensor devices. In: Proceedings of the Second International Workshop on Network Traffic Control, Analysis and Applications, pp. 708–713 (2010)
40.
Zurück zum Zitat Sun, S., Shi, H., Wu, Y.: A survey of multi-source domain adaptation. Inform. Fusion 24, 84–92 (2015)CrossRef Sun, S., Shi, H., Wu, Y.: A survey of multi-source domain adaptation. Inform. Fusion 24, 84–92 (2015)CrossRef
Metadaten
Titel
Adaptation Approaches in Unsupervised Learning: A Survey of the State-of-the-Art and Future Directions
verfasst von
JunHong Wang
YunQian Miao
Alaa Khamis
Fakhri Karray
Jiye Liang
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-41501-7_1