Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 5/2017

21.04.2016 | Original Article

Incremental enhanced α-expansion move for large data: a probability regularization perspective

verfasst von: Anqi Bi, Shitong Wang

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 5/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To deal with large data clustering tasks, an incremental version of exemplar-based clustering algorithm is proposed in this paper. The novel clustering algorithm, called Incremental Enhanced α-Expansion Move (IEEM), processes large data chunk by chunk. The work here includes two aspects. First, in terms of the maximum a posteriori principle, a unified target function is developed to unify two typical exemplar-based clustering algorithms, namely Affinity Propagation (AP) and Enhanced α-Expansion Move (EEM). Secondly, with the proposed target function, the probability based regularization term is proposed and accordingly the proposed target function is extended to make IEEM have the ability to improve clustering performance of the entire dataset by leveraging the clustering result of previous chunks. Another outstanding characteristic of IEEM is that only by modifying the definitions of several variables used in EEM, the minimization procedure of EEM and its theoretical spirit can be easily kept in IEEM, and hence no more efforts are needed to develop a new optimization algorithm for IEEM. In contrast to AP, EEM and the existing incremental clustering algorithm IMMFC, our experimental results of synthetic and real-world datasets indicate the effectiveness of IEEM.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Du KL, Swamy M (2014) Clustering I: basic clustering models and algorithms, In: Neural networks and statistical learning, Springer London, pp 215–258 Du KL, Swamy M (2014) Clustering I: basic clustering models and algorithms, In: Neural networks and statistical learning, Springer London, pp 215–258
2.
Zurück zum Zitat Yang MS, Wu KL, JN H, JY (2008) Alpha-Cut Implemented Fuzzy Clustering Algorithms and Switching Regressions, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol.38, no.3, pp 588–603 Yang MS, Wu KL, JN H, JY (2008) Alpha-Cut Implemented Fuzzy Clustering Algorithms and Switching Regressions, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol.38, no.3, pp 588–603
3.
Zurück zum Zitat Tasdemir K, Merenyi E (2011) A Validity Index for Prototype-Based Clustering of Data Sets With Complex Cluster Structures. IEEE Trans Syst Man Cybern B Cybern 41(4):1039–1053CrossRef Tasdemir K, Merenyi E (2011) A Validity Index for Prototype-Based Clustering of Data Sets With Complex Cluster Structures. IEEE Trans Syst Man Cybern B Cybern 41(4):1039–1053CrossRef
4.
Zurück zum Zitat Berkhin P (2006) A survey of clustering data mining techniques. In: Kogan J, Nicholas C, Teboulle M (eds) Grouping multidimensional data. Springer, Berlin Heidelberg, pp 25–71CrossRef Berkhin P (2006) A survey of clustering data mining techniques. In: Kogan J, Nicholas C, Teboulle M (eds) Grouping multidimensional data. Springer, Berlin Heidelberg, pp 25–71CrossRef
5.
Zurück zum Zitat Li B, Wang M, Li XL, Tan SQ, Huang JW (2015) A strategy of clustering modification directions in spatial image steganography. IEEE Trans Inf Forensics Secur 10(9):1905–1917CrossRef Li B, Wang M, Li XL, Tan SQ, Huang JW (2015) A strategy of clustering modification directions in spatial image steganography. IEEE Trans Inf Forensics Secur 10(9):1905–1917CrossRef
6.
Zurück zum Zitat Wang XZ, Xing HJ, Li Y, Hua Q, Dong CR, Pedrycz W (2015) A study on relationship between generalization abilities and fuzziness of base classifiers in ensemble learning. IEEE Trans Fuzzy Syst 23(5):1638–1654CrossRef Wang XZ, Xing HJ, Li Y, Hua Q, Dong CR, Pedrycz W (2015) A study on relationship between generalization abilities and fuzziness of base classifiers in ensemble learning. IEEE Trans Fuzzy Syst 23(5):1638–1654CrossRef
7.
Zurück zum Zitat Wang XZ, Ashfaq RAR, Fu AM (2015) Fuzziness based sample categorization for classifier performance improvement. J Intell Fuzzy Syst 29(3):1185–1196MathSciNetCrossRef Wang XZ, Ashfaq RAR, Fu AM (2015) Fuzziness based sample categorization for classifier performance improvement. J Intell Fuzzy Syst 29(3):1185–1196MathSciNetCrossRef
8.
Zurück zum Zitat Wang XZ (2015) Uncertainty in learning from big data-editorial. J Intell Fuzzy Syst 28(5):2329–2330CrossRef Wang XZ (2015) Uncertainty in learning from big data-editorial. J Intell Fuzzy Syst 28(5):2329–2330CrossRef
11.
Zurück zum Zitat Murphy KP, Weiss Y (1999) “M. I. Jordan. Loopy belief propagation for approximate inference: An empirical study”, Proc. of 15th Conf. on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc., pp 467–475 Murphy KP, Weiss Y (1999) “M. I. Jordan. Loopy belief propagation for approximate inference: An empirical study”, Proc. of 15th Conf. on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc., pp 467–475
12.
Zurück zum Zitat Tappen MF, Freeman WT (2003) “Comparison of graph cuts with belief propagation for stereo, using identical MRF parameters”, In: Proc. 9th IEEE Int. Conf. Computer Vision, pp. 900–906 Tappen MF, Freeman WT (2003) “Comparison of graph cuts with belief propagation for stereo, using identical MRF parameters”, In: Proc. 9th IEEE Int. Conf. Computer Vision, pp. 900–906
13.
Zurück zum Zitat Boykov Y, Veksler O, Zabih R (2001) Fast approximate energy minimization via graph cuts, IEEE Trans. on Pattern analysis and machine intelligence. 23(11):1222–1239 Boykov Y, Veksler O, Zabih R (2001) Fast approximate energy minimization via graph cuts, IEEE Trans. on Pattern analysis and machine intelligence. 23(11):1222–1239
14.
Zurück zum Zitat Kolmogorov V, Rother C (2006) Comparison of energy minimization algorithms for highly connected graphs. Proc Eur Conf Comp Vision 3952:1–15 Kolmogorov V, Rother C (2006) Comparison of energy minimization algorithms for highly connected graphs. Proc Eur Conf Comp Vision 3952:1–15
15.
Zurück zum Zitat Kolmogorov V, Zabih R (2004) What energy functions can be minimized via graph cuts?, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp 147–159 Kolmogorov V, Zabih R (2004) What energy functions can be minimized via graph cuts?, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp 147–159
16.
Zurück zum Zitat Zheng Y, Chen P (2013) Clustering based on enhanced α-expansion move, IEEE Trans. on knowledge and data. Engineering 25(10):2206–2216 Zheng Y, Chen P (2013) Clustering based on enhanced α-expansion move, IEEE Trans. on knowledge and data. Engineering 25(10):2206–2216
18.
Zurück zum Zitat Li W (2012) Clustering with uncertainties: an affinity propagation-based approach, Neural information processing. Springer, Berlin Heidelberg, pp 437–446 Li W (2012) Clustering with uncertainties: an affinity propagation-based approach, Neural information processing. Springer, Berlin Heidelberg, pp 437–446
19.
Zurück zum Zitat Wang CD, Lai JH, Suen CY (2013) Multi-exemplar affinity propagation, IEEE Trans. on Pattern Analysis and Machine Intelligence. 35(9):2223–2237 Wang CD, Lai JH, Suen CY (2013) Multi-exemplar affinity propagation, IEEE Trans. on Pattern Analysis and Machine Intelligence. 35(9):2223–2237
20.
Zurück zum Zitat Givoni IE, Frey BJ (2009) “Semi-supervised affinity propagation with instance-level constraints”, international conference on artificial intelligence and statistics. pp 161–168 Givoni IE, Frey BJ (2009) “Semi-supervised affinity propagation with instance-level constraints”, international conference on artificial intelligence and statistics. pp 161–168
21.
Zurück zum Zitat Sun L, Guo CH (2014) Incremental affinity propagation clustering based on message passing. IEEE Trans. on knowledge and data. Engineering 26(11):2731–2744 Sun L, Guo CH (2014) Incremental affinity propagation clustering based on message passing. IEEE Trans. on knowledge and data. Engineering 26(11):2731–2744
22.
Zurück zum Zitat Ott L, Ramos F (2012) “Unsupervised incremental learning for long-term autonomy,” Proc. IEEE Int’l Conf. Robotics and Automation, pp. 4022–4029 Ott L, Ramos F (2012) “Unsupervised incremental learning for long-term autonomy,” Proc. IEEE Int’l Conf. Robotics and Automation, pp. 4022–4029
23.
Zurück zum Zitat Shi XH, Guan RC, Wang LP, Pei ZL, Liang YC (2009) An incremental affinity propagation algorithm and its applications for text clustering, Proc. Int’l Joint Conf. Neural Networks pp 2914–2919 Shi XH, Guan RC, Wang LP, Pei ZL, Liang YC (2009) An incremental affinity propagation algorithm and its applications for text clustering, Proc. Int’l Joint Conf. Neural Networks pp 2914–2919
24.
Zurück zum Zitat Yang C, Bruzzone L, Guan RC, Lu L, Liang YC (2013) Incremental and decremental affinity propagation for semisupervised clustering in multispectral images. IEEE Trans Geosci Rem Sens 51(3):1666–1679CrossRef Yang C, Bruzzone L, Guan RC, Lu L, Liang YC (2013) Incremental and decremental affinity propagation for semisupervised clustering in multispectral images. IEEE Trans Geosci Rem Sens 51(3):1666–1679CrossRef
25.
Zurück zum Zitat Huber P (1997) Massive data sets workshop: the morning after, in massive data sets. National Academy Press. pp. 169–184 Huber P (1997) Massive data sets workshop: the morning after, in massive data sets. National Academy Press. pp. 169–184
26.
Zurück zum Zitat Bagirov AM, Ugon J, Webb D (2011) Fast modified global k-means algorithm for incremental cluster construction. Pattern Recogn 44(4):866–876CrossRefMATH Bagirov AM, Ugon J, Webb D (2011) Fast modified global k-means algorithm for incremental cluster construction. Pattern Recogn 44(4):866–876CrossRefMATH
27.
Zurück zum Zitat Zhang T, Ramakrishnan R, Livny M, Birch: an efficient data clustering method for very large databases. pp 103–114 Zhang T, Ramakrishnan R, Livny M, Birch: an efficient data clustering method for very large databases. pp 103–114
28.
Zurück zum Zitat Hore P, Hall L, Goldgof D, “Single pass fuzzy c means”, in Proc. IEEE Int. Fuzzy Syst. Conf. pp 1–7 Hore P, Hall L, Goldgof D, “Single pass fuzzy c means”, in Proc. IEEE Int. Fuzzy Syst. Conf. pp 1–7
29.
Zurück zum Zitat Hore P, Hall L, Goldgof D, Cheng W (2008) “Online fuzzy c means”, in Proc. IEEE Annu Meet North Amer Fuzzy Inf Process Soc. pp 1–5 Hore P, Hall L, Goldgof D, Cheng W (2008) “Online fuzzy c means”, in Proc. IEEE Annu Meet North Amer Fuzzy Inf Process Soc. pp 1–5
31.
Zurück zum Zitat Ma Z, Yang Y, Nie F (2015) Multitask spectral clustering by exploring intertask correlation. IEEE Trans Cybernetics 45(5):1069–1080 Ma Z, Yang Y, Nie F (2015) Multitask spectral clustering by exploring intertask correlation. IEEE Trans Cybernetics 45(5):1069–1080
34.
Zurück zum Zitat Cao F, Ester M, Qian W, “Density-based clustering over an evolving data stream with noise”, in Proc. SIAM Conf. Data Mining. pp 328–339 Cao F, Ester M, Qian W, “Density-based clustering over an evolving data stream with noise”, in Proc. SIAM Conf. Data Mining. pp 328–339
37.
Zurück zum Zitat Wang YT, Chen LH, Mei JP (2014) Incremental fuzzy clustering with multiple medoids for large data. IEEE Trans Fuzzy Syst 22(6):1557–1568CrossRef Wang YT, Chen LH, Mei JP (2014) Incremental fuzzy clustering with multiple medoids for large data. IEEE Trans Fuzzy Syst 22(6):1557–1568CrossRef
38.
Zurück zum Zitat Havens T, Bezdek J, Leckie C, Hall L, Palaniswami M (2012) Fuzzy c-means algorithms for very large data. IEEE Trans Fuzzy Syst 20(6):1130–1146CrossRef Havens T, Bezdek J, Leckie C, Hall L, Palaniswami M (2012) Fuzzy c-means algorithms for very large data. IEEE Trans Fuzzy Syst 20(6):1130–1146CrossRef
39.
Zurück zum Zitat Papadimitriou CH, Steiglitz K (1998) Combinatorial optimization: algorithms and complexity[M]. Dover Publications Papadimitriou CH, Steiglitz K (1998) Combinatorial optimization: algorithms and complexity[M]. Dover Publications
40.
Zurück zum Zitat Jiang YZ, Chung FL, Wang ST (2014) Enhanced fuzzy partitions vs data randomness in FCM. J Intell Fuzzy Syst 27(4):1639–1648MathSciNetMATH Jiang YZ, Chung FL, Wang ST (2014) Enhanced fuzzy partitions vs data randomness in FCM. J Intell Fuzzy Syst 27(4):1639–1648MathSciNetMATH
Metadaten
Titel
Incremental enhanced α-expansion move for large data: a probability regularization perspective
verfasst von
Anqi Bi
Shitong Wang
Publikationsdatum
21.04.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 5/2017
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-016-0532-0

Weitere Artikel der Ausgabe 5/2017

International Journal of Machine Learning and Cybernetics 5/2017 Zur Ausgabe

Neuer Inhalt