Skip to main content
Erschienen in: Artificial Intelligence Review 10/2023

24.03.2023

Cauchy balanced nonnegative matrix factorization

verfasst von: He Xiong, Deguang Kong, Feiping Nie

Erschienen in: Artificial Intelligence Review | Ausgabe 10/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nonnegative Matrix Factorization (NMF) plays an important role in many data mining and machine learning tasks. Standard NMF uses the Frobenius norm as the loss function which is well-known to be sensitive to noise. To address this issue, we propose a robust formulation of NMF, i.e., Cauchy-NMF, which is derived based on the assumption that the noise generally follows identical independent distributed (i.i.d.) Cauchy distribution. In particular, we derive the Cauchy Balanced NMF model (Cauchy-B-NMF) using Cauchy distribution, where (a) the numerical value of each element in the coefficient matrix is viewed as the posterior probability, which allows the clustering result to be obtained directly from the coefficient matrix without any additional post-processing; (b) a novel manifold regularization term is incorporated into the loss function, explicitly making the distant data points have dissimilar embeddings, while implicitly making the neighbouring data points have similar embeddings; (c) a balanced clustering term is enforced to achieve the desired equal number of data points across different clusters. We derive an efficient computational algorithm to solve the resultant optimization problem, and also provide a rigorous analysis of the algorithm convergence. Experimental results on several benchmarks demonstrate the effectiveness of our algorithms, which consistently provides better clustering results compared to many other NMF variants.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
It is also interesting to see the difference between Eqs. (19) and (20) is that the minuend is always non-negative in Eq. (19) although the absolute values are the same.
 
2
In the paper, let LHS be the left-hand-side of an equation, and RHS be the right hand-side of an equation.Biography The resumes of the three authors are listed as follows. He Xiong received the masters degree from the University of Science and Technology of China, in 2013. He is currently a lecture in the School of Computer and Information Engineering at BengBu University. His research interests include machine learning, data mining and computer vision. Deguang Kong received his Ph.D degree in Computer Science from University of Texas in 2013. He  currently works at Amazon, and ever worked in Google, Yahoo Research (Sunnyvale), Los Alamos national Lab, NEC research lab, Penn State University and Samsung Research America as a researcher. His research interests focus on feature learning and compressive sensing, user engagement understanding and recommendation,etc. He has published over 30 referred articles in top conferences, including ICML, NIPS, AAAI, CVPR, KDD, ICDM, SDM, WSDM, CIKM, ECML/PKDD, etc. He has served as a program committee member in NIPS, AAAI, IJCAI, KDD, SDM and a reviewer for TPAMI, TKDE, DMKD, TIFS, TNNLS, TDSC, etc. Feiping Nie received the Ph.D. degree in computer science from Tsinghua University, Beijing, China, in 2009. He is currently a professor with the Center for OPTical Imagery Analysis and Learning, Northwestern Polytechnical University, Xian, Shaanxi, China. He has published over 100 articles in the prestigious journals and conferences. His current research interests include machine learning and its applications fields, such as pattern recognition, data mining, computer vision, image processing, and information retrieval. Dr. Nie currently serves as an associate editor or a program committee member for several prestigious journals and conferences in the related fields.
 
Literatur
Zurück zum Zitat Althoff T, Ulges A, Dengel A (2011) Balanced clustering for content-based image browsing. In: Informatiktage 2011-Fachwissenschaftlicher Informatik-Kongress, 25. und 26. März 2011, B-IT Bonn-Aachen International Center for Information Technology in Bonn, Vol. S-10 of LNI, GI, pp. 27–30 Althoff T, Ulges A, Dengel A (2011) Balanced clustering for content-based image browsing. In: Informatiktage 2011-Fachwissenschaftlicher Informatik-Kongress, 25. und 26. März 2011, B-IT Bonn-Aachen International Center for Information Technology in Bonn, Vol. S-10 of LNI, GI, pp. 27–30
Zurück zum Zitat Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef
Zurück zum Zitat Bojchevski A, Matkovic Y, ünnemann SG (2017) Robust spectral clustering for noisy data: modeling sparse corruptions improves latent embeddings, in: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, Halifax, August 13-17, pp 737–746. https://doi.org/10.1145/3097983.3098156 Bojchevski A, Matkovic Y, ünnemann SG (2017) Robust spectral clustering for noisy data: modeling sparse corruptions improves latent embeddings, in: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, Halifax, August 13-17, pp 737–746. https://​doi.​org/​10.​1145/​3097983.​3098156
Zurück zum Zitat Cao X, Chen Y, Zhao Q, Meng D, Wang Y, Wang D, Xu Z (2015) Low-rank matrix factorization under general mixture noise distributions. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7–13, pp 1493–1501. https://doi.org/10.1109/ICCV.2015.175 Cao X, Chen Y, Zhao Q, Meng D, Wang Y, Wang D, Xu Z (2015) Low-rank matrix factorization under general mixture noise distributions. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7–13, pp 1493–1501. https://​doi.​org/​10.​1109/​ICCV.​2015.​175
Zurück zum Zitat Gao H, Nie F, Cai W, Huang H (2015) Robust capped norm nonnegative matrix factorization: capped norm NMF. In: Proceedings of the 24th ACM international on conference on information and knowledge management, CIKM ’15, ACM, New York, pp 871–880. https://doi.org/10.1145/2806416.2806568 Gao H, Nie F, Cai W, Huang H (2015) Robust capped norm nonnegative matrix factorization: capped norm NMF. In: Proceedings of the 24th ACM international on conference on information and knowledge management, CIKM ’15, ACM, New York, pp 871–880. https://​doi.​org/​10.​1145/​2806416.​2806568
Zurück zum Zitat Graham DB, Allinson NM (1998) Characterising virtual eigensignatures for general purpose face recognition. In: Face recognition, Springer, pp 446–456 Graham DB, Allinson NM (1998) Characterising virtual eigensignatures for general purpose face recognition. In: Face recognition, Springer, pp 446–456
Zurück zum Zitat Guo X, Pan B, Cai D, He X (2017) Robust asymmetric bayesian adaptive matrix factorization. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence, IJCAI 2017, Melbourne, August 19–25, pp 1760–1766. https://doi.org/10.24963/ijcai.2017/244 Guo X, Pan B, Cai D, He X (2017) Robust asymmetric bayesian adaptive matrix factorization. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence, IJCAI 2017, Melbourne, August 19–25, pp 1760–1766. https://​doi.​org/​10.​24963/​ijcai.​2017/​244
Zurück zum Zitat Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: Proceedings of the 2006 IEEE computer society conference on computer vision and pattern recognition-volume 2, CVPR ’06, IEEE Computer Society, Washington, pp 1735–1742. https://doi.org/10.1109/CVPR.2006.100 Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: Proceedings of the 2006 IEEE computer society conference on computer vision and pattern recognition-volume 2, CVPR ’06, IEEE Computer Society, Washington, pp 1735–1742. https://​doi.​org/​10.​1109/​CVPR.​2006.​100
Zurück zum Zitat Kannan R, Ballard G, Park H (2016) A high-performance parallel algorithm for nonnegative matrix factorization. In: Asenjo R, Harris T(eds), Proceedings of the 21st ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP 2016, Barcelona, March 12–16, ACM, pp 9:1–9:11. https://doi.org/10.1145/2851141.2851152 Kannan R, Ballard G, Park H (2016) A high-performance parallel algorithm for nonnegative matrix factorization. In: Asenjo R, Harris T(eds), Proceedings of the 21st ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP 2016, Barcelona, March 12–16, ACM, pp 9:1–9:11. https://​doi.​org/​10.​1145/​2851141.​2851152
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
Zurück zum Zitat Lian D, Liu R, Ge Y, Zheng K, Xie X, Cao L (2017) Discrete content-aware matrix factorization. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’17, ACM, New York, pp 325–334. https://doi.org/10.1145/3097983.3098008 Lian D, Liu R, Ge Y, Zheng K, Xie X, Cao L (2017) Discrete content-aware matrix factorization. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’17, ACM, New York, pp 325–334. https://​doi.​org/​10.​1145/​3097983.​3098008
Zurück zum Zitat Liao Q, Guan N, Zhang Q (2015) Logdet divergence based sparse non-negative matrix factorization for stable representation. In: 2015 IEEE international conference on data mining, ICDM 2015, Atlantic City, November 14–17, pp 871–876. https://doi.org/10.1109/ICDM.2015.52 Liao Q, Guan N, Zhang Q (2015) Logdet divergence based sparse non-negative matrix factorization for stable representation. In: 2015 IEEE international conference on data mining, ICDM 2015, Atlantic City, November 14–17, pp 871–876. https://​doi.​org/​10.​1109/​ICDM.​2015.​52
Zurück zum Zitat Lin W, He Z, Xiao M (2019) Balanced clustering: a uniform model and fast algorithm. In: Kraus S (ed), Proceedings of the twenty-eighth international joint conference on artificial intelligence, IJCAI 2019, Macao, August 10–16, ijcai.org, 2019, pp 2987–2993. https://doi.org/10.24963/ijcai.2019/414 Lin W, He Z, Xiao M (2019) Balanced clustering: a uniform model and fast algorithm. In: Kraus S (ed), Proceedings of the twenty-eighth international joint conference on artificial intelligence, IJCAI 2019, Macao, August 10–16, ijcai.org, 2019, pp 2987–2993. https://​doi.​org/​10.​24963/​ijcai.​2019/​414
Zurück zum Zitat Li Z, Nie F, Chang X, Ma Z, Yang Y (2018) Balanced clustering via exclusive lasso: a pragmatic approach. In: McIlraith SA, Weinberger KQ (eds), Proceedings of the thirty-second AAAI conference on artificial intelligence, (AAAI-18), the 30th innovative applications of artificial intelligence (IAAI-18), and the 8th AAAI symposium on educational advances in artificial intelligence (EAAI-18), New Orleans, February 2–7, 2018, AAAI Press, pp 3596–3603. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16711 Li Z, Nie F, Chang X, Ma Z, Yang Y (2018) Balanced clustering via exclusive lasso: a pragmatic approach. In: McIlraith SA, Weinberger KQ (eds), Proceedings of the thirty-second AAAI conference on artificial intelligence, (AAAI-18), the 30th innovative applications of artificial intelligence (IAAI-18), and the 8th AAAI symposium on educational advances in artificial intelligence (EAAI-18), New Orleans, February 2–7, 2018, AAAI Press, pp 3596–3603. https://​www.​aaai.​org/​ocs/​index.​php/​AAAI/​AAAI18/​paper/​view/​16711
Zurück zum Zitat Liutkus A, Fitzgerald D, Badeau R (2015) Cauchy nonnegative matrix factorization. In: 2015 IEEE workshop on applications of signal processing to audio and acoustics (WASPAA) pp 1–5 Liutkus A, Fitzgerald D, Badeau R (2015) Cauchy nonnegative matrix factorization. In: 2015 IEEE workshop on applications of signal processing to audio and acoustics (WASPAA) pp 1–5
Zurück zum Zitat Luo L, Zhang Y, Huang H (2020) Adversarial nonnegative matrix factorization. In: Proceedings of the 37th international conference on machine learning, ICML 2020, 13–18 July, Virtual Event, Vol. 119 of Proceedings of machine learning research, PMLR, 2020, pp 6479–6488. http://proceedings.mlr.press/v119/luo20c.html Luo L, Zhang Y, Huang H (2020) Adversarial nonnegative matrix factorization. In: Proceedings of the 37th international conference on machine learning, ICML 2020, 13–18 July, Virtual Event, Vol. 119 of Proceedings of machine learning research, PMLR, 2020, pp 6479–6488. http://​proceedings.​mlr.​press/​v119/​luo20c.​html
Zurück zum Zitat Malinen MI, Fränti P (2014) Balanced k-means for clustering. In: Fränti P, Brown G, Loog M, Escolano F, Pelillo M (eds), Structural, syntactic, and statistical pattern recognition-joint IAPR international workshop, S+SSPR 2014, Joensuu, August 20–22, 2014. Proceedings, Vol. 8621 of Lecture Notes in Computer Science, Springer, 2014, pp 32–41. https://doi.org/10.1007/978-3-662-44415-3_4 Malinen MI, Fränti P (2014) Balanced k-means for clustering. In: Fränti P, Brown G, Loog M, Escolano F, Pelillo M (eds), Structural, syntactic, and statistical pattern recognition-joint IAPR international workshop, S+SSPR 2014, Joensuu, August 20–22, 2014. Proceedings, Vol. 8621 of Lecture Notes in Computer Science, Springer, 2014, pp 32–41. https://​doi.​org/​10.​1007/​978-3-662-44415-3_​4
Zurück zum Zitat Mitra A, Vijayan P, Parthasarathy S, Ravindran B (2020) A unified non-negative matrix factorization framework for semi supervised learning on graphs. In: Demeniconi C, Chawla NV (eds), Proceedings of the 2020 SIAM international conference on data mining, SDM 2020, Cincinnati, May 7–9, SIAM, pp 487–495. https://doi.org/10.1137/1.9781611976236.55 Mitra A, Vijayan P, Parthasarathy S, Ravindran B (2020) A unified non-negative matrix factorization framework for semi supervised learning on graphs. In: Demeniconi C, Chawla NV (eds), Proceedings of the 2020 SIAM international conference on data mining, SDM 2020, Cincinnati, May 7–9, SIAM, pp 487–495. https://​doi.​org/​10.​1137/​1.​9781611976236.​55
Zurück zum Zitat Moon GE, Ellis JA, Sukumaran-Rajam A, Parthasarathy S, Sadayappan P (2020) ALO-NMF: accelerated locality-optimized non-negative matrix factorization. In: Gupta R, Liu Y, Tang J, Prakash BA (eds), KDD ’20: the 26th ACM SIGKDD conference on knowledge discovery and data mining, virtual event, August 23–27, 2020, ACM, 2020, pp 1758–1767. https://doi.org/10.1145/3394486.3403227 Moon GE, Ellis JA, Sukumaran-Rajam A, Parthasarathy S, Sadayappan P (2020) ALO-NMF: accelerated locality-optimized non-negative matrix factorization. In: Gupta R, Liu Y, Tang J, Prakash BA (eds), KDD ’20: the 26th ACM SIGKDD conference on knowledge discovery and data mining, virtual event, August 23–27, 2020, ACM, 2020, pp 1758–1767. https://​doi.​org/​10.​1145/​3394486.​3403227
Zurück zum Zitat Nene SA, Nayar SK, Murase H et al. Columbia object image library (coil-20) Nene SA, Nayar SK, Murase H et al. Columbia object image library (coil-20)
Zurück zum Zitat Rodrguez-Domnguez U, Dalmau O (2022) Symmetric nonnegative matrix factorization with elastic-net regularized block-wise weighted representation for clustering. Pattern Anal Appl pp 1–11 Rodrguez-Domnguez U, Dalmau O (2022) Symmetric nonnegative matrix factorization with elastic-net regularized block-wise weighted representation for clustering. Pattern Anal Appl pp 1–11
Zurück zum Zitat Samaria FS, Harter AC (1994) Parameterisation of a stochastic model for human face identification. In: Applications of computer vision, 1994., Proceedings of the second IEEE workshop on, IEEE, pp 138–142 Samaria FS, Harter AC (1994) Parameterisation of a stochastic model for human face identification. In: Applications of computer vision, 1994., Proceedings of the second IEEE workshop on, IEEE, pp 138–142
Zurück zum Zitat Su W, Hu J, Lin C, Shen SX (2015) Sla-aware tenant placement and dynamic resource provision in SAAS. In: Miller JA, Zhu H (eds), 2015 IEEE international conference on web services, ICWS 2015, New York, June 27 - July 2, IEEE Computer Society, 2015, pp 615–622. https://doi.org/10.1109/ICWS.2015.87 Su W, Hu J, Lin C, Shen SX (2015) Sla-aware tenant placement and dynamic resource provision in SAAS. In: Miller JA, Zhu H (eds), 2015 IEEE international conference on web services, ICWS 2015, New York, June 27 - July 2, IEEE Computer Society, 2015, pp 615–622. https://​doi.​org/​10.​1109/​ICWS.​2015.​87
Zurück zum Zitat Wang J-Y, Almasri I, Gao X (2012) Adaptive graph regularized nonnegative matrix factorization via feature selection. In: Pattern recognition (ICPR), 2012 21st international conference on, IEEE, pp 963–966 Wang J-Y, Almasri I, Gao X (2012) Adaptive graph regularized nonnegative matrix factorization via feature selection. In: Pattern recognition (ICPR), 2012 21st international conference on, IEEE, pp 963–966
Zurück zum Zitat Xu W, Liu X, Gong Y (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’03, ACM, New York, 2003, pp 267–273. https://doi.org/10.1145/860435.860485 Xu W, Liu X, Gong Y (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’03, ACM, New York, 2003, pp 267–273. https://​doi.​org/​10.​1145/​860435.​860485
Metadaten
Titel
Cauchy balanced nonnegative matrix factorization
verfasst von
He Xiong
Deguang Kong
Feiping Nie
Publikationsdatum
24.03.2023
Verlag
Springer Netherlands
Erschienen in
Artificial Intelligence Review / Ausgabe 10/2023
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-022-10379-y

Weitere Artikel der Ausgabe 10/2023

Artificial Intelligence Review 10/2023 Zur Ausgabe