Skip to main content
Erschienen in: Neural Processing Letters 2/2015

01.10.2015

Soft Margin Based Low-Rank Audio Signal Classification

verfasst von: ZiQiang Shi, JiQing Han, TieRan Zheng

Erschienen in: Neural Processing Letters | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We propose an algorithm to do audio signal classification based on low-rank matrix representative audio data. Conventionally, the low-rank matrix data can be represented by a vector in high dimensional space. Some learning algorithms are then applied in such a vector space for matrix data classification. Particularly, maximum margin classifiers, such as support vector machine (SVM) etc. have received much attention due to their effectiveness. In this paper, we classify the data directly in the matrix space. Our methodology is built on recent studies about matrix classification with the trace norm constrained weight matrix and SVM’s large-margin linear discrimination principle. The resulting low-rank SVM is then designed to maximize the margin between classes whilst minimizing the complexity of the classifier in both original and low-rank space. We compared our proposed algorithm with SVM and other state-of-the-art matrix signal classification methods. Experimental studies on real life audio signal classification show the effectiveness of our algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Argyriou A, Evgeniou T, Pontil M (2008) Convex multi-task feature learning. Mach Learn 73(3):243–272CrossRefMATH Argyriou A, Evgeniou T, Pontil M (2008) Convex multi-task feature learning. Mach Learn 73(3):243–272CrossRefMATH
2.
Zurück zum Zitat Atrey P, Maddage N, Kankanhalli M (2006) Audio based event detection for multimedia surveillance. In: IEEE international conference on acoustics, speech and signal processing, 5:V-V. IEEE Atrey P, Maddage N, Kankanhalli M (2006) Audio based event detection for multimedia surveillance. In: IEEE international conference on acoustics, speech and signal processing, 5:V-V. IEEE
3.
Zurück zum Zitat Bickel P, Ritov Y, Tsybakov A (2009) Simultaneous analysis of lasso and dantzig selector. Ann Stat 37(4):1705–1732MathSciNetCrossRef Bickel P, Ritov Y, Tsybakov A (2009) Simultaneous analysis of lasso and dantzig selector. Ann Stat 37(4):1705–1732MathSciNetCrossRef
4.
Zurück zum Zitat Chang C, Lin C (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27CrossRefMATH Chang C, Lin C (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27CrossRefMATH
5.
Zurück zum Zitat Davis S, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28(4):357–366CrossRef Davis S, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28(4):357–366CrossRef
6.
Zurück zum Zitat Fazel M, Hindi H, Boyd S. (2001) A rank minimization heuristic with application to minimum order system approximation. In: Proceedings of the 2001 American control conference, vol 6. IEEE, pp 4734–4739 Fazel M, Hindi H, Boyd S. (2001) A rank minimization heuristic with application to minimum order system approximation. In: Proceedings of the 2001 American control conference, vol 6. IEEE, pp 4734–4739
8.
Zurück zum Zitat Ji S, Ye J (2009) An accelerated gradient method for trace norm minimization. In: Proceedings of the 26th annual international conference on machine learning. ACM, pp 457–464 Ji S, Ye J (2009) An accelerated gradient method for trace norm minimization. In: Proceedings of the 26th annual international conference on machine learning. ACM, pp 457–464
9.
Zurück zum Zitat Lin Z, Chen M, Wu L, Ma Y. (2010) The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. Arxiv, preprint arXiv:1009.5055 Lin Z, Chen M, Wu L, Ma Y. (2010) The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. Arxiv, preprint arXiv:​1009.​5055
10.
Zurück zum Zitat Shi Z, Han J, Zheng T (2011) A novel framework based on trace norm minimization for audio event detection. In: Proceedings of the 18th international conference On neural information processing. Springer, Berlin, pp 646–654 Shi Z, Han J, Zheng T (2011) A novel framework based on trace norm minimization for audio event detection. In: Proceedings of the 18th international conference On neural information processing. Springer, Berlin, pp 646–654
11.
Zurück zum Zitat Srebro N, Rennie J, Jaakkola T (2005) Maximum-margin matrix factorization. Adv Neural Inf Process Syst 17:1329–1336 Srebro N, Rennie J, Jaakkola T (2005) Maximum-margin matrix factorization. Adv Neural Inf Process Syst 17:1329–1336
12.
Zurück zum Zitat Tomioka R, Aihara K (2007) Classifying matrices with a spectral regularization. In: Proceedings of the 24th international conference on machine learning. ACM, pp 895–902 Tomioka R, Aihara K (2007) Classifying matrices with a spectral regularization. In: Proceedings of the 24th international conference on machine learning. ACM, pp 895–902
13.
Zurück zum Zitat Umapathy K, Krishnan S, Rao R (2007) Audio signal feature extraction and classification using local discriminant bases. IEEE Trans Audio Speech Lang Process 15(4):1236–1246CrossRef Umapathy K, Krishnan S, Rao R (2007) Audio signal feature extraction and classification using local discriminant bases. IEEE Trans Audio Speech Lang Process 15(4):1236–1246CrossRef
14.
Zurück zum Zitat Vapnik V (2000) The nature of statistical learning theory. Springer, BerlinCrossRef Vapnik V (2000) The nature of statistical learning theory. Springer, BerlinCrossRef
15.
Zurück zum Zitat Wright J, Ganesh A, Rao S, Ma Y (2009) Robust principal component analysis? In: Proceedings of the conference on 2009 neural information processing systems. Citeseer Wright J, Ganesh A, Rao S, Ma Y (2009) Robust principal component analysis? In: Proceedings of the conference on 2009 neural information processing systems. Citeseer
16.
Zurück zum Zitat Zhuang X, Zhou X, Huang T, Hasegawa-Johnson M (2008) Feature analysis and selection for acoustic event detection. In: IEEE international conference on acoustics, speech and signal processing. IEEE, pp 17–20 Zhuang X, Zhou X, Huang T, Hasegawa-Johnson M (2008) Feature analysis and selection for acoustic event detection. In: IEEE international conference on acoustics, speech and signal processing. IEEE, pp 17–20
Metadaten
Titel
Soft Margin Based Low-Rank Audio Signal Classification
verfasst von
ZiQiang Shi
JiQing Han
TieRan Zheng
Publikationsdatum
01.10.2015
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 2/2015
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-014-9357-6

Weitere Artikel der Ausgabe 2/2015

Neural Processing Letters 2/2015 Zur Ausgabe

Neuer Inhalt