Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 2/2015

01.04.2015 | Original Article

Visual music score detection with unsupervised feature learning method based on K-means

verfasst von: Yang Fang, Teng Gui-fa

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic music score detection plays important role in the optical music recognition (OMR). In a visual image, the characteristic of the music scores is frequently degraded by illumination, distortion and other background elements. In this paper, to reduce the influences to OMR caused by those degradations especially the interference of Chinese character, an unsupervised feature learning detection method is proposed for improving the correctness of music score detection. Firstly, a detection framework was constructed. Then sub-image block features were extracted by simple unsupervised feature learning (UFL) method based on K-means and classified by SVM. Finally, music score detection processing was completed by connecting component searching algorithm based on the sub-image block label. Taking Chinese text as the main interferences, the detection rate was compared between UFL method and texture feature method based on 2D Gabor filter in the same framework. The experiment results show that unsupervised feature learning method gets less error detection rate than Gabor texture feature method with limited training set.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Szwoch M (2005) A robust detector for distorted music staves[C]. In: Computer analysis of images and patterns. Springer, Berlin, pp 701–708 Szwoch M (2005) A robust detector for distorted music staves[C]. In: Computer analysis of images and patterns. Springer, Berlin, pp 701–708
3.
Zurück zum Zitat Rebelo A, Capela A, da Costa JFP et al (2007) A shortest path approach for staff line detection[C]. In: The third international conference on automated production of cross media content for multi-channel distribution 2007 (AXMEDIS’07). IEEE, pp 79–85 Rebelo A, Capela A, da Costa JFP et al (2007) A shortest path approach for staff line detection[C]. In: The third international conference on automated production of cross media content for multi-channel distribution 2007 (AXMEDIS’07). IEEE, pp 79–85
6.
Zurück zum Zitat Dutta A, Pal U, Fornes A et al (2010) An efficient staff removal approach from printed musical documents[C]. In: 20th international conference on pattern recognition (ICPR), 2010. IEEE, pp 1965–1968 Dutta A, Pal U, Fornes A et al (2010) An efficient staff removal approach from printed musical documents[C]. In: 20th international conference on pattern recognition (ICPR), 2010. IEEE, pp 1965–1968
7.
Zurück zum Zitat Burgoyne JA, Pugin L, Eustace G et al (2007) A comparative survey of image binarization algorithms for optical recognition on degraded musical sources[C]. In; International society for music information retrieval conference (ISMIR), pp 509–512 Burgoyne JA, Pugin L, Eustace G et al (2007) A comparative survey of image binarization algorithms for optical recognition on degraded musical sources[C]. In; International society for music information retrieval conference (ISMIR), pp 509–512
8.
Zurück zum Zitat Pinto T, Rebelo A, Giraldi G et al (2011) Music score binarization based on domain knowledge[M]. In: Pattern recognition and image analysis. Springer, Berlin, pp 700–708 Pinto T, Rebelo A, Giraldi G et al (2011) Music score binarization based on domain knowledge[M]. In: Pattern recognition and image analysis. Springer, Berlin, pp 700–708
9.
Zurück zum Zitat Rebelo A, Cardoso JS (2013) Staff line detection and removal in the grayscale domain[C]. In: The 12th international conference on document analysis and recognition (ICDAR), pp 57–61 Rebelo A, Cardoso JS (2013) Staff line detection and removal in the grayscale domain[C]. In: The 12th international conference on document analysis and recognition (ICDAR), pp 57–61
10.
Zurück zum Zitat Timofe R, Gool LV (2013) Automatic stave discovery for musical facsimiles[C]. ACCV2012 4:510–523 Timofe R, Gool LV (2013) Automatic stave discovery for musical facsimiles[C]. ACCV2012 4:510–523
11.
Zurück zum Zitat Sun JD, Ma YY (2010) Summary of texture feature research[J]. Appl Comput Syst. 19(6):245–250 Sun JD, Ma YY (2010) Summary of texture feature research[J]. Appl Comput Syst. 19(6):245–250
12.
Zurück zum Zitat Zhang XZ (1992) Chinese character recognition technology [M]. Tsinghua university press, Beijing Zhang XZ (1992) Chinese character recognition technology [M]. Tsinghua university press, Beijing
13.
Zurück zum Zitat Sharma A, Imoto S, Miyano S et al (2012) Null space based feature selection method for gene expression data[J]. Int J Mach Learn Cybernet 3(4):269–276CrossRef Sharma A, Imoto S, Miyano S et al (2012) Null space based feature selection method for gene expression data[J]. Int J Mach Learn Cybernet 3(4):269–276CrossRef
14.
Zurück zum Zitat Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection[J]. Int J Mach Learn Cybernet 4(6):609–619CrossRef Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection[J]. Int J Mach Learn Cybernet 4(6):609–619CrossRef
15.
Zurück zum Zitat Xie ZX, Xu Y (2014) Sparse group LASSO based uncertain feature selection[J]. Int J Mach Learn Cybernet 5(2):201–210CrossRefMathSciNet Xie ZX, Xu Y (2014) Sparse group LASSO based uncertain feature selection[J]. Int J Mach Learn Cybernet 5(2):201–210CrossRefMathSciNet
16.
Zurück zum Zitat Coates A (2012) Demystifying unsupervised feature learning[D]. Stanford University, Stanford Coates A (2012) Demystifying unsupervised feature learning[D]. Stanford University, Stanford
17.
Zurück zum Zitat Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning[C]. In: NIPS workshop on deep learning and unsupervised feature learning 2011 Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning[C]. In: NIPS workshop on deep learning and unsupervised feature learning 2011
18.
Zurück zum Zitat Ranzato MA, Huang FJ, Boureau YL et al (2007) Unsupervised learning of invariant feature hierarchies with applications to object recognition[C]. IEEE Conf Comput Vis Pattern Recogn 2007:1–8 Ranzato MA, Huang FJ, Boureau YL et al (2007) Unsupervised learning of invariant feature hierarchies with applications to object recognition[C]. IEEE Conf Comput Vis Pattern Recogn 2007:1–8
19.
Zurück zum Zitat Kavukcuoglu K, Sermanet P, Boureau YL et al (2010) Learning convolutional feature hierarchies for visual recognition[C]. In: Advances in neural information processing systems, pp 1090–1098 Kavukcuoglu K, Sermanet P, Boureau YL et al (2010) Learning convolutional feature hierarchies for visual recognition[C]. In: Advances in neural information processing systems, pp 1090–1098
20.
Zurück zum Zitat Saxe A, Koh PW, Chen Z et al (2011) On random weights and unsupervised feature learning[C]. In: Twenty-eighth international conference on machine learning, pp 1–9 Saxe A, Koh PW, Chen Z et al (2011) On random weights and unsupervised feature learning[C]. In: Twenty-eighth international conference on machine learning, pp 1–9
21.
Zurück zum Zitat Coates A, Lee H, Ng AY (2011) An analysis of single-layer networks in unsupervised feature learning [J]. JMLR W&CP. 15:215–223 Coates A, Lee H, Ng AY (2011) An analysis of single-layer networks in unsupervised feature learning [J]. JMLR W&CP. 15:215–223
22.
Zurück zum Zitat Yeung D, Wang XZ (2002) Improving performance of similarity-based clustering by feature weight learning[J]. IEEE Trans Pattern Anal Mach Intell 24(4):556–561CrossRefMathSciNet Yeung D, Wang XZ (2002) Improving performance of similarity-based clustering by feature weight learning[J]. IEEE Trans Pattern Anal Mach Intell 24(4):556–561CrossRefMathSciNet
23.
Zurück zum Zitat Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning[J]. Pattern Recogn Lett 25(10):1123–1132CrossRef Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning[J]. Pattern Recogn Lett 25(10):1123–1132CrossRef
24.
Zurück zum Zitat Sarma TH, Viswanath P, Reddy BE (2013) A hybrid approach to speed-up the K-means clustering method [J]. Int J Mach Learn Cybernet 4(2):107–117CrossRef Sarma TH, Viswanath P, Reddy BE (2013) A hybrid approach to speed-up the K-means clustering method [J]. Int J Mach Learn Cybernet 4(2):107–117CrossRef
25.
Zurück zum Zitat Jan W, Riedmiller M (2012) Unsupervised learning of local features for music classification[C].In: 13th international society for music information retrieval conference (ISMIR2012), pp 139–144 Jan W, Riedmiller M (2012) Unsupervised learning of local features for music classification[C].In: 13th international society for music information retrieval conference (ISMIR2012), pp 139–144
26.
Zurück zum Zitat Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression[J]. Int J Mach Learn Cybernet 4(1):13–24CrossRef Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression[J]. Int J Mach Learn Cybernet 4(1):13–24CrossRef
27.
Zurück zum Zitat Zhang LF, Zhang LP, Tao DC et al (2012) On combining multiple features for hyperspectral remote sensing image classification[J]. IEEE Trans Geosci Remote Sens 50(3):879–893CrossRefMathSciNet Zhang LF, Zhang LP, Tao DC et al (2012) On combining multiple features for hyperspectral remote sensing image classification[J]. IEEE Trans Geosci Remote Sens 50(3):879–893CrossRefMathSciNet
28.
Zurück zum Zitat Manjunath BS, Ma WY (1996) Texture features for browsing and retrieval of image data[J]. IEEE Trans Pattern Anal Mach Intell. 18(8):837–842CrossRef Manjunath BS, Ma WY (1996) Texture features for browsing and retrieval of image data[J]. IEEE Trans Pattern Anal Mach Intell. 18(8):837–842CrossRef
29.
Zurück zum Zitat Qin LL, Li B (2006) Chinese and foreign music appreciation. Zhejiang University Press, Hangzhou Qin LL, Li B (2006) Chinese and foreign music appreciation. Zhejiang University Press, Hangzhou
30.
Zurück zum Zitat Zhu JX (2006) Music appreciation. Henan University Press, Kaifeng Zhu JX (2006) Music appreciation. Henan University Press, Kaifeng
31.
Zurück zum Zitat Coates A, Carpenter B, Case C et al (2011) Text detection and character recognition in scene images with unsupervised feature learning[C]. In: IEEE 2011 international conference on document analysis and recognition (ICDAR), pp 440–445 Coates A, Carpenter B, Case C et al (2011) Text detection and character recognition in scene images with unsupervised feature learning[C]. In: IEEE 2011 international conference on document analysis and recognition (ICDAR), pp 440–445
32.
Zurück zum Zitat Keerthi SS, Shevade SK, Bhattacharyya C et al (2001) Improvements to Platt’s SMO algorithm for SVM classifier design [J]. Neural Comput 13(3):637–649CrossRefMATH Keerthi SS, Shevade SK, Bhattacharyya C et al (2001) Improvements to Platt’s SMO algorithm for SVM classifier design [J]. Neural Comput 13(3):637–649CrossRefMATH
Metadaten
Titel
Visual music score detection with unsupervised feature learning method based on K-means
verfasst von
Yang Fang
Teng Gui-fa
Publikationsdatum
01.04.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 2/2015
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-014-0260-2

Weitere Artikel der Ausgabe 2/2015

International Journal of Machine Learning and Cybernetics 2/2015 Zur Ausgabe

Neuer Inhalt