Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 2/2015

01-04-2015 | Original Article

Visual music score detection with unsupervised feature learning method based on K-means

Authors: Yang Fang, Teng Gui-fa

Published in: International Journal of Machine Learning and Cybernetics | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automatic music score detection plays important role in the optical music recognition (OMR). In a visual image, the characteristic of the music scores is frequently degraded by illumination, distortion and other background elements. In this paper, to reduce the influences to OMR caused by those degradations especially the interference of Chinese character, an unsupervised feature learning detection method is proposed for improving the correctness of music score detection. Firstly, a detection framework was constructed. Then sub-image block features were extracted by simple unsupervised feature learning (UFL) method based on K-means and classified by SVM. Finally, music score detection processing was completed by connecting component searching algorithm based on the sub-image block label. Taking Chinese text as the main interferences, the detection rate was compared between UFL method and texture feature method based on 2D Gabor filter in the same framework. The experiment results show that unsupervised feature learning method gets less error detection rate than Gabor texture feature method with limited training set.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
2.
go back to reference Szwoch M (2005) A robust detector for distorted music staves[C]. In: Computer analysis of images and patterns. Springer, Berlin, pp 701–708 Szwoch M (2005) A robust detector for distorted music staves[C]. In: Computer analysis of images and patterns. Springer, Berlin, pp 701–708
3.
go back to reference Rebelo A, Capela A, da Costa JFP et al (2007) A shortest path approach for staff line detection[C]. In: The third international conference on automated production of cross media content for multi-channel distribution 2007 (AXMEDIS’07). IEEE, pp 79–85 Rebelo A, Capela A, da Costa JFP et al (2007) A shortest path approach for staff line detection[C]. In: The third international conference on automated production of cross media content for multi-channel distribution 2007 (AXMEDIS’07). IEEE, pp 79–85
6.
go back to reference Dutta A, Pal U, Fornes A et al (2010) An efficient staff removal approach from printed musical documents[C]. In: 20th international conference on pattern recognition (ICPR), 2010. IEEE, pp 1965–1968 Dutta A, Pal U, Fornes A et al (2010) An efficient staff removal approach from printed musical documents[C]. In: 20th international conference on pattern recognition (ICPR), 2010. IEEE, pp 1965–1968
7.
go back to reference Burgoyne JA, Pugin L, Eustace G et al (2007) A comparative survey of image binarization algorithms for optical recognition on degraded musical sources[C]. In; International society for music information retrieval conference (ISMIR), pp 509–512 Burgoyne JA, Pugin L, Eustace G et al (2007) A comparative survey of image binarization algorithms for optical recognition on degraded musical sources[C]. In; International society for music information retrieval conference (ISMIR), pp 509–512
8.
go back to reference Pinto T, Rebelo A, Giraldi G et al (2011) Music score binarization based on domain knowledge[M]. In: Pattern recognition and image analysis. Springer, Berlin, pp 700–708 Pinto T, Rebelo A, Giraldi G et al (2011) Music score binarization based on domain knowledge[M]. In: Pattern recognition and image analysis. Springer, Berlin, pp 700–708
9.
go back to reference Rebelo A, Cardoso JS (2013) Staff line detection and removal in the grayscale domain[C]. In: The 12th international conference on document analysis and recognition (ICDAR), pp 57–61 Rebelo A, Cardoso JS (2013) Staff line detection and removal in the grayscale domain[C]. In: The 12th international conference on document analysis and recognition (ICDAR), pp 57–61
10.
go back to reference Timofe R, Gool LV (2013) Automatic stave discovery for musical facsimiles[C]. ACCV2012 4:510–523 Timofe R, Gool LV (2013) Automatic stave discovery for musical facsimiles[C]. ACCV2012 4:510–523
11.
go back to reference Sun JD, Ma YY (2010) Summary of texture feature research[J]. Appl Comput Syst. 19(6):245–250 Sun JD, Ma YY (2010) Summary of texture feature research[J]. Appl Comput Syst. 19(6):245–250
12.
go back to reference Zhang XZ (1992) Chinese character recognition technology [M]. Tsinghua university press, Beijing Zhang XZ (1992) Chinese character recognition technology [M]. Tsinghua university press, Beijing
13.
go back to reference Sharma A, Imoto S, Miyano S et al (2012) Null space based feature selection method for gene expression data[J]. Int J Mach Learn Cybernet 3(4):269–276CrossRef Sharma A, Imoto S, Miyano S et al (2012) Null space based feature selection method for gene expression data[J]. Int J Mach Learn Cybernet 3(4):269–276CrossRef
14.
go back to reference Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection[J]. Int J Mach Learn Cybernet 4(6):609–619CrossRef Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection[J]. Int J Mach Learn Cybernet 4(6):609–619CrossRef
15.
go back to reference Xie ZX, Xu Y (2014) Sparse group LASSO based uncertain feature selection[J]. Int J Mach Learn Cybernet 5(2):201–210CrossRefMathSciNet Xie ZX, Xu Y (2014) Sparse group LASSO based uncertain feature selection[J]. Int J Mach Learn Cybernet 5(2):201–210CrossRefMathSciNet
16.
go back to reference Coates A (2012) Demystifying unsupervised feature learning[D]. Stanford University, Stanford Coates A (2012) Demystifying unsupervised feature learning[D]. Stanford University, Stanford
17.
go back to reference Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning[C]. In: NIPS workshop on deep learning and unsupervised feature learning 2011 Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning[C]. In: NIPS workshop on deep learning and unsupervised feature learning 2011
18.
go back to reference Ranzato MA, Huang FJ, Boureau YL et al (2007) Unsupervised learning of invariant feature hierarchies with applications to object recognition[C]. IEEE Conf Comput Vis Pattern Recogn 2007:1–8 Ranzato MA, Huang FJ, Boureau YL et al (2007) Unsupervised learning of invariant feature hierarchies with applications to object recognition[C]. IEEE Conf Comput Vis Pattern Recogn 2007:1–8
19.
go back to reference Kavukcuoglu K, Sermanet P, Boureau YL et al (2010) Learning convolutional feature hierarchies for visual recognition[C]. In: Advances in neural information processing systems, pp 1090–1098 Kavukcuoglu K, Sermanet P, Boureau YL et al (2010) Learning convolutional feature hierarchies for visual recognition[C]. In: Advances in neural information processing systems, pp 1090–1098
20.
go back to reference Saxe A, Koh PW, Chen Z et al (2011) On random weights and unsupervised feature learning[C]. In: Twenty-eighth international conference on machine learning, pp 1–9 Saxe A, Koh PW, Chen Z et al (2011) On random weights and unsupervised feature learning[C]. In: Twenty-eighth international conference on machine learning, pp 1–9
21.
go back to reference Coates A, Lee H, Ng AY (2011) An analysis of single-layer networks in unsupervised feature learning [J]. JMLR W&CP. 15:215–223 Coates A, Lee H, Ng AY (2011) An analysis of single-layer networks in unsupervised feature learning [J]. JMLR W&CP. 15:215–223
22.
go back to reference Yeung D, Wang XZ (2002) Improving performance of similarity-based clustering by feature weight learning[J]. IEEE Trans Pattern Anal Mach Intell 24(4):556–561CrossRefMathSciNet Yeung D, Wang XZ (2002) Improving performance of similarity-based clustering by feature weight learning[J]. IEEE Trans Pattern Anal Mach Intell 24(4):556–561CrossRefMathSciNet
23.
go back to reference Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning[J]. Pattern Recogn Lett 25(10):1123–1132CrossRef Wang XZ, Wang YD, Wang LJ (2004) Improving fuzzy c-means clustering based on feature-weight learning[J]. Pattern Recogn Lett 25(10):1123–1132CrossRef
24.
go back to reference Sarma TH, Viswanath P, Reddy BE (2013) A hybrid approach to speed-up the K-means clustering method [J]. Int J Mach Learn Cybernet 4(2):107–117CrossRef Sarma TH, Viswanath P, Reddy BE (2013) A hybrid approach to speed-up the K-means clustering method [J]. Int J Mach Learn Cybernet 4(2):107–117CrossRef
25.
go back to reference Jan W, Riedmiller M (2012) Unsupervised learning of local features for music classification[C].In: 13th international society for music information retrieval conference (ISMIR2012), pp 139–144 Jan W, Riedmiller M (2012) Unsupervised learning of local features for music classification[C].In: 13th international society for music information retrieval conference (ISMIR2012), pp 139–144
26.
go back to reference Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression[J]. Int J Mach Learn Cybernet 4(1):13–24CrossRef Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression[J]. Int J Mach Learn Cybernet 4(1):13–24CrossRef
27.
go back to reference Zhang LF, Zhang LP, Tao DC et al (2012) On combining multiple features for hyperspectral remote sensing image classification[J]. IEEE Trans Geosci Remote Sens 50(3):879–893CrossRefMathSciNet Zhang LF, Zhang LP, Tao DC et al (2012) On combining multiple features for hyperspectral remote sensing image classification[J]. IEEE Trans Geosci Remote Sens 50(3):879–893CrossRefMathSciNet
28.
go back to reference Manjunath BS, Ma WY (1996) Texture features for browsing and retrieval of image data[J]. IEEE Trans Pattern Anal Mach Intell. 18(8):837–842CrossRef Manjunath BS, Ma WY (1996) Texture features for browsing and retrieval of image data[J]. IEEE Trans Pattern Anal Mach Intell. 18(8):837–842CrossRef
29.
go back to reference Qin LL, Li B (2006) Chinese and foreign music appreciation. Zhejiang University Press, Hangzhou Qin LL, Li B (2006) Chinese and foreign music appreciation. Zhejiang University Press, Hangzhou
30.
go back to reference Zhu JX (2006) Music appreciation. Henan University Press, Kaifeng Zhu JX (2006) Music appreciation. Henan University Press, Kaifeng
31.
go back to reference Coates A, Carpenter B, Case C et al (2011) Text detection and character recognition in scene images with unsupervised feature learning[C]. In: IEEE 2011 international conference on document analysis and recognition (ICDAR), pp 440–445 Coates A, Carpenter B, Case C et al (2011) Text detection and character recognition in scene images with unsupervised feature learning[C]. In: IEEE 2011 international conference on document analysis and recognition (ICDAR), pp 440–445
32.
go back to reference Keerthi SS, Shevade SK, Bhattacharyya C et al (2001) Improvements to Platt’s SMO algorithm for SVM classifier design [J]. Neural Comput 13(3):637–649CrossRefMATH Keerthi SS, Shevade SK, Bhattacharyya C et al (2001) Improvements to Platt’s SMO algorithm for SVM classifier design [J]. Neural Comput 13(3):637–649CrossRefMATH
Metadata
Title
Visual music score detection with unsupervised feature learning method based on K-means
Authors
Yang Fang
Teng Gui-fa
Publication date
01-04-2015
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Machine Learning and Cybernetics / Issue 2/2015
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-014-0260-2

Other articles of this Issue 2/2015

International Journal of Machine Learning and Cybernetics 2/2015 Go to the issue