Skip to main content
Top
Published in: Pattern Analysis and Applications 4/2015

01-11-2015 | Short Paper

Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation

Authors: Jorge Calvo-Zaragoza, Isabel Barbancho, Lorenzo J. Tardón, Ana M. Barbancho

Published in: Pattern Analysis and Applications | Issue 4/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Staff detection and removal is one of the most important issues in optical music recognition (OMR) tasks since common approaches for symbol detection and classification are based on this process. Due to its complexity, staff detection and removal is often inaccurate, leading to a great number of errors in posterior stages. For this reason, a new approach that avoids this stage is proposed in this paper, which is expected to overcome these drawbacks. Our approach is put into practice in a case of study focused on scores written in white mensural notation. Symbol detection is performed by using the vertical projection of the staves. The cross-correlation operator for template matching is used at the classification stage. The goodness of our proposal is shown in an experiment in which our proposal attains an extraction rate of 96 % and a classification rate of 92 %, on average. The results found have reinforced the idea of pursuing a new research line in OMR systems without the need of the removal of staff lines.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bainbridge D, Bell T (2001) The challenge of optical music recognition. Lang Resour Eval 35:95–121 Bainbridge D, Bell T (2001) The challenge of optical music recognition. Lang Resour Eval 35:95–121
2.
go back to reference Barbancho I, Segura C, Tardon LJ, Barbancho AM (2010) Automatic selection of the region of interest in ancient scores. In: MELECON 2010–2010 15th IEEE Mediterranean Electrotechnical Conference, pp 326–331 Barbancho I, Segura C, Tardon LJ, Barbancho AM (2010) Automatic selection of the region of interest in ancient scores. In: MELECON 2010–2010 15th IEEE Mediterranean Electrotechnical Conference, pp 326–331
3.
4.
go back to reference Chen YS, Chen FS, Teng CH (2013) An optical music recognition system for skew or inverted musical scores. Int J Pattern Recognit Artif Intell 27(07):1–23 Chen YS, Chen FS, Teng CH (2013) An optical music recognition system for skew or inverted musical scores. Int J Pattern Recognit Artif Intell 27(07):1–23
5.
go back to reference Deza MM, Deza E (2009) Encyclopedia of Distances, first edn. Springer, New YorkCrossRef Deza MM, Deza E (2009) Encyclopedia of Distances, first edn. Springer, New YorkCrossRef
6.
go back to reference Duda RO, Hart PE (1973) Pattern classification and scene analysis, first edn. Wiley, Hoboken Duda RO, Hart PE (1973) Pattern classification and scene analysis, first edn. Wiley, Hoboken
7.
go back to reference Dutta A, Pal U, Fornes A, Llados J (2010) An efficient staff removal approach from printed musical documents. In: Pattern Recognition (ICPR), 2010 20th International Conference. pp 1965–1968 Dutta A, Pal U, Fornes A, Llados J (2010) An efficient staff removal approach from printed musical documents. In: Pattern Recognition (ICPR), 2010 20th International Conference. pp 1965–1968
8.
go back to reference Fornés A, Lladós J, Sánchez G (2005) Staff and graphical primitive segmentation in old handwritten music scores. In: Proceedings of the 2005 conference on Artificial Intelligence Research and Development. IOS Press, Amsterdam, pp 83–90 Fornés A, Lladós J, Sánchez G (2005) Staff and graphical primitive segmentation in old handwritten music scores. In: Proceedings of the 2005 conference on Artificial Intelligence Research and Development. IOS Press, Amsterdam, pp 83–90
9.
10.
go back to reference Gonzalez RC, Woods RE (2007) Digital Image Processing. Prentice-Hall, Upper Saddle River Gonzalez RC, Woods RE (2007) Digital Image Processing. Prentice-Hall, Upper Saddle River
11.
12.
go back to reference Hwang SK, Kim WY (2006) Fast and efficient method for computing art. Image Process IEEE Trans 15(1):112–117CrossRef Hwang SK, Kim WY (2006) Fast and efficient method for computing art. Image Process IEEE Trans 15(1):112–117CrossRef
13.
go back to reference Jelinek F (1998) Statistical methods for speech recognition. The MIT Press, Cambridge Jelinek F (1998) Statistical methods for speech recognition. The MIT Press, Cambridge
14.
go back to reference Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10:707MathSciNet Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10:707MathSciNet
15.
go back to reference Lewis JP (1995) Fast template matching. In: Vision Interface. Canadian Image Processing and Pattern Recognition Society, Quebec City, pp 120–123 Lewis JP (1995) Fast template matching. In: Vision Interface. Canadian Image Processing and Pattern Recognition Society, Quebec City, pp 120–123
16.
go back to reference Ng KC, Cooper D, Stefani E, Boyle RD, Bailey N (1999) Embracing the composer: optical recognition of handwritten manuscripts. In: Proceedings of the International Computer Music Conference, Beijing Ng KC, Cooper D, Stefani E, Boyle RD, Bailey N (1999) Embracing the composer: optical recognition of handwritten manuscripts. In: Proceedings of the International Computer Music Conference, Beijing
17.
go back to reference Otsu N (January 1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66MathSciNetCrossRef Otsu N (January 1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66MathSciNetCrossRef
18.
go back to reference Caldas Pinto JR, Vieira P, Ramalho M, Mengucci M, Pina P, Muge F (2000) Ancient music recovery for digital libraries. In: Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries, ECDL ’00. Springer, London, pp 24–34 Caldas Pinto JR, Vieira P, Ramalho M, Mengucci M, Pina P, Muge F (2000) Ancient music recovery for digital libraries. In: Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries, ECDL ’00. Springer, London, pp 24–34
19.
go back to reference João Rogério Caldas Pinto, Vieira P, João Miguel da Costa Sousa (2003) A new graph-like classification method applied to ancient handwritten musical symbols. IJDAR 6(1):10–22 João Rogério Caldas Pinto, Vieira P, João Miguel da Costa Sousa (2003) A new graph-like classification method applied to ancient handwritten musical symbols. IJDAR 6(1):10–22
20.
go back to reference Pizer SM, Johnston RE, Ericksen JP, Yankaskas BC, Muller KE (1990) Contrast-limited adaptive histogram equalization: speed and effectiveness. In: Visualization in Biomedical Computing, 1990, Proceedings of the First Conference, pp 337–345 Pizer SM, Johnston RE, Ericksen JP, Yankaskas BC, Muller KE (1990) Contrast-limited adaptive histogram equalization: speed and effectiveness. In: Visualization in Biomedical Computing, 1990, Proceedings of the First Conference, pp 337–345
21.
go back to reference Pruslin D (1966) Automatic recognition of sheet music. Sc.d. dissertation, Massachusetts Institute of Technology Pruslin D (1966) Automatic recognition of sheet music. Sc.d. dissertation, Massachusetts Institute of Technology
22.
go back to reference Pugin L (2006) Optical music recognition of early typographic prints using hidden markov models. In: ISMIR, pp 53–56 Pugin L (2006) Optical music recognition of early typographic prints using hidden markov models. In: ISMIR, pp 53–56
23.
go back to reference Rebelo A, Fujinaga I, Paszkiewicz F, Marcal ARS, Guedes C, Cardoso JS (2012) Optical music recognition: state-of-the-art and open issues. Int J Multimed Inf Retr 1(3):173–190 Rebelo A, Fujinaga I, Paszkiewicz F, Marcal ARS, Guedes C, Cardoso JS (2012) Optical music recognition: state-of-the-art and open issues. Int J Multimed Inf Retr 1(3):173–190
24.
go back to reference Sarvaiya JN, Patnaik S, Bombaywala S (2009) Image registration by template matching using normalized cross-correlation. In: Advances in Computing, Control, Telecommunication Technologies, 2009. ACT ’09. International Conference on, pp 819–822 Sarvaiya JN, Patnaik S, Bombaywala S (2009) Image registration by template matching using normalized cross-correlation. In: Advances in Computing, Control, Telecommunication Technologies, 2009. ACT ’09. International Conference on, pp 819–822
25.
go back to reference Sotoodeh M, Tajeripour F (2012) Staff detection and removal using derivation and connected component analysis. In: Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium, pp 054–057 Sotoodeh M, Tajeripour F (2012) Staff detection and removal using derivation and connected component analysis. In: Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium, pp 054–057
27.
go back to reference Su B, Lu S, Pal U, Tan CL (2012) An effective staff detection and removal technique for musical documents. In: Document analysis systems (DAS), 2012 10th IAPR International Workshop, pp 160–164 Su B, Lu S, Pal U, Tan CL (2012) An effective staff detection and removal technique for musical documents. In: Document analysis systems (DAS), 2012 10th IAPR International Workshop, pp 160–164
28.
go back to reference Szwoch M (2005) A robust detector for distorted music staves. In: Gagalowicz A, Philips W (eds) computer analysis of images and patterns, vol 3691, Lecture notes in computer science. Springer, Berlin Heidelberg, pp 701–708CrossRef Szwoch M (2005) A robust detector for distorted music staves. In: Gagalowicz A, Philips W (eds) computer analysis of images and patterns, vol 3691, Lecture notes in computer science. Springer, Berlin Heidelberg, pp 701–708CrossRef
29.
go back to reference Tardón LJ, Sammartino S, Barbancho I, Gómez V, Oliver A (2009) Optical music recognition for scores written in white mensural notation. J Image Video Process 6 Tardón LJ, Sammartino S, Barbancho I, Gómez V, Oliver A (2009) Optical music recognition for scores written in white mensural notation. J Image Video Process 6
30.
go back to reference Toyama F, Shoji K, Miyamichi J (2006) Symbol recognition of printed piano scores with touching symbols. In: Pattern Recognition, 2006. ICPR 2006. 18th International Conference, vol 2, pp 480–483 Toyama F, Shoji K, Miyamichi J (2006) Symbol recognition of printed piano scores with touching symbols. In: Pattern Recognition, 2006. ICPR 2006. 18th International Conference, vol 2, pp 480–483
31.
go back to reference Trier OD, Taxt T (1995) Evaluation of binarization methods for document images. Pattern Analysis Mach Intell IEEE Trans 17(3):312–315CrossRef Trier OD, Taxt T (1995) Evaluation of binarization methods for document images. Pattern Analysis Mach Intell IEEE Trans 17(3):312–315CrossRef
32.
go back to reference Wei SD, Lai SH (Nov 2008) Fast template matching based on normalized cross correlation with adaptive multilevel winner update. Image Process IEEE Trans 17(11):2227–2235MathSciNetCrossRef Wei SD, Lai SH (Nov 2008) Fast template matching based on normalized cross correlation with adaptive multilevel winner update. Image Process IEEE Trans 17(11):2227–2235MathSciNetCrossRef
Metadata
Title
Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation
Authors
Jorge Calvo-Zaragoza
Isabel Barbancho
Lorenzo J. Tardón
Ana M. Barbancho
Publication date
01-11-2015
Publisher
Springer London
Published in
Pattern Analysis and Applications / Issue 4/2015
Print ISSN: 1433-7541
Electronic ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-014-0415-5

Other articles of this Issue 4/2015

Pattern Analysis and Applications 4/2015 Go to the issue

Premium Partner