Skip to main content
Erschienen in: Pattern Analysis and Applications 4/2015

01.11.2015 | Short Paper

Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation

verfasst von: Jorge Calvo-Zaragoza, Isabel Barbancho, Lorenzo J. Tardón, Ana M. Barbancho

Erschienen in: Pattern Analysis and Applications | Ausgabe 4/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Staff detection and removal is one of the most important issues in optical music recognition (OMR) tasks since common approaches for symbol detection and classification are based on this process. Due to its complexity, staff detection and removal is often inaccurate, leading to a great number of errors in posterior stages. For this reason, a new approach that avoids this stage is proposed in this paper, which is expected to overcome these drawbacks. Our approach is put into practice in a case of study focused on scores written in white mensural notation. Symbol detection is performed by using the vertical projection of the staves. The cross-correlation operator for template matching is used at the classification stage. The goodness of our proposal is shown in an experiment in which our proposal attains an extraction rate of 96 % and a classification rate of 92 %, on average. The results found have reinforced the idea of pursuing a new research line in OMR systems without the need of the removal of staff lines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bainbridge D, Bell T (2001) The challenge of optical music recognition. Lang Resour Eval 35:95–121 Bainbridge D, Bell T (2001) The challenge of optical music recognition. Lang Resour Eval 35:95–121
2.
Zurück zum Zitat Barbancho I, Segura C, Tardon LJ, Barbancho AM (2010) Automatic selection of the region of interest in ancient scores. In: MELECON 2010–2010 15th IEEE Mediterranean Electrotechnical Conference, pp 326–331 Barbancho I, Segura C, Tardon LJ, Barbancho AM (2010) Automatic selection of the region of interest in ancient scores. In: MELECON 2010–2010 15th IEEE Mediterranean Electrotechnical Conference, pp 326–331
3.
Zurück zum Zitat Bribiesca E (1999) A new chain code. Pattern Recognit 32(2):235–251CrossRef Bribiesca E (1999) A new chain code. Pattern Recognit 32(2):235–251CrossRef
4.
Zurück zum Zitat Chen YS, Chen FS, Teng CH (2013) An optical music recognition system for skew or inverted musical scores. Int J Pattern Recognit Artif Intell 27(07):1–23 Chen YS, Chen FS, Teng CH (2013) An optical music recognition system for skew or inverted musical scores. Int J Pattern Recognit Artif Intell 27(07):1–23
5.
Zurück zum Zitat Deza MM, Deza E (2009) Encyclopedia of Distances, first edn. Springer, New YorkCrossRef Deza MM, Deza E (2009) Encyclopedia of Distances, first edn. Springer, New YorkCrossRef
6.
Zurück zum Zitat Duda RO, Hart PE (1973) Pattern classification and scene analysis, first edn. Wiley, Hoboken Duda RO, Hart PE (1973) Pattern classification and scene analysis, first edn. Wiley, Hoboken
7.
Zurück zum Zitat Dutta A, Pal U, Fornes A, Llados J (2010) An efficient staff removal approach from printed musical documents. In: Pattern Recognition (ICPR), 2010 20th International Conference. pp 1965–1968 Dutta A, Pal U, Fornes A, Llados J (2010) An efficient staff removal approach from printed musical documents. In: Pattern Recognition (ICPR), 2010 20th International Conference. pp 1965–1968
8.
Zurück zum Zitat Fornés A, Lladós J, Sánchez G (2005) Staff and graphical primitive segmentation in old handwritten music scores. In: Proceedings of the 2005 conference on Artificial Intelligence Research and Development. IOS Press, Amsterdam, pp 83–90 Fornés A, Lladós J, Sánchez G (2005) Staff and graphical primitive segmentation in old handwritten music scores. In: Proceedings of the 2005 conference on Artificial Intelligence Research and Development. IOS Press, Amsterdam, pp 83–90
9.
Zurück zum Zitat Freeman H (1961) On the encoding of arbitrary geometric configurations. Electr Comput IRE Trans EC 10(2):260–268MathSciNetCrossRef Freeman H (1961) On the encoding of arbitrary geometric configurations. Electr Comput IRE Trans EC 10(2):260–268MathSciNetCrossRef
10.
Zurück zum Zitat Gonzalez RC, Woods RE (2007) Digital Image Processing. Prentice-Hall, Upper Saddle River Gonzalez RC, Woods RE (2007) Digital Image Processing. Prentice-Hall, Upper Saddle River
11.
Zurück zum Zitat Hartigan JA (1975) Clustering algorithms. Wiley, HobokenMATH Hartigan JA (1975) Clustering algorithms. Wiley, HobokenMATH
12.
Zurück zum Zitat Hwang SK, Kim WY (2006) Fast and efficient method for computing art. Image Process IEEE Trans 15(1):112–117CrossRef Hwang SK, Kim WY (2006) Fast and efficient method for computing art. Image Process IEEE Trans 15(1):112–117CrossRef
13.
Zurück zum Zitat Jelinek F (1998) Statistical methods for speech recognition. The MIT Press, Cambridge Jelinek F (1998) Statistical methods for speech recognition. The MIT Press, Cambridge
14.
Zurück zum Zitat Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10:707MathSciNet Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10:707MathSciNet
15.
Zurück zum Zitat Lewis JP (1995) Fast template matching. In: Vision Interface. Canadian Image Processing and Pattern Recognition Society, Quebec City, pp 120–123 Lewis JP (1995) Fast template matching. In: Vision Interface. Canadian Image Processing and Pattern Recognition Society, Quebec City, pp 120–123
16.
Zurück zum Zitat Ng KC, Cooper D, Stefani E, Boyle RD, Bailey N (1999) Embracing the composer: optical recognition of handwritten manuscripts. In: Proceedings of the International Computer Music Conference, Beijing Ng KC, Cooper D, Stefani E, Boyle RD, Bailey N (1999) Embracing the composer: optical recognition of handwritten manuscripts. In: Proceedings of the International Computer Music Conference, Beijing
17.
Zurück zum Zitat Otsu N (January 1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66MathSciNetCrossRef Otsu N (January 1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66MathSciNetCrossRef
18.
Zurück zum Zitat Caldas Pinto JR, Vieira P, Ramalho M, Mengucci M, Pina P, Muge F (2000) Ancient music recovery for digital libraries. In: Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries, ECDL ’00. Springer, London, pp 24–34 Caldas Pinto JR, Vieira P, Ramalho M, Mengucci M, Pina P, Muge F (2000) Ancient music recovery for digital libraries. In: Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries, ECDL ’00. Springer, London, pp 24–34
19.
Zurück zum Zitat João Rogério Caldas Pinto, Vieira P, João Miguel da Costa Sousa (2003) A new graph-like classification method applied to ancient handwritten musical symbols. IJDAR 6(1):10–22 João Rogério Caldas Pinto, Vieira P, João Miguel da Costa Sousa (2003) A new graph-like classification method applied to ancient handwritten musical symbols. IJDAR 6(1):10–22
20.
Zurück zum Zitat Pizer SM, Johnston RE, Ericksen JP, Yankaskas BC, Muller KE (1990) Contrast-limited adaptive histogram equalization: speed and effectiveness. In: Visualization in Biomedical Computing, 1990, Proceedings of the First Conference, pp 337–345 Pizer SM, Johnston RE, Ericksen JP, Yankaskas BC, Muller KE (1990) Contrast-limited adaptive histogram equalization: speed and effectiveness. In: Visualization in Biomedical Computing, 1990, Proceedings of the First Conference, pp 337–345
21.
Zurück zum Zitat Pruslin D (1966) Automatic recognition of sheet music. Sc.d. dissertation, Massachusetts Institute of Technology Pruslin D (1966) Automatic recognition of sheet music. Sc.d. dissertation, Massachusetts Institute of Technology
22.
Zurück zum Zitat Pugin L (2006) Optical music recognition of early typographic prints using hidden markov models. In: ISMIR, pp 53–56 Pugin L (2006) Optical music recognition of early typographic prints using hidden markov models. In: ISMIR, pp 53–56
23.
Zurück zum Zitat Rebelo A, Fujinaga I, Paszkiewicz F, Marcal ARS, Guedes C, Cardoso JS (2012) Optical music recognition: state-of-the-art and open issues. Int J Multimed Inf Retr 1(3):173–190 Rebelo A, Fujinaga I, Paszkiewicz F, Marcal ARS, Guedes C, Cardoso JS (2012) Optical music recognition: state-of-the-art and open issues. Int J Multimed Inf Retr 1(3):173–190
24.
Zurück zum Zitat Sarvaiya JN, Patnaik S, Bombaywala S (2009) Image registration by template matching using normalized cross-correlation. In: Advances in Computing, Control, Telecommunication Technologies, 2009. ACT ’09. International Conference on, pp 819–822 Sarvaiya JN, Patnaik S, Bombaywala S (2009) Image registration by template matching using normalized cross-correlation. In: Advances in Computing, Control, Telecommunication Technologies, 2009. ACT ’09. International Conference on, pp 819–822
25.
Zurück zum Zitat Sotoodeh M, Tajeripour F (2012) Staff detection and removal using derivation and connected component analysis. In: Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium, pp 054–057 Sotoodeh M, Tajeripour F (2012) Staff detection and removal using derivation and connected component analysis. In: Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium, pp 054–057
27.
Zurück zum Zitat Su B, Lu S, Pal U, Tan CL (2012) An effective staff detection and removal technique for musical documents. In: Document analysis systems (DAS), 2012 10th IAPR International Workshop, pp 160–164 Su B, Lu S, Pal U, Tan CL (2012) An effective staff detection and removal technique for musical documents. In: Document analysis systems (DAS), 2012 10th IAPR International Workshop, pp 160–164
28.
Zurück zum Zitat Szwoch M (2005) A robust detector for distorted music staves. In: Gagalowicz A, Philips W (eds) computer analysis of images and patterns, vol 3691, Lecture notes in computer science. Springer, Berlin Heidelberg, pp 701–708CrossRef Szwoch M (2005) A robust detector for distorted music staves. In: Gagalowicz A, Philips W (eds) computer analysis of images and patterns, vol 3691, Lecture notes in computer science. Springer, Berlin Heidelberg, pp 701–708CrossRef
29.
Zurück zum Zitat Tardón LJ, Sammartino S, Barbancho I, Gómez V, Oliver A (2009) Optical music recognition for scores written in white mensural notation. J Image Video Process 6 Tardón LJ, Sammartino S, Barbancho I, Gómez V, Oliver A (2009) Optical music recognition for scores written in white mensural notation. J Image Video Process 6
30.
Zurück zum Zitat Toyama F, Shoji K, Miyamichi J (2006) Symbol recognition of printed piano scores with touching symbols. In: Pattern Recognition, 2006. ICPR 2006. 18th International Conference, vol 2, pp 480–483 Toyama F, Shoji K, Miyamichi J (2006) Symbol recognition of printed piano scores with touching symbols. In: Pattern Recognition, 2006. ICPR 2006. 18th International Conference, vol 2, pp 480–483
31.
Zurück zum Zitat Trier OD, Taxt T (1995) Evaluation of binarization methods for document images. Pattern Analysis Mach Intell IEEE Trans 17(3):312–315CrossRef Trier OD, Taxt T (1995) Evaluation of binarization methods for document images. Pattern Analysis Mach Intell IEEE Trans 17(3):312–315CrossRef
32.
Zurück zum Zitat Wei SD, Lai SH (Nov 2008) Fast template matching based on normalized cross correlation with adaptive multilevel winner update. Image Process IEEE Trans 17(11):2227–2235MathSciNetCrossRef Wei SD, Lai SH (Nov 2008) Fast template matching based on normalized cross correlation with adaptive multilevel winner update. Image Process IEEE Trans 17(11):2227–2235MathSciNetCrossRef
Metadaten
Titel
Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation
verfasst von
Jorge Calvo-Zaragoza
Isabel Barbancho
Lorenzo J. Tardón
Ana M. Barbancho
Publikationsdatum
01.11.2015
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 4/2015
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-014-0415-5

Weitere Artikel der Ausgabe 4/2015

Pattern Analysis and Applications 4/2015 Zur Ausgabe

Premium Partner