Skip to main content
Top
Published in: International Journal on Document Analysis and Recognition (IJDAR) 1/2019

17-01-2019 | OriginalPaper

Stroke order normalization for improving recognition of online handwritten mathematical expressions

Authors: Anh Duc Le, Hai Dai Nguyen, Bipin Indurkhya, Masaki Nakagawa

Published in: International Journal on Document Analysis and Recognition (IJDAR) | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We present a technique based on stroke order normalization for improving recognition of online handwritten mathematical expressions (ME). The stroke order dependent system has less time complexity than the stroke order free system, but it must incorporate special grammar rules to cope with stroke order variations. The stroke order normalization technique solves this problem and also the problem of unexpected stroke order variations without increasing the time complexity of ME recognition. In order to normalize stroke order, the XY cut method is modified since its original form causes problems when structural components in ME overlap. First, vertically ordered strokes are located by detecting vertical symbols and their upper/lower components, which are treated as MEs and reordered recursively. Second, unordered strokes on the left side of the vertical symbols are reordered as horizontally ordered strokes. Third, the remaining strokes are reordered recursively. The horizontally ordered strokes are reordered from left to right, and the vertically ordered strokes are reordered from top to bottom. Finally, the proposed stroke order normalization is combined with the stroke order dependent ME recognition system. The evaluations on the CROHME 2014 database show that the ME recognition system incorporating the stroke order normalization outperforms all other systems that use only CROHME 2014 for training while the processing time is kept low.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chan, K., Yeung, D.: Mathematical expression recognition: a survey. Int. J. Doc. Anal. Recognit. 3, 3–15 (2000)CrossRef Chan, K., Yeung, D.: Mathematical expression recognition: a survey. Int. J. Doc. Anal. Recognit. 3, 3–15 (2000)CrossRef
2.
go back to reference Zanibbi, R., Blostein, D.: Recognition and retrieval of mathematical expressions. Int. J. Doc. Anal. Recognit. 15, 331–357 (2012)CrossRef Zanibbi, R., Blostein, D.: Recognition and retrieval of mathematical expressions. Int. J. Doc. Anal. Recognit. 15, 331–357 (2012)CrossRef
3.
go back to reference Mouchere, H., Viard-Gaudin, C., Zanibbi, R., Garain, U.: ICFHR 2014 competition on recognition of on-line handwritten mathematical expressions (CROHME 2014). In: International Conference Frontiers in Handwriting Recognition, pp. 791–796 (2014) Mouchere, H., Viard-Gaudin, C., Zanibbi, R., Garain, U.: ICFHR 2014 competition on recognition of on-line handwritten mathematical expressions (CROHME 2014). In: International Conference Frontiers in Handwriting Recognition, pp. 791–796 (2014)
4.
go back to reference Lehmberg, S., Winkler, H.J., Lang, M.: A soft-decision approach for symbol segmentation within handwritten mathematical expressions, International conference on acoustics, speech, and signal processing, vol. 6, pp. 3434–3437, Atlanta (1996) Lehmberg, S., Winkler, H.J., Lang, M.: A soft-decision approach for symbol segmentation within handwritten mathematical expressions, International conference on acoustics, speech, and signal processing, vol. 6, pp. 3434–3437, Atlanta (1996)
5.
go back to reference Toyozumi, K., et al.: A study of symbol segmentation method for handwritten mathematical formula recognition using mathematical structure information. In: International Conference on Pattern Recognition, vol. 2, pp. 630–633, Cambridge (2004) Toyozumi, K., et al.: A study of symbol segmentation method for handwritten mathematical formula recognition using mathematical structure information. In: International Conference on Pattern Recognition, vol. 2, pp. 630–633, Cambridge (2004)
6.
go back to reference Hu, L., Zanibbi, R.: Segmenting handwritten math symbols using adaboost and multi-scale shape context features. In: International Conference on Document Analysis and Recognition, pp. 1180–1184, Washington (2013) Hu, L., Zanibbi, R.: Segmenting handwritten math symbols using adaboost and multi-scale shape context features. In: International Conference on Document Analysis and Recognition, pp. 1180–1184, Washington (2013)
7.
go back to reference MacLean, S., Labahn, G.: Elastic matching in linear time and constant space. In: IAPR Workshop on Document Analysis Systems (2010) MacLean, S., Labahn, G.: Elastic matching in linear time and constant space. In: IAPR Workshop on Document Analysis Systems (2010)
8.
go back to reference Hu, L., Zanibbi, R.: HMM-based recognition of online hand-written mathematical symbols using segmental K-means initialization and a modified pen-up/down feature. In: International Conference on Document Analysis and Recognition, pp. 457–462, Beijing (2011) Hu, L., Zanibbi, R.: HMM-based recognition of online hand-written mathematical symbols using segmental K-means initialization and a modified pen-up/down feature. In: International Conference on Document Analysis and Recognition, pp. 457–462, Beijing (2011)
9.
go back to reference Alvaro, F., Sanchez, J.A., Benedi, J.M.: Classification of on-line mathematical symbols with hybrid features and recurrent neural networks, International Conference on Document Analysis and Recognition, pp. 1012–1016, Washington (2013) Alvaro, F., Sanchez, J.A., Benedi, J.M.: Classification of on-line mathematical symbols with hybrid features and recurrent neural networks, International Conference on Document Analysis and Recognition, pp. 1012–1016, Washington (2013)
10.
go back to reference Davila, K.M., Ludi, S., Zanibbi R.: Using off-line features and synthetic data for on-line handwritten math symbol recognition, International Conference on Frontiers in Handwriting Recognition, pp. 323–328, Crete (2014) Davila, K.M., Ludi, S., Zanibbi R.: Using off-line features and synthetic data for on-line handwritten math symbol recognition, International Conference on Frontiers in Handwriting Recognition, pp. 323–328, Crete (2014)
11.
go back to reference Garain, U., Chaudhuri, B.B.: Recognition of online handwritten mathematical expressions. IEEE Trans. Syst. Man Cybern. Part B Cybern. 34, 2366–2376 (2004)CrossRef Garain, U., Chaudhuri, B.B.: Recognition of online handwritten mathematical expressions. IEEE Trans. Syst. Man Cybern. Part B Cybern. 34, 2366–2376 (2004)CrossRef
12.
go back to reference Alvaro, F., Sanchez, J.A., Benedi, J.M.: Offline features for classifying handwritten math symbols with recurrent neural networks. In: International Conference on Pattern Recognition, pp. 2944–2949, Stockholm (2014) Alvaro, F., Sanchez, J.A., Benedi, J.M.: Offline features for classifying handwritten math symbols with recurrent neural networks. In: International Conference on Pattern Recognition, pp. 2944–2949, Stockholm (2014)
13.
go back to reference Nguyen, H.D., Le, A.D., Nakagawa, M.: Deep neural network for recognizing online handwritten mathematical symbols. In: IAPR Asian Conference on Pattern Recognition, pp. 121–125 (2015) Nguyen, H.D., Le, A.D., Nakagawa, M.: Deep neural network for recognizing online handwritten mathematical symbols. In: IAPR Asian Conference on Pattern Recognition, pp. 121–125 (2015)
14.
go back to reference Nguyen, H.D., Le, A.D., Nakagawa, M.: Recognition of online handwritten math symbols using deep neural networks, IEICE Transactions on Information and Systems, vol. E99.D, pp. 3110–3118 (2016) Nguyen, H.D., Le, A.D., Nakagawa, M.: Recognition of online handwritten math symbols using deep neural networks, IEICE Transactions on Information and Systems, vol. E99.D, pp. 3110–3118 (2016)
15.
go back to reference Yamamoto, R., Sako, S., Nishimoto, T., Sagayama, S.: Online recognition of handwritten mathematical expressions based on stroke-based stochastic context-free grammar. In: International Workshop on Frontiers in Handwriting Recognition, pp. 249–254, La Baule, France (2006) Yamamoto, R., Sako, S., Nishimoto, T., Sagayama, S.: Online recognition of handwritten mathematical expressions based on stroke-based stochastic context-free grammar. In: International Workshop on Frontiers in Handwriting Recognition, pp. 249–254, La Baule, France (2006)
16.
go back to reference Simistira, F., Katsouros, V., Carayannis, G.: Recognition of online handwritten mathematical formulas using probabilistic SVMs and stochastic context free grammars. Pattern Recognit. Lett. 53, 85–92 (2015)CrossRef Simistira, F., Katsouros, V., Carayannis, G.: Recognition of online handwritten mathematical formulas using probabilistic SVMs and stochastic context free grammars. Pattern Recognit. Lett. 53, 85–92 (2015)CrossRef
17.
go back to reference Le, A.D., Nakagawa, M.: A system for recognizing online handwritten mathematical expressions by using improved structural analysis. Int. J. Doc. Anal. Recognit. 19, 305–319 (2016)CrossRef Le, A.D., Nakagawa, M.: A system for recognizing online handwritten mathematical expressions by using improved structural analysis. Int. J. Doc. Anal. Recognit. 19, 305–319 (2016)CrossRef
18.
go back to reference Le, A.D., Phan, T.V., Nakagawa, M.: A system for recognizing online handwritten mathematical expressions and improvement of structure analysis. In: IAPR Workshop on Document Analysis Systems, pp. 51–55 (2014) Le, A.D., Phan, T.V., Nakagawa, M.: A system for recognizing online handwritten mathematical expressions and improvement of structure analysis. In: IAPR Workshop on Document Analysis Systems, pp. 51–55 (2014)
19.
go back to reference MacLean, S., Labahn, G.: A new approach for recognizing handwritten mathematics using relational grammars and fuzzy sets. Int. J. Doc. Anal. Recognit. 16, 139–163 (2013)CrossRef MacLean, S., Labahn, G.: A new approach for recognizing handwritten mathematics using relational grammars and fuzzy sets. Int. J. Doc. Anal. Recognit. 16, 139–163 (2013)CrossRef
20.
go back to reference Alvaro, F., Sanchez, J., Benedi, J.: Recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden markov models. Pattern Recognit. Lett. 35, 58–67 (2014)CrossRef Alvaro, F., Sanchez, J., Benedi, J.: Recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden markov models. Pattern Recognit. Lett. 35, 58–67 (2014)CrossRef
21.
go back to reference Lee, H.-J., Wang Lee, J.-S.: Design of a mathematical expression understanding system. Pattern Recognit. Lett. 18, 289–298 (1997)CrossRef Lee, H.-J., Wang Lee, J.-S.: Design of a mathematical expression understanding system. Pattern Recognit. Lett. 18, 289–298 (1997)CrossRef
22.
go back to reference Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE Trans. Pattern Anal. Mach. Intell. 24(11), 1455–1467 (2002)CrossRef Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE Trans. Pattern Anal. Mach. Intell. 24(11), 1455–1467 (2002)CrossRef
23.
go back to reference Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: International Conference on Pattern Recognition, pp. 347–349, Montreal, Canada (1984) Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: International Conference on Pattern Recognition, pp. 347–349, Montreal, Canada (1984)
24.
go back to reference Meunier, J.: Optimized XY-cut for determining a page reading order. In: International Conference on Document Analysis and Recognition, pp. 347–351, Seoul, Korea (2005) Meunier, J.: Optimized XY-cut for determining a page reading order. In: International Conference on Document Analysis and Recognition, pp. 347–351, Seoul, Korea (2005)
25.
go back to reference Le, A.D., Nguyen, H.D., Nakagawa, M.: Modified X–Y cut for re-ordering strokes of online handwritten mathematical expressions. In: IAPR Workshop on Document Analysis Systems, pp. 233–238, Greece (2016) Le, A.D., Nguyen, H.D., Nakagawa, M.: Modified XY cut for re-ordering strokes of online handwritten mathematical expressions. In: IAPR Workshop on Document Analysis Systems, pp. 233–238, Greece (2016)
26.
go back to reference Eto, Y., Suzuki, M.: Mathematical formula recognition using virtual link network. In: International Conference on Document Analysis and Recognition, pp. 430–437, USA (2001) Eto, Y., Suzuki, M.: Mathematical formula recognition using virtual link network. In: International Conference on Document Analysis and Recognition, pp. 430–437, USA (2001)
27.
go back to reference Aly, W., Uchida, S., Suzuki, M.: Identifying subscripts and superscripts in mathematical documents. Math. Comput. Sci. 2, 195–209 (2008)CrossRefMATH Aly, W., Uchida, S., Suzuki, M.: Identifying subscripts and superscripts in mathematical documents. Math. Comput. Sci. 2, 195–209 (2008)CrossRefMATH
28.
go back to reference Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. In: International Conference on Machine Learning, vol. 37, pp 2048–2057 (2015) Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. In: International Conference on Machine Learning, vol. 37, pp 2048–2057 (2015)
29.
go back to reference Luong, T., Pham, H., Manning, C.: Effective approaches to attention-based neural machine translation. In: The 2015 Conference on Empirical Methods in Natural Language Processing (2015) Luong, T., Pham, H., Manning, C.: Effective approaches to attention-based neural machine translation. In: The 2015 Conference on Empirical Methods in Natural Language Processing (2015)
30.
go back to reference Deng, Y., Kanervisto, A., Ling, J., Rush, A.: Image-to-Markup Generation with Coarse-to-Fine Attention, pp. 980–989. ICML, Pittsburgh (2017) Deng, Y., Kanervisto, A., Ling, J., Rush, A.: Image-to-Markup Generation with Coarse-to-Fine Attention, pp. 980–989. ICML, Pittsburgh (2017)
31.
go back to reference Zhang, J., Du, J., Zhang, S., Liu, D., Hu, Y., Hu, J., Wei, S., Dai, L.: Watch, attend and parse: an end-to-end neural network based approach to handwritten mathematical expression recognition. Pattern Recognit. 71, 196–206 (2017)CrossRef Zhang, J., Du, J., Zhang, S., Liu, D., Hu, Y., Hu, J., Wei, S., Dai, L.: Watch, attend and parse: an end-to-end neural network based approach to handwritten mathematical expression recognition. Pattern Recognit. 71, 196–206 (2017)CrossRef
32.
go back to reference Le, A.D., Nakagawa, M.: Training an end-to-end system for handwritten mathematical expression recognition by generated patterns. In: International Conference on Document Analysis and Recognition, pp. 1056–1061 (2017) Le, A.D., Nakagawa, M.: Training an end-to-end system for handwritten mathematical expression recognition by generated patterns. In: International Conference on Document Analysis and Recognition, pp. 1056–1061 (2017)
Metadata
Title
Stroke order normalization for improving recognition of online handwritten mathematical expressions
Authors
Anh Duc Le
Hai Dai Nguyen
Bipin Indurkhya
Masaki Nakagawa
Publication date
17-01-2019
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Document Analysis and Recognition (IJDAR) / Issue 1/2019
Print ISSN: 1433-2833
Electronic ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-019-00315-2

Other articles of this Issue 1/2019

International Journal on Document Analysis and Recognition (IJDAR) 1/2019 Go to the issue

Premium Partner