Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 1/2014

01.03.2014 | Original Paper

An over-segmentation method for single-touching Chinese handwriting with learning-based filtering

verfasst von: Liang Xu, Fei Yin, Qiu-Feng Wang, Cheng-Lin Liu

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The segmentation of touching characters is still a challenging task, posing a bottleneck for offline Chinese handwriting recognition. In this paper, we propose an effective over-segmentation method with learning-based filtering using geometric features for single-touching Chinese handwriting. First, we detect candidate cuts by skeleton and contour analysis to guarantee a high recall rate of character separation. A filter is designed by supervised learning and used to prune implausible cuts to improve the precision. Since the segmentation rules and features are independent of the string length, the proposed method can deal with touching strings with more than two characters. The proposed method is evaluated on both the character segmentation task and the text line recognition task. The results on two large databases demonstrate the superiority of the proposed method in dealing with single-touching Chinese handwriting.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)CrossRef Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)CrossRef
2.
Zurück zum Zitat Liu, C.-L., Koga, M., Fujisawa, H.: Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading. IEEE Trans. Pattern Anal. Mach. Intell. 24(11), 1425–1437 (2002)CrossRef Liu, C.-L., Koga, M., Fujisawa, H.: Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading. IEEE Trans. Pattern Anal. Mach. Intell. 24(11), 1425–1437 (2002)CrossRef
3.
Zurück zum Zitat Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)CrossRef Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)CrossRef
4.
Zurück zum Zitat Ribas, F.C., Oliveira, L.S., Britto, A.S., Jr., Sabourin, R.: Handwitten digit segmentation: a comparative study. Int. J. Doc. Anal. Recognit. (published online) (2013) Ribas, F.C., Oliveira, L.S., Britto, A.S., Jr., Sabourin, R.: Handwitten digit segmentation: a comparative study. Int. J. Doc. Anal. Recognit. (published online) (2013)
5.
Zurück zum Zitat Alginahi, Y.M.: A survey on Arabic character segmentation. Int. J. Doc. Anal. Recognit. (published online) (2013) Alginahi, Y.M.: A survey on Arabic character segmentation. Int. J. Doc. Anal. Recognit. (published online) (2013)
6.
Zurück zum Zitat Lee, H., Verman, B.: Binary segmentation algorithm for English cursive handwriting recognition. Pattern Recognit. 45(4), 1306–1317 (2012)CrossRef Lee, H., Verman, B.: Binary segmentation algorithm for English cursive handwriting recognition. Pattern Recognit. 45(4), 1306–1317 (2012)CrossRef
7.
Zurück zum Zitat Ikeda, H., Ogawa, Y., Koga, M., Nishimura, H., Sako, H., Fujisawa, H.: A recognition method for touching Japanese handwritten characters. In: Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 641–644 (1999) Ikeda, H., Ogawa, Y., Koga, M., Nishimura, H., Sako, H., Fujisawa, H.: A recognition method for touching Japanese handwritten characters. In: Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 641–644 (1999)
8.
Zurück zum Zitat Han, Z., Liu, C.-P., Yin, X.-C.: A two-stage handwritten character segmentation approach in mail address recognition. In: Proceedings of 8th International Conference on Document Analysis and Recognition, pp. 111–115 (2005) Han, Z., Liu, C.-P., Yin, X.-C.: A two-stage handwritten character segmentation approach in mail address recognition. In: Proceedings of 8th International Conference on Document Analysis and Recognition, pp. 111–115 (2005)
9.
Zurück zum Zitat Yu, M.L., Kwok, P.C.K., Leung, C.H., Tse, K.W.: Segmentation and recognition of Chinese bank check amounts. Int. J. Doc. Anal. Recognit. 3(4), 207–217 (2001)CrossRef Yu, M.L., Kwok, P.C.K., Leung, C.H., Tse, K.W.: Segmentation and recognition of Chinese bank check amounts. Int. J. Doc. Anal. Recognit. 3(4), 207–217 (2001)CrossRef
10.
Zurück zum Zitat Tseng, L.Y., Chen, R.C.: Segmenting handwritten Chinese characters based on heuristic merging of stroke bounding boxes and dynamic programming. Pattern Recognit. Lett. 19(10), 963–973 (1998)CrossRef Tseng, L.Y., Chen, R.C.: Segmenting handwritten Chinese characters based on heuristic merging of stroke bounding boxes and dynamic programming. Pattern Recognit. Lett. 19(10), 963–973 (1998)CrossRef
11.
Zurück zum Zitat Tseng, Y.-H., Lee, H.-J.: Recognition-based handwritten Chinese character segmentation using a probabilistic Viterbi algorithm. Pattern Recognit. Lett. 20(8), 791–806 (1999)CrossRef Tseng, Y.-H., Lee, H.-J.: Recognition-based handwritten Chinese character segmentation using a probabilistic Viterbi algorithm. Pattern Recognit. Lett. 20(8), 791–806 (1999)CrossRef
12.
Zurück zum Zitat Gao, J., Ding, X., Wu, Y.: A segmentation algorithm for handwritten Chinese character strings. In: Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 633–636 (1999) Gao, J., Ding, X., Wu, Y.: A segmentation algorithm for handwritten Chinese character strings. In: Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 633–636 (1999)
13.
Zurück zum Zitat Yamaguchi, T., Yoshikawa, T., Shinogi, T., Tsuruoka, S., Teramoto, M.: A segmentation method for touching Japanese handwritten characters based on connecting condition of line. In: Proceedings of 6th International Conference on Document Analysis and Recognition, pp. 837–841 (2001) Yamaguchi, T., Yoshikawa, T., Shinogi, T., Tsuruoka, S., Teramoto, M.: A segmentation method for touching Japanese handwritten characters based on connecting condition of line. In: Proceedings of 6th International Conference on Document Analysis and Recognition, pp. 837–841 (2001)
14.
Zurück zum Zitat Yamaguchi, T., Tsuruoka, S., Yoshikawa, T., Shinogi, T., Makimoto, E., Ogata, H., Shridhar, M.: A segmentation system for touching handwritten Japanese characters. In: Proceedings of 8th International Workshop on Frontiers in Handwriting Recognition, pp. 407–412 (2002) Yamaguchi, T., Tsuruoka, S., Yoshikawa, T., Shinogi, T., Makimoto, E., Ogata, H., Shridhar, M.: A segmentation system for touching handwritten Japanese characters. In: Proceedings of 8th International Workshop on Frontiers in Handwriting Recognition, pp. 407–412 (2002)
15.
Zurück zum Zitat Suwa, M.: Segmentation of touching handwritten Japanese characters using the graph theory method. In: Proceedings of 8th International Conference on Document Recognition and Retrieval, pp. 280–289 (2001) Suwa, M.: Segmentation of touching handwritten Japanese characters using the graph theory method. In: Proceedings of 8th International Conference on Document Recognition and Retrieval, pp. 280–289 (2001)
16.
Zurück zum Zitat Wang, R., Ding, X., Liu, C.: Handwritten Chinese address segmentation and recognition based on merging strokes. Qinghua Daxue Xuebao/J. Tsinghua Univ. 44(4), 498–502 (2004) (in Chinese) Wang, R., Ding, X., Liu, C.: Handwritten Chinese address segmentation and recognition based on merging strokes. Qinghua Daxue Xuebao/J. Tsinghua Univ. 44(4), 498–502 (2004) (in Chinese)
17.
Zurück zum Zitat Li, N.-X., Gao, X., Jin, L.-W.: Curved segmentation path generation for unconstrained handwritten Chinese text lines. In: Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, pp. 501–505 (2008) Li, N.-X., Gao, X., Jin, L.-W.: Curved segmentation path generation for unconstrained handwritten Chinese text lines. In: Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, pp. 501–505 (2008)
18.
Zurück zum Zitat Bunke, H.: Recognition of cursive Roman handwriting-past, present and future. In: Proceedings of 7th International Conference on Document Analysis and Recognition, pp. 448–459 (2003) Bunke, H.: Recognition of cursive Roman handwriting-past, present and future. In: Proceedings of 7th International Conference on Document Analysis and Recognition, pp. 448–459 (2003)
19.
Zurück zum Zitat Bayer, T., Kressel, U.: Cut classification for segmentation. In: Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 565–568 (1993) Bayer, T., Kressel, U.: Cut classification for segmentation. In: Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 565–568 (1993)
20.
Zurück zum Zitat Vellasques, E., Oliveira, L.S., Britto Jr, A.S., Koerich, A.L., Sabourin, R.: Filtering segmentation cuts for digit string recognition. Pattern Recognit. 41(10), 3044–3053 (2008)CrossRefMATH Vellasques, E., Oliveira, L.S., Britto Jr, A.S., Koerich, A.L., Sabourin, R.: Filtering segmentation cuts for digit string recognition. Pattern Recognit. 41(10), 3044–3053 (2008)CrossRefMATH
21.
Zurück zum Zitat Zhao, S., Chi, Z., Shi, P., Yan, H.: Two-stage segmentation of unconstrained handwritten Chinese characters. Pattern Recognit. 36(1), 145–156 (2003)CrossRefMATH Zhao, S., Chi, Z., Shi, P., Yan, H.: Two-stage segmentation of unconstrained handwritten Chinese characters. Pattern Recognit. 36(1), 145–156 (2003)CrossRefMATH
22.
Zurück zum Zitat Suen, C.Y., Mori, S., Kim, S.-H., Leung, C.H.: Analysis and recognition of Asian scripts-the state of the art. In: Proceedings of 7th International Conference on Document Analysis and Recognition, pp. 866–878 (2003) Suen, C.Y., Mori, S., Kim, S.-H., Leung, C.H.: Analysis and recognition of Asian scripts-the state of the art. In: Proceedings of 7th International Conference on Document Analysis and Recognition, pp. 866–878 (2003)
23.
Zurück zum Zitat Srihari, S., Yang, X., Ball, G.: Offline Chinese handwriting recognition: an assessment of current technology. Frontiers Comput. Sci. China 1(2), 137–155 (2007)CrossRef Srihari, S., Yang, X., Ball, G.: Offline Chinese handwriting recognition: an assessment of current technology. Frontiers Comput. Sci. China 1(2), 137–155 (2007)CrossRef
24.
Zurück zum Zitat Su, T., Zhang, T., Guan, D., Huang, H.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)CrossRefMATH Su, T., Zhang, T., Guan, D., Huang, H.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)CrossRefMATH
25.
Zurück zum Zitat Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: A touching character database from Chinese handwriting for assessing segmentation algorithms. In: Proceedings of 12th International Conference on Frontiers in Handwriting Recognition, pp. 89–94 (2012) Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: A touching character database from Chinese handwriting for assessing segmentation algorithms. In: Proceedings of 12th International Conference on Frontiers in Handwriting Recognition, pp. 89–94 (2012)
26.
Zurück zum Zitat Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: Proceedings of 11th International Conference on Document Analysis and Recognition, pp. 37–41 (2011) Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: Proceedings of 11th International Conference on Document Analysis and Recognition, pp. 37–41 (2011)
27.
Zurück zum Zitat Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: Touching character separation in Chinese handwriting using visibility-based foreground analysis. In: Proceedings of 11th International Conference on Document Analysis and Recognition, pp. 859–863 (2011) Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: Touching character separation in Chinese handwriting using visibility-based foreground analysis. In: Proceedings of 11th International Conference on Document Analysis and Recognition, pp. 859–863 (2011)
28.
Zurück zum Zitat Liang, Z., Shi, P.: A metasynthetic approach for segmenting handwritten Chinese character strings. Pattern Recognit. Lett. 26(10), 1498–1511 (2005)CrossRef Liang, Z., Shi, P.: A metasynthetic approach for segmenting handwritten Chinese character strings. Pattern Recognit. Lett. 26(10), 1498–1511 (2005)CrossRef
29.
Zurück zum Zitat Strathy, N.W., Suen, C.Y., Kryzyzak, A.: Segmentation of handwritten digits using contour features. In: Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 577–580 (1993) Strathy, N.W., Suen, C.Y., Kryzyzak, A.: Segmentation of handwritten digits using contour features. In: Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 577–580 (1993)
30.
Zurück zum Zitat Ha, T.M., Zimmermann, M., Bunke, H.: Off-line handwritten numeral string recognition by combining segmentation-based and segmentation-free methods. Pattern Recognit. 31(3), 257–272 (1998)CrossRef Ha, T.M., Zimmermann, M., Bunke, H.: Off-line handwritten numeral string recognition by combining segmentation-based and segmentation-free methods. Pattern Recognit. 31(3), 257–272 (1998)CrossRef
31.
Zurück zum Zitat Chen, Y.-K., Wang, J.-F.: Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1304–1317 (2000)CrossRef Chen, Y.-K., Wang, J.-F.: Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1304–1317 (2000)CrossRef
32.
Zurück zum Zitat Oliveira, L.S., Lethelier, E., Bortolozzi, F., Sabourin, R.: A new segmentation approach for handwritten digits. In: Proceedings of 15th International Conference on, Pattern Recognition, pp. 2323–2326 (2000) Oliveira, L.S., Lethelier, E., Bortolozzi, F., Sabourin, R.: A new segmentation approach for handwritten digits. In: Proceedings of 15th International Conference on, Pattern Recognition, pp. 2323–2326 (2000)
33.
Zurück zum Zitat Sadri, J., Suen, C.Y., Bui, T.D.: Automatic segmentation of unconstrained handwritten numeral strings. In: Proceedings of 9th International Workshop on Frontiers in Handwriting Recognition, pp. 317–322 (2004) Sadri, J., Suen, C.Y., Bui, T.D.: Automatic segmentation of unconstrained handwritten numeral strings. In: Proceedings of 9th International Workshop on Frontiers in Handwriting Recognition, pp. 317–322 (2004)
34.
Zurück zum Zitat Suzuki, S., Abe, K.: Binary picture thinning by an iterative parallel two-subcycle operation. Pattern Recognit. 10(3), 297–307 (1987)CrossRef Suzuki, S., Abe, K.: Binary picture thinning by an iterative parallel two-subcycle operation. Pattern Recognit. 10(3), 297–307 (1987)CrossRef
35.
Zurück zum Zitat Rosenfeld, A., Johnston, E.: Angle detection on digital curves. IEEE Trans. Comput. 22, 875–878 (1976) Rosenfeld, A., Johnston, E.: Angle detection on digital curves. IEEE Trans. Comput. 22, 875–878 (1976)
36.
Zurück zum Zitat Ramer, U.: An iterative procedure for the polygonal approximation of plane closed curves. Comput. Graph. Image Process 1, 244–256 (1972)CrossRef Ramer, U.: An iterative procedure for the polygonal approximation of plane closed curves. Comput. Graph. Image Process 1, 244–256 (1972)CrossRef
37.
Zurück zum Zitat Liu, C.-L., Kim, I.-J., Kim, J.H.: Model-based stroke extraction and matching for handwritten Chinese character recognition. Pattern Recognit. 34(12), 2339–2352 (2001)CrossRefMATH Liu, C.-L., Kim, I.-J., Kim, J.H.: Model-based stroke extraction and matching for handwritten Chinese character recognition. Pattern Recognit. 34(12), 2339–2352 (2001)CrossRefMATH
38.
Zurück zum Zitat Yin, F., Wang, Q.-F., Liu, C.-L.: Integrating geometric context for text alignment of handwritten Chinese documents. In: Proceedings of 11th International Conference on Frontiers in Handwriting Recognition, pp. 7–12 (2010) Yin, F., Wang, Q.-F., Liu, C.-L.: Integrating geometric context for text alignment of handwritten Chinese documents. In: Proceedings of 11th International Conference on Frontiers in Handwriting Recognition, pp. 7–12 (2010)
39.
Zurück zum Zitat Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, chap. 2. Wiley, New York (2001) Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, chap. 2. Wiley, New York (2001)
40.
Zurück zum Zitat Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995) Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995)
42.
Zurück zum Zitat Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)MATH Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)MATH
43.
Zurück zum Zitat Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef
Metadaten
Titel
An over-segmentation method for single-touching Chinese handwriting with learning-based filtering
verfasst von
Liang Xu
Fei Yin
Qiu-Feng Wang
Cheng-Lin Liu
Publikationsdatum
01.03.2014
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 1/2014
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-013-0208-1

Weitere Artikel der Ausgabe 1/2014

International Journal on Document Analysis and Recognition (IJDAR) 1/2014 Zur Ausgabe

Premium Partner