Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2014

01.08.2014 | Theoretical Advances

Performance enhancement of online handwritten Tamil symbol recognition with reevaluation techniques

verfasst von: Suresh Sundaram, A. G. Ramakrishnan

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this article, we aim at reducing the error rate of the online Tamil symbol recognition system by employing multiple experts to reevaluate certain decisions of the primary support vector machine classifier. Motivated by the relatively high percentage of occurrence of base consonants in the script, a reevaluation technique has been proposed to correct any ambiguities arising in the base consonants. Secondly, a dynamic time-warping method is proposed to automatically extract the discriminative regions for each set of confused characters. Class-specific features derived from these regions aid in reducing the degree of confusion. Thirdly, statistics of specific features are proposed for resolving any confusions in vowel modifiers. The reevaluation approaches are tested on two databases (a) the isolated Tamil symbols in the IWFHR test set, and (b) the symbols segmented from a set of 10,000 Tamil words. The recognition rate of the isolated test symbols of the IWFHR database improves by 1.9 %. For the word database, the incorporation of the reevaluation step improves the symbol recognition rate by 3.5 % (from 88.4 to 91.9 %). This, in turn, boosts the word recognition rate by 11.9 % (from 65.0 to 76.9 %). The reduction in the word error rate has been achieved using a generic approach, without the incorporation of language models.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Sundaram S (2011) Lexicon-free recognition strategies for online handwritten Tamil words, PhD Thesis, Indian Institute of Science Sundaram S (2011) Lexicon-free recognition strategies for online handwritten Tamil words, PhD Thesis, Indian Institute of Science
2.
Zurück zum Zitat Sundaresan CS, Keerthi SS (1999) A study of representations for pen based handwriting recognition of Tamil characters In: Proceedings International Conference on Document Analysis and Recognition, pp 422–425 Sundaresan CS, Keerthi SS (1999) A study of representations for pen based handwriting recognition of Tamil characters In: Proceedings International Conference on Document Analysis and Recognition, pp 422–425
3.
Zurück zum Zitat Toselli AH, Pastor M, Vidal E (2007) On-line handwriting recognition system for Tamil handwritten characters, In: Proceedings Pattern Recognition Image Analysis, pp 370–377 Toselli AH, Pastor M, Vidal E (2007) On-line handwriting recognition system for Tamil handwritten characters, In: Proceedings Pattern Recognition Image Analysis, pp 370–377
4.
Zurück zum Zitat Prasanth L, Babu J, Sharma R, Rao P, Dinesh M (2007) Elastic matching of online handwritten Tamil and Telugu scripts using local features In: Proceedings International Conference on Document Analysis and Recognition, pp 1028–1032 Prasanth L, Babu J, Sharma R, Rao P, Dinesh M (2007) Elastic matching of online handwritten Tamil and Telugu scripts using local features In: Proceedings International Conference on Document Analysis and Recognition, pp 1028–1032
5.
Zurück zum Zitat Joshi N, Sita G, Ramakrishnan AG, Madhvanath S (2004) Comparison of elastic matching algorithms for online Tamil handwritten character recognition In: Proceedings International Workshop Frontiers Handwriting Recognition, pp 444–449 Joshi N, Sita G, Ramakrishnan AG, Madhvanath S (2004) Comparison of elastic matching algorithms for online Tamil handwritten character recognition In: Proceedings International Workshop Frontiers Handwriting Recognition, pp 444–449
6.
Zurück zum Zitat Deepu V, Madhvanath S, Ramakrishnan AG (2004) Principal component analysis for online handwritten character recognition In: Proceedings International Conference Pattern Recognition, pp 327–330 Deepu V, Madhvanath S, Ramakrishnan AG (2004) Principal component analysis for online handwritten character recognition In: Proceedings International Conference Pattern Recognition, pp 327–330
7.
Zurück zum Zitat Raghavendra BS, Narayanan CK, Sita G, Ramakrishnan AG, Sriganesh M (2005) Prototype learning methods for online handwriting recognition In: Proceedings International Conference on Document Analysis and Recognition, pp 287–291 Raghavendra BS, Narayanan CK, Sita G, Ramakrishnan AG, Sriganesh M (2005) Prototype learning methods for online handwriting recognition In: Proceedings International Conference on Document Analysis and Recognition, pp 287–291
8.
Zurück zum Zitat Swethalakshmi H, Chandra Sekhar C, Chakravarthy VS (2007) Spatiostructural features for recognition of online handwritten characters in Devanagari and Tamil scripts. Proc Intern Conf Artif Neural Netw 2:230–239 Swethalakshmi H, Chandra Sekhar C, Chakravarthy VS (2007) Spatiostructural features for recognition of online handwritten characters in Devanagari and Tamil scripts. Proc Intern Conf Artif Neural Netw 2:230–239
9.
Zurück zum Zitat Aparna KH, Subramanian V, Kasirajan M, Prakash GV, Chakravarthy VS, Madhvanath S (2004) Online handwriting recognition for Tamil In: Proceedings International Worshop Frontiers Handwriting Recognition, pp 438–443 Aparna KH, Subramanian V, Kasirajan M, Prakash GV, Chakravarthy VS, Madhvanath S (2004) Online handwriting recognition for Tamil In: Proceedings International Worshop Frontiers Handwriting Recognition, pp 438–443
10.
Zurück zum Zitat Vuurpijl L, Schomaker L, Van Erp M (2003) Architectures for detecting and solving conflicts: two-stage classification and support vector classifiers. Intern J Doc Aanal Recogn, 5(4):213–223CrossRef Vuurpijl L, Schomaker L, Van Erp M (2003) Architectures for detecting and solving conflicts: two-stage classification and support vector classifiers. Intern J Doc Aanal Recogn, 5(4):213–223CrossRef
11.
Zurück zum Zitat Bellili A, Gilloux M, Gallinari P (2003) An MLP–SVM combination architecture for offline handwritten digit recognition. Intern J Doc Aanal Recogn 5(4):244–252CrossRef Bellili A, Gilloux M, Gallinari P (2003) An MLP–SVM combination architecture for offline handwritten digit recognition. Intern J Doc Aanal Recogn 5(4):244–252CrossRef
12.
Zurück zum Zitat Prevost L, Oudot L, Moises A, Michel-Sendis C, Milgram M (2005) Hybrid generative/discriminative classifier for unconstrained character recognition. Pat Recogn Lett 26(12):1840–1848CrossRef Prevost L, Oudot L, Moises A, Michel-Sendis C, Milgram M (2005) Hybrid generative/discriminative classifier for unconstrained character recognition. Pat Recogn Lett 26(12):1840–1848CrossRef
13.
Zurück zum Zitat Alaei A, Nagabhushan P, Pal U (2009) Fine classification of unconstrained handwritten persian/arabic numerals by removing confusion amongst similar classes In: Proceedings International Conference on Document Analysis and Recognition, pp 601–605 Alaei A, Nagabhushan P, Pal U (2009) Fine classification of unconstrained handwritten persian/arabic numerals by removing confusion amongst similar classes In: Proceedings International Conference on Document Analysis and Recognition, pp 601–605
14.
Zurück zum Zitat Sharma DV, Lehal GS, Mehta S (2009) Shape encoded post processing of Gurmukhi OCR In: Proceedings International Conference on Document Analysis and Recognition, pp 788–792 Sharma DV, Lehal GS, Mehta S (2009) Shape encoded post processing of Gurmukhi OCR In: Proceedings International Conference on Document Analysis and Recognition, pp 788–792
15.
Zurück zum Zitat Lehal GS, Singh C (2002) A post processor for Gurmukhi OCR. SADHANA 27(1):99–112CrossRef Lehal GS, Singh C (2002) A post processor for Gurmukhi OCR. SADHANA 27(1):99–112CrossRef
16.
Zurück zum Zitat Nair K, Jawahar CV (2010) A post-processing scheme for Malayalam using statistical sub-character language models In: Proceedings Document Analysis System, pp 363–370 Nair K, Jawahar CV (2010) A post-processing scheme for Malayalam using statistical sub-character language models In: Proceedings Document Analysis System, pp 363–370
17.
Zurück zum Zitat Chaudhuri BB, Pal U (1996) OCR error detection and correction of an inflectional Indian language script. Proc Intern Conf Pat Recogn 3:245–249 Chaudhuri BB, Pal U (1996) OCR error detection and correction of an inflectional Indian language script. Proc Intern Conf Pat Recogn 3:245–249
18.
Zurück zum Zitat Nethravathi B, Archana CP, Shashikiran K, Ramakrishnan AG, Kumar V (2010) Creation of a huge annotated database for Tamil and Kannada OHR In: Proceedings International Workshop Frontiers Handwriting Recognition, pp 415–420 Nethravathi B, Archana CP, Shashikiran K, Ramakrishnan AG, Kumar V (2010) Creation of a huge annotated database for Tamil and Kannada OHR In: Proceedings International Workshop Frontiers Handwriting Recognition, pp 415–420
19.
Zurück zum Zitat Isolated IWFHR 2006 Tamil Handwritten Character Dataset www.hpl.hp.com/india/research/penhw-interfaces-1linguistics.html Isolated IWFHR 2006 Tamil Handwritten Character Dataset www.​hpl.​hp.​com/​india/​research/​penhw-interfaces-1linguistics.​html
20.
Zurück zum Zitat Burges JC (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Dis 2:121–167CrossRef Burges JC (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Dis 2:121–167CrossRef
21.
Zurück zum Zitat Duda, Hart, Stork (1995) Pattern classification, Springer Wiley Duda, Hart, Stork (1995) Pattern classification, Springer Wiley
22.
Zurück zum Zitat Chang CC, Lin CJ (2011) LIBSVM : a library for support vector machines, ACM transactions on intelligent systems and technology, Vol 2, Issue 3 Chang CC, Lin CJ (2011) LIBSVM : a library for support vector machines, ACM transactions on intelligent systems and technology, Vol 2, Issue 3
23.
Zurück zum Zitat Rahman AFR, Fairhurst MC (1997) Selective partition algorithm for finding regions of maximum pairwise dissimilarity among statistical class models. Pat Recogn Lett 18(7):605–611CrossRef Rahman AFR, Fairhurst MC (1997) Selective partition algorithm for finding regions of maximum pairwise dissimilarity among statistical class models. Pat Recogn Lett 18(7):605–611CrossRef
24.
Zurück zum Zitat Leung KC, Leung CH (2010) Recognition of handwritten Chinese characters by critical region analysis. Pat Recogn 43(3):949–961CrossRefMATH Leung KC, Leung CH (2010) Recognition of handwritten Chinese characters by critical region analysis. Pat Recogn 43(3):949–961CrossRefMATH
25.
Zurück zum Zitat Sundaram S, Ramakrishnan AG (2011) Lexicon-free, novel segmentation of online handwritten Indic words In: Proceedings International Conference on Document Analysis and Recognition, pp 1175–1179 Sundaram S, Ramakrishnan AG (2011) Lexicon-free, novel segmentation of online handwritten Indic words In: Proceedings International Conference on Document Analysis and Recognition, pp 1175–1179
26.
Zurück zum Zitat Suresh S, Ramakrishnan AG (2013) Attention-feedback based robust segmentation of online handwritten isolated Tamil words. ACM Trans Asian Lang Inform Process vol 12, Issue 1, Article 4, (March 2013) Suresh S, Ramakrishnan AG (2013) Attention-feedback based robust segmentation of online handwritten isolated Tamil words. ACM Trans Asian Lang Inform Process vol 12, Issue 1, Article 4, (March 2013)
Metadaten
Titel
Performance enhancement of online handwritten Tamil symbol recognition with reevaluation techniques
verfasst von
Suresh Sundaram
A. G. Ramakrishnan
Publikationsdatum
01.08.2014
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2014
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-013-0353-7

Weitere Artikel der Ausgabe 3/2014

Pattern Analysis and Applications 3/2014 Zur Ausgabe

Premium Partner