Skip to main content
Top
Published in: Soft Computing 9/2015

01-09-2015 | Methodologies and Application

An approach to the script discrimination in the Slavic documents

Script discrimination

Authors: Darko Brodić, Zoran N. Milivojević, Čedomir A. Maluckov

Published in: Soft Computing | Issue 9/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The paper deals with the problem of the script discrimination in old Slavic printed documents. Therefore, an algorithm for script classification and identification is proposed. It creates coded text from initial document. Then, the coded text is subjected to statistical analysis. As a result, the texture feature extraction is carried out. Obtained texture features are used as criteria for script classification and identification. The proposed method is tested on the samples of old Slavic printed documents written in Glagolitic, Cyrillic and Latin script.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bharati MH, Liu JJ, MacGregor JF (2004) Image texture analysis: methods and comparisons. Chemom Intell Lab Systems 72(1):57–71CrossRef Bharati MH, Liu JJ, MacGregor JF (2004) Image texture analysis: methods and comparisons. Chemom Intell Lab Systems 72(1):57–71CrossRef
go back to reference Brodić D, Milivojević ZN, Maluckov Č (2013) Recognition of the script in Serbian documents using frequency occurrence and co-occurrence analysis. Sci World J 2013(896328):1–14CrossRef Brodić D, Milivojević ZN, Maluckov Č (2013) Recognition of the script in Serbian documents using frequency occurrence and co-occurrence analysis. Sci World J 2013(896328):1–14CrossRef
go back to reference Brodić D, Milivojević Z, Maluckov Č A (2014) Script characterization in the old Slavic documents. In: Elmoataz A, Lezoray O, Nouboud F, Mammass D (eds) Image and Signal Processing, LNCS 8509, pp 230–238. Springer, Berlin Brodić D, Milivojević Z, Maluckov Č A (2014) Script characterization in the old Slavic documents. In: Elmoataz A, Lezoray O, Nouboud F, Mammass D (eds) Image and Signal Processing, LNCS 8509, pp 230–238. Springer, Berlin
go back to reference Busch A, Boles WW, Sridharan S (2006) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732CrossRef Busch A, Boles WW, Sridharan S (2006) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732CrossRef
go back to reference Chaudhuri BB, Pal U, Mitra M (2002) Automatic recognition of printed Oriya script. Sadhana 27(1):23–34CrossRef Chaudhuri BB, Pal U, Mitra M (2002) Automatic recognition of printed Oriya script. Sadhana 27(1):23–34CrossRef
go back to reference Clausi DA (2002) An analysis of co-occurrence texture statistics as a function of grey level quantization. Can J Remote Sens 28(1):45–62CrossRef Clausi DA (2002) An analysis of co-occurrence texture statistics as a function of grey level quantization. Can J Remote Sens 28(1):45–62CrossRef
go back to reference Del Bimbo A (2001) Visual information retrieval. Morgan Kaufmann Publishers Inc, San Francisco Del Bimbo A (2001) Visual information retrieval. Morgan Kaufmann Publishers Inc, San Francisco
go back to reference Eleyan A, Demirel H (2011) Co-occurrence matrix and its statistical features as a new approach for face recognition. Turkish J Electrical Eng Comput Sci 19(1):98–107 Eleyan A, Demirel H (2011) Co-occurrence matrix and its statistical features as a new approach for face recognition. Turkish J Electrical Eng Comput Sci 19(1):98–107
go back to reference Ghosh D, Dube T, Shivaprasad AP (2010) Script recognition—a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161CrossRef Ghosh D, Dube T, Shivaprasad AP (2010) Script recognition—a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161CrossRef
go back to reference Haralick R, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Systems Man Cybern 3(6):610–621CrossRef Haralick R, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Systems Man Cybern 3(6):610–621CrossRef
go back to reference Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef
go back to reference Joshi GD, Garg S, Sivaswamy J (2007) A generalised framework for script identification. Int J Document Anal Recogn ( IJDAR) 10(2):55–68CrossRef Joshi GD, Garg S, Sivaswamy J (2007) A generalised framework for script identification. Int J Document Anal Recogn ( IJDAR) 10(2):55–68CrossRef
go back to reference Pal U, Chaudhury BB (2002) Identification of different script lines from multi-script documents. Image Vis Comput 20(13–14):945–954 Pal U, Chaudhury BB (2002) Identification of different script lines from multi-script documents. Image Vis Comput 20(13–14):945–954
go back to reference Silva C, Ribeiro B (2007) On text-based mining with active learning and background knowledge using SVM. Soft Comput 11(6):519–530CrossRef Silva C, Ribeiro B (2007) On text-based mining with active learning and background knowledge using SVM. Soft Comput 11(6):519–530CrossRef
go back to reference Tolambiya A, Venkatraman S, Kalra PK (2010) Content-based image classification with wavelet relevance vector machines. Soft Comput 14(2):129–136CrossRef Tolambiya A, Venkatraman S, Kalra PK (2010) Content-based image classification with wavelet relevance vector machines. Soft Comput 14(2):129–136CrossRef
go back to reference Valkealahti K, Oja E (1998) Reduced multidimensional co-occurrence histograms in texture classification. IEEE Trans Pattern Anal Mach Intell 20(1):90–94CrossRef Valkealahti K, Oja E (1998) Reduced multidimensional co-occurrence histograms in texture classification. IEEE Trans Pattern Anal Mach Intell 20(1):90–94CrossRef
go back to reference Yang Z, Purves D (2004) The statistical structure of natural light patterns determines perceived light intensity. In: Proceedings of the National Academy of sciences of the United States of America 101(23):8745–8750 Yang Z, Purves D (2004) The statistical structure of natural light patterns determines perceived light intensity. In: Proceedings of the National Academy of sciences of the United States of America 101(23):8745–8750
go back to reference Zhang J, Tan T (2002) Brief review of invariant texture analysis methods. Pattern Recogn 35(3):735–747CrossRefMATH Zhang J, Tan T (2002) Brief review of invariant texture analysis methods. Pattern Recogn 35(3):735–747CrossRefMATH
go back to reference Zramdini AW, Ingold R (1998) Optical font recognition using typographical features. IEEE Trans Pattern Anal Mach Intell 20(8):877–882CrossRef Zramdini AW, Ingold R (1998) Optical font recognition using typographical features. IEEE Trans Pattern Anal Mach Intell 20(8):877–882CrossRef
Metadata
Title
An approach to the script discrimination in the Slavic documents
Script discrimination
Authors
Darko Brodić
Zoran N. Milivojević
Čedomir A. Maluckov
Publication date
01-09-2015
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 9/2015
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-014-1435-1

Other articles of this Issue 9/2015

Soft Computing 9/2015 Go to the issue

Premium Partner