Skip to main content

2015 | OriginalPaper | Buchkapitel

Classification of the Scripts in Medieval Documents from Balkan Region by Run-Length Texture Analysis

verfasst von : Darko Brodić, Alessia Amelio, Zoran N. Milivojević

Erschienen in: Neural Information Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper presents a script classification method of the medieval documents originated from the Balkan region. It consists in a multi-step procedure which includes the text mapping according to typographical features, creation of equivalent image patterns, run-length pattern analysis in order to establish a feature vector and state-of-the art classification method Genetic Algorithms Image Clustering for Document Analysis (GA-ICDA) which successfully disseminates the documents written in different scripts. The proposed method is evaluated on custom oriented document databases, which include the handprinted or printed documents written in old Cyrillic, angular and round Glagolitic, ancient Latin and Greek scripts. The experiment demonstrates very good results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ghosh, D., Dube, T., Shivaprasad, A.: Script recognition - a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)CrossRef Ghosh, D., Dube, T., Shivaprasad, A.: Script recognition - a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)CrossRef
2.
Zurück zum Zitat Joshi, G.D., Garg, S., Sivaswamy, J.: A generalised framework for script identification. Int. J. Doc. Anal. Recogn. 10(2), 55–68 (2007)CrossRef Joshi, G.D., Garg, S., Sivaswamy, J.: A generalised framework for script identification. Int. J. Doc. Anal. Recogn. 10(2), 55–68 (2007)CrossRef
4.
Zurück zum Zitat Brodić, D., Maluckov, Č.A., Milivojević, Z.N., Draganov, I.R.: Differentiation of the script using adjacent local binary patterns. In: Agre, G., Hitzler, P., Krisnadhi, A.A., Kuznetsov, S.O. (eds.) AIMSA 2014. LNCS, vol. 8722, pp. 162–169. Springer, Heidelberg (2014) Brodić, D., Maluckov, Č.A., Milivojević, Z.N., Draganov, I.R.: Differentiation of the script using adjacent local binary patterns. In: Agre, G., Hitzler, P., Krisnadhi, A.A., Kuznetsov, S.O. (eds.) AIMSA 2014. LNCS, vol. 8722, pp. 162–169. Springer, Heidelberg (2014)
5.
Zurück zum Zitat Zramdini, A.W., Ingold, R.: Optical font recognition using typographical features. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 877–882 (1998)CrossRef Zramdini, A.W., Ingold, R.: Optical font recognition using typographical features. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 877–882 (1998)CrossRef
6.
Zurück zum Zitat Galloway, M.M.: Texture analysis using gray level run lengths. Comput. Graph. Image Process. 4(2), 172–179 (1975)CrossRef Galloway, M.M.: Texture analysis using gray level run lengths. Comput. Graph. Image Process. 4(2), 172–179 (1975)CrossRef
7.
Zurück zum Zitat Chu, A., Sehgal, C.M., Greenleaf, J.F.: Use of gray value distribution of run lengths for texture analysis. Pattern Recogn. Lett. 11(6), 415–419 (1990)CrossRefMATH Chu, A., Sehgal, C.M., Greenleaf, J.F.: Use of gray value distribution of run lengths for texture analysis. Pattern Recogn. Lett. 11(6), 415–419 (1990)CrossRefMATH
8.
Zurück zum Zitat Dasarathy, B.R., Holder, E.B.: Image characterizations based on joint gray-level run-length distributions. Pattern Recogn. Lett. 12(8), 497–502 (1991)CrossRef Dasarathy, B.R., Holder, E.B.: Image characterizations based on joint gray-level run-length distributions. Pattern Recogn. Lett. 12(8), 497–502 (1991)CrossRef
9.
Zurück zum Zitat Brodić, D., Amelio, A., Milivojević, Z.N.: Characterization and distinction between closely related south Slavic languages on the example of Serbian and Croatian. In: Azzopardi, G., Petkov, N., Yamagiwa, S. (eds.) CAIP 2015. LNCS, vol. 9256, pp. 654–666. Springer, Heidelberg (2015) CrossRef Brodić, D., Amelio, A., Milivojević, Z.N.: Characterization and distinction between closely related south Slavic languages on the example of Serbian and Croatian. In: Azzopardi, G., Petkov, N., Yamagiwa, S. (eds.) CAIP 2015. LNCS, vol. 9256, pp. 654–666. Springer, Heidelberg (2015) CrossRef
10.
Zurück zum Zitat Amelio, A., Pizzuti, C.: A new evolutionary-based clustering framework for image databases. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D. (eds.) ICISP 2014. LNCS, vol. 8509, pp. 322–331. Springer, Heidelberg (2014) Amelio, A., Pizzuti, C.: A new evolutionary-based clustering framework for image databases. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D. (eds.) ICISP 2014. LNCS, vol. 8509, pp. 322–331. Springer, Heidelberg (2014)
11.
Zurück zum Zitat Marti, R., Laguna, M., Glover, F., Campos, V.: Reducing the bandwidth of a sparse matrix with tabu search. Eur. J. Oper. Res. 135(2), 450–280 (2001)MathSciNetCrossRefMATH Marti, R., Laguna, M., Glover, F., Campos, V.: Reducing the bandwidth of a sparse matrix with tabu search. Eur. J. Oper. Res. 135(2), 450–280 (2001)MathSciNetCrossRefMATH
12.
Zurück zum Zitat Marinai, S., Marino, E., Soda, G.: Self-organizing maps for clustering in document image analysis, machine learning in document analysis and recognition. In: Marinai, S., Fujisawa, H. (eds.) Machine Learning in Document Analysis and Recognition. LNCS (SCI), vol. 90, pp. 193–219. Springer, Heidelberg (2008) CrossRef Marinai, S., Marino, E., Soda, G.: Self-organizing maps for clustering in document image analysis, machine learning in document analysis and recognition. In: Marinai, S., Fujisawa, H. (eds.) Machine Learning in Document Analysis and Recognition. LNCS (SCI), vol. 90, pp. 193–219. Springer, Heidelberg (2008) CrossRef
13.
Zurück zum Zitat Pu, Y., Shi, J., Guo, L.: A hierarchical method for clustering binary text image. In: Yuan, Y., Wu, X., Lu, Y. (eds.) ISCTCS 2012. CCIS, vol. 320, pp. 388–396. Springer, Heidelberg (2013) CrossRef Pu, Y., Shi, J., Guo, L.: A hierarchical method for clustering binary text image. In: Yuan, Y., Wu, X., Lu, Y. (eds.) ISCTCS 2012. CCIS, vol. 320, pp. 388–396. Springer, Heidelberg (2013) CrossRef
14.
Zurück zum Zitat Rigutini, L., Maggini, M.: A semi-supervised document clustering algorithm based on EM. In: Proceedings of the International Conference on 2005 IEEE/WIC/ACM on Web Intelligence, pp. 200–206 (2005) Rigutini, L., Maggini, M.: A semi-supervised document clustering algorithm based on EM. In: Proceedings of the International Conference on 2005 IEEE/WIC/ACM on Web Intelligence, pp. 200–206 (2005)
15.
Zurück zum Zitat Hu, X., Yoo, I.: A comprehensive comparison study of document clustering for a biomedical digital library medline. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 220–229 (2006) Hu, X., Yoo, I.: A comprehensive comparison study of document clustering for a biomedical digital library medline. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 220–229 (2006)
16.
Zurück zum Zitat De Vargas, R.R., Bedregal, B.R.C.: A way to obtain the quality of a partition by adjusted rand index. In: Workshop-School on Theoretical Computer Science, pp. 67–71 (2013) De Vargas, R.R., Bedregal, B.R.C.: A way to obtain the quality of a partition by adjusted rand index. In: Workshop-School on Theoretical Computer Science, pp. 67–71 (2013)
Metadaten
Titel
Classification of the Scripts in Medieval Documents from Balkan Region by Run-Length Texture Analysis
verfasst von
Darko Brodić
Alessia Amelio
Zoran N. Milivojević
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26532-2_48