Skip to main content
Top

2019 | OriginalPaper | Chapter

Minimizing Training Data for Reliable Writer Identification in Medieval Manuscripts

Authors : Nicole Dalia Cilia, Claudio De Stefano, Francesco Fontanella, Mario Molinara, Alessandra Scotto di Freca

Published in: New Trends in Image Analysis and Processing – ICIAP 2019

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Palaeography aims to study ancient documents and the identification of the people who participated in the handwriting process of a given document is one of the most important problems. To this aim, expert paleographers typically analyze handwriting features such as letter heights and widths, distances between characters and angles of inclination. With the aim of achieving more precise measures and also thanks to the availability of high-quality digital images, paleographers are starting to use digital tools. In this context, in previous studies, we proposed a pattern recognition system for distinguishing the writers of mediaeval books and also investigated which is the minimum amount of training data needed to achieve satisfactory results in terms of accuracy. In this paper, we present a reject option that allows us to implement a highly-reliable system for writer identification, trained on a reduced set of data. The experimental results, performed on two sets of digital images from medieval Bibles, show that rejecting only a few samples it is possible to strongly reduce the error rate.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Antonacopoulos, A., Downton, A.C.: Special issue on the analysis of historical documents. IJDAR 9(2–4), 75–77 (2007)CrossRef Antonacopoulos, A., Downton, A.C.: Special issue on the analysis of historical documents. IJDAR 9(2–4), 75–77 (2007)CrossRef
2.
go back to reference Bozzolo, C., Coq, D., Muzerelle, D., Ornato, E.: Noir et blanc. Premiers résultats d’une enquête sur la mise en page dans le livre médiéval. In: Il libro e il testo, Urbino, pp. 195–221 (1982) Bozzolo, C., Coq, D., Muzerelle, D., Ornato, E.: Noir et blanc. Premiers résultats d’une enquête sur la mise en page dans le livre médiéval. In: Il libro e il testo, Urbino, pp. 195–221 (1982)
4.
go back to reference Bulacu, M., Schomaker, L.: Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 701–717 (2007)CrossRef Bulacu, M., Schomaker, L.: Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 701–717 (2007)CrossRef
5.
go back to reference Cilia, N., De Stefano, C., Fontanella, F., Scotto di Freca, A.: A ranking-based feature selection approach for handwritten character recognition. Pattern Recogn. Lett. 121, 77–86 (2018)CrossRef Cilia, N., De Stefano, C., Fontanella, F., Scotto di Freca, A.: A ranking-based feature selection approach for handwritten character recognition. Pattern Recogn. Lett. 121, 77–86 (2018)CrossRef
6.
go back to reference Ciula, A.: The palaeographical method under the light of a digital approach. In: Rehbein, M., Sahle, P., Schaßan, T. (eds.) Kodikologie und Paläographie im digitalen Zeitalter-Codicology and Palaeography in the Digital Age, pp. 219–237. Bod, Norderstedt (2009) Ciula, A.: The palaeographical method under the light of a digital approach. In: Rehbein, M., Sahle, P., Schaßan, T. (eds.) Kodikologie und Paläographie im digitalen Zeitalter-Codicology and Palaeography in the Digital Age, pp. 219–237. Bod, Norderstedt (2009)
8.
go back to reference Cordella, L.P., De Stefano, C., Fontanella, F., Marrocco, C., Scotto di Freca, A.: Combining single class features for improving performance of a two stage classifier. In: 20th International Conference on Pattern Recognition (ICPR 2010), pp. 4352–4355. IEEE Computer Society (2010) Cordella, L.P., De Stefano, C., Fontanella, F., Marrocco, C., Scotto di Freca, A.: Combining single class features for improving performance of a two stage classifier. In: 20th International Conference on Pattern Recognition (ICPR 2010), pp. 4352–4355. IEEE Computer Society (2010)
9.
go back to reference Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (2006)CrossRef Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (2006)CrossRef
10.
go back to reference De Stefano, C., D’Elia, C., Marcelli, A.: A dynamic approach to learning vector quantization. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004), vol. 4, pp. 601–604 (August 2004) De Stefano, C., D’Elia, C., Marcelli, A.: A dynamic approach to learning vector quantization. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004), vol. 4, pp. 601–604 (August 2004)
11.
go back to reference De Stefano, C., D’Elia, C., Marcelli, A., Scotto di Freca, A.: Improving dynamic learning vector quantization. In: Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), vol. 2, pp. 804–807 (August 2006) De Stefano, C., D’Elia, C., Marcelli, A., Scotto di Freca, A.: Improving dynamic learning vector quantization. In: Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), vol. 2, pp. 804–807 (August 2006)
12.
go back to reference De Stefano, C., Folino, G., Fontanella, F., Scotto Freca, A.: Using bayesian networks for selecting classifiers in GP ensembles. Inf. Sci. 258, 200–216 (2014)MathSciNetCrossRef De Stefano, C., Folino, G., Fontanella, F., Scotto Freca, A.: Using bayesian networks for selecting classifiers in GP ensembles. Inf. Sci. 258, 200–216 (2014)MathSciNetCrossRef
15.
go back to reference De Stefano, C., Maniaci, M., Fontanella, F., Scotto Freca, A.: Layout measures for writer identification in mediaeval documents. Measurement 127, 443–452 (2018)CrossRef De Stefano, C., Maniaci, M., Fontanella, F., Scotto Freca, A.: Layout measures for writer identification in mediaeval documents. Measurement 127, 443–452 (2018)CrossRef
16.
go back to reference De Stefano, C., Maniaci, M., Fontanella, F., Scotto di Freca, A.: Reliable writer identification in medieval manuscripts through page layout features: The Avila Bible case. Eng. Appl. Artif. Intell. 72, 99–110 (2018)CrossRef De Stefano, C., Maniaci, M., Fontanella, F., Scotto di Freca, A.: Reliable writer identification in medieval manuscripts through page layout features: The Avila Bible case. Eng. Appl. Artif. Intell. 72, 99–110 (2018)CrossRef
17.
go back to reference De Stefano, C., D’Elia, C., Scotto di Freca, A., Marcelli, A.: Classifier combination by bayesian networks for handwriting recognition. Int. J. Pattern Recogn. Artif. Intell. 23(05), 887–905 (2009)CrossRef De Stefano, C., D’Elia, C., Scotto di Freca, A., Marcelli, A.: Classifier combination by bayesian networks for handwriting recognition. Int. J. Pattern Recogn. Artif. Intell. 23(05), 887–905 (2009)CrossRef
19.
go back to reference Dhali, M.A., He, S., Popovic, M., Tigchelaar, E., Schomaker, L.: A digital palaeographic approach towards writer identification in the dead sea scrolls. In: Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods, ICPRAM, pp. 693–702 (2017) Dhali, M.A., He, S., Popovic, M., Tigchelaar, E., Schomaker, L.: A digital palaeographic approach towards writer identification in the dead sea scrolls. In: Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods, ICPRAM, pp. 693–702 (2017)
20.
go back to reference Dinstein, I., Shapira, Y.: Ancient hebraic handwriting identification with run-length histograms. IEEE Trans. Syst. Man Cybern. 12(3), 405–409 (1982)CrossRef Dinstein, I., Shapira, Y.: Ancient hebraic handwriting identification with run-length histograms. IEEE Trans. Syst. Man Cybern. 12(3), 405–409 (1982)CrossRef
21.
go back to reference Gurrado, M.: “Graphoskop”, uno strumento informatico per l’analisi paleografica quantitativa. In: Rehbein, M., Sahle, P., Schaßan, T. (eds.) Kodikologie und Paläographie im digitalen Zeitalter-Codicology and Palaeography in the Digital Age, pp. 251–259. Bod, Norderstedt (2009) Gurrado, M.: “Graphoskop”, uno strumento informatico per l’analisi paleografica quantitativa. In: Rehbein, M., Sahle, P., Schaßan, T. (eds.) Kodikologie und Paläographie im digitalen Zeitalter-Codicology and Palaeography in the Digital Age, pp. 251–259. Bod, Norderstedt (2009)
22.
go back to reference He, S., Samara, P., Burgers, J., Schomaker, L.: Image-based historical manuscript dating using contour and stroke fragments. Pattern Recogn. 58, 159–171 (2016)CrossRef He, S., Samara, P., Burgers, J., Schomaker, L.: Image-based historical manuscript dating using contour and stroke fragments. Pattern Recogn. 58, 159–171 (2016)CrossRef
23.
go back to reference Liang, Y., Fairhurst, M.C., Guest, R.M., Erbilek, M.: Automatic handwriting feature extraction, analysis and visualization in the context of digital palaeography. IJPRAI 30(4), 1653001 (2016). 1–26 Liang, Y., Fairhurst, M.C., Guest, R.M., Erbilek, M.: Automatic handwriting feature extraction, analysis and visualization in the context of digital palaeography. IJPRAI 30(4), 1653001 (2016). 1–26
24.
go back to reference Maniaci, M., Ornato, G.: Prime considerazioni sulla genesi e la storia della bibbia di avila. In: Miscellanea F. Magistrale (2010) Maniaci, M., Ornato, G.: Prime considerazioni sulla genesi e la storia della bibbia di avila. In: Miscellanea F. Magistrale (2010)
25.
go back to reference Quinlan, J.R.: C4. 5 Programs for Machine Learning. Morgan Kaufmann Series in Machine Learning. Morgan Kaufmann, San Francisco (1993) Quinlan, J.R.: C4. 5 Programs for Machine Learning. Morgan Kaufmann Series in Machine Learning. Morgan Kaufmann, San Francisco (1993)
26.
go back to reference Schomaker, L., Franke, K., Bulacu, M.: Using codebooks of fragmented connected-component contours in forensic and historic writer identification. Pattern Recogn. Lett. 28(6), 719–727 (2007). Pattern Recognition in Cultural Heritage and Medical ApplicationsCrossRef Schomaker, L., Franke, K., Bulacu, M.: Using codebooks of fragmented connected-component contours in forensic and historic writer identification. Pattern Recogn. Lett. 28(6), 719–727 (2007). Pattern Recognition in Cultural Heritage and Medical ApplicationsCrossRef
Metadata
Title
Minimizing Training Data for Reliable Writer Identification in Medieval Manuscripts
Authors
Nicole Dalia Cilia
Claudio De Stefano
Francesco Fontanella
Mario Molinara
Alessandra Scotto di Freca
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-30754-7_20

Premium Partner