Skip to main content

2002 | OriginalPaper | Buchkapitel

An Integrated System for the Analysis and the Recognition of Characters in Ancient Documents

verfasst von : Stefano Vezzosi, Luigi Bedini, Anna Tonazzini

Erschienen in: Document Analysis Systems V

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

This paper describes an integrated system for processing and analyzing highly degraded ancient printed documents. For each page, the system reduces noise by wavelet-based filtering, extracts and segments the text lines into characters by a fast adaptive thresholding, and performs OCR by a feed-forward back-propagation multilayer neural network. The probability recognition is used as a discriminant parameter for determining the automatic activation of a feed-back process, leading back to a block for refining segmentation. This block acts only on the small portions of the text where the recognition was not trustable, and makes use of blind deconvolution and MRF-based segmentation techniques. The experimental results highlight the good performance of the whole system in the analysis of even strongly degraded texts.

Metadaten
Titel
An Integrated System for the Analysis and the Recognition of Characters in Ancient Documents
verfasst von
Stefano Vezzosi
Luigi Bedini
Anna Tonazzini
Copyright-Jahr
2002
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-45869-7_5

Premium Partner