Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 3/2020

10.03.2020 | Special Issue Paper

Hyperkernel-based intuitionistic fuzzy c-means for denoising color archival document images

verfasst von: Walid Elhedda, Maroua Mehri, Mohamed Ali Mahjoub

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this article, we have addressed the problem of denoising and enhancement of color archival handwritten document images by separating noise from text and background. Indeed, archival document images that originated from scanning or photographing paper documents are mainly digitized in full color mode. Thus, it is necessary to preserve and exploit color information when applying an enhancement method or a denoising technique. Thus, the focus of our work has been to model a color image using a hyperspace. The defined hyperspace formed by the image pixels is obtained by using both topological and color spaces. The novelty of our work lies in exploiting the obtained hyperspace to cluster the extracted low-level features (topological and color) and, thereafter, to separate noise from text and background. Indeed, based on combining the obtained hyperspace with an adapted kernel-based intuitionistic fuzzy c-means (KIFCM) algorithm we have proposed a novel hyper-KIFCM (HKIFCM) method for denoising color historical document images. To illustrate the effectiveness of the HKIFCM method, a thorough experimental study has been firstly conducted with qualitative and quantitative observations obtained from color archival handwritten document images collected from both the Tunisian national archives and two datasets provided in the context of open competitions at ICDAR and ICFHR conferences. Then, we have compared the results achieved with those obtained using the state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Some image samples among those we have used in our experiments are temporarily available on https://​drive.​google.​com/​open?​id=​1X-SDB2CmT3cfkB8dTd​S3mEWR5-KLkwsa and on request subject to the agreement from the ANT.
 
Literatur
4.
Zurück zum Zitat Elhedda, W., Mehri, M., Mahjoub, M.A.: A comparative study of filtering approaches applied to color archival document images. In: Proceedings of the International Arab Conference on Information Technology (2017) Elhedda, W., Mehri, M., Mahjoub, M.A.: A comparative study of filtering approaches applied to color archival document images. In: Proceedings of the International Arab Conference on Information Technology (2017)
5.
Zurück zum Zitat Stanco, F., Tenze, L., Ramponi, G.: Technique to correct yellowing and foxing in antique books. IET Image Process. 1(2), 123–133 (2007) Stanco, F., Tenze, L., Ramponi, G.: Technique to correct yellowing and foxing in antique books. IET Image Process. 1(2), 123–133 (2007)
6.
Zurück zum Zitat Drira, F., LeBourgeois, F., Emptoz, H.: Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique. In: Lecture Notes in Computer Science (2006) Drira, F., LeBourgeois, F., Emptoz, H.: Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique. In: Lecture Notes in Computer Science (2006)
7.
Zurück zum Zitat Tan, C.L., Shen, P.: Restoration of archival documents using a wavelet technique. IEEE Trans. Pattern Anal. Mach. Intell. 24, 10 (2002) Tan, C.L., Shen, P.: Restoration of archival documents using a wavelet technique. IEEE Trans. Pattern Anal. Mach. Intell. 24, 10 (2002)
8.
Zurück zum Zitat Charrada, M.A., Benamara, N.E.: Old document image denoising using bilateral filter. In: International Document Image Processing (2013) Charrada, M.A., Benamara, N.E.: Old document image denoising using bilateral filter. In: International Document Image Processing (2013)
9.
Zurück zum Zitat Ganbold, G.: History document image background noise and removal methods. Int. J. Knowl. Content Dev. Technol. 5(2), 11–24 (2015) Ganbold, G.: History document image background noise and removal methods. Int. J. Knowl. Content Dev. Technol. 5(2), 11–24 (2015)
10.
Zurück zum Zitat Chaira, T.: A novel intuitionistic fuzzy c-means color clustering on human cell images. In: Proceedings of World Congress on Nature and Biologically Inspired Computing, pp. 736–741 (2009) Chaira, T.: A novel intuitionistic fuzzy c-means color clustering on human cell images. In: Proceedings of World Congress on Nature and Biologically Inspired Computing, pp. 736–741 (2009)
11.
Zurück zum Zitat Lin, K.P.: A novel evolutionary kernel intuitionistic fuzzy c-means clustering algorithm. IEEE Trans. Fuzzy Syst. 22(5), 1074–1087 (2014) Lin, K.P.: A novel evolutionary kernel intuitionistic fuzzy c-means clustering algorithm. IEEE Trans. Fuzzy Syst. 22(5), 1074–1087 (2014)
12.
Zurück zum Zitat Sugeno, M.: Fuzzy measures and fuzzy integrals: a survey. In: Readings in Fuzzy Sets for Intelligent Systems, pp. 251–257. Morgan Kaufmann, Los Altos (1993) Sugeno, M.: Fuzzy measures and fuzzy integrals: a survey. In: Readings in Fuzzy Sets for Intelligent Systems, pp. 251–257. Morgan Kaufmann, Los Altos (1993)
13.
Zurück zum Zitat Leydier, Y., LeBourgeois, F., Emptoz, H.: Serialized unsupervised classifier for adaptative color image segmentation: application to digitized ancient manuscripts. In: Proceedings of International Conference on Pattern Recognition, vol. 1, pp. 494–497 (2004) Leydier, Y., LeBourgeois, F., Emptoz, H.: Serialized unsupervised classifier for adaptative color image segmentation: application to digitized ancient manuscripts. In: Proceedings of International Conference on Pattern Recognition, vol. 1, pp. 494–497 (2004)
14.
Zurück zum Zitat Sangwine, S.J., Ell, T.A.: Hypercomplex auto- and cross-correlation of color images. In: Proceedings of IEEE International Conference on Image Processing (1999) Sangwine, S.J., Ell, T.A.: Hypercomplex auto- and cross-correlation of color images. In: Proceedings of IEEE International Conference on Image Processing (1999)
15.
Zurück zum Zitat Sangwine, S.J., Ell, T.A.: The discrete Fourier transform of a colour image. In: Proceedings of Image Processing II Mathematical Methods, Algorithms and Applications, pp. 430–441 (2000) Sangwine, S.J., Ell, T.A.: The discrete Fourier transform of a colour image. In: Proceedings of Image Processing II Mathematical Methods, Algorithms and Applications, pp. 430–441 (2000)
16.
Zurück zum Zitat Jangra, S., Rani, P.: A survey on STING and CLIQUE grid based clustering methods. Int. J. Adv. Res. Comput. Sci. 8, 5 (2017) Jangra, S., Rani, P.: A survey on STING and CLIQUE grid based clustering methods. Int. J. Adv. Res. Comput. Sci. 8, 5 (2017)
17.
Zurück zum Zitat Babur, I.H., Ahmed, J., Ahmed, B., Habib, M.: Analysis of DBSCAN clustering technique on different datasets using WekaTools. Sci. Int. 27(6), 5087–5090 (2015) Babur, I.H., Ahmed, J., Ahmed, B., Habib, M.: Analysis of DBSCAN clustering technique on different datasets using WekaTools. Sci. Int. 27(6), 5087–5090 (2015)
18.
Zurück zum Zitat Mehri, M., Gomez-Krämer, P., Héroux, P., Boucher, A., Mullot, R.: A texture-based pixel labeling approach for historical books. In: Proceedings of Pattern Analysis and Applications, pp. 325–364 (2017) Mehri, M., Gomez-Krämer, P., Héroux, P., Boucher, A., Mullot, R.: A texture-based pixel labeling approach for historical books. In: Proceedings of Pattern Analysis and Applications, pp. 325–364 (2017)
19.
Zurück zum Zitat Tonazzini, A., Bedini, L.: Restoration of recto-verso colour documents using correlated component analysis. EURASIP J. Adv. Signal Process. 2013, 58 (2013) Tonazzini, A., Bedini, L.: Restoration of recto-verso colour documents using correlated component analysis. EURASIP J. Adv. Signal Process. 2013, 58 (2013)
20.
Zurück zum Zitat Chaira, T., Panwar, A.: An Atanassov’s intuitionistic fuzzy kernel clustering for medical image segmentation. Int. J. Comput. Intell. Syst. 7(2), 360–370 (2014) Chaira, T., Panwar, A.: An Atanassov’s intuitionistic fuzzy kernel clustering for medical image segmentation. Int. J. Comput. Intell. Syst. 7(2), 360–370 (2014)
21.
Zurück zum Zitat Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10(2–3), 191–203 (1984) Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10(2–3), 191–203 (1984)
22.
Zurück zum Zitat Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999) Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
23.
Zurück zum Zitat Kannan, S.R., Ramathilagam, S., Sathya, A., Pandiyarajan, R.: Effective fuzzy c-means based kernel function in segmenting medical images. Comput. Biol. Med. 40(6), 572–579 (2010) Kannan, S.R., Ramathilagam, S., Sathya, A., Pandiyarajan, R.: Effective fuzzy c-means based kernel function in segmenting medical images. Comput. Biol. Med. 40(6), 572–579 (2010)
24.
Zurück zum Zitat Kannan, S.R., Ramathilagam, S., Devi, R., Sathya, A.: Robust kernel FCM in segmentation of breast medical images. Expert Syst. Appl. 38(4), 4382–4389 (2011) Kannan, S.R., Ramathilagam, S., Devi, R., Sathya, A.: Robust kernel FCM in segmentation of breast medical images. Expert Syst. Appl. 38(4), 4382–4389 (2011)
25.
Zurück zum Zitat Atanassov, K.T.: Intuitionistic fuzzy set. Fuzzy Set Syst. 20(1), 87–96 (1986)MATH Atanassov, K.T.: Intuitionistic fuzzy set. Fuzzy Set Syst. 20(1), 87–96 (1986)MATH
26.
Zurück zum Zitat Kaur, P., Soni, A.K., Gosain, A.: Robust intuitionistic fuzzy c-means clustering for linearly and nonlinearly separable data. In: Proceedings of International Conference on Image Information Processing (2011) Kaur, P., Soni, A.K., Gosain, A.: Robust intuitionistic fuzzy c-means clustering for linearly and nonlinearly separable data. In: Proceedings of International Conference on Image Information Processing (2011)
27.
Zurück zum Zitat Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2(1), 1–8 (1980)MATH Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2(1), 1–8 (1980)MATH
28.
Zurück zum Zitat Xu, Z., Chen, J., Wu, J.: Clustering algorithm for intuitionistic fuzzy sets. Inf. Sci. 178(19), 3775–3790 (2008)MathSciNetMATH Xu, Z., Chen, J., Wu, J.: Clustering algorithm for intuitionistic fuzzy sets. Inf. Sci. 178(19), 3775–3790 (2008)MathSciNetMATH
29.
Zurück zum Zitat Atanassov, K.T., Stoeva, S.: Intuitionistic fuzzy sets. In: Proceedings of Polish Symposium on Interval and Fuzzy Mathematics, pp. 23–26 (1983) Atanassov, K.T., Stoeva, S.: Intuitionistic fuzzy sets. In: Proceedings of Polish Symposium on Interval and Fuzzy Mathematics, pp. 23–26 (1983)
30.
Zurück zum Zitat Yager, R.R.: Some aspects of intuitionistic fuzzy sets. Fuzzy Optim. Decis. Mak. 8, 67–90 (2009)MathSciNetMATH Yager, R.R.: Some aspects of intuitionistic fuzzy sets. Fuzzy Optim. Decis. Mak. 8, 67–90 (2009)MathSciNetMATH
31.
Zurück zum Zitat Xu, Z., Hui, H.: Entropy-based procedures for intuitionistic fuzzy multiple attribute decision making. J. Syst. Eng. Electron. 20(5), 1001–1011 (2009) Xu, Z., Hui, H.: Entropy-based procedures for intuitionistic fuzzy multiple attribute decision making. J. Syst. Eng. Electron. 20(5), 1001–1011 (2009)
32.
Zurück zum Zitat Xu, Z., Wu, J.: Intuitionistic fuzzy c-means clustering algorithms. J.Syst. Eng. Electron. 21(4), 580–590 (2010) Xu, Z., Wu, J.: Intuitionistic fuzzy c-means clustering algorithms. J.Syst. Eng. Electron. 21(4), 580–590 (2010)
33.
Zurück zum Zitat Chaira, T.: A novel intuitionistic fuzzy c-means clustering algorithm and its application to medical images. Appl. Soft Comput. 11, 1711–1717 (2011) Chaira, T.: A novel intuitionistic fuzzy c-means clustering algorithm and its application to medical images. Appl. Soft Comput. 11, 1711–1717 (2011)
34.
Zurück zum Zitat Jiang, H., Zhou, X., Feng, B., Zhang, M.: A new intuitionistic fuzzy c-means clustering algorithm. In: Proceedings of International Conference on Mechatronic Sciences, Electric Engineering and Computer (2013) Jiang, H., Zhou, X., Feng, B., Zhang, M.: A new intuitionistic fuzzy c-means clustering algorithm. In: Proceedings of International Conference on Mechatronic Sciences, Electric Engineering and Computer (2013)
35.
Zurück zum Zitat Jiang, H., Zhou, X., Feng, B., Zhang, M.: A new intuitionistic fuzzy c-means clustering algorithm. In: Proceedings of International Conference on Mechatronic Sciences, Electric Engineering and Computer (2013) Jiang, H., Zhou, X., Feng, B., Zhang, M.: A new intuitionistic fuzzy c-means clustering algorithm. In: Proceedings of International Conference on Mechatronic Sciences, Electric Engineering and Computer (2013)
36.
Zurück zum Zitat Gatos, B., Ntirogiannis, K., Pratikakis., I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: Proceedings of International Conference on Document Analysis and Recognition, pp. 1375–1382 (2009) Gatos, B., Ntirogiannis, K., Pratikakis., I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: Proceedings of International Conference on Document Analysis and Recognition, pp. 1375–1382 (2009)
37.
Zurück zum Zitat Pratikakis, I., Zagoris, K., Barlas, G., Gatos., B.: ICFHR 2016 handwritten document image binarization contest (H-DIBCO 2016). In: Proceedings of International Conference on Frontiers in Handwriting Recognition, pp. 619–623 (2016) Pratikakis, I., Zagoris, K., Barlas, G., Gatos., B.: ICFHR 2016 handwritten document image binarization contest (H-DIBCO 2016). In: Proceedings of International Conference on Frontiers in Handwriting Recognition, pp. 619–623 (2016)
38.
Zurück zum Zitat Cheng, H., Sun, Y.: A hierarchical approach to color image segmentation using homogeneity. IEEE Trans. Image Process. 9(12), 2071–2082 (2000) Cheng, H., Sun, Y.: A hierarchical approach to color image segmentation using homogeneity. IEEE Trans. Image Process. 9(12), 2071–2082 (2000)
39.
Zurück zum Zitat Rendón, E., Abundez, I., Arizmendi, A., Quiroz, E.M.: Internal versus external cluster validation indexes. Int. J. Comput. Commun. 5(1), 27–34 (2011) Rendón, E., Abundez, I., Arizmendi, A., Quiroz, E.M.: Internal versus external cluster validation indexes. Int. J. Comput. Commun. 5(1), 27–34 (2011)
40.
Zurück zum Zitat Rendón, E., Abundez, I., Gutierrez, C., Zagal, S.D., Arizmendi, A., Quiroz, E.M., Arzate, H.E.: A comparison of internal and external cluster validation indexes. In: Proceedings of Applications of Mathematics and Computer Engineering, pp. 158–163 (2011) Rendón, E., Abundez, I., Gutierrez, C., Zagal, S.D., Arizmendi, A., Quiroz, E.M., Arzate, H.E.: A comparison of internal and external cluster validation indexes. In: Proceedings of Applications of Mathematics and Computer Engineering, pp. 158–163 (2011)
41.
Zurück zum Zitat Powers, D.M.W.: Evaluation: from precision, recall and F-factor to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet Powers, D.M.W.: Evaluation: from precision, recall and F-factor to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet
42.
Zurück zum Zitat Pitas, I., Venetsanopoulos, A.N.: Nonlinear filters in image processing: principles and applications. In: The Springer International Series in Engineering and Computer Science. Academic Publishers, Boston (1990) Pitas, I., Venetsanopoulos, A.N.: Nonlinear filters in image processing: principles and applications. In: The Springer International Series in Engineering and Computer Science. Academic Publishers, Boston (1990)
43.
Zurück zum Zitat Sharma, S.: Applied multivariate techniques. In: University of South Carolina, Wiley, NewYork (1996) Sharma, S.: Applied multivariate techniques. In: University of South Carolina, Wiley, NewYork (1996)
Metadaten
Titel
Hyperkernel-based intuitionistic fuzzy c-means for denoising color archival document images
verfasst von
Walid Elhedda
Maroua Mehri
Mohamed Ali Mahjoub
Publikationsdatum
10.03.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 3/2020
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-020-00352-2