Skip to main content

2015 | OriginalPaper | Buchkapitel

A Semi-automatic Solution Archive for Cross-Cut Shredded Text Documents Reconstruction

verfasst von : Shuxuan Guo, Songyang Lao, Jinlin Guo, Hang Xiang

Erschienen in: Image and Graphics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic reconstruction of cross-cut shredded text documents (RCCSTD) is important in some areas and it is still a highly challenging problem so far. In this work, we propose a novel semi-automatic reconstruction solution archive for RCCSTD. This solution archive consists of five components, namely preprocessing, row clustering, error evaluation function (EEF), optimal reconstructing route searching and human mediation (HM). Specifically, a row clustering algorithm based on signal correlation coefficient and cross-correlation sequence, and an improved EEF based on gradient vector is separately evaluated by combining with HM and without HM. Experimental results show that row clustering is effective for identifying and grouping shreds belonging to a same row of text documents. The EEF proposed in this work improves the precision and produces high performance in RCCSTD regardless of using HM or not. Overall, extra HM boosts both of the performance of row clustering and shred reconstructing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Biesinger, B.: Enhancing an evolutionary algorithm with a solution archive to reconstruct cross cut shredded text documents, na (2012) Biesinger, B.: Enhancing an evolutionary algorithm with a solution archive to reconstruct cross cut shredded text documents, na (2012)
2.
Zurück zum Zitat Chung, M.G., Fleck, M., Forsyth, D.: Jigsaw puzzle solver using shape and color. In: 1998 Fourth International Conference on Signal Processing Proceedings, 1998, ICSP 1998, vol. 2, pp. 877–880 (1998) Chung, M.G., Fleck, M., Forsyth, D.: Jigsaw puzzle solver using shape and color. In: 1998 Fourth International Conference on Signal Processing Proceedings, 1998, ICSP 1998, vol. 2, pp. 877–880 (1998)
3.
Zurück zum Zitat De Smet, P.: Reconstruction of ripped-up documents using fragment stack analysis procedures. Forensic Sci. Int. 176(2), 124–136 (2008)CrossRef De Smet, P.: Reconstruction of ripped-up documents using fragment stack analysis procedures. Forensic Sci. Int. 176(2), 124–136 (2008)CrossRef
4.
Zurück zum Zitat De Smet, P.: Semi-automatic forensic reconstruction of ripped-up documents. In: 10th International Conference on Document Analysis and Recognition, 2009, ICDAR 2009, pp. 703–707. IEEE (2009) De Smet, P.: Semi-automatic forensic reconstruction of ripped-up documents. In: 10th International Conference on Document Analysis and Recognition, 2009, ICDAR 2009, pp. 703–707. IEEE (2009)
5.
Zurück zum Zitat Goldberg, D., Malon, C., Bern, M.: A global approach to automatic solution of jigsaw puzzles. In: Proceedings of the Eighteenth Annual Symposium on Computational Geometry, pp. 82–87. ACM (2002) Goldberg, D., Malon, C., Bern, M.: A global approach to automatic solution of jigsaw puzzles. In: Proceedings of the Eighteenth Annual Symposium on Computational Geometry, pp. 82–87. ACM (2002)
6.
Zurück zum Zitat Justino, E., Oliveira, L.S., Freitas, C.: Reconstructing shredded documents through feature matching. Forensic Sci. Int. 160(2), 140–147 (2006)CrossRef Justino, E., Oliveira, L.S., Freitas, C.: Reconstructing shredded documents through feature matching. Forensic Sci. Int. 160(2), 140–147 (2006)CrossRef
7.
Zurück zum Zitat Lawler, E.L., Lenstra, J.K., Kan, A.R., Shmoys, D.B.: The Traveling Salesman Problem. A Guided Tour of Combinatorial Optimisation. Wiley, Chichester (1985) Lawler, E.L., Lenstra, J.K., Kan, A.R., Shmoys, D.B.: The Traveling Salesman Problem. A Guided Tour of Combinatorial Optimisation. Wiley, Chichester (1985)
8.
Zurück zum Zitat Perl, J., Diem, M., Kleber, F., Sablatnig, R.: Strip shredded document reconstruction using optical character recognition (2011) Perl, J., Diem, M., Kleber, F., Sablatnig, R.: Strip shredded document reconstruction using optical character recognition (2011)
9.
Zurück zum Zitat Prandtstetter, M., Raidl, G.R.: Combining forces to reconstruct strip shredded text documents. In: Blesa, M.J., Blum, C., Cotta, C., Fernández, A.J., Gallardo, J.E., Roli, A., Sampels, M. (eds.) HM 2008. LNCS, vol. 5296, pp. 175–189. Springer, Heidelberg (2008) CrossRef Prandtstetter, M., Raidl, G.R.: Combining forces to reconstruct strip shredded text documents. In: Blesa, M.J., Blum, C., Cotta, C., Fernández, A.J., Gallardo, J.E., Roli, A., Sampels, M. (eds.) HM 2008. LNCS, vol. 5296, pp. 175–189. Springer, Heidelberg (2008) CrossRef
10.
Zurück zum Zitat Prandtstetter, M., Raidl, G.R.: Meta-heuristics for reconstructing cross cut shredded text documents. In: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, pp. 349–356. ACM (2009) Prandtstetter, M., Raidl, G.R.: Meta-heuristics for reconstructing cross cut shredded text documents. In: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, pp. 349–356. ACM (2009)
11.
Zurück zum Zitat Ranca, R.: A modular framework for the automatic reconstruction of shredded documents. In: AAAI (Late-Breaking Developments) (2013) Ranca, R.: A modular framework for the automatic reconstruction of shredded documents. In: AAAI (Late-Breaking Developments) (2013)
12.
Zurück zum Zitat Ross, G.T., Soland, R.M.: A branch and bound algorithm for the generalized assignment problem. Math. Program. 8(1), 91–103 (1975)CrossRefMathSciNetMATH Ross, G.T., Soland, R.M.: A branch and bound algorithm for the generalized assignment problem. Math. Program. 8(1), 91–103 (1975)CrossRefMathSciNetMATH
13.
Zurück zum Zitat Schauer, C.: Reconstructing cross-cut shredded documents by means of evolutionary algorithms, na (2010) Schauer, C.: Reconstructing cross-cut shredded documents by means of evolutionary algorithms, na (2010)
14.
Zurück zum Zitat Schauer, C., Prandtstetter, M., Raidl, G.R.: A memetic algorithm for reconstructing cross-cut shredded text documents. In: Blesa, M.J., Blum, C., Raidl, G., Roli, A., Sampels, M. (eds.) HM 2010. LNCS, vol. 6373, pp. 103–117. Springer, Heidelberg (2010) CrossRef Schauer, C., Prandtstetter, M., Raidl, G.R.: A memetic algorithm for reconstructing cross-cut shredded text documents. In: Blesa, M.J., Blum, C., Raidl, G., Roli, A., Sampels, M. (eds.) HM 2010. LNCS, vol. 6373, pp. 103–117. Springer, Heidelberg (2010) CrossRef
15.
Zurück zum Zitat Ukovich, A., Ramponi, G., Doulaverakis, H., Kompatsiaris, Y., Strintzis, M.: Shredded document reconstruction using MPEG-7 standard descriptors. In: Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, pp. 334–337. IEEE (2004) Ukovich, A., Ramponi, G., Doulaverakis, H., Kompatsiaris, Y., Strintzis, M.: Shredded document reconstruction using MPEG-7 standard descriptors. In: Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, pp. 334–337. IEEE (2004)
Metadaten
Titel
A Semi-automatic Solution Archive for Cross-Cut Shredded Text Documents Reconstruction
verfasst von
Shuxuan Guo
Songyang Lao
Jinlin Guo
Hang Xiang
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-21978-3_39

Premium Partner