Skip to main content

2017 | OriginalPaper | Buchkapitel

SPODS: A Dataset of Color-Official Documents and Detection of Logo, Stamp, and Signature

verfasst von : Amit Vijay Nandedkar, Jayanta Mukherjee, Shamik Sural

Erschienen in: Computer Vision, Graphics, and Image Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Office automation is an active area of research. It involves archival and retrieval of official documents. For developing a system for this purpose, it is necessary to have an extensive benchmark dataset consisting various types of official documents. However, it is hard to make available real world official documents as they are mostly confidential. In the absence of such benchmark datasets, it is difficult to evaluate newly developed algorithms. Hence, efforts have been made to build dataset consisting of different categories of documents that resemble real world official documents. In this work, we present a dataset called as scanned pseudo-official data-set (SPODS) which is created by us and made available online. Official documents are usually distinguished by presence of logo, stamp, signature, date, etc. The paper also presents a new approach for the detection of logo, stamp, and signature using spectral filtering and part based features. A comparative study on performances of the proposed method and existing algorithms on the SPODS dataset demonstrates the effectiveness of the proposed technique.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Ahmed, S., Malik, M.I., Liwicki, M., Dengel, A.: Signature segmentation from document images. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 425–429. IEEE (2012) Ahmed, S., Malik, M.I., Liwicki, M., Dengel, A.: Signature segmentation from document images. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 425–429. IEEE (2012)
4.
Zurück zum Zitat Ahmed, S., Shafait, F., Liwicki, M., Dengel, A.: A generic method for stamp segmentation using part-based features. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 708–712. IEEE (2013) Ahmed, S., Shafait, F., Liwicki, M., Dengel, A.: A generic method for stamp segmentation using part-based features. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 708–712. IEEE (2013)
5.
Zurück zum Zitat Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef
6.
Zurück zum Zitat Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef
7.
Zurück zum Zitat Dey, S., Mukherjee, J., Sural, S.: Logo and stamp detection from document images by finding outliers. In: Proceedings of the 5th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG). IEEE (2015) Dey, S., Mukherjee, J., Sural, S.: Logo and stamp detection from document images by finding outliers. In: Proceedings of the 5th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG). IEEE (2015)
8.
Zurück zum Zitat Dey, S., Mukherjee, J., Sural, S., Bhowmick, P.: Colored rubber stamp removal from document images. In: Maji, P., Ghosh, A., Murty, M.N., Ghosh, K., Pal, S.K. (eds.) PReMI 2013. LNCS, vol. 8251, pp. 545–550. Springer, Heidelberg (2013). doi:10.1007/978-3-642-45062-4_75 CrossRef Dey, S., Mukherjee, J., Sural, S., Bhowmick, P.: Colored rubber stamp removal from document images. In: Maji, P., Ghosh, A., Murty, M.N., Ghosh, K., Pal, S.K. (eds.) PReMI 2013. LNCS, vol. 8251, pp. 545–550. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-45062-4_​75 CrossRef
9.
Zurück zum Zitat Doermann, D., Tombre, K., et al.: Handbook of Document Image Processing and Recognition. Springer, London (2014)CrossRefMATH Doermann, D., Tombre, K., et al.: Handbook of Document Image Processing and Recognition. Springer, London (2014)CrossRefMATH
10.
Zurück zum Zitat Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Hoboken (2012)MATH Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Hoboken (2012)MATH
11.
Zurück zum Zitat Jain, R., Doermann, D.: Logo retrieval in document images. In: Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, pp. 135–139. IEEE (2012) Jain, R., Doermann, D.: Logo retrieval in document images. In: Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, pp. 135–139. IEEE (2012)
12.
Zurück zum Zitat Le, V.P., Nayef, N., Visani, M., Ogier, J.M., De Tran, C.: Document retrieval based on logo spotting using key-point matching. In: Proceedings of the 22nd International Conference on Pattern Recognition (ICPR), pp. 3056–3061. IEEE (2014) Le, V.P., Nayef, N., Visani, M., Ogier, J.M., De Tran, C.: Document retrieval based on logo spotting using key-point matching. In: Proceedings of the 22nd International Conference on Pattern Recognition (ICPR), pp. 3056–3061. IEEE (2014)
13.
Zurück zum Zitat Liu, L., Yu, M., Shao, L.: Multiview alignment hashing for efficient image search. IEEE Trans. Image Process. 24(3), 956–966 (2015)CrossRefMathSciNet Liu, L., Yu, M., Shao, L.: Multiview alignment hashing for efficient image search. IEEE Trans. Image Process. 24(3), 956–966 (2015)CrossRefMathSciNet
14.
Zurück zum Zitat Mandal, R., Roy, P.P., Pal, U.: Signature segmentation from machine printed documents using conditional random field. In: Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR), pp. 1170–1174. IEEE (2011) Mandal, R., Roy, P.P., Pal, U.: Signature segmentation from machine printed documents using conditional random field. In: Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR), pp. 1170–1174. IEEE (2011)
15.
Zurück zum Zitat Micenková, B., van Beusekom, J.: Stamp detection in color document images. In: Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR), pp. 1125–1129. IEEE (2011) Micenková, B., van Beusekom, J.: Stamp detection in color document images. In: Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR), pp. 1125–1129. IEEE (2011)
16.
Zurück zum Zitat Nandedkar, A.V., Mukhopadhyay, J., Sural, S.: Text-graphics separation to detect logo and stamp from color document images: a spectral approach. In: Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 571–575. IEEE (2015) Nandedkar, A.V., Mukhopadhyay, J., Sural, S.: Text-graphics separation to detect logo and stamp from color document images: a spectral approach. In: Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 571–575. IEEE (2015)
17.
Zurück zum Zitat Roy, P.P., Pal, U., Lladós, J.: Document seal detection using GHT and character proximity graphs. Pattern Recogn. 44(6), 1282–1295 (2011)CrossRef Roy, P.P., Pal, U., Lladós, J.: Document seal detection using GHT and character proximity graphs. Pattern Recogn. 44(6), 1282–1295 (2011)CrossRef
18.
Zurück zum Zitat Rusiñol, M., Lladós, J.: Efficient logo retrieval through hashing shape context descriptors. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 215–222. ACM (2010) Rusiñol, M., Lladós, J.: Efficient logo retrieval through hashing shape context descriptors. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 215–222. ACM (2010)
19.
Zurück zum Zitat Smeulders, A.W., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef Smeulders, A.W., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef
20.
Zurück zum Zitat Srihari, S.N., Shetty, S., Chen, S., Srinivasan, H., Huang, C., Agam, G., Frieder, O.: Document image retrieval using signatures as queries. In: Proceedings of the 2nd International Conference on Document Image Analysis for Libraries (DIAL), pp. 198–203. IEEE (2006) Srihari, S.N., Shetty, S., Chen, S., Srinivasan, H., Huang, C., Agam, G., Frieder, O.: Document image retrieval using signatures as queries. In: Proceedings of the 2nd International Conference on Document Image Analysis for Libraries (DIAL), pp. 198–203. IEEE (2006)
21.
Zurück zum Zitat Wang, H., Chen, Y.: Logo detection in document images based on boundary extension of feature rectangles. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 1335–1339. IEEE (2009) Wang, H., Chen, Y.: Logo detection in document images based on boundary extension of feature rectangles. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 1335–1339. IEEE (2009)
22.
Zurück zum Zitat Wong, K.Y., Casey, R.G., Wahl, F.M.: Document analysis system. IBM J. Res. Dev. 26(6), 647–656 (1982)CrossRef Wong, K.Y., Casey, R.G., Wahl, F.M.: Document analysis system. IBM J. Res. Dev. 26(6), 647–656 (1982)CrossRef
23.
Zurück zum Zitat Zhu, G., Doermann, D.: Automatic document logo detection. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2, pp. 864–868. IEEE (2007) Zhu, G., Doermann, D.: Automatic document logo detection. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2, pp. 864–868. IEEE (2007)
24.
Zurück zum Zitat Zhu, G., Zheng, Y., Doermann, D., Jaeger, S.: Signature detection and matching for document image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(11), 2015–2031 (2009)CrossRef Zhu, G., Zheng, Y., Doermann, D., Jaeger, S.: Signature detection and matching for document image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(11), 2015–2031 (2009)CrossRef
Metadaten
Titel
SPODS: A Dataset of Color-Official Documents and Detection of Logo, Stamp, and Signature
verfasst von
Amit Vijay Nandedkar
Jayanta Mukherjee
Shamik Sural
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68124-5_19

Premium Partner