Skip to main content

2020 | OriginalPaper | Buchkapitel

Effective Barcode Hunter via Semantic Segmentation in the Wild

verfasst von : Feng Ni, Xixin Cao

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Barcodes are popularly used for product identification in many scenarios. However, locating them on product images is challenging. Half-occlusion, distortion, darkness or targets being too small to recognize can often add to the difficulties using conventional methods. In this paper, we introduce a large-scale diverse barcode dataset and adopt a deep learning-based semantic segmentation approach to address these problems. Specifically, we use an efficient method to synthesize 30000 well-annotated images containing diverse barcode labels, and get Barcode-30 k, a large-scale dataset with accurate pixel-level annotated barcode in the wild. Moreover, to locate barcode more precisely, we further propose an Effective Barcode Hunter - BarcodeNet. It is a semantic segmentation model based on CNN (Convolutional Neural Network) and is mainly formed with two novel modules, Prior Pyramid Pooling Module (P3M) and Pyramid Refine Module (PRM). Additional ablation studies further demonstrate the effectiveness of BarcodeNet, and it yields a high mIoU result of 95.36% on the proposed synthetic Barcode-30 k validation-set. To prove the practical value of the whole system, we test the BarcodeNet trained on train-set of Barcode-30 k on a manually annotated testing set that only collected from cameras, it achieves mIoU of 90.3%, which is a very accurate result for practical applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)CrossRef Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)CrossRef
3.
Zurück zum Zitat Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)CrossRef Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)CrossRef
4.
Zurück zum Zitat Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015) Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015)
5.
Zurück zum Zitat Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:1706.05587 (2017) Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:​1706.​05587 (2017)
6.
Zurück zum Zitat Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016) Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016)
7.
Zurück zum Zitat Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH
8.
Zurück zum Zitat Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014) Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014)
9.
Zurück zum Zitat Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017) Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017)
10.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015) CrossRef He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015) CrossRef
11.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
12.
Zurück zum Zitat Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008) Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008)
13.
Zurück zum Zitat Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001) Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)
14.
Zurück zum Zitat Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017) Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017)
15.
Zurück zum Zitat Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)CrossRef Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)CrossRef
16.
Zurück zum Zitat Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017) Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017)
17.
Zurück zum Zitat Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016) Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016)
18.
Zurück zum Zitat Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015) Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015)
19.
Zurück zum Zitat Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012) Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012)
20.
Zurück zum Zitat Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015) Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015)
21.
Zurück zum Zitat Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)CrossRef Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)CrossRef
22.
Zurück zum Zitat Thoma, M.: A survey of semantic segmentation (2016) Thoma, M.: A survey of semantic segmentation (2016)
23.
Zurück zum Zitat Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)CrossRef Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)CrossRef
24.
Zurück zum Zitat Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016) Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016)
25.
Zurück zum Zitat Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011) Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011)
26.
Zurück zum Zitat Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015) Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015)
27.
Zurück zum Zitat Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010) Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010)
28.
Zurück zum Zitat Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013) Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013)
30.
Zurück zum Zitat Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017) Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017)
Metadaten
Titel
Effective Barcode Hunter via Semantic Segmentation in the Wild
verfasst von
Feng Ni
Xixin Cao
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-37731-1_35

Neuer Inhalt