Skip to main content
Top

2020 | OriginalPaper | Chapter

Effective Barcode Hunter via Semantic Segmentation in the Wild

Authors : Feng Ni, Xixin Cao

Published in: MultiMedia Modeling

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Barcodes are popularly used for product identification in many scenarios. However, locating them on product images is challenging. Half-occlusion, distortion, darkness or targets being too small to recognize can often add to the difficulties using conventional methods. In this paper, we introduce a large-scale diverse barcode dataset and adopt a deep learning-based semantic segmentation approach to address these problems. Specifically, we use an efficient method to synthesize 30000 well-annotated images containing diverse barcode labels, and get Barcode-30 k, a large-scale dataset with accurate pixel-level annotated barcode in the wild. Moreover, to locate barcode more precisely, we further propose an Effective Barcode Hunter - BarcodeNet. It is a semantic segmentation model based on CNN (Convolutional Neural Network) and is mainly formed with two novel modules, Prior Pyramid Pooling Module (P3M) and Pyramid Refine Module (PRM). Additional ablation studies further demonstrate the effectiveness of BarcodeNet, and it yields a high mIoU result of 95.36% on the proposed synthetic Barcode-30 k validation-set. To prove the practical value of the whole system, we test the BarcodeNet trained on train-set of Barcode-30 k on a manually annotated testing set that only collected from cameras, it achieves mIoU of 90.3%, which is a very accurate result for practical applications.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)CrossRef Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for scene segmentation. TPAMI 39(12), 2481–2495 (2017)CrossRef
3.
go back to reference Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)CrossRef Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. TPAMI 40(4), 834–848 (2018)CrossRef
4.
go back to reference Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015) Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: ICLR (2015)
5.
go back to reference Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:1706.05587 (2017) Chen, L., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR. arXiv:​1706.​05587 (2017)
6.
go back to reference Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016) Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: CVPR, pp. 3213–3223 (2016)
7.
go back to reference Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH
8.
go back to reference Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014) Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation, pp. 3992–4000 (2014)
9.
go back to reference Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017) Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., Garcia-Rodriguez, J.: A review on deep learning techniques applied to semantic segmentation (2017)
10.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015) CrossRef He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. TPAMI 37(9), 1904–16 (2015) CrossRef
11.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
12.
go back to reference Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008) Johnson, M., Shotton, J., Cipolla, R.: Semantic texton forests for image categorization and segmentation 5(7), 1–8 (2008)
13.
go back to reference Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001) Lafferty, J.D., Mccallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)
14.
go back to reference Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017) Li, J.H., Wang, W.H., Rao, T.T., Zhu, W.B., Liu, C.J.: Morphological segmentation of 2-D barcode gray scale image. In: ICISAI, pp. 62–68 (2017)
15.
go back to reference Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)CrossRef Li, J., Wang, Y.W., Chen, Y., Wang, G.: Adaptive segmentation method for 2-D barcode image base on mathematic morphological. Res. J. Appl. Sci. Eng. Technol. 6(18), 3335–3342 (2013)CrossRef
16.
go back to reference Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017) Li, J., Zhao, Q., Tan, X., Luo, Z., Tang, Z.: Using deep convnet for robust 1D barcode detection. In: ICIISA, pp. 261–267 (2017)
17.
go back to reference Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016) Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation (2016)
18.
go back to reference Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015) Liu, W., Rabinovich, A., Berg, A.C.: ParseNet: looking wider to see better. Computer Science (2015)
19.
go back to reference Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012) Liu, Z., He, L., Liang, R.: Research on segmentation algorithm of 2D color barcode based on mobile phone. In: ICCEE (2012)
20.
go back to reference Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015) Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation, pp. 1520–1528 (2015)
21.
go back to reference Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)CrossRef Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. TPAMI 39(4), 640–651 (2017)CrossRef
22.
go back to reference Thoma, M.: A survey of semantic segmentation (2016) Thoma, M.: A survey of semantic segmentation (2016)
23.
go back to reference Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)CrossRef Wachenfeld, S., Terlunen, S., Jiang, X.: Robust 1-D barcode recognition on camera phones and mobile product information display (2010)CrossRef
24.
go back to reference Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016) Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016)
25.
go back to reference Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011) Xu, W., Mccloskey, S.: 2D barcode localization and motion deblurring using a flutter shutter camera. In: Applications of Computer Vision, pp. 159–165 (2011)
26.
go back to reference Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015) Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015)
27.
go back to reference Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010) Zamberletti, A., Gallo, I., Carullo, M., Binaghi, E.: Neural image restoration for decoding 1-D barcodes using common camera phones. In: VISAPP 2010 - ICCV, pp. 5–11 (2010)
28.
go back to reference Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013) Zamberletti, A., Gallo, I., Albertini, S.: Robust angle invariant 1D barcode detection. In: ACPR, pp. 160–164 (2013)
30.
go back to reference Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017) Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR, pp. 6230–6239 (2017)
Metadata
Title
Effective Barcode Hunter via Semantic Segmentation in the Wild
Authors
Feng Ni
Xixin Cao
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-37731-1_35