Skip to main content
Erschienen in: Multimedia Systems 3/2023

25.01.2023 | Regular Paper

Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)

verfasst von: Min-Jen Tsai, Hung-Yu Wu, Di-Ting Lin

Erschienen in: Multimedia Systems | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The development of the Internet has enabled the QR code to become the most frequently applied two-dimensional barcode in daily life and in commercial advertisements, and its application continues to be more diversified to include warehouse management, electronic tickets, mobile payments, etc. The standard QR code consists of black and white modules, which display a monotonous visual effect. Since graph patterns are much easier to understand than text characters, showing the subject by patterns inside the QR code is the easiest way to understand implicit content.
This research involves the development of a methodology called ARM-QR, in which the QR code is integrated with full-color images, and deep learning technology is used to beautify it. First, the region of interest (ROI) of the color image is automatically identified using Mask R-CNN. The QR code’s visual beautification is further adjusted by the content of the object. Discrete wavelet transform and contrast sensitivity functions are also used to strengthen the visual perception of the QR code and reduce the impact of a low print resolution on the graphic legibility. The ARM-QR code’s visual quality is intensively verified by visual quality indices, which include the Peak Signal-to-Noise Ratio (PSNR), Mean-Square Error (MSE), Structural Similarity Index Metric (SSIM), and Gradient Magnitude Similarity Deviation (GMSD) based on evaluating the experimental data. The results of the experiment confirm that the visual beautification of the QR code generated in this research is of higher quality than that in other QR code beautification studies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Brostow, G.J., Fauqueur, J., Cipolla, R.: Semantic object classes in video: a high-definition ground truth database. Pattern Recogn. Lett. 30(2), 88–97 (2009)CrossRef Brostow, G.J., Fauqueur, J., Cipolla, R.: Semantic object classes in video: a high-definition ground truth database. Pattern Recogn. Lett. 30(2), 88–97 (2009)CrossRef
3.
Zurück zum Zitat Chang, J., Alain B., and Ostromoukhov, V. “Structure-aware error diffusion,” in Proc. ACM Trans Graph (TOG) 28(5): no. 162:1–162:8, Dec 2009. Chang, J., Alain B., and Ostromoukhov, V. “Structure-aware error diffusion,” in Proc. ACM Trans Graph (TOG) 28(5): no. 162:1–162:8, Dec 2009.
4.
Zurück zum Zitat Chen, H., Sun, K., Tian, Z., Shen, C. et al., “BlendMask: Top-down meets bottom-up for instance segmentation,” in Conf. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 8573–8581, Jan 2020. Chen, H., Sun, K., Tian, Z., Shen, C. et al., “BlendMask: Top-down meets bottom-up for instance segmentation,” in Conf. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 8573–8581, Jan 2020.
5.
Zurück zum Zitat Chen, L.C., Zhu, Y., Papandreou, G. et al., “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,” in Conf. Computer Vision – ECCV 2018, pp. 833–851. Chen, L.C., Zhu, Y., Papandreou, G. et al., “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,” in Conf. Computer Vision – ECCV 2018, pp. 833–851.
11.
Zurück zum Zitat Kyprianidis J.E., and Döllner, J. “Image abstraction by structure adaptive filtering,” in Proc. EG UKTheory and Practice of Computer Graphics, pp 51–58, 2008. Kyprianidis J.E., and Döllner, J. “Image abstraction by structure adaptive filtering,” in Proc. EG UKTheory and Practice of Computer Graphics, pp 51–58, 2008.
12.
Zurück zum Zitat Levicky, D., Foris, P.: Human Visual System Models in Digital Image Watermarking. Radioengineering 13(4), 38–43 (2004) Levicky, D., Foris, P.: Human Visual System Models in Digital Image Watermarking. Radioengineering 13(4), 38–43 (2004)
13.
Zurück zum Zitat Li, L., Li, Y., Wang, B., et al.: “A new aesthetic QR code algorithm based on salient region detection and SPBVM”, in Conf, pp. 20–32. Security with Intelligent Computing and Big-data Services, Springer, Cham (2017) Li, L., Li, Y., Wang, B., et al.: “A new aesthetic QR code algorithm based on salient region detection and SPBVM”, in Conf, pp. 20–32. Security with Intelligent Computing and Big-data Services, Springer, Cham (2017)
14.
Zurück zum Zitat Li, L., Wang, B., Lu, J., Zhang, S., et al.: A new aesthetic QR code algorithm based on salient region detection and SPBVM. J. Int Technol 20(3), 935–946 (2019) Li, L., Wang, B., Lu, J., Zhang, S., et al.: A new aesthetic QR code algorithm based on salient region detection and SPBVM. J. Int Technol 20(3), 935–946 (2019)
15.
Zurück zum Zitat Lin, S.S., Chang, Y.F., Le, T.N.H. et al. Generation of Photorealistic QR codes,” in Conf. SIGGRAPH Asia 2019 Posters, Nov 2019. Lin, S.S., Chang, Y.F., Le, T.N.H. et al. Generation of Photorealistic QR codes,” in Conf. SIGGRAPH Asia 2019 Posters, Nov 2019.
17.
Zurück zum Zitat Lin, L., Zou, X., He, L. et al. “Aesthetic QR code generation with background contrast enhancement and user interaction,” in Conf. Third International Workshop on Pattern Recognition, July 2018. Available: https://doi.org/10.1117/12.2502054 Lin, L., Zou, X., He, L. et al. “Aesthetic QR code generation with background contrast enhancement and user interaction,” in Conf. Third International Workshop on Pattern Recognition, July 2018. Available: https://​doi.​org/​10.​1117/​12.​2502054
18.
Zurück zum Zitat Lin, L., Wu, S., Liu, S., et al., “Interactive QR code beautification with full background image embedding,” in Proc. SPIE 10443, Second International Workshop on Pattern Recognition, 1044317, June 2017. Available: https://doi.org/10.1117/12.2280282 Lin, L., Wu, S., Liu, S., et al., “Interactive QR code beautification with full background image embedding,” in Proc. SPIE 10443, Second International Workshop on Pattern Recognition, 1044317, June 2017. Available: https://​doi.​org/​10.​1117/​12.​2280282
19.
Zurück zum Zitat Lin, Y.S., Luo, S.J., Chen, B.Y.: Artistic QR code embellishment. Computer. Graph. Forum 32(7), 137–146 (2013)CrossRef Lin, Y.S., Luo, S.J., Chen, B.Y.: Artistic QR code embellishment. Computer. Graph. Forum 32(7), 137–146 (2013)CrossRef
23.
Zurück zum Zitat Ono, S., Morinaga, K., Nakayama, S.: Two-dimensional barcode decoration based on real-coded genetic algorithm. In: Proceedings of IEEE CEC, Hong-Kong, China, pp. 1068–1073 (2008) Ono, S., Morinaga, K., Nakayama, S.: Two-dimensional barcode decoration based on real-coded genetic algorithm. In: Proceedings of IEEE CEC, Hong-Kong, China, pp. 1068–1073 (2008)
25.
Zurück zum Zitat Rathi, J., Grewal, S.K.: Aesthetic QR: approaches for beautified, fast decoding, and secured QR codes”. IJ Inform. Eng. Elect. Bus. 3, 10–18 (2022) Rathi, J., Grewal, S.K.: Aesthetic QR: approaches for beautified, fast decoding, and secured QR codes”. IJ Inform. Eng. Elect. Bus. 3, 10–18 (2022)
28.
Zurück zum Zitat Shelhamer, E., Long, J. and Darrell, T. “Fully Convolutional Networks for Semantic Segmentation,” in Conf. IEEE Transactions on Computer Vision and Pattern Recognition (CVPR), pp. 3431–3440, June 2015. Shelhamer, E., Long, J. and Darrell, T. “Fully Convolutional Networks for Semantic Segmentation,” in Conf. IEEE Transactions on Computer Vision and Pattern Recognition (CVPR), pp. 3431–3440, June 2015.
33.
Zurück zum Zitat Viola, P., Jones, M.J.: “Rapid object detection using a boosted cascade of simple features”, in Conf. IEEE Comp Soc Conf Comp Vis Patt Recogn 1, 511–518 (2001) Viola, P., Jones, M.J.: “Rapid object detection using a boosted cascade of simple features”, in Conf. IEEE Comp Soc Conf Comp Vis Patt Recogn 1, 511–518 (2001)
37.
Zurück zum Zitat Xu, M., Su, H., Li, Y., et al.: Stylized aesthetic QR code. IEEE Trans. Multimed. 21(8), 1960–1970 (2018)CrossRef Xu, M., Su, H., Li, Y., et al.: Stylized aesthetic QR code. IEEE Trans. Multimed. 21(8), 1960–1970 (2018)CrossRef
38.
Zurück zum Zitat Xue, W., Zhang, L., Mou, X., et al.: Gradient magnitude similarity deviation: a highly efficient perceptual image quality index. IEEE Trans. Image Process. 23(2), 684–695 (2014)MathSciNetCrossRefMATH Xue, W., Zhang, L., Mou, X., et al.: Gradient magnitude similarity deviation: a highly efficient perceptual image quality index. IEEE Trans. Image Process. 23(2), 684–695 (2014)MathSciNetCrossRefMATH
Metadaten
Titel
Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)
verfasst von
Min-Jen Tsai
Hung-Yu Wu
Di-Ting Lin
Publikationsdatum
25.01.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 3/2023
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-022-01046-x

Weitere Artikel der Ausgabe 3/2023

Multimedia Systems 3/2023 Zur Ausgabe

Neuer Inhalt