nach oben

Multimedia Systems

Erschienen in:

25.01.2023 | Regular Paper

Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)

verfasst von: Min-Jen Tsai, Hung-Yu Wu, Di-Ting Lin

Erschienen in: Multimedia Systems | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The development of the Internet has enabled the QR code to become the most frequently applied two-dimensional barcode in daily life and in commercial advertisements, and its application continues to be more diversified to include warehouse management, electronic tickets, mobile payments, etc. The standard QR code consists of black and white modules, which display a monotonous visual effect. Since graph patterns are much easier to understand than text characters, showing the subject by patterns inside the QR code is the easiest way to understand implicit content.

This research involves the development of a methodology called ARM-QR, in which the QR code is integrated with full-color images, and deep learning technology is used to beautify it. First, the region of interest (ROI) of the color image is automatically identified using Mask R-CNN. The QR code’s visual beautification is further adjusted by the content of the object. Discrete wavelet transform and contrast sensitivity functions are also used to strengthen the visual perception of the QR code and reduce the impact of a low print resolution on the graphic legibility. The ARM-QR code’s visual quality is intensively verified by visual quality indices, which include the Peak Signal-to-Noise Ratio (PSNR), Mean-Square Error (MSE), Structural Similarity Index Metric (SSIM), and Gradient Magnitude Similarity Deviation (GMSD) based on evaluating the experimental data. The results of the experiment confirm that the visual beautification of the QR code generated in this research is of higher quality than that in other QR code beautification studies.

Vorheriger Artikel Adaptive Kalman Filter with power transformation for online multi-object tracking

Nächster Artikel Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Beegan, A.P., Iyer, L.R., Bell, A.E., et al. “Design and Evaluation of Perceptual Masks for Wavelet Image Compression,” in Proc. 10th IEEE Digital Signal Processing Workshop, IEEE CS Press, pp. 88–93, Oct 2002. Available: https://doi.org/10.1109/DSPWS.2002.1231082

Brostow, G.J., Fauqueur, J., Cipolla, R.: Semantic object classes in video: a high-definition ground truth database. Pattern Recogn. Lett. 30(2), 88–97 (2009)CrossRef

Chang, J., Alain B., and Ostromoukhov, V. “Structure-aware error diffusion,” in Proc. ACM Trans Graph (TOG) 28(5): no. 162:1–162:8, Dec 2009.

Chen, H., Sun, K., Tian, Z., Shen, C. et al., “BlendMask: Top-down meets bottom-up for instance segmentation,” in Conf. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 8573–8581, Jan 2020.

Chen, L.C., Zhu, Y., Papandreou, G. et al., “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,” in Conf. Computer Vision – ECCV 2018, pp. 833–851.

Chu, H.K., Chang, C.S., Lee, R.R. et al., “Halftone QR codes,” in Proc. ACM Trans Graph (TOG) 32(6): no. 217. ACM SIGGRAPH ASIA 2013, https://doi.org/10.1145/2508363.2508408

Falcon, A. (2017) 40 Gorgeous QR code Artworks That Rock [Online]. Available: http://www.hongkiat.com/blog/qr-code-artworks/ (accessed on 16 November 2022)

Garateguy, G.J., Arce, G.R., Lau, D.L., et al.: QR images: optimized image embedding in QR codes. IEEE Transact. Image Process (2014). https://doi.org/10.1109/TIP.2014.2321501MathSciNetCrossRefMATH

He, K., Gkioxari, G., Dollár, P., et al. “Mask R-CNN,” in Conf. IEEE International Conference on Computer Vision (ICCV), pp. 2961–2969, Mar 2017. Available: https://doi.org/10.1109/ICCV.2017.322

10.

Huang, B.B., Tang, S.X.: A contrast-sensitive visible watermarking scheme”. IEEE Multimed (2006). https://doi.org/10.1109/MMUL.2006.23CrossRef

11.

Kyprianidis J.E., and Döllner, J. “Image abstraction by structure adaptive filtering,” in Proc. EG UKTheory and Practice of Computer Graphics, pp 51–58, 2008.

12.

Levicky, D., Foris, P.: Human Visual System Models in Digital Image Watermarking. Radioengineering 13(4), 38–43 (2004)

13.

Li, L., Li, Y., Wang, B., et al.: “A new aesthetic QR code algorithm based on salient region detection and SPBVM”, in Conf, pp. 20–32. Security with Intelligent Computing and Big-data Services, Springer, Cham (2017)

14.

Li, L., Wang, B., Lu, J., Zhang, S., et al.: A new aesthetic QR code algorithm based on salient region detection and SPBVM. J. Int Technol 20(3), 935–946 (2019)

15.

Lin, S.S., Chang, Y.F., Le, T.N.H. et al. Generation of Photorealistic QR codes,” in Conf. SIGGRAPH Asia 2019 Posters, Nov 2019.

16.

Lin, T.Y., Dollár, P., Girshick, R., et al. “Feature Pyramid Networks for Object Detection,” in Conf. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2117–2125, Dec 2016. Available: https://doi.org/10.1109/CVPR.2017.106

17.

Lin, L., Zou, X., He, L. et al. “Aesthetic QR code generation with background contrast enhancement and user interaction,” in Conf. Third International Workshop on Pattern Recognition, July 2018. Available: https://doi.org/10.1117/12.2502054

18.

Lin, L., Wu, S., Liu, S., et al., “Interactive QR code beautification with full background image embedding,” in Proc. SPIE 10443, Second International Workshop on Pattern Recognition, 1044317, June 2017. Available: https://doi.org/10.1117/12.2280282

19.

Lin, Y.S., Luo, S.J., Chen, B.Y.: Artistic QR code embellishment. Computer. Graph. Forum 32(7), 137–146 (2013)CrossRef

20.

Lin, S.S., Hu, M.C., Lee, C.H., et al.: Efficient QR code beautification with high quality visual content. IEEE Transact Multimed (2015). https://doi.org/10.1109/TMM.2015.2437711CrossRef

21.

Lu, J., Cheng, W., Zhang, S.Q., et al.: A novel aesthetic QR code algorithm based on hybrid basis vector matrices. Symmetry (2018). https://doi.org/10.3390/sym10110543CrossRef

22.

Mannos, J., Sakrison, D.: The effects of a visual fidelity criterion on the encoding of images. IEEE Trans Inf Theorem (1974). https://doi.org/10.1109/TIT.1974.1055250CrossRefMATH

23.

Ono, S., Morinaga, K., Nakayama, S.: Two-dimensional barcode decoration based on real-coded genetic algorithm. In: Proceedings of IEEE CEC, Hong-Kong, China, pp. 1068–1073 (2008)

24.

Qiao, S., Fang, X., Sheng, B., et al.: Structure-aware QR code abstraction. Vis Comp (2015). https://doi.org/10.1007/s00371-015-1107-xCrossRef

25.

Rathi, J., Grewal, S.K.: Aesthetic QR: approaches for beautified, fast decoding, and secured QR codes”. IJ Inform. Eng. Elect. Bus. 3, 10–18 (2022)

26.

Redmon, J., Divvala, S., Girshick, R., et al.: “You only look once: unified, real-time object detection”, in Conf. IEEE Conf Comp Vis Patt Recogn. (2015). https://doi.org/10.1109/CVPR.2016.91CrossRef

27.

Russ Cox's method, (2012, April 12) QArt Codes [Online]. Available: https://research.swtch.com/qart

28.

Shelhamer, E., Long, J. and Darrell, T. “Fully Convolutional Networks for Semantic Segmentation,” in Conf. IEEE Transactions on Computer Vision and Pattern Recognition (CVPR), pp. 3431–3440, June 2015.

29.

Tsai, M.J.: A visible watermarking algorithm based on the content and contrast aware (COCOA) technique. J. Visual Commun. Image Represent. (2009). https://doi.org/10.1016/j.jvcir.2009.03.011CrossRef

30.

Tsai, M.J., Hsieh, C.Y.: The visual color QR code algorithm (DWT-QR) based on wavelet transform and human vision system. Multimed Tools App (2019). https://doi.org/10.1007/s11042-019-7308-yCrossRef

31.

Tsai, M.J., Peng, S.L.: QR code beautification by instance segmentation (IS-QR). Dig Signal Process (2023). https://doi.org/10.1016/j.dsp.2022.103887CrossRef

32.

USC SIPI–The USC-SIPI image database [Online]. Available: http://sipi.usc.edu/services/database/Database.html (accessed 3 Jan, 2021)

33.

Viola, P., Jones, M.J.: “Rapid object detection using a boosted cascade of simple features”, in Conf. IEEE Comp Soc Conf Comp Vis Patt Recogn 1, 511–518 (2001)

34.

Visualead Company, “Visual QR code” [Online]. Available online: http://www.visualead.com/ (accessed on 16 November 2022).

35.

Wang, Z., Bovik, A.C., Sheikh, H.R., et al.: Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Proc. (2004). https://doi.org/10.1109/TIP.2003.819861CrossRef

36.

Watson, A.B., Yang, G.Y., Solomon, J.A., et al.: Visibility of wavelet quantization noise. IEEE Trans Image Proc. (1997). https://doi.org/10.1109/83605413CrossRef

37.

Xu, M., Su, H., Li, Y., et al.: Stylized aesthetic QR code. IEEE Trans. Multimed. 21(8), 1960–1970 (2018)CrossRef

38.

Xue, W., Zhang, L., Mou, X., et al.: Gradient magnitude similarity deviation: a highly efficient perceptual image quality index. IEEE Trans. Image Process. 23(2), 684–695 (2014)MathSciNetCrossRefMATH

39.

Zhang, L., Zhang, L., Mou, X., et al.: FSIM: a feature similarity index for image quality assessment. IEEE Trans Image Proc. (2001). https://doi.org/10.1109/TIP.2011.2109730CrossRefMATH

Titel: Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)
verfasst von: Min-Jen Tsai
Hung-Yu Wu
Di-Ting Lin
Publikationsdatum: 25.01.2023
Verlag: Springer Berlin Heidelberg
Erschienen in: Multimedia Systems / Ausgabe 3/2023
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI: https://doi.org/10.1007/s00530-022-01046-x

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Buchstaben, die aus einem Megaphon kommen/© MicroStockHub/Getty Images/iStock, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2023

From macro to micro: rethinking multi-scale pedestrian detection

An improved contrast enhancement for dark images with non-uniform illumination based on edge preservation

Illu-NASNet: unsupervised illumination estimation based on dense spatio-temporal smoothness

Deblurring transformer tracking with conditional cross-attention

Edge-preserving image denoising using noise-enhanced patch-based non-local means

Image-text matching using multi-subspace joint representation

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.