Skip to main content
Erschienen in: International Journal of Computer Vision 7/2023

18.04.2023

Advanced Binary Neural Network for Single Image Super Resolution

verfasst von: Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao

Erschienen in: International Journal of Computer Vision | Ausgabe 7/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Binary neural network (BNN) is an effective approach to accelerate the model inference and has been initially applied in the field of single image super resolution (SISR). However, the optimization of efficiency and accuracy remains a major challenge for achieving further improvements. While existing BNN-based SR methods solve the SISR problems by proposing a residual block-oriented quantization mechanism, the quantization process in the up-sampling stage and the representation tendency of binary super resolution networks are ignored. In this paper, we propose an Advanced Binary Super Resolution (ABSR) method to optimize the binary generator in terms of quantization mechanism and up-sampling strategy. Specifically, we first design an excitation-selection mechanism for binary inference, which could distinctively implement self-adjustment of activation and significantly reduce inference errors. Furthermore, we construct a binary up-sampling strategy that achieves performance almost equal to that of real-valued up-sampling modules, and fully frees up the inference speed of the binary network. Extensive experiments show that the ABSR not only reaches state-of-the-art BNN-based SR performance in terms of objective metrics and visual quality, but also reduces computational consumption drastically.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ahn, N., Kang, B., & Sohn, KA .(2018). Fast, accurate, and lightweight super-resolution with cascading residual network. In: Proceedings of the European conference on computer vision (ECCV). Ahn, N., Kang, B., & Sohn, KA .(2018). Fast, accurate, and lightweight super-resolution with cascading residual network. In: Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Antoni, B., Joan, D., & Julia, N. (2019). Motion-compensated Spatio-temporal filtering for multi-image and multimodal super-resolution. International Journal of Computer Vision, 127(10), 1474–1500.CrossRef Antoni, B., Joan, D., & Julia, N. (2019). Motion-compensated Spatio-temporal filtering for multi-image and multimodal super-resolution. International Journal of Computer Vision, 127(10), 1474–1500.CrossRef
Zurück zum Zitat Bevilacqua, M., Roumy, A., Guillemot, C., & Alberi-Morel, ML. (2012). Low-complexity single-image super-resolution based on nonnegative neighbor embedding pp. 1–10. Bevilacqua, M., Roumy, A., Guillemot, C., & Alberi-Morel, ML. (2012). Low-complexity single-image super-resolution based on nonnegative neighbor embedding pp. 1–10.
Zurück zum Zitat Chen, TQ., Rubanova, Y., Bettencourt, J., & Duvenaud, DK. (2018). Neural ordinary differential equations. In: Advances in neural information processing systems, pp. 6571–6583. Chen, TQ., Rubanova, Y., Bettencourt, J., & Duvenaud, DK. (2018). Neural ordinary differential equations. In: Advances in neural information processing systems, pp. 6571–6583.
Zurück zum Zitat Courbariaux, M., Bengio, Y., & David, JP. (2015). Binaryconnect: Training deep neural networks with binary weights during propagations. In: Advances in .neural information processing systems, pp. 3123–3131 Courbariaux, M., Bengio, Y., & David, JP. (2015). Binaryconnect: Training deep neural networks with binary weights during propagations. In: Advances in .neural information processing systems, pp. 3123–3131
Zurück zum Zitat Dong, C., Loy, C. C., He, K., & Tang, X. (2015). Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2), 295–307.CrossRef Dong, C., Loy, C. C., He, K., & Tang, X. (2015). Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2), 295–307.CrossRef
Zurück zum Zitat Greenspan, H. (2009). Super-resolution in medical imaging. The Computer Journal, 52(1), 43–63.CrossRef Greenspan, H. (2009). Super-resolution in medical imaging. The Computer Journal, 52(1), 43–63.CrossRef
Zurück zum Zitat Guo, Z., Zhang, X., Mu, H., Heng, W., Liu, Z., Wei, Y., & Sun, J. (2020). Single path one-shot neural architecture search with uniform sampling. In: Proceedings of the European conference on computer vision (ECCV), Springer, vol 12361, pp. 544–560. Guo, Z., Zhang, X., Mu, H., Heng, W., Liu, Z., Wei, Y., & Sun, J. (2020). Single path one-shot neural architecture search with uniform sampling. In: Proceedings of the European conference on computer vision (ECCV), Springer, vol 12361, pp. 544–560.
Zurück zum Zitat He, X., Mo, Z., Wang, P., Liu, Y., Yang, M., & Cheng, J. (2019). Ode-inspired network design for single image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). He, X., Mo, Z., Wang, P., Liu, Y., Yang, M., & Cheng, J. (2019). Ode-inspired network design for single image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Huang, JB., Singh, A., & Ahuja, N. (2015). Single image super-resolution from transformed self-exemplars. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp. 5197–5206. Huang, JB., Singh, A., & Ahuja, N. (2015). Single image super-resolution from transformed self-exemplars. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp. 5197–5206.
Zurück zum Zitat Huang, Y., Shao, L., & Frangi, AF. (2017). Simultaneous super-resolution and cross-modality synthesis of 3d medical images using weakly-supervised joint convolutional sparse coding. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Huang, Y., Shao, L., & Frangi, AF. (2017). Simultaneous super-resolution and cross-modality synthesis of 3d medical images using weakly-supervised joint convolutional sparse coding. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Hubara, I., Courbariaux, M., Soudry, D., El-Yaniv, R., & Bengio, Y. (2016). Binarized neural networks. In: Advances in neural information processing systems, pp 4107–4115. Hubara, I., Courbariaux, M., Soudry, D., El-Yaniv, R., & Bengio, Y. (2016). Binarized neural networks. In: Advances in neural information processing systems, pp 4107–4115.
Zurück zum Zitat Hui, Z., Gao, X., Yang, Y., & Wang, X. (2019). Lightweight image super-resolution with information multi-distillation network. In: Proceedings of the 27th ACM international conference on multimedia (ACM MM). Hui, Z., Gao, X., Yang, Y., & Wang, X. (2019). Lightweight image super-resolution with information multi-distillation network. In: Proceedings of the 27th ACM international conference on multimedia (ACM MM).
Zurück zum Zitat Jiang, X., Wang, N., Xin, J., Li, K., Yang, X., & Gao, X. (2021). Training binary neural network without batch normalization for image super-resolution. In: Proceedings of the thirty-fifth AAAI conference on artificial intelligence (AAAI). Jiang, X., Wang, N., Xin, J., Li, K., Yang, X., & Gao, X. (2021). Training binary neural network without batch normalization for image super-resolution. In: Proceedings of the thirty-fifth AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Jiang, X., Wang, N., Xin, J., Li, K., Yang, X., Li, X., Wang, Jie, & Gao, X. (2022). Fabnet: Frequency-aware binarized network for single image super-resolution. In: IEEE Transactions on neural networks and learning systems. pp. 1–11. Jiang, X., Wang, N., Xin, J., Li, K., Yang, X., Li, X., Wang, Jie, & Gao, X. (2022). Fabnet: Frequency-aware binarized network for single image super-resolution. In: IEEE Transactions on neural networks and learning systems. pp. 1–11.
Zurück zum Zitat Kim, J., Kwon, Lee, J., & Mu, Lee, K. (2016). Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Kim, J., Kwon, Lee, J., & Mu, Lee, K. (2016). Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lai, WS., Huang, JB., Ahuja, N,. & Yang, MH. (2017). Deep laplacian pyramid networks for fast and accurate super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Lai, WS., Huang, JB., Ahuja, N,. & Yang, MH. (2017). Deep laplacian pyramid networks for fast and accurate super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Ledig, C., Theis, L., Huszár, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., & Wang, Z. et al (2017). Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Ledig, C., Theis, L., Huszár, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., & Wang, Z. et al (2017). Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lei, Z., Peng, W., Chunhua, S., Lingqiao, L., Wei, W., Yanning, Z., & van den Hengel, A. (2020). Adaptive importance learning for improving lightweight image super-resolution network. International Journal of Computer Vision, 128(2), 479–499.MathSciNetCrossRef Lei, Z., Peng, W., Chunhua, S., Lingqiao, L., Wei, W., Yanning, Z., & van den Hengel, A. (2020). Adaptive importance learning for improving lightweight image super-resolution network. International Journal of Computer Vision, 128(2), 479–499.MathSciNetCrossRef
Zurück zum Zitat Li, K., Wang, N., Xin, J., Jiang, X., Li, J., Gao, X., Han, K., & Wang, Y. (2022). Local means binary networks for image super-resolution. IEEE Transactions on Neural Networks and Learning Systems pp 1–11, 10.1109/TNNLS.2022.3212827. Li, K., Wang, N., Xin, J., Jiang, X., Li, J., Gao, X., Han, K., & Wang, Y. (2022). Local means binary networks for image super-resolution. IEEE Transactions on Neural Networks and Learning Systems pp 1–11, 10.1109/TNNLS.2022.3212827.
Zurück zum Zitat Li, Y., Dong, X., Zhang, SQ., Bai, H., Chen, Y., & Wang, W. (2020). Rtn: Reparameterized ternary network. Proceedings of the thirty-fourth AAAI conference on artificial intelligence (AAAI). Li, Y., Dong, X., Zhang, SQ., Bai, H., Chen, Y., & Wang, W. (2020). Rtn: Reparameterized ternary network. Proceedings of the thirty-fourth AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Lim, B., Son, S., Kim, H., Nah, S., & Mu, Lee, K. (2017). Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops (CVPRW). Lim, B., Son, S., Kim, H., Nah, S., & Mu, Lee, K. (2017). Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops (CVPRW).
Zurück zum Zitat Lin, F., Fookes, C., Chandran, V., & Sridharan, S. (2007). Super-resolved faces for improved face recognition from surveillance video. In: Proceedings of the international conference on advances in biometrics. Lin, F., Fookes, C., Chandran, V., & Sridharan, S. (2007). Super-resolved faces for improved face recognition from surveillance video. In: Proceedings of the international conference on advances in biometrics.
Zurück zum Zitat Liu, J., Zhang, W., Tang, Y., Tang, J., & Wu, G. (2020a). Residual feature aggregation network for image super-resolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2359–2368. Liu, J., Zhang, W., Tang, Y., Tang, J., & Wu, G. (2020a). Residual feature aggregation network for image super-resolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2359–2368.
Zurück zum Zitat Liu, P., Zhang, H., Wei, L., & Zuo, W. (2019). Multi-level wavelet convolutional neural networks. IEEE Access, 7, 74973–74985.CrossRef Liu, P., Zhang, H., Wei, L., & Zuo, W. (2019). Multi-level wavelet convolutional neural networks. IEEE Access, 7, 74973–74985.CrossRef
Zurück zum Zitat Liu, Z., Wu, B., Luo, W., Yang, X., Liu, W., & Cheng, KT. (2018). Bi-real net: Enhancing the performance of 1-bit cnns with improved representational capability and advanced training algorithm. In: Proceedings of the European conference on computer vision (ECCV). Liu, Z., Wu, B., Luo, W., Yang, X., Liu, W., & Cheng, KT. (2018). Bi-real net: Enhancing the performance of 1-bit cnns with improved representational capability and advanced training algorithm. In: Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Liu, Z., Shen, Z., Savvides, M., & Cheng, KT. (2020b). Reactnet: Towards precise binary neural network with generalized activation functions. arXiv preprint arXiv:2003.03488, Liu, Z., Shen, Z., Savvides, M., & Cheng, KT. (2020b). Reactnet: Towards precise binary neural network with generalized activation functions. arXiv preprint arXiv:​2003.​03488,
Zurück zum Zitat Ma, Y., Xiong, H., Hu, Z., & Ma, L. (2019). Efficient super resolution using binarized neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops (CVPRW). Ma, Y., Xiong, H., Hu, Z., & Ma, L. (2019). Efficient super resolution using binarized neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops (CVPRW).
Zurück zum Zitat Maas, AL., Hannun, AY., & Ng, AY. (2013). Rectifier nonlinearities improve neural network acoustic models. In: Proceeding ICML, vol 30, p. 3. Maas, AL., Hannun, AY., & Ng, AY. (2013). Rectifier nonlinearities improve neural network acoustic models. In: Proceeding ICML, vol 30, p. 3.
Zurück zum Zitat Martin, D., Fowlkes, C., Tal, D., & Malik, J. (2001). A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. ICCV, IEEE, 2, 416–423. Martin, D., Fowlkes, C., Tal, D., & Malik, J. (2001). A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. ICCV, IEEE, 2, 416–423.
Zurück zum Zitat Qin, H., Gong, R., Liu, X., Wei, Z., Yu, F., & Song, J. (2020). Ir-net: Forward and backward information retention for highly accurate binary neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Qin, H., Gong, R., Liu, X., Wei, Z., Yu, F., & Song, J. (2020). Ir-net: Forward and backward information retention for highly accurate binary neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Rastegari, M., Ordonez, V., Redmon, J., & Farhadi, A. (2016). Xnor-net: Imagenet classification using binary convolutional neural networks. In: Proceedings of the European conference on computer vision (ECCV). Rastegari, M., Ordonez, V., Redmon, J., & Farhadi, A. (2016). Xnor-net: Imagenet classification using binary convolutional neural networks. In: Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Rasti, P., Uiboupin, T., Escalera, S., & Anbarjafari, G. (2016). Convolutional neural network super resolution for face recognition in surveillance monitoring. In: International conference on articulated motion and deformable objects. Rasti, P., Uiboupin, T., Escalera, S., & Anbarjafari, G. (2016). Convolutional neural network super resolution for face recognition in surveillance monitoring. In: International conference on articulated motion and deformable objects.
Zurück zum Zitat Timofte, R., Lee, KM., Wang, X,, Tian, Y., Ke, Y., Zhang, Y., Wu, S., Chao, D., Liang, L., & Yu, Q. (2017). Ntire 2017 challenge on single image super-resolution: Methods and results. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) Timofte, R., Lee, KM., Wang, X,, Tian, Y., Ke, Y., Zhang, Y., Wu, S., Chao, D., Liang, L., & Yu, Q. (2017). Ntire 2017 challenge on single image super-resolution: Methods and results. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Zurück zum Zitat Wang, Y., Lu, Y., & Blankevoort, T. (2020). Differentiable joint pruning and quantization for hardware efficiency. In: Proceedings of the European conference on computer vision (ECCV), Springer, vol 12374, pp 259–277 Wang, Y., Lu, Y., & Blankevoort, T. (2020). Differentiable joint pruning and quantization for hardware efficiency. In: Proceedings of the European conference on computer vision (ECCV), Springer, vol 12374, pp 259–277
Zurück zum Zitat Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612. Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612.
Zurück zum Zitat Xin, J., Li, J., Jiang, X., Wang, N., Huang, H., & Gao, X. (2020a). Wavelet-based dual recursive network for image super-resolution. IEEE Transactions on Neural Networks and Learning Systems. Xin, J., Li, J., Jiang, X., Wang, N., Huang, H., & Gao, X. (2020a). Wavelet-based dual recursive network for image super-resolution. IEEE Transactions on Neural Networks and Learning Systems.
Zurück zum Zitat Xin, J., Wang, N., Jiang, X., Li, J., Huang, H., & Gao, X. (2020b). Binarized neural network for single image super resolution. In: Proceedings of the European conference on computer vision (ECCV). Xin, J., Wang, N., Jiang, X., Li, J., Huang, H., & Gao, X. (2020b). Binarized neural network for single image super resolution. In: Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Xu, Z., & Cheung, RC. (2019). Accurate and compact convolutional neural networks with trained binarization. arXiv preprint arXiv:1909.11366 Xu, Z., & Cheung, RC. (2019). Accurate and compact convolutional neural networks with trained binarization. arXiv preprint arXiv:​1909.​11366
Zurück zum Zitat Zeyde, R., Elad, M., & Protter, M. (2012). On single image scale-up using sparse-representations. In: Proceedings of the international conference on curves and surfaces. Zeyde, R., Elad, M., & Protter, M. (2012). On single image scale-up using sparse-representations. In: Proceedings of the international conference on curves and surfaces.
Zurück zum Zitat Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., & Fu, Y. (2018a). Image super-resolution using very deep residual channel attention networks. In: Proceedings of the European conference on computer vision (ECCV). Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., & Fu, Y. (2018a). Image super-resolution using very deep residual channel attention networks. In: Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Zhang, Y., Tian, Y., Kong, Y., Zhong, B., & Fu, Y. (2018b). Residual dense network for image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Zhang, Y., Tian, Y., Kong, Y., Zhong, B., & Fu, Y. (2018b). Residual dense network for image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Metadaten
Titel
Advanced Binary Neural Network for Single Image Super Resolution
verfasst von
Jingwei Xin
Nannan Wang
Xinrui Jiang
Jie Li
Xinbo Gao
Publikationsdatum
18.04.2023
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 7/2023
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-023-01789-8

Weitere Artikel der Ausgabe 7/2023

International Journal of Computer Vision 7/2023 Zur Ausgabe

Premium Partner