Top

Neural Processing Letters

Published in:

25-07-2021

Video Super-Resolution with Frame-Wise Dynamic Fusion and Self-Calibrated Deformable Alignment

Authors: Wenjie Xu, Huihui Song, Yutong Jin, Fei Yan

Published in: Neural Processing Letters | Issue 4/2022

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In video super-resolution, exploiting spatial information of reference frame and temporal information from neighbouring frames is significant but challenging. Since existing image super-resolution (SR) methods have achieved remarkable reconstruction results, in this paper, we propose a generic frame-wise dynamic fusion module (DFM) to fully aggregate temporal information into reference frame. Specifically, we employ dynamic convolution to flexibly fuse element-wise temporal information frame by frame. Before that, to handle large motion across frames, we propose a self-calibrated deformable (SCD) alignment module, in which motion offsets are predicted via self-calibrated convolution that explicitly expand receptive field of each convolutional layer through internal communications in a multi-resolution manner. The aligned features of each neighbouring frame are then fed to the DFM to make a temporal information fusion. Finally, the reference features containing spatial and temporal information are sent into SR reconstruction module for the high-resolution frame. Experimental results on several datasets demonstrate superior performance to state-of-the-art published methods on video super-resolution.

previous article PPIS-JOIN: A Novel Privacy-Preserving Image Similarity Join Method

next article Multimodal Orthodontic Corpus Construction Based on Semantic Tag Classification Method

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Bao W, Lai WS, Zhang X, Gao Z, Yang MH (2021) Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement. IEEE Tran Patt Anal Mach Intell 43(3):933–948. https://doi.org/10.1109/TPAMI.2019.2941941CrossRef

Bertasius G, Torresani L, Shi, J (2018) Object detection in video with spatiotemporal sampling networks. In: ECCV, pp. 331–346

Caballero J, Ledig C, Aitken A, Acosta A, Totz J, Wang Z, Shi W (2017) Real-time video super-resolution with spatio-temporal networks and motion compensation. In: CVPR, pp. 4778–4787

Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic convolution: Attention over convolution kernels. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . https://doi.org/10.1109/cvpr42600.2020.01104

Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: ICCV, pp. 764–773

Guo Y, Wu Z, Shen D (2019) Learning longitudinal classification-regression model for infant hippocampus segmentation. Neurocomputing p. https://doi.org/10.1016/j.neucom.2019.01.108

Hao S, Zhou Y, Guo Y (2020) A brief survey on semantic segmentation with deep learning. Neurocomputing p. https://doi.org/10.1016/j.neucom.2019.11.118

Haris M, Shakhnarovich G, Ukita N (2018) Deep back-projection networks for super-resolution. In: CVPR, pp. 1664–1673

Haris M, Shakhnarovich G, Ukita N (2020) Space-time-aware multi-resolution video enhancement. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2856–2865 . https://doi.org/10.1109/CVPR42600.2020.00293

10.

He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: ICCV, pp. 1026–1034

11.

He K, Zhang X, Ren S, Sun J(2016) Deep residual learning for image recognition. In: CVPR, pp. 770–778

12.

Hu R, Zhu X, Zhu Y, Gan J (2020) Robust svm with adaptive graph learning. World Wide Web 23:19451968

13.

Huang Y, Wang W, Wang L(2015) Bidirectional recurrent convolutional networks for multi-frame super-resolution. In: NIPS, pp. 235–243

14.

Jia X, De Brabandere B, Tuytelaars T, Gool LV (2016) Dynamic filter networks. In: D.D. Lee, M. Sugiyama, U.V. Luxburg, I. Guyon, R. Garnett (eds.) Advances in Neural Information Processing Systems 29, pp. 667–675. Curran Associates, Inc. (2016). http://papers.nips.cc/paper/6578-dynamic-filter-networks.pdf

15.

Jian M, Jia T, Wu L, Zhang L, Wang D (2020) Content-based bipartite user-image correlation for image recommendation. Neural Process Lett (1)

16.

Jo Y, Wug Oh S, Kang J, Joo Kim S (2018) Deep video super-resolution network using dynamic upsampling filters without explicit motion compensation. In: CVPR, pp. 3224–3232

17.

Kappeler A, Yoo S, Dai Q, Katsaggelos AK (2016) Video super-resolution with convolutional neural networks. IEEE Trans Comput Imag 2(2):109–122MathSciNetCrossRef

18.

Kim TH, Sajjadi MSM, Hirsch M, Scholkopf B (2018) Spatio-temporal transformer network for video restoration. In: Proceedings of the European Conference on Computer Vision (ECCV)

19.

Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980

20.

Lai WS, Huang JB, Ahuja N, Yang MH (2018) Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 1

21.

Li S, He F, Du B, Zhang L, Xu Y, Tao D (2019) Fast spatio-temporal residual network for video super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

22.

Liu C, Sun D (2011) A bayesian approach to adaptive video super resolution. In: CVPR, pp. 209–216. IEEE

23.

Park SC, Park MK, Kang MG (2003) Super-resolution image reconstruction: a technical overview. IEEE Sig Process Mag 20(3):21–36CrossRef

24.

Ren S, Li J, Guo K, Li F (2021) Medical video super-resolution based on asymmetric back-projection network with multilevel error feedback. IEEE Access 9:17909–17920. https://doi.org/10.1109/ACCESS.2021.3054433CrossRef

25.

Sajjadi MS, Vemulapalli R, Brown M (2018) Frame-recurrent video super-resolution. In: CVPR, pp. 6626–6634

26.

Song H, Xu W, Liu D, Liu B, Liu Q, Metaxas DN (2021) Multi-stage feature fusion network for video super-resolution. IEEE Trans Image Process 30:2923–2934. https://doi.org/10.1109/TIP.2021.3056868CrossRef

27.

Tao X, Gao H, Liao R, Wang J, Jia J (2017) Detail-revealing deep video super-resolution. In: ICCV, pp. 4472–4480

28.

Tian Y, Zhang Y, Fu Y, Xu C (2018) Tdan: Temporally deformable alignment network for video super-resolution. arXiv preprint arXiv:1812.02898

29.

Wang L, Guo Y, Lin Z, Deng X, An W(2018) Learning for video super-resolution through hr optical flow estimation. In: ACCV, pp. 514–529. Springer

30.

Wang X, Chan KC, Yu K, Dong C, Change Loy C (2019) Edvr: Video restoration with enhanced deformable convolutional networks. In: CVPRW, pp

31.

Wang X, Yu K, Dong C, Change Loy C (2018) Recovering realistic texture in image super-resolution by deep spatial feature transform. In: CVPR, pp. 606–615

32.

Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. TIP 13(4):600–612

33.

Xingjian SHI, Chen Z, Wang H, Yeung DY, Wong WK, Woo WC (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, pp 802–810

34.

Xu YS, Tseng SYR, Tseng Y, Kuo HK, Tsai YM (2020) Unified dynamic convolutional network for super-resolution with variational degradations

35.

Xue T, Chen B, Wu J, Wei D, Freeman WT (2017) Video enhancement with task-oriented flow. IJCV pp. 1–20

36.

Yang B, Bender G, Le QV, Ngiam J(2019) Condconv: Conditionally parameterized convolutions for efficient inference

37.

Zhang Y, Li K, Li K, Wang L, Zhong B, Fu Y (2018) Image super-resolution using very deep residual channel attention networks. In: ECCV, pp. 286–301

38.

Zhao Y, Xiong Y, Lin D (2018) Trajectory convolution for action recognition. In: NIPS, pp. 2204–2215

39.

Zhou S, Zhang J, Pan J, Zuo W, Xie H, Ren J (2019) Spatio-temporal filter adaptive network for video deblurring. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2482–2491

40.

Zhu X, Gan J, Lu G, Li J, Zhang S (2020) Spectral clustering via half-quadratic optimization. World Wide Web 23:19691988CrossRef

41.

Zhu X, Hu H, Lin S, Dai J (2019) Deformable convnets v2: More deformable, better results. In: CVPR, pp. 9308–9316

Title: Video Super-Resolution with Frame-Wise Dynamic Fusion and Self-Calibrated Deformable Alignment
Authors: Wenjie Xu
Huihui Song
Yutong Jin
Fei Yan
Publication date: 25-07-2021
Publisher: Springer US
Published in: Neural Processing Letters / Issue 4/2022
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI: https://doi.org/10.1007/s11063-021-10593-9

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 4/2022

Adaptive Synchronization-Based Approach for Finite-Time Parameters Identification of Genetic Regulatory Networks

A Performance Comparison of Robust Models in Wind Turbines Power Curve Estimation: A Case Study

Research for an Adaptive Classifier Based on Dynamic Graph Learning

Multimodal Orthodontic Corpus Construction Based on Semantic Tag Classification Method

Long Time Series Deep Forecasting with Multiscale Feature Extraction and Seq2seq Attention Mechanism

Adaptive Graph Learning for Semi-supervised Self-paced Classification