Skip to main content
Top
Published in: Neural Processing Letters 4/2022

25-07-2021

Video Super-Resolution with Frame-Wise Dynamic Fusion and Self-Calibrated Deformable Alignment

Authors: Wenjie Xu, Huihui Song, Yutong Jin, Fei Yan

Published in: Neural Processing Letters | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In video super-resolution, exploiting spatial information of reference frame and temporal information from neighbouring frames is significant but challenging. Since existing image super-resolution (SR) methods have achieved remarkable reconstruction results, in this paper, we propose a generic frame-wise dynamic fusion module (DFM) to fully aggregate temporal information into reference frame. Specifically, we employ dynamic convolution to flexibly fuse element-wise temporal information frame by frame. Before that, to handle large motion across frames, we propose a self-calibrated deformable (SCD) alignment module, in which motion offsets are predicted via self-calibrated convolution that explicitly expand receptive field of each convolutional layer through internal communications in a multi-resolution manner. The aligned features of each neighbouring frame are then fed to the DFM to make a temporal information fusion. Finally, the reference features containing spatial and temporal information are sent into SR reconstruction module for the high-resolution frame. Experimental results on several datasets demonstrate superior performance to state-of-the-art published methods on video super-resolution.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Bertasius G, Torresani L, Shi, J (2018) Object detection in video with spatiotemporal sampling networks. In: ECCV, pp. 331–346 Bertasius G, Torresani L, Shi, J (2018) Object detection in video with spatiotemporal sampling networks. In: ECCV, pp. 331–346
3.
go back to reference Caballero J, Ledig C, Aitken A, Acosta A, Totz J, Wang Z, Shi W (2017) Real-time video super-resolution with spatio-temporal networks and motion compensation. In: CVPR, pp. 4778–4787 Caballero J, Ledig C, Aitken A, Acosta A, Totz J, Wang Z, Shi W (2017) Real-time video super-resolution with spatio-temporal networks and motion compensation. In: CVPR, pp. 4778–4787
5.
go back to reference Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: ICCV, pp. 764–773 Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: ICCV, pp. 764–773
8.
go back to reference Haris M, Shakhnarovich G, Ukita N (2018) Deep back-projection networks for super-resolution. In: CVPR, pp. 1664–1673 Haris M, Shakhnarovich G, Ukita N (2018) Deep back-projection networks for super-resolution. In: CVPR, pp. 1664–1673
10.
go back to reference He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: ICCV, pp. 1026–1034 He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: ICCV, pp. 1026–1034
11.
go back to reference He K, Zhang X, Ren S, Sun J(2016) Deep residual learning for image recognition. In: CVPR, pp. 770–778 He K, Zhang X, Ren S, Sun J(2016) Deep residual learning for image recognition. In: CVPR, pp. 770–778
12.
go back to reference Hu R, Zhu X, Zhu Y, Gan J (2020) Robust svm with adaptive graph learning. World Wide Web 23:19451968 Hu R, Zhu X, Zhu Y, Gan J (2020) Robust svm with adaptive graph learning. World Wide Web 23:19451968
13.
go back to reference Huang Y, Wang W, Wang L(2015) Bidirectional recurrent convolutional networks for multi-frame super-resolution. In: NIPS, pp. 235–243 Huang Y, Wang W, Wang L(2015) Bidirectional recurrent convolutional networks for multi-frame super-resolution. In: NIPS, pp. 235–243
15.
go back to reference Jian M, Jia T, Wu L, Zhang L, Wang D (2020) Content-based bipartite user-image correlation for image recommendation. Neural Process Lett (1) Jian M, Jia T, Wu L, Zhang L, Wang D (2020) Content-based bipartite user-image correlation for image recommendation. Neural Process Lett (1)
16.
go back to reference Jo Y, Wug Oh S, Kang J, Joo Kim S (2018) Deep video super-resolution network using dynamic upsampling filters without explicit motion compensation. In: CVPR, pp. 3224–3232 Jo Y, Wug Oh S, Kang J, Joo Kim S (2018) Deep video super-resolution network using dynamic upsampling filters without explicit motion compensation. In: CVPR, pp. 3224–3232
17.
go back to reference Kappeler A, Yoo S, Dai Q, Katsaggelos AK (2016) Video super-resolution with convolutional neural networks. IEEE Trans Comput Imag 2(2):109–122MathSciNetCrossRef Kappeler A, Yoo S, Dai Q, Katsaggelos AK (2016) Video super-resolution with convolutional neural networks. IEEE Trans Comput Imag 2(2):109–122MathSciNetCrossRef
18.
go back to reference Kim TH, Sajjadi MSM, Hirsch M, Scholkopf B (2018) Spatio-temporal transformer network for video restoration. In: Proceedings of the European Conference on Computer Vision (ECCV) Kim TH, Sajjadi MSM, Hirsch M, Scholkopf B (2018) Spatio-temporal transformer network for video restoration. In: Proceedings of the European Conference on Computer Vision (ECCV)
20.
go back to reference Lai WS, Huang JB, Ahuja N, Yang MH (2018) Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 1 Lai WS, Huang JB, Ahuja N, Yang MH (2018) Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 1
21.
go back to reference Li S, He F, Du B, Zhang L, Xu Y, Tao D (2019) Fast spatio-temporal residual network for video super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Li S, He F, Du B, Zhang L, Xu Y, Tao D (2019) Fast spatio-temporal residual network for video super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
22.
go back to reference Liu C, Sun D (2011) A bayesian approach to adaptive video super resolution. In: CVPR, pp. 209–216. IEEE Liu C, Sun D (2011) A bayesian approach to adaptive video super resolution. In: CVPR, pp. 209–216. IEEE
23.
go back to reference Park SC, Park MK, Kang MG (2003) Super-resolution image reconstruction: a technical overview. IEEE Sig Process Mag 20(3):21–36CrossRef Park SC, Park MK, Kang MG (2003) Super-resolution image reconstruction: a technical overview. IEEE Sig Process Mag 20(3):21–36CrossRef
25.
go back to reference Sajjadi MS, Vemulapalli R, Brown M (2018) Frame-recurrent video super-resolution. In: CVPR, pp. 6626–6634 Sajjadi MS, Vemulapalli R, Brown M (2018) Frame-recurrent video super-resolution. In: CVPR, pp. 6626–6634
27.
go back to reference Tao X, Gao H, Liao R, Wang J, Jia J (2017) Detail-revealing deep video super-resolution. In: ICCV, pp. 4472–4480 Tao X, Gao H, Liao R, Wang J, Jia J (2017) Detail-revealing deep video super-resolution. In: ICCV, pp. 4472–4480
28.
go back to reference Tian Y, Zhang Y, Fu Y, Xu C (2018) Tdan: Temporally deformable alignment network for video super-resolution. arXiv preprint arXiv:1812.02898 Tian Y, Zhang Y, Fu Y, Xu C (2018) Tdan: Temporally deformable alignment network for video super-resolution. arXiv preprint arXiv:​1812.​02898
29.
go back to reference Wang L, Guo Y, Lin Z, Deng X, An W(2018) Learning for video super-resolution through hr optical flow estimation. In: ACCV, pp. 514–529. Springer Wang L, Guo Y, Lin Z, Deng X, An W(2018) Learning for video super-resolution through hr optical flow estimation. In: ACCV, pp. 514–529. Springer
30.
go back to reference Wang X, Chan KC, Yu K, Dong C, Change Loy C (2019) Edvr: Video restoration with enhanced deformable convolutional networks. In: CVPRW, pp Wang X, Chan KC, Yu K, Dong C, Change Loy C (2019) Edvr: Video restoration with enhanced deformable convolutional networks. In: CVPRW, pp
31.
go back to reference Wang X, Yu K, Dong C, Change Loy C (2018) Recovering realistic texture in image super-resolution by deep spatial feature transform. In: CVPR, pp. 606–615 Wang X, Yu K, Dong C, Change Loy C (2018) Recovering realistic texture in image super-resolution by deep spatial feature transform. In: CVPR, pp. 606–615
32.
go back to reference Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. TIP 13(4):600–612 Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. TIP 13(4):600–612
33.
go back to reference Xingjian SHI, Chen Z, Wang H, Yeung DY, Wong WK, Woo WC (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, pp 802–810 Xingjian SHI, Chen Z, Wang H, Yeung DY, Wong WK, Woo WC (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, pp 802–810
34.
go back to reference Xu YS, Tseng SYR, Tseng Y, Kuo HK, Tsai YM (2020) Unified dynamic convolutional network for super-resolution with variational degradations Xu YS, Tseng SYR, Tseng Y, Kuo HK, Tsai YM (2020) Unified dynamic convolutional network for super-resolution with variational degradations
35.
go back to reference Xue T, Chen B, Wu J, Wei D, Freeman WT (2017) Video enhancement with task-oriented flow. IJCV pp. 1–20 Xue T, Chen B, Wu J, Wei D, Freeman WT (2017) Video enhancement with task-oriented flow. IJCV pp. 1–20
36.
go back to reference Yang B, Bender G, Le QV, Ngiam J(2019) Condconv: Conditionally parameterized convolutions for efficient inference Yang B, Bender G, Le QV, Ngiam J(2019) Condconv: Conditionally parameterized convolutions for efficient inference
37.
go back to reference Zhang Y, Li K, Li K, Wang L, Zhong B, Fu Y (2018) Image super-resolution using very deep residual channel attention networks. In: ECCV, pp. 286–301 Zhang Y, Li K, Li K, Wang L, Zhong B, Fu Y (2018) Image super-resolution using very deep residual channel attention networks. In: ECCV, pp. 286–301
38.
go back to reference Zhao Y, Xiong Y, Lin D (2018) Trajectory convolution for action recognition. In: NIPS, pp. 2204–2215 Zhao Y, Xiong Y, Lin D (2018) Trajectory convolution for action recognition. In: NIPS, pp. 2204–2215
39.
go back to reference Zhou S, Zhang J, Pan J, Zuo W, Xie H, Ren J (2019) Spatio-temporal filter adaptive network for video deblurring. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2482–2491 Zhou S, Zhang J, Pan J, Zuo W, Xie H, Ren J (2019) Spatio-temporal filter adaptive network for video deblurring. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2482–2491
40.
go back to reference Zhu X, Gan J, Lu G, Li J, Zhang S (2020) Spectral clustering via half-quadratic optimization. World Wide Web 23:19691988CrossRef Zhu X, Gan J, Lu G, Li J, Zhang S (2020) Spectral clustering via half-quadratic optimization. World Wide Web 23:19691988CrossRef
41.
go back to reference Zhu X, Hu H, Lin S, Dai J (2019) Deformable convnets v2: More deformable, better results. In: CVPR, pp. 9308–9316 Zhu X, Hu H, Lin S, Dai J (2019) Deformable convnets v2: More deformable, better results. In: CVPR, pp. 9308–9316
Metadata
Title
Video Super-Resolution with Frame-Wise Dynamic Fusion and Self-Calibrated Deformable Alignment
Authors
Wenjie Xu
Huihui Song
Yutong Jin
Fei Yan
Publication date
25-07-2021
Publisher
Springer US
Published in
Neural Processing Letters / Issue 4/2022
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-021-10593-9

Other articles of this Issue 4/2022

Neural Processing Letters 4/2022 Go to the issue