Skip to main content
Top
Published in: Pattern Analysis and Applications 1/2023

18-09-2022 | Theoretical Advances

Correlation-based and content-enhanced network for video style transfer

Authors: Honglin Lin, Mengmeng Wang, Yong Liu, Jiaxin Kou

Published in: Pattern Analysis and Applications | Issue 1/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Artistic style transfer aims to migrate the style pattern from a referenced style image to a given content image, which has achieved significant advances in recent years. However, producing temporally coherent and visually pleasing stylized frames is still challenging. Although existing works have made some effort, they rely on the inefficient optical flow or other cumbersome operations to model spatiotemporal information. In this paper, we propose an arbitrary video style transfer network that can generate consistent results with reasonable style patterns and clear content structure. We adopt multi-channel correlation module to render the input images stably according to cross-domain feature correlation. Meanwhile, Earth Movers’ Distance is used to capture the main characteristics of style images. To maintain the semantic structure during the stylization, we also employ the AdaIN-based skip connections and self-similarity loss, which can further improve the temporal consistency. Qualitative and quantitative experiments have demonstrated the effectiveness of our framework.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
Literature
1.
go back to reference Gatys LA, Ecker AS, Bethge M (2016) Image style transfer using convolutional neural networks. In: IEEE conference on Computer Vision and Pattern Recognition, pp. 2414–2423 Gatys LA, Ecker AS, Bethge M (2016) Image style transfer using convolutional neural networks. In: IEEE conference on Computer Vision and Pattern Recognition, pp. 2414–2423
2.
3.
go back to reference Ruder M, Dosovitskiy A, Brox T (2016) Artistic style transfer for videos. In: German Conference on Pattern Recognition, pp. 26–36. Springer Ruder M, Dosovitskiy A, Brox T (2016) Artistic style transfer for videos. In: German Conference on Pattern Recognition, pp. 26–36. Springer
4.
go back to reference Huang H, Wang H, Luo W, Ma L, Jiang W, Zhu X, Li Z, Liu W (2017) Real-time neural style transfer for videos. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 783–791 Huang H, Wang H, Luo W, Ma L, Jiang W, Zhu X, Li Z, Liu W (2017) Real-time neural style transfer for videos. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 783–791
5.
go back to reference Gupta A, Johnson J, Alahi A, Fei-Fei L (2017) Characterizing and improving stability in neural style transfer. In: IEEE International Conference on Computer Vision, pp. 4067–4076 Gupta A, Johnson J, Alahi A, Fei-Fei L (2017) Characterizing and improving stability in neural style transfer. In: IEEE International Conference on Computer Vision, pp. 4067–4076
6.
go back to reference Chen X, Zhang Y, Wang Y, Shu H, Xu C, Xu C (2020) Optical flow distillation: towards efficient and stable video style transfer. In: European Conference on Computer Vision, pp. 614–630. Springer Chen X, Zhang Y, Wang Y, Shu H, Xu C, Xu C (2020) Optical flow distillation: towards efficient and stable video style transfer. In: European Conference on Computer Vision, pp. 614–630. Springer
7.
go back to reference Chen D, Liao J, Yuan L, Yu N, Hua G (2017) Coherent online video style transfer. In: IEEE International Conference on Computer Vision, pp. 1105–1114 Chen D, Liao J, Yuan L, Yu N, Hua G (2017) Coherent online video style transfer. In: IEEE International Conference on Computer Vision, pp. 1105–1114
8.
go back to reference Wang W, Yang S, Xu J, Liu J (2020) Consistent video style transfer via relaxation and regularization. IEEE Trans Image Process 29:9125–9139CrossRefMATH Wang W, Yang S, Xu J, Liu J (2020) Consistent video style transfer via relaxation and regularization. IEEE Trans Image Process 29:9125–9139CrossRefMATH
9.
go back to reference Deng Y, Tang F, Dong W, Huang H, Ma C, Xu C (2020) Arbitrary video style transfer via multi-channel correlation. arXiv preprint arXiv:2009.08003 Deng Y, Tang F, Dong W, Huang H, Ma C, Xu C (2020) Arbitrary video style transfer via multi-channel correlation. arXiv preprint arXiv:​2009.​08003
10.
go back to reference Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision, pp. 694–711. Springer Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision, pp. 694–711. Springer
11.
go back to reference Chen D, Yuan L, Liao J, Yu N, Hua G (2017) Stylebank: an explicit representation for neural image style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1897–1906 Chen D, Yuan L, Liao J, Yu N, Hua G (2017) Stylebank: an explicit representation for neural image style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1897–1906
13.
go back to reference Huang X, Belongie S (2017) Arbitrary style transfer in real-time with adaptive instance normalization. In: IEEE International Conference on Computer Vision, pp. 1501–1510 Huang X, Belongie S (2017) Arbitrary style transfer in real-time with adaptive instance normalization. In: IEEE International Conference on Computer Vision, pp. 1501–1510
14.
15.
go back to reference Li X, Liu S, Kautz J, Yang M-H (2019) Learning linear transformations for fast image and video style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3809–3817 Li X, Liu S, Kautz J, Yang M-H (2019) Learning linear transformations for fast image and video style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3809–3817
16.
go back to reference Park DY, Lee KH (2019) Arbitrary style transfer with style-attentional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5880–5888 Park DY, Lee KH (2019) Arbitrary style transfer with style-attentional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5880–5888
17.
go back to reference Yao Y, Ren J, Xie X, Liu W, Liu Y-J, Wang J (2019) Attention-aware multi-stroke style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1467–1475 Yao Y, Ren J, Xie X, Liu W, Liu Y-J, Wang J (2019) Attention-aware multi-stroke style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1467–1475
18.
go back to reference Liu S, Lin T, He D, Li F, Wang M, Li X, Sun Z, Li Q, Ding E (2021) Adaattn: revisit attention mechanism in arbitrary neural style transfer. In: IEEE International Conference on Computer Vision, pp. 6649–6658 Liu S, Lin T, He D, Li F, Wang M, Li X, Sun Z, Li Q, Ding E (2021) Adaattn: revisit attention mechanism in arbitrary neural style transfer. In: IEEE International Conference on Computer Vision, pp. 6649–6658
19.
go back to reference Lai W-S, Huang J-B, Wang O, Shechtman E, Yumer E, Yang M-H (2018) Learning blind video temporal consistency. In: European Conference on Computer Vision, pp. 170–185 Lai W-S, Huang J-B, Wang O, Shechtman E, Yumer E, Yang M-H (2018) Learning blind video temporal consistency. In: European Conference on Computer Vision, pp. 170–185
20.
go back to reference Gao W, Li Y, Yin Y, Yang M-H (2020) Fast video multi-style transfer. In: IEEE Winter Conference on Applications of Computer Vision, pp. 3222–3230 Gao W, Li Y, Yin Y, Yang M-H (2020) Fast video multi-style transfer. In: IEEE Winter Conference on Applications of Computer Vision, pp. 3222–3230
21.
22.
go back to reference Zhou Y, Xu X, Shen F, Gao L, Lu H, Shen, HT (2020) Temporal denoising mask synthesis network for learning blind video temporal consistency. In: ACM International Conference on Multimedia, pp. 475–483 Zhou Y, Xu X, Shen F, Gao L, Lu H, Shen, HT (2020) Temporal denoising mask synthesis network for learning blind video temporal consistency. In: ACM International Conference on Multimedia, pp. 475–483
23.
go back to reference He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778
24.
go back to reference Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008
25.
go back to reference Arjovsky M, Chintala S, Bottou L (2017) Wasserstein generative adversarial networks. In: International Conference on Machine Learning, pp. 214–223. PMLR Arjovsky M, Chintala S, Bottou L (2017) Wasserstein generative adversarial networks. In: International Conference on Machine Learning, pp. 214–223. PMLR
27.
go back to reference Kolkin N, Salavon J, Shakhnarovich G. (2019) Style transfer by relaxed optimal transport and self-similarity. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 10051–10060 Kolkin N, Salavon J, Shakhnarovich G. (2019) Style transfer by relaxed optimal transport and self-similarity. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 10051–10060
28.
go back to reference Xu K, Wen L, Li G, Qi H, Bo L, Huang Q (2021) Learning self-supervised space-time cnn for fast video style transfer. IEEE Trans Image Process 30:2501–2512CrossRef Xu K, Wen L, Li G, Qi H, Bo L, Huang Q (2021) Learning self-supervised space-time cnn for fast video style transfer. IEEE Trans Image Process 30:2501–2512CrossRef
29.
go back to reference Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755. Springer Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755. Springer
30.
go back to reference Saleh B, Elgammal A (2015) Large-scale classification of fine-art paintings: learning the right metric on the right feature. arXiv preprint arXiv:1505.00855 Saleh B, Elgammal A (2015) Large-scale classification of fine-art paintings: learning the right metric on the right feature. arXiv preprint arXiv:​1505.​00855
31.
go back to reference Butler DJ, Wulff J, Stanley GB, Black MJ (2012) A naturalistic open source movie for optical flow evaluation. In: European Conference on Computer Vision, pp. 611–625. Springer Butler DJ, Wulff J, Stanley GB, Black MJ (2012) A naturalistic open source movie for optical flow evaluation. In: European Conference on Computer Vision, pp. 611–625. Springer
32.
go back to reference Pont-Tuset J, Perazzi F, Caelles S, Arbeláez P, Sorkine-Hornung A, Gool LV (2018) The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:1704.00675 Pont-Tuset J, Perazzi F, Caelles S, Arbeláez P, Sorkine-Hornung A, Gool LV (2018) The 2017 davis challenge on video object segmentation. arXiv preprint arXiv:​1704.​00675
Metadata
Title
Correlation-based and content-enhanced network for video style transfer
Authors
Honglin Lin
Mengmeng Wang
Yong Liu
Jiaxin Kou
Publication date
18-09-2022
Publisher
Springer London
Published in
Pattern Analysis and Applications / Issue 1/2023
Print ISSN: 1433-7541
Electronic ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-022-01106-y

Other articles of this Issue 1/2023

Pattern Analysis and Applications 1/2023 Go to the issue

Premium Partner