Skip to main content
Top
Published in: Multimedia Systems 4/2023

04-06-2023 | Regular Paper

FDS_2D: rethinking magnitude-phase features for DeepFake detection

Authors: Gaoming Yang, Anxing Wei, Xianjin Fang, Ji Zhang

Published in: Multimedia Systems | Issue 4/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To reduce the harm of forged information, more and more detection methods use frequency domain information. They mostly take spectra as clues to identify fake content. However, the current work tends to use only one of the magnitude and phase spectra for learning. In this paper, we notice that the magnitude and phase spectrum contain different image information. Only one spectrum is easily disturbed by noise, and the robustness of the method is difficult to guarantee. Therefore, we propose the Frequency Domain Separable DeepFake Detection (FDS_2D), which is a multi-branch network to obtain features in different frequency spectra. In FDS_2D, the spectral information is divided into three categories: the magnitude spectrum, the phase spectrum, and the relationship between the two spectra. According to their characteristics, we design independent modules for feature extraction from them. Moreover, to improve the utilization efficiency of multi-features, we propose a multi-input multi-output attention mechanism for information interaction between branches. The experimental results show that each part of FDS_2D effectively extracts and applies spectral information; The comprehensive performance of our model is verified on FaceForensic +  + , Celeb-DF, and DFDC. It proves that the ability of FDS_2D to detect DeepFake is not inferior to existing models.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference Kingma, D.P., Welling, M.: Auto-encoding variational bayes. Stat 1050, 1 (2014)MATH Kingma, D.P., Welling, M.: Auto-encoding variational bayes. Stat 1050, 1 (2014)MATH
5.
go back to reference Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. Adv. Neural. Inf. Process. Syst. 33, 6840–6851 (2020) Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. Adv. Neural. Inf. Process. Syst. 33, 6840–6851 (2020)
13.
go back to reference Coccomini DA, Messina N, Gennaro C, et al (2022) Combining efficientnet and vision transformers for video deepfake detection. In: Image Analysis and Processing–ICIAP 2022: 21st International Conference, Lecce, Italy, May 23–27, 2022, Proceedings, Part III, Springer, pp 219–229, https://doi.org/10.1007/978-3-031-06433-3 19 Coccomini DA, Messina N, Gennaro C, et al (2022) Combining efficientnet and vision transformers for video deepfake detection. In: Image Analysis and Processing–ICIAP 2022: 21st International Conference, Lecce, Italy, May 23–27, 2022, Proceedings, Part III, Springer, pp 219–229, https://​doi.​org/​10.​1007/​978-3-031-06433-3 19
14.
go back to reference Durall R, Keuper M, Pfreundt FJ, et al (2019) Unmasking deepfakes with simple features. CoRR abs/1911.00686 Durall R, Keuper M, Pfreundt FJ, et al (2019) Unmasking deepfakes with simple features. CoRR abs/1911.00686
17.
go back to reference Qian Y, Yin G, Sheng L, et al (2020) Thinking in frequency: Face forgery detection by mining frequency-aware clues. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII, Springer, pp 86–103, https://doi.org/10.1007/978-3-030-58610-2 6 Qian Y, Yin G, Sheng L, et al (2020) Thinking in frequency: Face forgery detection by mining frequency-aware clues. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII, Springer, pp 86–103, https://​doi.​org/​10.​1007/​978-3-030-58610-2 6
18.
go back to reference Odena, A., Dumoulin, V., Olah, C.: Deconvolution and checkerboard artifacts. Distill 1(10), e3 (2016)CrossRef Odena, A., Dumoulin, V., Olah, C.: Deconvolution and checkerboard artifacts. Distill 1(10), e3 (2016)CrossRef
19.
go back to reference Azulay, A., Weiss, Y.: Why do deep convolutional networks generalize so poorly to small image transformations? J. Mach. Learn. Res. 20, 1–25 (2019)MathSciNetMATH Azulay, A., Weiss, Y.: Why do deep convolutional networks generalize so poorly to small image transformations? J. Mach. Learn. Res. 20, 1–25 (2019)MathSciNetMATH
24.
go back to reference Kaiser L, Gomez AN, Chollet F (2018) Depthwise separable convolutions for neural machine translation. In: International Conference on Learning Representations Kaiser L, Gomez AN, Chollet F (2018) Depthwise separable convolutions for neural machine translation. In: International Conference on Learning Representations
29.
go back to reference Vaswani A, Shazeer N, Parmar N, et al (2017) Attention is all you need. Advances in Neural Information Processing Systems 30 Vaswani A, Shazeer N, Parmar N, et al (2017) Attention is all you need. Advances in Neural Information Processing Systems 30
30.
go back to reference Dosovitskiy A, Beyer L, Kolesnikov A, et al (2020) An image is worth 16x16 words: Transformers for image recognition at scale. CoRR abs/2010.11929 Dosovitskiy A, Beyer L, Kolesnikov A, et al (2020) An image is worth 16x16 words: Transformers for image recognition at scale. CoRR abs/2010.11929
Metadata
Title
FDS_2D: rethinking magnitude-phase features for DeepFake detection
Authors
Gaoming Yang
Anxing Wei
Xianjin Fang
Ji Zhang
Publication date
04-06-2023
Publisher
Springer Berlin Heidelberg
Published in
Multimedia Systems / Issue 4/2023
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-023-01118-6

Other articles of this Issue 4/2023

Multimedia Systems 4/2023 Go to the issue