Full Text (PDF)
Original Article

Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses

Shikha Upadhyay, R Murugan

Author Information

Licence:

Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Indian Journal of Forensic Medicine and Pathology 17(4):p 241-247, OCT. DEC. 2024. | DOI: https://doi.org/10.21088/ijfmp.0974.3383.17424.3

How Cite This Article:

Upadhyay S, Murugan R. Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses. Indian J Forensic Med Pathol. 2024;17(4):241-247.

Timeline

Received : June 19, 2024         Accepted : August 21, 2024          Published : December 15, 2024

Abstract

Context: The rapid advancement of deepfake technology has posed a significant challenge in the field of audio authentication, blurring the line between real and manipulated voices. Aims: This research aims to understand the intricate relationship between vocal feature parameters of authentic and deepfake voices and contribute to deepfake detection by analyzing vocal features and employing AI methodologies. Settings and Design: The study utilized the “DEEP-VOICE: DeepFake Voice Recognition” dataset and employed correlation analysis, descriptive statistics, visualization techniques and machine learning models. Methods and Material: Vocal features like duration, frequency, MFCCs and others were extracted. Correlation matrices, descriptive statistics and comparative visualizations were used to analyze differences between real and deepfake voices. Logistic Regression models were trained on the vocal features. Statistical analysis used: Correlation coefficients, means, standard deviations and machine learning model evaluation metrics were employed. Results: Features like Duration, Number of Pulses and Voice Breaks exhibited negative correlations with deepfakes, while Mean Frequency and Median Frequency showed positive correlations with real voices. Descriptive statistics and visualizations highlighted differences in pitch and spectral characteristics. Logistic Regression models achieved promising performance, with pitch-related features being most effective. Conclusions: The study identified key vocal feature parameters associated with real and deepfake voices, providing insights for developing robust voice authentication techniques and advancing deepfake detection.


References

  • 1.   Chaudhuri S, Bhattacharjee D, Jana S. Voice Authentication Techniques: A Comprehensive Review. IEEE Access. 2022;10:85455-85475.
  • 2.   Wang Y, Zhang L, Liu Y. Investigating Vocal Characteristics in Deepfake Voice Recognition: An Experimental Study. J Artif Intell Res. 2024;40:521-537.
  • 3.   Li X, Li Q. Deepfake Detection: A Survey. IEEE Trans Inf Forensics Secur. 2023;15:2543-2565.
  • 4.   Patel D, Gupta R. Exploring Voice Biometrics for Deepfake Detection: A Review. IEEE Trans Multimed. 2022;24(9):2411-2425.
  • 5.   Todisco M, Wang X, Sahidullah M, Delgado H, Nautsch A, Yamagishi J, et al. ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. Interspeech. 2019:1008-1012.
  • 6.   Garcia R, Kumar V. Deep Learning Approaches for Voice Authentication: A Survey. IEEE Trans Neural Netw Learn Syst. 2023;35(8):3789-3806.
  • 7.   Johnson B, Smith A. Investigating Vocal Feature Extraction Techniques: A Comparative Analysis. J Pattern Recognit. 2021;45(3):211-225.
  • 8.   Alzantot M, Wang Z, Srivastava MB.Deep Residual Neural Networks for Audio Spoofing Detection. Interspeech. 2019:1078-1082.
  • 9.   Liu Y, Zhang L, Wang Y. Analyzing Vocal Characteristics in Deepfake Voice Recognition: An Experimental Study. J Artif Intell Res. 2024;40:521-537.
  • 10.   Kinnunen T, Sahidullah M, Delgado H, Todisco M, Evans N, Yamagishi J, et al. The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection. Interspeech. 2017:2-6.
  • 11.   Smith A, Johnson B, Williams C. Analyzing Vocal Features for Voice Biometrics: A Comparative Study. J Signal Process Syst. 2021;25(2):315-332.
  • 12.   Wang H, Zhang M. Voice Authentication Using Machine Learning Techniques: A Review. Expert Syst Appl. 2023;98:58-72.
  • 13.   Farid H. Detecting Deep Fakes: A New Approach to Uncovering AI-Generated Synthetic Media. IEEE Signal Process Mag. 2021;38(6):76-84.
  • 14.   Williams C, Smith A, Johnson B. Deepfake Voice Detection: A Comprehensive Survey. IEEE Trans Multimed. 2022;24(10):2789-2803.
  • 15.   Zhang H, Liu J, Zhou Y. Deep Learning-Based Voice Conversion Techniques: A Review. IEEE Trans Audio Speech Lang Process. 2023;31:789-805.
  • 16.   DEEP-VOICE: DeepFake Voice Recognition dataset [Internet]. Kaggle. [Cited 2024 Aug 3]. Available from: https://www.kaggle. com/datasets/mountainanalytics/ deep-voice-deepfake-voice-recognition.
  • 17.   Mishra A, Singh C, Dwivedi A, Singh D, Biswal AK. Network forensics: an approach towards detecting cyber crime. In2021 International Conference in Advances in Power, Signal Information Technology (APSIT) 2021 Oct 8 (pp. 1-6). IEEE.
  • 18.   Tara H, Mishra A. A comparative study of digital forensic tools for data extraction from electronic devices. J. Punjab Acad. Forensic Med. Toxicol. 2021,21(1):97-104.
  • 19.   Singh SK, Mishra A. Digital Forensics and Cybersecurity Tools. InAdvancements in Cybercrime Investigation and Digital Forensics 2023 (pp. 367-382). Apple Academic Press.
  • 20.   National Institute of Justice (US). Technical Working Group on Crime Scene Investigation. Crime Scene Investigation: A Guide for Law Enforcement. US Department of Justice, Office of Justice Programs, National Institute of Justice; 2000.
  • 21.   Dwivedi A, Mishra A, Singh D. Cybersecurity and privacy issues of Blockchain technology. InBlockchain for Information Security and Privacy 2021 Nov 30 (pp. 69-94). Auerbach Publications.
  • 22.   Singla S, Subhash S, Mishra A. Network and Data Analysis Tools for Forensic Science. Modern Forensic Tools and Devices: Trends in Criminal Investigation. 2023 Nov 13:23-39.
  • 23.   Harisha A, Mishra A, Singh C, editors. Advancements in Cybercrime Investigation and Digital Forensics. CRC Press; 2023 Oct 6.
  • 24.   Tyagi V, Singh C, Singh SK, Mishra A. Tools and Protocols for the Analysis of Mobile Phones in Digital Forensics. InAdvancements in Cybercrime Investigation and Digital Forensics 2023 Oct 6 (pp. 229-245). Apple Academic Press.
  • 25.   Barbaro A, Mishra A, editors. Manual of Crime Scene Investigation. CRC Press; 2022 Oct 28.

Data Sharing Statement

There are no additional data available.

Funding

This research received no funding.

Author Contributions

All authors contributed significantly to the work and approve its publication.

Ethics Declaration

This article does not involve any human or animal subjects, and therefore does not require ethics approval.

Acknowledgements

Information not provide.

Conflicts of Interest

The authors report no conflicts of interest in this work.


About this article


Cite this article

Upadhyay S, Murugan R. Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses. Indian J Forensic Med Pathol. 2024;17(4):241-247.


Licence:

Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Received Accepted Published
June 19, 2024 August 21, 2024 December 15, 2024

DOI: https://doi.org/10.21088/ijfmp.0974.3383.17424.3

Keywords

DeepfakeVoice AuthenticationVocal feature parametersArtificial IntelligenceStatistical AnalysisVisualization TechniquesMachine Learning

Article Level Metrics

Last Updated

Thursday 18 June 2026, 02:39:25 (IST)


7110

Accesses

24
2030
00

Citations


NA
NA
NA

Download citation


Article Keywords


Keyword Highlighting

Highlight selected keywords in the article text.


Timeline


Received June 19, 2024
Accepted August 21, 2024
Published December 15, 2024

licence


Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Access this article



Share