Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses
Shikha Upadhyay Research Scholar, Department of Forensic Science, JAIN (Deemed-to-be University), Bangalore 560069, Karnataka, India
R Murugan Professor, School of Computer Science and IT, 3Assistant Professor, Department of Forensic Science, JAIN (Deemed-to-be University), Bangalore 560069, Karnataka, India
Address for correspondence: Shikha Upadhyay, Research Scholar, Department of Forensic Science, JAIN (Deemed-to-be University), Bangalore 560069, Karnataka, India E-mail: shikhasivi@gmail.com
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
Upadhyay S, Murugan R. Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses. Indian J Forensic Med Pathol. 2024;17(4):241-247.
Timeline
Received : June 19, 2024
Accepted : August 21, 2024
Published : December 15, 2024
Abstract
Context: The rapid advancement of deepfake technology has posed a significant challenge in the field of audio authentication, blurring the line between real and manipulated voices.
Aims: This research aims to understand the intricate relationship between vocal feature parameters of authentic and deepfake voices and contribute to deepfake detection by analyzing vocal features and employing AI methodologies.
Settings and Design: The study utilized the “DEEP-VOICE: DeepFake Voice Recognition” dataset and employed correlation analysis, descriptive statistics, visualization techniques and machine learning models.
Methods and Material: Vocal features like duration, frequency, MFCCs and others were extracted. Correlation matrices, descriptive statistics and comparative visualizations were used to analyze differences between real and deepfake voices. Logistic Regression models were trained on the vocal features.
Statistical analysis used: Correlation coefficients, means, standard deviations and machine learning model evaluation metrics were employed.
Results: Features like Duration, Number of Pulses and Voice Breaks exhibited negative correlations with deepfakes, while Mean Frequency and Median Frequency showed positive correlations with real voices. Descriptive statistics and visualizations highlighted differences in pitch and spectral characteristics. Logistic Regression models achieved promising performance, with pitch-related features being most effective.
Conclusions: The study identified key vocal feature parameters associated with real and deepfake voices, providing insights for developing robust voice authentication techniques and advancing deepfake detection.
References
1. Chaudhuri S, Bhattacharjee D, Jana S. Voice Authentication Techniques: A Comprehensive Review. IEEE Access. 2022;10:85455-85475.
2. Wang Y, Zhang L, Liu Y. Investigating Vocal Characteristics in Deepfake Voice Recognition: An Experimental Study. J Artif Intell Res. 2024;40:521-537.
3. Li X, Li Q. Deepfake Detection: A Survey. IEEE Trans Inf Forensics Secur. 2023;15:2543-2565.
4. Patel D, Gupta R. Exploring Voice Biometrics for Deepfake Detection: A Review. IEEE Trans Multimed. 2022;24(9):2411-2425.
5. Todisco M, Wang X, Sahidullah M, Delgado H, Nautsch A, Yamagishi J, et al. ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. Interspeech. 2019:1008-1012.
6. Garcia R, Kumar V. Deep Learning Approaches for Voice Authentication: A Survey. IEEE Trans Neural Netw Learn Syst. 2023;35(8):3789-3806.
7. Johnson B, Smith A. Investigating Vocal Feature Extraction Techniques: A Comparative Analysis. J Pattern Recognit. 2021;45(3):211-225.
8. Alzantot M, Wang Z, Srivastava MB.Deep Residual Neural Networks for Audio Spoofing Detection. Interspeech. 2019:1078-1082.
9. Liu Y, Zhang L, Wang Y. Analyzing Vocal Characteristics in Deepfake Voice Recognition: An Experimental Study. J Artif Intell Res. 2024;40:521-537.
10. Kinnunen T, Sahidullah M, Delgado H, Todisco M, Evans N, Yamagishi J, et al. The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection. Interspeech. 2017:2-6.
11. Smith A, Johnson B, Williams C. Analyzing Vocal Features for Voice Biometrics: A Comparative Study. J Signal Process Syst. 2021;25(2):315-332.
12. Wang H, Zhang M. Voice Authentication Using Machine Learning Techniques: A Review. Expert Syst Appl. 2023;98:58-72.
13. Farid H. Detecting Deep Fakes: A New Approach to Uncovering AI-Generated Synthetic Media. IEEE Signal Process Mag. 2021;38(6):76-84.
14. Williams C, Smith A, Johnson B. Deepfake Voice Detection: A Comprehensive Survey. IEEE Trans Multimed. 2022;24(10):2789-2803.
15. Zhang H, Liu J, Zhou Y. Deep Learning-Based Voice Conversion Techniques: A Review. IEEE Trans Audio Speech Lang Process. 2023;31:789-805.
16. DEEP-VOICE: DeepFake Voice Recognition dataset [Internet]. Kaggle. [Cited 2024 Aug 3]. Available from: https://www.kaggle. com/datasets/mountainanalytics/ deep-voice-deepfake-voice-recognition.
17. Mishra A, Singh C, Dwivedi A, Singh D, Biswal AK. Network forensics: an approach towards detecting cyber crime. In2021 International Conference in Advances in Power, Signal Information Technology (APSIT) 2021 Oct 8 (pp. 1-6). IEEE.
18. Tara H, Mishra A. A comparative study of digital forensic tools for data extraction from electronic devices. J. Punjab Acad. Forensic Med. Toxicol. 2021,21(1):97-104.
19. Singh SK, Mishra A. Digital Forensics and Cybersecurity Tools. InAdvancements in Cybercrime Investigation and Digital Forensics 2023 (pp. 367-382). Apple Academic Press.
20. National Institute of Justice (US). Technical Working Group on Crime Scene Investigation. Crime Scene Investigation: A Guide for Law Enforcement. US Department of Justice, Office of Justice Programs, National Institute of Justice; 2000.
21. Dwivedi A, Mishra A, Singh D. Cybersecurity and privacy issues of Blockchain technology. InBlockchain for Information Security and Privacy 2021 Nov 30 (pp. 69-94). Auerbach Publications.
22. Singla S, Subhash S, Mishra A. Network and Data Analysis Tools for Forensic Science. Modern Forensic Tools and Devices: Trends in Criminal Investigation. 2023 Nov 13:23-39.
23. Harisha A, Mishra A, Singh C, editors. Advancements in Cybercrime Investigation and Digital Forensics. CRC Press; 2023 Oct 6.
24. Tyagi V, Singh C, Singh SK, Mishra A. Tools and Protocols for the Analysis of Mobile Phones in Digital Forensics. InAdvancements in Cybercrime Investigation and Digital Forensics 2023 Oct 6 (pp. 229-245). Apple Academic Press.
25. Barbaro A, Mishra A, editors. Manual of Crime Scene Investigation. CRC Press; 2022 Oct 28.
Data Sharing Statement
There are no additional data available.
Funding
This research received no funding.
Author Contributions
All authors contributed significantly to the work and approve its publication.
Ethics Declaration
This article does not involve any human or animal subjects, and therefore does not require ethics approval.
Acknowledgements
Information not provide.
Conflicts of Interest
The authors report no conflicts of interest in this work.
About this article
Cite this article
Upadhyay S, Murugan R. Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses. Indian J Forensic Med Pathol. 2024;17(4):241-247.
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.