Advertisement!
Author Information Pack
Editorial Board
Submit article
Special Issue
Editor's selection process
Join as Reviewer/Editor
List of Reviewer
Indexing Information
Most popular articles
Purchase Single Articles
Archive
Free Online Access
Current Issue
Recommend this journal to your library
Advertiser
Accepted Articles
Search Articles
Email Alerts
FAQ
Contact Us
Indian Journal of Forensic Medicine and Pathology

Volume  17, Issue 4, Oct - Dec. 2024, Pages 241-247
 

Original Article

Decoding Deepfake Voices: Investigating Relationship Between Vocal Feature Parameters of Original and Deep Fake Voices by using Advanced Machine Learning Techniques and Statistical Analyses

Shikha Upadhyay1, R Murugan

1Research Scholar, Department of Forensic Science, 2Professor, School of Computer Science and IT,  Department of Forensic Science, JAIN (Deemed-to-be University), Bangalore 560069, Karnataka, India.
 

Choose an option to locate / access this Article:
days Access
Check if you have access through your login credentials.        PDF      |
|

Open Access: View PDF

DOI: https://dx.doi.org/10.21088/ijfmp.0974.3383.17424.3

Abstract

Context: The rapid advancement of deepfake technology has posed a significant challenge in the field of audio authentication, blurring the line between real and manipulated voices. Aims: This research aims to understand the intricate relationship between vocal feature parameters of authentic and deepfake voices and contribute to deepfake detection by analyzing vocal features and employing AI methodologies  Settings and Design: The study utilized the “DEEP-VOICE: DeepFake Voice Recognition” dataset and employed correlation analysis, descriptive statistics, visualization techniques and machine learning models. Methods and Material: Vocal features like duration, frequency, MFCCs and others were extracted. Correlation matrices, descriptive statistics and comparative visualizations were used to analyze differences between real and deepfake voices. Logistic Regression models were trained on the vocal features. Statistical analysis used: Correlation coefficients, means, standard deviations and machine learning model evaluation metrics were employed.  Results: Features like Duration, Number of Pulses and Voice Breaks exhibited negative correlations with deepfakes, while Mean Frequency and Median Frequency showed positive correlations with real voices. Descriptive statistics and visualizations highlighted differences in pitch and spectral characteristics. Logistic Regression models achieved promising performance, with pitch-related features being most effective.  Conclusions: The study identified key vocal feature parameters associated with real and deepfake voices, providing insights for developing robust voice authentication techniques and advancing deepfake detection.


Keywords : Deepfake; Voice Authentication; Vocal feature parameters; Artificial Intelligence; Statistical Analysis; Visualization Techniques; Machine Learning.
Corresponding Author : Shikha Upadhyay