Mental Health Diagnosis From Voice Data Using Convolutional Neural Networks and Vision Transformers

R Islam, MT Ahad, F Ahmed, B Song, Y Li - Journal of Voice, 2024 - Elsevier
Summary Integrating Convolutional Neural Networks and Vision Transformers in voice
analysis has unveiled a new horizon in mental health identification. Human voice, a …

Advances in Automated Voice Pathology Detection: A Comprehensive Review of Speech Signal Analysis Techniques

A Sankaran, LS Kumar - IEEE Access, 2024 - ieeexplore.ieee.org
Speech is the principal means of communication among humans for centuries. It arises
when the voice tone produced by the vocal cords is modulated by the articulators giving rise …

Pathological voice classification using MEEL features and SVM-TabNet model

M Zakariah, M Al-Razgan, T Alfakih - Speech Communication, 2024 - Elsevier
In clinical settings, early diagnosis and objective assessment depend on the detection of
voice pathology. To classify anomalous voices, this work uses an approach that combines …

Voice pathology detection on spontaneous speech data using deep learning models

S Farazi, Y Shekofteh - International Journal of Speech Technology, 2024 - Springer
Speech problems are a common issue that affects people everywhere and can affect the
quality of their lives. The human speech production system involves various components …

Pathological voice classification system based on CNN-BiLSTM network using speech enhancement and multi-stream approach

S Belabbas, D Addou, SA Selouani - International Journal of Speech …, 2024 - Springer
The paper developing a resilient speech classification system for individuals with voice
disorders poses a formidable challenge due to the significant variability and distortions …

Resilient embedded system for classification respiratory diseases in a real time

AF Mahmood, AM Alkababji, A Daood - Biomedical Signal Processing and …, 2024 - Elsevier
Listening to lung sounds using a stethoscope is still one of the most important methods to
diagnose respiratory diseases. These sounds are complex and challenging to diagnose, as …

Detecting Forged Audio Files Using “Mixed Paste” Command: A Deep Learning Approach Based on Korean Phonemic Features

Y Son, JW Park - Sensors, 2024 - mdpi.com
The ubiquity of smartphones today enables the widespread utilization of voice recording for
diverse purposes. Consequently, the submission of voice recordings as digital evidence in …

MSDFEN: Multi-scale dynamic feature extraction network for pathological voice detection

Z Dai, Y Jiang, L Cao, X Zhang, Z Tao - Applied Acoustics, 2025 - Elsevier
With the rapid development of deep learning methods, their application in pathological voice
detection has become increasingly extensive, yielding promising results. However, most …

A modular deep learning architecture for voice pathology classification

I Miliaresi, A Pikrakis - IEEE Access, 2023 - ieeexplore.ieee.org
The development of methods that combine different sources of information for medical
diagnosis is an essential challenge in the field of medical informatics. In this context, we …

Voice Pathology Detection Based on Canonical Correlation Analysis Method Using Hilbert–Huang Transform and LSTM Features

MB Er, N İlhan - Arabian Journal for Science and Engineering, 2024 - Springer
Voice disorders, affecting pitch, intensity, and quality of sound from the larynx, significantly
impact individuals' quality of life. The growing attention from researchers underscores the …