Automatic audio feature extraction for keyword spotting

P Vitolo, R Liguori, L Di Benedetto… - IEEE Signal …, 2023 - ieeexplore.ieee.org
The accuracy and computational complexity of keyword spotting (KWS) systems are heavily
influenced by the choice of audio features in speech signals. This letter introduces a novel …

[PDF][PDF] Machine learning for Arabic phonemes recognition using electrolarynx speech

ZJM Ameen, AA Kadhim - International Journal of Electrical and …, 2023 - academia.edu
Automatic speech recognition system is one of the essential ways of interaction with
machines. Interests in speech based intelligent systems have grown in the past few …

[PDF][PDF] Classification of three pathological voices based on specific features groups using support vector machine

M Altayeb, A Al-Ghraibah - International Journal of Electrical and …, 2022 - academia.edu
Determining and classifying pathological human sounds are still an interesting area of
research in the field of speech processing. This paper explores different methods of voice …

Privacy and security of smart systems

K Suresh Kumar, D Prabakaran… - … for sustainable smart …, 2022 - Wiley Online Library
A smart city is a digitized urban area with a collection of electronic sensor nodes for data
collection to maintain the resources and various features efficiently. The smart systems …

Advancing Cough Classification: Swin Transformer vs. 2D CNN with STFT and Augmentation Techniques

M Ghourabi, F Mourad-Chehade, A Chkeir - Electronics, 2024 - mdpi.com
Coughing, a common symptom associated with various respiratory problems, is a crucial
indicator for diagnosing and tracking respiratory diseases. Accurate identification and …

Audio Classifier for Endangered Language Analysis and Education

M Reddy, M Chen - International Conference on Artificial Intelligence in …, 2023 - Springer
Around 42% of the world languages are considered endangered due to the decline in the
number of speakers. MeTILDA (Melodic Transcription in Language Documentation and …

Application and Improvement of MFCC in Gesture Recognition with Surface Electromyography

S Zhu, D Wang, Q Hu, H Wu, F Fang… - International Journal of …, 2024 - World Scientific
As a physiological signal reflecting the state of muscle activation, surface electromyography
(sEMG) plays a vital role in the assessment of neuromuscular health, human–computer …

Optimizing the configuration of deep learning models for music genre classification

T Li - Heliyon, 2024 - cell.com
Music genre categorization is a fundamental use of sound processing methods in the realm
of music retrieval. Typically, people are responsible for categorizing music genres. Machine …

[PDF][PDF] Short Time Fourier Transform in Reinvigorating Distinctive Facts of Individual Spectral Centroid of Mel Frequency Numeric for Security Authentication

HI Pratiwi, W Budiharto, IH Kartowisastro… - International Journal of …, 2024 - ijicic.org
Human throat and mouth anatomy attribute to the uniqueness of a human voice, speech
patterns or in Mel Scale is called spectral centroids. In general, speeches frequencies differ …

Gammatonegram representation for end-to-end dysarthric speech processing tasks: Speech recognition, speaker identification, and intelligibility assessment

A Farhadipour, H Veisi - Iran Journal of Computer Science, 2024 - Springer
Dysarthria is a disability that causes a disturbance in the human speech system and reduces
the quality and intelligibility of a person's speech. Because of this effect, the normal speech …