Speech processing: MFCC based feature extraction techniques-an investigation

P Vitolo, R Liguori, L Di Benedetto… - IEEE Signal …, 2023 - ieeexplore.ieee.org

The accuracy and computational complexity of keyword spotting (KWS) systems are heavily
influenced by the choice of audio features in speech signals. This letter introduces a novel …

被引用次数：5 相关文章所有 3 个版本

[PDF] academia.edu

[PDF][PDF] Machine learning for Arabic phonemes recognition using electrolarynx speech

ZJM Ameen, AA Kadhim - International Journal of Electrical and …, 2023 - academia.edu

Automatic speech recognition system is one of the essential ways of interaction with
machines. Interests in speech based intelligent systems have grown in the past few …

被引用次数：8 相关文章所有 6 个版本

[PDF] academia.edu

[PDF][PDF] Classification of three pathological voices based on specific features groups using support vector machine

M Altayeb, A Al-Ghraibah - International Journal of Electrical and …, 2022 - academia.edu

Determining and classifying pathological human sounds are still an interesting area of
research in the field of speech processing. This paper explores different methods of voice …

被引用次数：11 相关文章所有 5 个版本

Privacy and security of smart systems

K Suresh Kumar, D Prabakaran… - … for sustainable smart …, 2022 - Wiley Online Library

A smart city is a digitized urban area with a collection of electronic sensor nodes for data
collection to maintain the resources and various features efficiently. The smart systems …

被引用次数：5 相关文章

[PDF] mdpi.com

Advancing Cough Classification: Swin Transformer vs. 2D CNN with STFT and Augmentation Techniques

M Ghourabi, F Mourad-Chehade, A Chkeir - Electronics, 2024 - mdpi.com

Coughing, a common symptom associated with various respiratory problems, is a crucial
indicator for diagnosing and tracking respiratory diseases. Accurate identification and …

被引用次数：2 相关文章所有 2 个版本

[PDF] nsf.gov

Audio Classifier for Endangered Language Analysis and Education

M Reddy, M Chen - International Conference on Artificial Intelligence in …, 2023 - Springer

Around 42% of the world languages are considered endangered due to the decline in the
number of speakers. MeTILDA (Melodic Transcription in Language Documentation and …

被引用次数：1 相关文章所有 2 个版本

Application and Improvement of MFCC in Gesture Recognition with Surface Electromyography

S Zhu, D Wang, Q Hu, H Wu, F Fang… - International Journal of …, 2024 - World Scientific

As a physiological signal reflecting the state of muscle activation, surface electromyography
(sEMG) plays a vital role in the assessment of neuromuscular health, human–computer …

被引用次数：1 相关文章

[PDF] cell.com Full View

Optimizing the configuration of deep learning models for music genre classification

T Li - Heliyon, 2024 - cell.com

Music genre categorization is a fundamental use of sound processing methods in the realm
of music retrieval. Typically, people are responsible for categorizing music genres. Machine …

被引用次数：5 相关文章所有 7 个版本

[PDF] ijicic.org

[PDF][PDF] Short Time Fourier Transform in Reinvigorating Distinctive Facts of Individual Spectral Centroid of Mel Frequency Numeric for Security Authentication

HI Pratiwi, W Budiharto, IH Kartowisastro… - International Journal of …, 2024 - ijicic.org

Human throat and mouth anatomy attribute to the uniqueness of a human voice, speech
patterns or in Mel Scale is called spectral centroids. In general, speeches frequencies differ …

被引用次数：1 相关文章

[PDF] arxiv.org

Gammatonegram representation for end-to-end dysarthric speech processing tasks: Speech recognition, speaker identification, and intelligibility assessment

A Farhadipour, H Veisi - Iran Journal of Computer Science, 2024 - Springer

Dysarthria is a disability that causes a disturbance in the human speech system and reduces
the quality and intelligibility of a person's speech. Because of this effect, the normal speech …

被引用次数：3 相关文章所有 4 个版本