A comprehensive survey on automatic speech recognition using neural networks
AS Dhanjal, W Singh - Multimedia Tools and Applications, 2024 - Springer
The continuous development in Automatic Speech Recognition has grown and
demonstrated its enormous potential in Human Interaction Communication systems. It is …
demonstrated its enormous potential in Human Interaction Communication systems. It is …
Trends in audio signal feature extraction methods
G Sharma, K Umapathy, S Krishnan - Applied Acoustics, 2020 - Elsevier
Audio signal processing algorithms generally involves analysis of signal, extracting its
properties, predicting its behaviour, recognizing if any pattern is present in the signal, and …
properties, predicting its behaviour, recognizing if any pattern is present in the signal, and …
A review on classifying abnormal behavior in crowd scene
AA Afiq, MA Zakariya, MN Saad, AA Nurfarzana… - Journal of Visual …, 2019 - Elsevier
Crowd behavior analysis has become one of the new areas of interest in the computer vision
community due to the increasing demands from surveillance and security industries. It is …
community due to the increasing demands from surveillance and security industries. It is …
Detecting respiratory pathologies using convolutional neural networks and variational autoencoders for unbalancing data
MT García-Ordás, JA Benítez-Andrades… - Sensors, 2020 - mdpi.com
The aim of this paper was the detection of pathologies through respiratory sounds. The
ICBHI (International Conference on Biomedical and Health Informatics) Benchmark was …
ICBHI (International Conference on Biomedical and Health Informatics) Benchmark was …
[HTML][HTML] Smart home security solutions using facial authentication and speaker recognition through artificial neural networks
N Saxena, D Varshney - International Journal of Cognitive Computing in …, 2021 - Elsevier
In this paper, a holistic solution for Smart Home Security is implemented which helps in
improving privacy and security using two independent and emerging technologies of facial …
improving privacy and security using two independent and emerging technologies of facial …
Fast evaluation of crack growth path using time series forecasting
This paper aims at forecasting the crack propagation in risk assessment of engineering
structures based on time series algorithms named “long short-term memory” and “multi-layer …
structures based on time series algorithms named “long short-term memory” and “multi-layer …
[PDF][PDF] A review on voice-based interface for human-robot interaction
AA Badr, AK Abdul-Hassan - Iraqi Journal for Electrical and Electronic …, 2020 - iasj.net
With the recent developments of technology and the advances in artificial intelligence and
machine learning techniques, it has become possible for the robot to understand and …
machine learning techniques, it has become possible for the robot to understand and …
Attention-block deep learning based features fusion in wearable social sensor for mental wellbeing evaluations
With the progressive increase of stress, anxiety and depression in working and living
environment, mental health assessment becomes an important social interaction research …
environment, mental health assessment becomes an important social interaction research …
A large-scale uav audio dataset and audio-based uav classification using cnn
The increased popularity and accessibility of UAVs may create potential threats.
Researchers have been developing UAV detection and classification systems with different …
Researchers have been developing UAV detection and classification systems with different …
Bimodal variational autoencoder for audiovisual speech recognition
Multimodal fusion is the idea of combining information in a joint representation of multiple
modalities. The goal of multimodal fusion is to improve the accuracy of results from …
modalities. The goal of multimodal fusion is to improve the accuracy of results from …