[HTML][HTML] Overview of voice conversion methods based on deep learning
T Walczyna, Z Piotrowski - Applied sciences, 2023 - mdpi.com
Voice conversion is a process where the essence of a speaker's identity is seamlessly
transferred to another speaker, all while preserving the content of their speech. This usage is …
transferred to another speaker, all while preserving the content of their speech. This usage is …
[HTML][HTML] A review of recent advances on deep learning methods for audio-visual speech recognition
This article provides a detailed review of recent advances in audio-visual speech
recognition (AVSR) methods that have been developed over the last decade (2013–2023) …
recognition (AVSR) methods that have been developed over the last decade (2013–2023) …
[HTML][HTML] FusionNet: a convolution–transformer fusion network for hyperspectral image classification
L Yang, Y Yang, J Yang, N Zhao, L Wu, L Wang… - Remote Sensing, 2022 - mdpi.com
In recent years, deep-learning-based hyperspectral image (HSI) classification networks
have become one of the most dominant implementations in HSI classification tasks. Among …
have become one of the most dominant implementations in HSI classification tasks. Among …
[HTML][HTML] Detecting deepfake voice using explainable deep learning techniques
Fake media, generated by methods such as deepfakes, have become indistinguishable from
real media, but their detection has not improved at the same pace. Furthermore, the absence …
real media, but their detection has not improved at the same pace. Furthermore, the absence …
[HTML][HTML] A hybrid algorithm with Swin transformer and convolution for cloud detection
Cloud detection is critical in remote sensing image processing, and convolutional neural
networks (CNNs) have significantly advanced this field. However, traditional CNNs primarily …
networks (CNNs) have significantly advanced this field. However, traditional CNNs primarily …
[HTML][HTML] Multi-modal adaptive fusion transformer network for the estimation of depression level
Depression is a severe psychological condition that affects millions of people worldwide. As
depression has received more attention in recent years, it has become imperative to develop …
depression has received more attention in recent years, it has become imperative to develop …
[HTML][HTML] Advances and challenges in deep learning-based change detection for remote sensing images: A review through various learning paradigms
Change detection (CD) in remote sensing (RS) imagery is a pivotal method for detecting
changes in the Earth's surface, finding wide applications in urban planning, disaster …
changes in the Earth's surface, finding wide applications in urban planning, disaster …
[HTML][HTML] Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …
[HTML][HTML] A CNN-based method for enhancing boring vibration with time-domain convolution-augmented transformer
H Zhang, J Li, G Cai, Z Chen, H Zhang - Insects, 2023 - mdpi.com
Simple Summary Trunk-boring insects have emerged as one of the most threatening forest
pests globally, causing significant damage to forests. Certain groups of larvae reside within …
pests globally, causing significant damage to forests. Certain groups of larvae reside within …
[HTML][HTML] Interp-sum: Unsupervised video summarization with piecewise linear interpolation
UN Yoon, MD Hong, GS Jo - Sensors, 2021 - mdpi.com
This paper addresses the problem of unsupervised video summarization. Video
summarization helps people browse large-scale videos easily with a summary from the …
summarization helps people browse large-scale videos easily with a summary from the …