[HTML][HTML] Overview of voice conversion methods based on deep learning

T Walczyna, Z Piotrowski - Applied sciences, 2023 - mdpi.com
Voice conversion is a process where the essence of a speaker's identity is seamlessly
transferred to another speaker, all while preserving the content of their speech. This usage is …

[HTML][HTML] A review of recent advances on deep learning methods for audio-visual speech recognition

D Ivanko, D Ryumin, A Karpov - Mathematics, 2023 - mdpi.com
This article provides a detailed review of recent advances in audio-visual speech
recognition (AVSR) methods that have been developed over the last decade (2013–2023) …

[HTML][HTML] FusionNet: a convolution–transformer fusion network for hyperspectral image classification

L Yang, Y Yang, J Yang, N Zhao, L Wu, L Wang… - Remote Sensing, 2022 - mdpi.com
In recent years, deep-learning-based hyperspectral image (HSI) classification networks
have become one of the most dominant implementations in HSI classification tasks. Among …

[HTML][HTML] Detecting deepfake voice using explainable deep learning techniques

SY Lim, DK Chae, SC Lee - Applied Sciences, 2022 - mdpi.com
Fake media, generated by methods such as deepfakes, have become indistinguishable from
real media, but their detection has not improved at the same pace. Furthermore, the absence …

[HTML][HTML] A hybrid algorithm with Swin transformer and convolution for cloud detection

C Gong, T Long, R Yin, W Jiao, G Wang - Remote Sensing, 2023 - mdpi.com
Cloud detection is critical in remote sensing image processing, and convolutional neural
networks (CNNs) have significantly advanced this field. However, traditional CNNs primarily …

[HTML][HTML] Multi-modal adaptive fusion transformer network for the estimation of depression level

H Sun, J Liu, S Chai, Z Qiu, L Lin, X Huang, Y Chen - Sensors, 2021 - mdpi.com
Depression is a severe psychological condition that affects millions of people worldwide. As
depression has received more attention in recent years, it has become imperative to develop …

[HTML][HTML] Advances and challenges in deep learning-based change detection for remote sensing images: A review through various learning paradigms

L Wang, M Zhang, X Gao, W Shi - Remote Sensing, 2024 - mdpi.com
Change detection (CD) in remote sensing (RS) imagery is a pivotal method for detecting
changes in the Earth's surface, finding wide applications in urban planning, disaster …

[HTML][HTML] Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

[HTML][HTML] A CNN-based method for enhancing boring vibration with time-domain convolution-augmented transformer

H Zhang, J Li, G Cai, Z Chen, H Zhang - Insects, 2023 - mdpi.com
Simple Summary Trunk-boring insects have emerged as one of the most threatening forest
pests globally, causing significant damage to forests. Certain groups of larvae reside within …

[HTML][HTML] Interp-sum: Unsupervised video summarization with piecewise linear interpolation

UN Yoon, MD Hong, GS Jo - Sensors, 2021 - mdpi.com
This paper addresses the problem of unsupervised video summarization. Video
summarization helps people browse large-scale videos easily with a summary from the …