Reimagining speech: a scoping review of deep learning-based methods for non-parallel voice conversion

AR Bargum, S Serafin, C Erkut - Frontiers in Signal Processing, 2024 - frontiersin.org
Research on deep learning-powered voice conversion (VC) in speech-to-speech scenarios
are gaining increasing popularity. Although many of the works in the field of voice …

Unsupervised domain adaptation with and without access to source data for estimating occupancy and recognizing activities in smart buildings

J Dridi, M Amayri, N Bouguila - Building and Environment, 2023 - Elsevier
Energy-efficient buildings have gained increasing interest in the last decades as they
provide optimal energy management. With the emergence of smart homes, many smart tools …

Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion

AR Bargum, S Serafin, C Erkut - arXiv preprint arXiv:2311.08104, 2023 - arxiv.org
Research on deep learning-powered voice conversion (VC) in speech-to-speech scenarios
is getting increasingly popular. Although many of the works in the field of voice conversion …

GLGAN-VC: A guided loss-based generative adversarial network for many-to-many voice conversion

S Dhar, ND Jana, S Das - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
Many-to-many voice conversion (VC) is a technique aimed at mapping speech features
between multiple speakers during training and transferring the vocal characteristics of one …

RAGAN: Regression attention generative adversarial networks

X Jiang, Z Ge - IEEE Transactions on Artificial Intelligence, 2022 - ieeexplore.ieee.org
Despite surrounding by Big Data, we still need to learn from insufficient data in many
scenarios. Building an accurate regression model for a small amount of data is a pretty tricky …

[PDF][PDF] Fighting Disinformation: Overview of Recent AI-Based Collaborative Human-Computer Interaction for Intelligent Decision Support Systems.

T Polzehl, V Schmitt, N Feldhus, J Meyer… - VISIGRAPP (2 …, 2023 - scitepress.org
Methods for automatic disinformation detection have gained much attention in recent years,
as false information can have a severe impact on societal cohesion. Disinformation can …

Region normalized capsule network based generative adversarial network for non-parallel voice conversion

MT Akhter, P Banerjee, S Dhar, S Ghosh… - … Conference on Speech …, 2023 - Springer
Voice conversion (VC) involves altering the vocal characteristics of a source speaker to
resemble those of a target speaker while maintaining the same linguistic content. Recently …

Generating long financial report using conditional variational autoencoders with knowledge distillation

Z Wang, Y Ren, X Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Generating financial reports from a piece of news is a challenging task due to the lack of
sufficient background knowledge to effectively generate long financial reports. To address …

Voice conversion using feature specific loss function based self-attentive generative adversarial network

S Dhar, P Banerjee, ND Jana… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Voice conversion (VC) is the process of converting the vocal texture of a source speaker
similar to that of a target speaker without altering the content of the source speaker's speech …

An analysis of performance evaluation metrics for voice conversion models

MT Akhter, P Banerjee, S Dhar… - 2022 IEEE 19th India …, 2022 - ieeexplore.ieee.org
The process of transforming a source speaker's vocal style or vocal feature to that of a target
speaker while keeping the linguistic information of the source speaker unchanged is known …