Reimagining speech: a scoping review of deep learning-based methods for non-parallel voice conversion
Research on deep learning-powered voice conversion (VC) in speech-to-speech scenarios
are gaining increasing popularity. Although many of the works in the field of voice …
are gaining increasing popularity. Although many of the works in the field of voice …
Unsupervised domain adaptation with and without access to source data for estimating occupancy and recognizing activities in smart buildings
Energy-efficient buildings have gained increasing interest in the last decades as they
provide optimal energy management. With the emergence of smart homes, many smart tools …
provide optimal energy management. With the emergence of smart homes, many smart tools …
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion
Research on deep learning-powered voice conversion (VC) in speech-to-speech scenarios
is getting increasingly popular. Although many of the works in the field of voice conversion …
is getting increasingly popular. Although many of the works in the field of voice conversion …
GLGAN-VC: A guided loss-based generative adversarial network for many-to-many voice conversion
Many-to-many voice conversion (VC) is a technique aimed at mapping speech features
between multiple speakers during training and transferring the vocal characteristics of one …
between multiple speakers during training and transferring the vocal characteristics of one …
RAGAN: Regression attention generative adversarial networks
Despite surrounding by Big Data, we still need to learn from insufficient data in many
scenarios. Building an accurate regression model for a small amount of data is a pretty tricky …
scenarios. Building an accurate regression model for a small amount of data is a pretty tricky …
[PDF][PDF] Fighting Disinformation: Overview of Recent AI-Based Collaborative Human-Computer Interaction for Intelligent Decision Support Systems.
Methods for automatic disinformation detection have gained much attention in recent years,
as false information can have a severe impact on societal cohesion. Disinformation can …
as false information can have a severe impact on societal cohesion. Disinformation can …
Region normalized capsule network based generative adversarial network for non-parallel voice conversion
Voice conversion (VC) involves altering the vocal characteristics of a source speaker to
resemble those of a target speaker while maintaining the same linguistic content. Recently …
resemble those of a target speaker while maintaining the same linguistic content. Recently …
Generating long financial report using conditional variational autoencoders with knowledge distillation
Generating financial reports from a piece of news is a challenging task due to the lack of
sufficient background knowledge to effectively generate long financial reports. To address …
sufficient background knowledge to effectively generate long financial reports. To address …
Voice conversion using feature specific loss function based self-attentive generative adversarial network
Voice conversion (VC) is the process of converting the vocal texture of a source speaker
similar to that of a target speaker without altering the content of the source speaker's speech …
similar to that of a target speaker without altering the content of the source speaker's speech …
An analysis of performance evaluation metrics for voice conversion models
The process of transforming a source speaker's vocal style or vocal feature to that of a target
speaker while keeping the linguistic information of the source speaker unchanged is known …
speaker while keeping the linguistic information of the source speaker unchanged is known …