Real-time voice conversion using artificial neural networks with rectified linear units.

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier

Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

被引用次数：352 相关文章所有 6 个版本

[PDF] cell.com Full View

Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com

Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …

被引用次数：29 相关文章所有 7 个版本

[PDF] researchgate.net

Utilizing AlexNet deep transfer learning for ear recognition

A Abd Almisreb, N Jamil, NM Din - 2018 fourth international …, 2018 - ieeexplore.ieee.org

Transfer Learning is an efficient approach of solving classification problem with little amount
of data. In this paper, we applied Transfer Learning to the well-known AlexNet Convolution …

被引用次数：140 相关文章所有 2 个版本

Generation and detection of manipulated multimodal audiovisual content: Advances, trends and open challenges

H Liz-Lopez, M Keita, A Taleb-Ahmed, A Hadid… - Information …, 2024 - Elsevier

Generative deep learning techniques have invaded the public discourse recently. Despite
the advantages, the applications to disinformation are concerning as the counter-measures …

被引用次数：21 相关文章所有 4 个版本

[PDF] github.io

Voice conversion using deep neural networks with speaker-independent pre-training

SH Mohammadi, A Kain - 2014 IEEE Spoken Language …, 2014 - ieeexplore.ieee.org

In this study, we trained a deep autoencoder to build compact representations of short-term
spectra of multiple speakers. Using this compact representation as mapping features, we …

被引用次数：124 相关文章所有 9 个版本

[PDF] cuhk.edu.hk

[PDF][PDF] Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.

S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng - Interspeech, 2018 - se.cuhk.edu.hk

Developing a voice conversion (VC) system for a particular speaker typically requires
considerable data from both the source and target speakers. This paper aims to effectuate …

被引用次数：68 相关文章所有 6 个版本

GPU-based parallel optimization of immune convolutional neural network and embedded system

T Gong, T Fan, J Guo, Z Cai - Engineering Applications of Artificial …, 2017 - Elsevier

Up to now, the image recognition system has been utilized more and more widely in the
security monitoring, the industrial intelligent monitoring, the unmanned vehicle, and even the …

被引用次数：55 相关文章所有 3 个版本

[PDF] arxiv.org

Towards low-resource stargan voice conversion using weight adaptive instance normalization

M Chen, Y Shi, T Hain - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org

Many-to-many voice conversion with non-parallel training data has seen significant progress
in recent years. It is challenging because of lacking of ground truth parallel data. StarGAN …

被引用次数：16 相关文章所有 4 个版本

[PDF] iop.org

The protection of megascience projects from deepfake technologies threats: information law aspects

EI Galyashina, VD Nikishin - Journal of Physics: Conference …, 2022 - iopscience.iop.org

The paper examines the potential threats of the malicious use of deepfake technology to
destabilize and discredit megascience projects in the global information space. The …

被引用次数：8 相关文章所有 6 个版本

[PDF] researchgate.net

[PDF][PDF] Deep transfer learning for human identification based on footprint: A comparative study

MMA Abuqadumah, MAM Ali… - … of Engineering and …, 2019 - researchgate.net

Identifying people based on their footprint has not yet gained enough attention from the
researchers. Therefore, in this paper, an investigation of human identification conducted …

被引用次数：12 相关文章所有 5 个版本