Online, loudness-invariant vocal detection in mixed music signals

A comprehensive empirical review of modern voice activity detection approaches for movies and TV shows

M Sharma, S Joshi, T Chatterjee, R Hamid - Neurocomputing, 2022 - Elsevier

A robust and language agnostic Voice Activity Detection (VAD) is crucial for Digital
Entertainment Content (DEC). Primary examples of DEC include movies and TV series …

被引用次数：13 相关文章所有 4 个版本

[PDF] sciencedirect.com

Surrogate modelling for an aircraft dynamic landing loads simulation using an LSTM AutoEncoder-based dimensionality reduction approach

M Lazzara, M Chevalier, M Colombo, JG Garcia… - Aerospace Science and …, 2022 - Elsevier

Surrogate modelling can alleviate the computational burden of design activities as they rely
on multiple evaluations of high-fidelity models. However, the learning task can be adversely …

被引用次数：38 相关文章所有 3 个版本

[PDF] ieee.org

A fusion methodology to bridge GPS outages for INS/GPS integrated navigation system

Y Zhang - IEEE access, 2019 - ieeexplore.ieee.org

The performance of an inertial navigation system (INS) and global positioning system (GPS)
integrated navigation system is reduced during GPS outages. To bridge GPS outages, a …

被引用次数：79 相关文章所有 4 个版本

[HTML] mdpi.com

[HTML][HTML] Music generation system for adversarial training based on deep learning

J Min, Z Liu, L Wang, D Li, M Zhang, Y Huang - Processes, 2022 - mdpi.com

With the rapid development of artificial intelligence, the application of this new technology to
music generation has attracted more attention and achieved gratifying results. This study …

被引用次数：17 相关文章所有 5 个版本

[PDF] github.io

An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music

EJ Humphrey, S Reddy, P Seetharaman… - IEEE Signal …, 2018 - ieeexplore.ieee.org

Humans have devised a vast array of musical instruments, but the most prevalent instrument
remains the human voice. Thus, techniques for applying audio signal processing methods to …

被引用次数：52 相关文章所有 5 个版本

[HTML] aip.org

[HTML][HTML] Deep learning approaches for thermographic imaging

P Kovács, B Lehner, G Thummerer, G Mayr… - Journal of Applied …, 2020 - pubs.aip.org

In this paper, we investigate two deep learning approaches to recovering initial temperature
profiles from thermographic images in non-destructive material testing. First, we trained a …

被引用次数：28 相关文章所有 8 个版本

[PDF] ismir.net

[PDF][PDF] Zero-Mean Convolutions for Level-Invariant Singing Voice Detection.

J Schlüter, B Lehner - ISMIR, 2018 - ismir2018.ismir.net

State-of-the-art singing voice detectors are based on classifiers trained on annotated
examples. As recently shown, such detectors have an important weakness: Since singing …

被引用次数：39 相关文章所有 8 个版本

A Real-Time Framework for Automatic Sarcasm Detection Using Proposed Tensor-DNN-50 Algorithm

JS Murthy, GM Siddesh - … Conference on Frontiers in Computing and …, 2023 - Springer

Social media platforms such as Twitter, Facebook, and Instagram are vast repositories of
trending global news. They generate an enormous amount of data, offering a valuable …

被引用次数：6 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] Singing voice detection in opera recordings: A case study on robustness and generalization

M Krause, M Müller, C Weiß - Electronics, 2021 - mdpi.com

Automatically detecting the presence of singing in music audio recordings is a central task
within music information retrieval. While modern machine-learning systems produce high …

被引用次数：16 相关文章所有 7 个版本

[PDF] arxiv.org

Basic filters for convolutional neural networks applied to music: Training or design?

M Dörfler, T Grill, R Bammer, A Flexer - Neural Computing and …, 2020 - Springer

When convolutional neural networks are used to tackle learning problems based on music
or other time series, raw one-dimensional data are commonly preprocessed to obtain …

被引用次数：29 相关文章所有 7 个版本