A comprehensive empirical review of modern voice activity detection approaches for movies and TV shows

M Sharma, S Joshi, T Chatterjee, R Hamid - Neurocomputing, 2022 - Elsevier
A robust and language agnostic Voice Activity Detection (VAD) is crucial for Digital
Entertainment Content (DEC). Primary examples of DEC include movies and TV series …

Surrogate modelling for an aircraft dynamic landing loads simulation using an LSTM AutoEncoder-based dimensionality reduction approach

M Lazzara, M Chevalier, M Colombo, JG Garcia… - Aerospace Science and …, 2022 - Elsevier
Surrogate modelling can alleviate the computational burden of design activities as they rely
on multiple evaluations of high-fidelity models. However, the learning task can be adversely …

A fusion methodology to bridge GPS outages for INS/GPS integrated navigation system

Y Zhang - IEEE access, 2019 - ieeexplore.ieee.org
The performance of an inertial navigation system (INS) and global positioning system (GPS)
integrated navigation system is reduced during GPS outages. To bridge GPS outages, a …

[HTML][HTML] Music generation system for adversarial training based on deep learning

J Min, Z Liu, L Wang, D Li, M Zhang, Y Huang - Processes, 2022 - mdpi.com
With the rapid development of artificial intelligence, the application of this new technology to
music generation has attracted more attention and achieved gratifying results. This study …

An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music

EJ Humphrey, S Reddy, P Seetharaman… - IEEE Signal …, 2018 - ieeexplore.ieee.org
Humans have devised a vast array of musical instruments, but the most prevalent instrument
remains the human voice. Thus, techniques for applying audio signal processing methods to …

[HTML][HTML] Deep learning approaches for thermographic imaging

P Kovács, B Lehner, G Thummerer, G Mayr… - Journal of Applied …, 2020 - pubs.aip.org
In this paper, we investigate two deep learning approaches to recovering initial temperature
profiles from thermographic images in non-destructive material testing. First, we trained a …

[PDF][PDF] Zero-Mean Convolutions for Level-Invariant Singing Voice Detection.

J Schlüter, B Lehner - ISMIR, 2018 - ismir2018.ismir.net
State-of-the-art singing voice detectors are based on classifiers trained on annotated
examples. As recently shown, such detectors have an important weakness: Since singing …

A Real-Time Framework for Automatic Sarcasm Detection Using Proposed Tensor-DNN-50 Algorithm

JS Murthy, GM Siddesh - … Conference on Frontiers in Computing and …, 2023 - Springer
Social media platforms such as Twitter, Facebook, and Instagram are vast repositories of
trending global news. They generate an enormous amount of data, offering a valuable …

[HTML][HTML] Singing voice detection in opera recordings: A case study on robustness and generalization

M Krause, M Müller, C Weiß - Electronics, 2021 - mdpi.com
Automatically detecting the presence of singing in music audio recordings is a central task
within music information retrieval. While modern machine-learning systems produce high …

Basic filters for convolutional neural networks applied to music: Training or design?

M Dörfler, T Grill, R Bammer, A Flexer - Neural Computing and …, 2020 - Springer
When convolutional neural networks are used to tackle learning problems based on music
or other time series, raw one-dimensional data are commonly preprocessed to obtain …