Imposing higher-level structure in polyphonic music generation using convolutional restricted boltzmann machines and constraints

S Lattner, M Grachten, G Widmer - Journal of Creative Music …, 2018 - search.informit.org
We introduce a method for imposing higher-level structure on generated, polyphonic music.
A Convolutional Restricted Boltzmann Machine (C-RBM) as a generative model is combined …

Bassnet: A variational gated autoencoder for conditional generation of bass guitar tracks with learned interactive control

M Grachten, S Lattner, E Deruty - Applied Sciences, 2020 - mdpi.com
Deep learning has given AI-based methods for music creation a boost by over the past
years. An important challenge in this field is to balance user control and autonomy in music …

Autoencoder-based articulatory-to-acoustic mapping for ultrasound silent speech interfaces

G Gosztolya, Á Pintér, L Tóth, T Grósz… - … Joint Conference on …, 2019 - ieeexplore.ieee.org
When using ultrasound video as input, Deep Neural Network-based Silent Speech
Interfaces usually rely on the whole image to estimate the spectral parameters required for …

Learning transposition-invariant interval features from symbolic music and audio

S Lattner, M Grachten, G Widmer - arXiv preprint arXiv:1806.08236, 2018 - arxiv.org
Many music theoretical constructs (such as scale types, modes, cadences, and chord types)
are defined in terms of pitch intervals---relative distances between pitches. Therefore, when …

A predictive model for music based on learned interval representations

S Lattner, M Grachten, G Widmer - arXiv preprint arXiv:1806.08686, 2018 - arxiv.org
Connectionist sequence models (eg, RNNs) applied to musical sequences suffer from two
known problems: First, they have strictly" absolute pitch perception". Therefore, they fail to …

Multi-scale and multi-dimensional modelling of music structure using polytopic graphs

C Louboutin - 2019 - theses.hal.science
In this thesis, we approach these questions by defining and implementing a multi-scale
model for music segment structure description, called Polytopic Graph of Latent Relations …

[PDF][PDF] Autoenkóderen alapuló jellemzőreprezentáció mély neuronhálós, ultrahang-alapú némabeszéd-interfészekben

P Ádám, G Gábor, T László, G Tamás, CT Gábor… - smartlab.tmit.bme.hu
Kivonat A neurális hálón alapuló némabeszéd-interfészek általában a teljes ultrahangkép
alapján becslik meg a spektrális paramétereket, melyekből a vokóder aztán beszédet …

[引用][C] Autoenkóderen alapuló jellemzőreprezentáció mély neuronhálós, ultrahang-alapú némabeszéd-interfészekben

Á Pintér, G Gosztolya, L Tóth, T Grósz… - 2019 - Szegedi Tudományegyetem …