Learning Musical Relations using Gated Autoencoders

Imposing higher-level structure in polyphonic music generation using convolutional restricted boltzmann machines and constraints

S Lattner, M Grachten, G Widmer - Journal of Creative Music …, 2018 - search.informit.org

We introduce a method for imposing higher-level structure on generated, polyphonic music.
A Convolutional Restricted Boltzmann Machine (C-RBM) as a generative model is combined …

被引用次数：89 相关文章所有 9 个版本

[PDF] mdpi.com

Bassnet: A variational gated autoencoder for conditional generation of bass guitar tracks with learned interactive control

M Grachten, S Lattner, E Deruty - Applied Sciences, 2020 - mdpi.com

Deep learning has given AI-based methods for music creation a boost by over the past
years. An important challenge in this field is to balance user control and autonomy in music …

被引用次数：20 相关文章所有 8 个版本

[PDF] arxiv.org

Autoencoder-based articulatory-to-acoustic mapping for ultrasound silent speech interfaces

G Gosztolya, Á Pintér, L Tóth, T Grósz… - … Joint Conference on …, 2019 - ieeexplore.ieee.org

When using ultrasound video as input, Deep Neural Network-based Silent Speech
Interfaces usually rely on the whole image to estimate the spectral parameters required for …

被引用次数：22 相关文章所有 8 个版本

[PDF] arxiv.org

Learning transposition-invariant interval features from symbolic music and audio

S Lattner, M Grachten, G Widmer - arXiv preprint arXiv:1806.08236, 2018 - arxiv.org

Many music theoretical constructs (such as scale types, modes, cadences, and chord types)
are defined in terms of pitch intervals---relative distances between pitches. Therefore, when …

被引用次数：17 相关文章所有 7 个版本

[PDF] arxiv.org

A predictive model for music based on learned interval representations

S Lattner, M Grachten, G Widmer - arXiv preprint arXiv:1806.08686, 2018 - arxiv.org

Connectionist sequence models (eg, RNNs) applied to musical sequences suffer from two
known problems: First, they have strictly" absolute pitch perception". Therefore, they fail to …

被引用次数：12 相关文章所有 8 个版本

[PDF] hal.science

Multi-scale and multi-dimensional modelling of music structure using polytopic graphs

C Louboutin - 2019 - theses.hal.science

In this thesis, we approach these questions by defining and implementing a multi-scale
model for music segment structure description, called Polytopic Graph of Latent Relations …

被引用次数：4 相关文章所有 3 个版本

[PDF] bme.hu

[PDF][PDF] Autoenkóderen alapuló jellemzőreprezentáció mély neuronhálós, ultrahang-alapú némabeszéd-interfészekben

P Ádám, G Gábor, T László, G Tamás, CT Gábor… - smartlab.tmit.bme.hu

Kivonat A neurális hálón alapuló némabeszéd-interfészek általában a teljes ultrahangkép
alapján becslik meg a spektrális paramétereket, melyekből a vokóder aztán beszédet …

[引用][C] Autoenkóderen alapuló jellemzőreprezentáció mély neuronhálós, ultrahang-alapú némabeszéd-interfészekben

Á Pintér, G Gosztolya, L Tóth, T Grósz… - 2019 - Szegedi Tudományegyetem …