[PDF][PDF] Pairwise decomposition with deep neural networks and multiscale kernel subspace learning for acoustic scene classification

E Marchi, D Tonelli, X Xu, F Ringeval, J Deng… - 2016 - opus.bibliothek.uni-augsburg.de
2016opus.bibliothek.uni-augsburg.de
We propose a system for acoustic scene classification using pairwise decomposition with
deep neural networks and dimensionality reduction by multiscale kernel subspace learning.
It is our contribution to the Acoustic Scene Classification task of the IEEE AASP Challenge
on Detection and Classification of Acoustic Scenes and Events (DCASE2016). The system
classifies 15 different acoustic scenes. First, auditory spectral features are extracted and fed
into 15 binary deep multilayer perceptron neural networks (MLP). MLP are trained with the …
Abstract
We propose a system for acoustic scene classification using pairwise decomposition with deep neural networks and dimensionality reduction by multiscale kernel subspace learning. It is our contribution to the Acoustic Scene Classification task of the IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE2016). The system classifies 15 different acoustic scenes. First, auditory spectral features are extracted and fed into 15 binary deep multilayer perceptron neural networks (MLP). MLP are trained with the ‘one-against-all’paradigm to perform a pairwise decomposition. In a second stage, a large number of spectral, cepstral, energy and voicing-related audio features are extracted. Multiscale Gaussian kernels are then used in constructing optimal linear combination of Gram matrices for multiple kernel subspace learning. The reduced feature set is fed into a nearest-neighbour classifier. Predictions from the two systems are then combined by a threshold-based decision function. On the official development set of the challenge, an accuracy of 81.4% is achieved.
opus.bibliothek.uni-augsburg.de
以上显示的是最相近的搜索结果。 查看全部搜索结果