Breaking the glass ceiling for embedding-based classifiers for large output spaces

C Guo, A Mousavi, X Wu… - Advances in …, 2019 - proceedings.neurips.cc
C Guo, A Mousavi, X Wu, DN Holtmann-Rice, S Kale, S Reddi, S Kumar
Advances in Neural Information Processing Systems, 2019proceedings.neurips.cc
In extreme classification settings, embedding-based neural network models are currently not
competitive with sparse linear and tree-based methods in terms of accuracy. Most prior
works attribute this poor performance to the low-dimensional bottleneck in embedding-
based methods. In this paper, we demonstrate that theoretically there is no limitation to using
low-dimensional embedding-based methods, and provide experimental evidence that
overfitting is the root cause of the poor performance of embedding-based methods. These …
Abstract
In extreme classification settings, embedding-based neural network models are currently not competitive with sparse linear and tree-based methods in terms of accuracy. Most prior works attribute this poor performance to the low-dimensional bottleneck in embedding-based methods. In this paper, we demonstrate that theoretically there is no limitation to using low-dimensional embedding-based methods, and provide experimental evidence that overfitting is the root cause of the poor performance of embedding-based methods. These findings motivate us to investigate novel data augmentation and regularization techniques to mitigate overfitting. To this end, we propose GLaS, a new regularizer for embedding-based neural network approaches. It is a natural generalization from the graph Laplacian and spread-out regularizers, and empirically it addresses the drawback of each regularizer alone when applied to the extreme classification setup. With the proposed techniques, we attain or improve upon the state-of-the-art on most widely tested public extreme classification datasets with hundreds of thousands of labels.
proceedings.neurips.cc
以上显示的是最相近的搜索结果。 查看全部搜索结果