Binary embeddings with structured hashed projections

C Baykal, L Liebenwein, I Gilitschenski… - arXiv preprint arXiv …, 2018 - arxiv.org

We present an efficient coresets-based neural network compression algorithm that sparsifies
the parameters of a trained fully-connected neural network in a manner that provably …

被引用次数：95 相关文章所有 9 个版本

[PDF] neurips.cc

The unreasonable effectiveness of structured random orthogonal embeddings

KM Choromanski, M Rowland… - Advances in neural …, 2017 - proceedings.neurips.cc

We examine a class of embeddings based on structured random matrices with orthogonal
rows which can be applied in many machine learning applications including dimensionality …

被引用次数：98 相关文章所有 13 个版本

[PDF] arxiv.org

On the expressive power of self-attention matrices

V Likhosherstov, K Choromanski, A Weller - arXiv preprint arXiv …, 2021 - arxiv.org

Transformer networks are able to capture patterns in data coming from many domains (text,
images, videos, proteins, etc.) with little or no change to architecture components. We …

被引用次数：30 相关文章所有 3 个版本

[PDF] arxiv.org

Sensitivity-informed provable pruning of neural networks

C Baykal, L Liebenwein, I Gilitschenski… - SIAM Journal on …, 2022 - SIAM

We introduce a family of pruning algorithms that sparsifies the parameters of a trained model
in a way that approximately preserves the model's predictive accuracy. Our algorithms use a …

被引用次数：33 相关文章所有 4 个版本

[PDF] mlr.press

The geometry of random features

K Choromanski, M Rowland, T Sarlós… - International …, 2018 - proceedings.mlr.press

We present an in-depth examination of the effectiveness of radial basis function kernel
(beyond Gaussian) estimators based on orthogonal random feature maps. We show that …

被引用次数：45 相关文章所有 4 个版本

[PDF] mlr.press

Recycling randomness with structure for sublinear time kernel expansions

K Choromanski, V Sindhwani - International Conference on …, 2016 - proceedings.mlr.press

We propose a scheme for recycling Gaussian random vectors into structured matrices to ap-
proximate various kernel functions in sublin-ear time via random embeddings. Our frame …

被引用次数：48 相关文章所有 8 个版本

[PDF] mlr.press

Structured adaptive and random spinners for fast machine learning computations

M Bojarski, A Choromanska… - Artificial intelligence …, 2017 - proceedings.mlr.press

We consider an efficient computational framework for speeding up several machine learning
algorithms with almost no loss of accuracy. The proposed framework relies on projections …

被引用次数：43 相关文章所有 9 个版本

[PDF] auai.org

[PDF][PDF] FROSH: FasteR Online Sketching Hashing.

X Chen, I King, MR Lyu - UAI, 2017 - auai.org

Many hashing methods, especially those that are in the data-dependent category with good
learning accuracy, are still inefficient when dealing with three critical problems in modern …

被引用次数：33 相关文章所有 2 个版本

Binary vectors for fast distance and similarity estimation

DA Rachkovskij - Cybernetics and Systems Analysis, 2017 - Springer

This review considers methods and algorithms for fast estimation of distance/similarity
measures between initial data from vector representations with binary or integer-valued …

被引用次数：39 相关文章所有 6 个版本

[PDF] jmlr.org

On binary embedding using circulant matrices

XY Felix, A Bhaskara, S Kumar, Y Gong… - Journal of Machine …, 2018 - jmlr.org

Binary embeddings provide efficient and powerful ways to perform operations on large scale
data. However binary embedding typically requires long codes in order to preserve the …

被引用次数：35 相关文章所有 8 个版本