[图书][B] Foundations of data science
A Blum, J Hopcroft, R Kannan - 2020 - books.google.com
This book provides an introduction to the mathematical and algorithmic foundations of data
science, including machine learning, high-dimensional geometry, and analysis of large …
science, including machine learning, high-dimensional geometry, and analysis of large …
Settling the polynomial learnability of mixtures of gaussians
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately
estimate the mixture parameters. We give an algorithm for this problem that has running time …
estimate the mixture parameters. We give an algorithm for this problem that has running time …
Clustering with spectral norm and the k-means algorithm
A Kumar, R Kannan - 2010 IEEE 51st Annual Symposium on …, 2010 - ieeexplore.ieee.org
There has been much progress on efficient algorithms for clustering data points generated
by a mixture of k probability distributions under the assumption that the means of the …
by a mixture of k probability distributions under the assumption that the means of the …
Polynomial learning of distribution families
The question of polynomial learn ability of probability distributions, particularly Gaussian
mixture distributions, has recently received significant attention in theoretical computer …
mixture distributions, has recently received significant attention in theoretical computer …
A discriminative framework for clustering via similarity functions
Problems of clustering data from pairwise similarity information are ubiquitous in Computer
Science. Theoretical treatments typically view the similarity information as ground-truth and …
Science. Theoretical treatments typically view the similarity information as ground-truth and …
Isotropic PCA and affine-invariant clustering
SC Brubaker, SS Vempala - Building Bridges: Between Mathematics and …, 2008 - Springer
We present an extension of Principal Component Analysis (PCA) and a new algorithm for
clustering points in R n based on it. The key property of the algorithm is that it is affine …
clustering points in R n based on it. The key property of the algorithm is that it is affine …
Sketching for large-scale learning of mixture models
N Keriven, A Bourrier, R Gribonval… - … and Inference: A …, 2018 - academic.oup.com
Learning parameters from voluminous data can be prohibitive in terms of memory and
computational requirements. We propose a 'compressive learning'framework, where we …
computational requirements. We propose a 'compressive learning'framework, where we …
Spectral algorithms
R Kannan, S Vempala - Foundations and Trends® in …, 2009 - nowpublishers.com
Spectral methods refer to the use of eigenvalues, eigenvectors, singular values, and
singular vectors. They are widely used in Engineering, Applied Mathematics, and Statistics …
singular vectors. They are widely used in Engineering, Applied Mathematics, and Statistics …
[图书][B] Algorithmic aspects of machine learning
A Moitra - 2018 - books.google.com
This book bridges theoretical computer science and machine learning by exploring what the
two sides can teach each other. It emphasizes the need for flexible, tractable models that …
two sides can teach each other. It emphasizes the need for flexible, tractable models that …
Clustering with interactive feedback
In this paper, we initiate a theoretical study of the problem of clustering data under
interactive feedback. We introduce a query-based model in which users can provide …
interactive feedback. We introduce a query-based model in which users can provide …