[图书][B] Foundations of data science

A Blum, J Hopcroft, R Kannan - 2020 - books.google.com
This book provides an introduction to the mathematical and algorithmic foundations of data
science, including machine learning, high-dimensional geometry, and analysis of large …

Settling the polynomial learnability of mixtures of gaussians

A Moitra, G Valiant - 2010 IEEE 51st Annual Symposium on …, 2010 - ieeexplore.ieee.org
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately
estimate the mixture parameters. We give an algorithm for this problem that has running time …

Clustering with spectral norm and the k-means algorithm

A Kumar, R Kannan - 2010 IEEE 51st Annual Symposium on …, 2010 - ieeexplore.ieee.org
There has been much progress on efficient algorithms for clustering data points generated
by a mixture of k probability distributions under the assumption that the means of the …

Polynomial learning of distribution families

M Belkin, K Sinha - 2010 IEEE 51st Annual Symposium on …, 2010 - ieeexplore.ieee.org
The question of polynomial learn ability of probability distributions, particularly Gaussian
mixture distributions, has recently received significant attention in theoretical computer …

A discriminative framework for clustering via similarity functions

MF Balcan, A Blum, S Vempala - Proceedings of the fortieth annual ACM …, 2008 - dl.acm.org
Problems of clustering data from pairwise similarity information are ubiquitous in Computer
Science. Theoretical treatments typically view the similarity information as ground-truth and …

Isotropic PCA and affine-invariant clustering

SC Brubaker, SS Vempala - Building Bridges: Between Mathematics and …, 2008 - Springer
We present an extension of Principal Component Analysis (PCA) and a new algorithm for
clustering points in R n based on it. The key property of the algorithm is that it is affine …

Sketching for large-scale learning of mixture models

N Keriven, A Bourrier, R Gribonval… - … and Inference: A …, 2018 - academic.oup.com
Learning parameters from voluminous data can be prohibitive in terms of memory and
computational requirements. We propose a 'compressive learning'framework, where we …

Spectral algorithms

R Kannan, S Vempala - Foundations and Trends® in …, 2009 - nowpublishers.com
Spectral methods refer to the use of eigenvalues, eigenvectors, singular values, and
singular vectors. They are widely used in Engineering, Applied Mathematics, and Statistics …

[图书][B] Algorithmic aspects of machine learning

A Moitra - 2018 - books.google.com
This book bridges theoretical computer science and machine learning by exploring what the
two sides can teach each other. It emphasizes the need for flexible, tractable models that …

Clustering with interactive feedback

MF Balcan, A Blum - International Conference on Algorithmic Learning …, 2008 - Springer
In this paper, we initiate a theoretical study of the problem of clustering data under
interactive feedback. We introduce a query-based model in which users can provide …