Not too little, not too much: a theoretical analysis of graph (over) smoothing

N Keriven - Advances in Neural Information Processing …, 2022 - proceedings.neurips.cc
We analyze graph smoothing with mean aggregation, where each node successively
receives the average of the features of its neighbors. Indeed, it has quickly been observed …

[PDF][PDF] Tensor decompositions for learning latent variable models.

A Anandkumar, R Ge, DJ Hsu, SM Kakade… - J. Mach. Learn. Res …, 2014 - jmlr.org
This work considers a computationally and statistically efficient parameter estimation method
for a wide class of latent variable models—including Gaussian mixture models, hidden …

Learning from untrusted data

M Charikar, J Steinhardt, G Valiant - … of the 49th Annual ACM SIGACT …, 2017 - dl.acm.org
The vast majority of theoretical results in machine learning and statistics assume that the
training data is a reliable reflection of the phenomena to be learned. Similarly, most learning …

[图书][B] Foundations of data science

A Blum, J Hopcroft, R Kannan - 2020 - books.google.com
This book provides an introduction to the mathematical and algorithmic foundations of data
science, including machine learning, high-dimensional geometry, and analysis of large …

Low-rank approximation and regression in input sparsity time

KL Clarkson, DP Woodruff - Journal of the ACM (JACM), 2017 - dl.acm.org
We design a new distribution over m× n matrices S so that, for any fixed n× d matrix A of rank
r, with probability at least 9/10,∥ SAx∥ 2=(1±ε)∥ Ax∥ 2 simultaneously for all x∈ R d …

Statistical query lower bounds for robust estimation of high-dimensional gaussians and gaussian mixtures

I Diakonikolas, DM Kane… - 2017 IEEE 58th Annual …, 2017 - ieeexplore.ieee.org
We describe a general technique that yields the first Statistical Query lower bounds for a
range of fundamental high-dimensional learning problems involving Gaussian distributions …

[PDF][PDF] Multi-objective reinforcement learning using sets of pareto dominating policies

K Van Moffaert, A Nowé - The Journal of Machine Learning Research, 2014 - jmlr.org
Many real-world problems involve the optimization of multiple, possibly conflicting
objectives. Multi-objective reinforcement learning (MORL) is a generalization of standard …

Multi-view clustering via canonical correlation analysis

K Chaudhuri, SM Kakade, K Livescu… - Proceedings of the 26th …, 2009 - dl.acm.org
Clustering data in high dimensions is believed to be a hard problem in general. A number of
efficient clustering algorithms developed in recent years address this problem by projecting …

Improved approximation algorithms for large matrices via random projections

T Sarlos - 2006 47th annual IEEE symposium on foundations of …, 2006 - ieeexplore.ieee.org
Several results appeared that show significant reduction in time for matrix multiplication,
singular value decomposition as well as linear (lscr 2) regression, all based on data …

Mixture models, robustness, and sum of squares proofs

SB Hopkins, J Li - Proceedings of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org
We use the Sum of Squares method to develop new efficient algorithms for learning well-
separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that …