Connecting the Dots--Density-Connectivity Distance unifies DBSCAN, k-Center and Spectral Clustering

A Beer, A Draganov, E Hohma, P Jahn… - Proceedings of the 29th …, 2023 - dl.acm.org
Despite the popularity of density-based clustering, its procedural definition makes it difficult
to analyze compared to clustering methods that minimize a loss function. In this paper, we …

Hamilton-Jacobi equations on graphs with applications to semi-supervised learning and data depth

J Calder, M Ettehad - Journal of Machine Learning Research, 2022 - jmlr.org
Shortest path graph distances are widely used in data science and machine learning, since
they can approximate the underlying geodesic distance on the data manifold. However, the …

Geometric scattering on measure spaces

J Chew, M Hirn, S Krishnaswamy, D Needell… - arXiv preprint arXiv …, 2022 - arxiv.org
The scattering transform is a multilayered, wavelet-based transform initially introduced as a
model of convolutional neural networks (CNNs) that has played a foundational role in our …

Continuum Limits of Ollivier's Ricci Curvature on data clouds: pointwise consistency and global lower bounds

NG Trillos, M Weber - arXiv preprint arXiv:2307.02378, 2023 - arxiv.org
Let $\mathcal {M}\subseteq\mathbb {R}^ d $ denote a low-dimensional manifold and let
$\mathcal {X}=\{x_1,\dots, x_n\} $ be a collection of points uniformly sampled from $\mathcal …

Ratio convergence rates for Euclidean first-passage percolation: applications to the graph infinity Laplacian

L Bungert, J Calder, T Roith - The Annals of Applied Probability, 2024 - projecteuclid.org
Ratio convergence rates for Euclidean first-passage percolation: Applications to the graph
infinity Laplacian Page 1 The Annals of Applied Probability 2024, Vol. 34, No. 4, 3870–3910 …

Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

NG Trillos, A Little, D McKenzie, JM Murphy - arXiv preprint arXiv …, 2023 - arxiv.org
We analyze the convergence properties of Fermat distances, a family of density-driven
metrics defined on Riemannian manifolds with an associated probability measure. Fermat …

Intrinsic persistent homology via density-based metric learning

X Fernández, E Borghini, G Mindlin… - Journal of Machine …, 2023 - jmlr.org
We address the problem of estimating topological features from data in high dimensional
Euclidean spaces under the manifold assumption. Our approach is based on the …

Learning distances from data with normalizing flows and score matching

P Sorrenson, D Behrend-Uriarte, C Schnörr… - arXiv preprint arXiv …, 2024 - arxiv.org
Density-based distances (DBDs) offer an elegant solution to the problem of metric learning.
By defining a Riemannian metric which increases with decreasing probability density …

A convergence rate for manifold neural networks

JA Chew, D Needell… - … Conference on Sampling …, 2023 - ieeexplore.ieee.org
High-dimensional data arises in numerous applications, and the rapidly developing field of
geometric deep learning seeks to develop neural network architectures to analyze such data …

Large sample spectral analysis of graph-based multi-manifold clustering

NG Trillos, P He, C Li - Journal of Machine Learning Research, 2023 - jmlr.org
In this work we study statistical properties of graph-based algorithms for multimanifold
clustering (MMC). In MMC the goal is to retrieve the multi-manifold structure underlying a …