Stochastic geometry to generalize the Mondrian process

E O'Reilly, NM Tran - SIAM Journal on Mathematics of Data Science, 2022 - SIAM
The stable under iteration (STIT) tessellation process is a stochastic process that produces a
recursive partition of space with cut directions drawn independently from a distribution over …

Stochastic geometry to generalize the Mondrian process

E O'Reilly, N Tran - arXiv preprint arXiv:2002.00797, 2020 - arxiv.org
The stable under iterated tessellation (STIT) process is a stochastic process that produces a
recursive partition of space with cut directions drawn independently from a distribution over …

The Mondrian process for machine learning

M Balog, YW Teh - arXiv preprint arXiv:1507.05181, 2015 - arxiv.org
This report is concerned with the Mondrian process and its applications in machine learning.
The Mondrian process is a guillotine-partition-valued stochastic process that possesses an …

Minimax optimal rates for mondrian trees and forests

J Mourtada, S Gaïffas, E Scornet - arXiv preprint arXiv:1803.05784, 2018 - arxiv.org
Introduced by Breiman, Random Forests are widely used classification and regression
algorithms. While being initially designed as batch algorithms, several variants have been …

Neural Implicit Manifold Learning for Topology-Aware Density Estimation

BL Ross, G Loaiza-Ganem, AL Caterini… - … on Machine Learning …, 2023 - openreview.net
Natural data observed in $\mathbb {R}^ n $ is often constrained to an $ m $-dimensional
manifold $\mathcal {M} $, where $ m< n $. This work focuses on the task of building …

Normal-bundle bootstrap

R Zhang, R Ghanem - SIAM Journal on Mathematics of Data Science, 2021 - SIAM
Probabilistic models of data sets often exhibit salient geometric structure. Such a
phenomenon is summed up in the manifold distribution hypothesis and can be exploited in …

Density estimation on smooth manifolds with normalizing flows

D Kalatzis, JZ Ye, A Pouplin, J Wohlert… - arXiv preprint arXiv …, 2021 - arxiv.org
We present a framework for learning probability distributions on topologically non-trivial
manifolds, utilizing normalizing flows. Current methods focus on manifolds that are …

Minimax rates for high-dimensional random tessellation forests

E O'Reilly, NM Tran - Journal of Machine Learning Research, 2024 - jmlr.org
Random forests are a popular class of algorithms used for regression and classification. The
algorithm introduced by Breiman in 2001 and many of its variants are ensembles of …

Random tessellation forests

S Ge, S Wang, YW Teh, L Wang… - Advances in Neural …, 2019 - proceedings.neurips.cc
Abstract Space partitioning methods such as random forests and the Mondrian process are
powerful machine learning methods for multi-dimensional and relational data, and are …

Uniform convergence rate of the kernel density estimator adaptive to intrinsic volume dimension

J Kim, J Shin, A Rinaldo… - … Conference on Machine …, 2019 - proceedings.mlr.press
We derive concentration inequalities for the supremum norm of the difference between a
kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the …