A review of domain adaptation without target labels
Domain adaptation has become a prominent problem setting in machine learning and
related fields. This review asks the question: How can a classifier learn from a source …
related fields. This review asks the question: How can a classifier learn from a source …
A review on quantification learning
The task of quantification consists in providing an aggregate estimation (eg, the class
distribution in a classification problem) for unseen test sets, applying a model that is trained …
distribution in a classification problem) for unseen test sets, applying a model that is trained …
Confident learning: Estimating uncertainty in dataset labels
Learning exists in the context of data, yet notions of confidence typically focus on model
predictions, not label quality. Confident learning (CL) is an alternative approach which …
predictions, not label quality. Confident learning (CL) is an alternative approach which …
Isolating sources of disentanglement in variational autoencoders
We decompose the evidence lower bound to show the existence of a term measuring the
total correlation between latent variables. We use this to motivate the beta-TCVAE (Total …
total correlation between latent variables. We use this to motivate the beta-TCVAE (Total …
Compositional foundation models for hierarchical planning
To make effective decisions in novel environments with long-horizon goals, it is crucial to
engage in hierarchical reasoning across spatial and temporal scales. This entails planning …
engage in hierarchical reasoning across spatial and temporal scales. This entails planning …
Conformal prediction under covariate shift
RJ Tibshirani, R Foygel Barber… - Advances in neural …, 2019 - proceedings.neurips.cc
We extend conformal prediction methodology beyond the case of exchangeable data. In
particular, we show that a weighted version of conformal prediction can be used to compute …
particular, we show that a weighted version of conformal prediction can be used to compute …
Veegan: Reducing mode collapse in gans using implicit variational learning
Deep generative models provide powerful tools for distributions over complicated manifolds,
such as those of natural images. But many of these methods, including generative …
such as those of natural images. But many of these methods, including generative …
Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections
In many real-world reinforcement learning applications, access to the environment is limited
to a fixed dataset, instead of direct (online) interaction with the environment. When using this …
to a fixed dataset, instead of direct (online) interaction with the environment. When using this …
Breaking the curse of horizon: Infinite-horizon off-policy estimation
We consider the off-policy estimation problem of estimating the expected reward of a target
policy using samples collected by a different behavior policy. Importance sampling (IS) has …
policy using samples collected by a different behavior policy. Importance sampling (IS) has …
Learning in implicit generative models
S Mohamed, B Lakshminarayanan - arXiv preprint arXiv:1610.03483, 2016 - arxiv.org
Generative adversarial networks (GANs) provide an algorithmic framework for constructing
generative models with several appealing properties: they do not require a likelihood …
generative models with several appealing properties: they do not require a likelihood …