There and back again: Outlier detection between statistical reasoning and data mining algorithms

A Zimek, P Filzmoser - Wiley Interdisciplinary Reviews: Data …, 2018 - Wiley Online Library
Outlier detection has been a topic in statistics for centuries. Over mainly the last two
decades, there has been also an increasing interest in the database and data mining …

Density-ratio matching under the bregman divergence: a unified framework of density-ratio estimation

M Sugiyama, T Suzuki, T Kanamori - Annals of the Institute of Statistical …, 2012 - Springer
Estimation of the ratio of probability densities has attracted a great deal of attention since it
can be used for addressing various statistical paradigms. A naive approach to density-ratio …

Learning from positive and unlabeled data: A survey

J Bekker, J Davis - Machine Learning, 2020 - Springer
Learning from positive and unlabeled data or PU learning is the setting where a learner only
has access to positive examples and unlabeled data. The assumption is that the unlabeled …

Analysis of learning from positive and unlabeled data

MC Du Plessis, G Niu… - Advances in neural …, 2014 - proceedings.neurips.cc
Learning a classifier from positive and unlabeled data is an important class of classification
problems that are conceivable in many practical applications. In this paper, we first show that …

Domain adaptation via transfer component analysis

SJ Pan, IW Tsang, JT Kwok… - IEEE transactions on …, 2010 - ieeexplore.ieee.org
Domain adaptation allows knowledge from a source domain to be transferred to a different
but related target domain. Intuitively, discovering a good feature representation across …

Convex formulation for learning from positive and unlabeled data

M Du Plessis, G Niu… - … conference on machine …, 2015 - proceedings.mlr.press
We discuss binary classification from only from positive and unlabeled data (PU
classification), which is conceivable in various real-world machine learning problems. Since …

[图书][B] Machine learning in non-stationary environments: Introduction to covariate shift adaptation

M Sugiyama, M Kawanabe - 2012 - books.google.com
Theory, algorithms, and applications of machine learning techniques to overcome" covariate
shift" non-stationarity. As the power of computing has grown over the past few decades, the …

[PDF][PDF] A least-squares approach to direct importance estimation

T Kanamori, S Hido, M Sugiyama - The Journal of Machine Learning …, 2009 - jmlr.org
We address the problem of estimating the ratio of two probability density functions, which is
often referred to as the importance. The importance values can be used for various …

Statistical outlier detection using direct density ratio estimation

S Hido, Y Tsuboi, H Kashima, M Sugiyama… - … and information systems, 2011 - Springer
We propose a new statistical approach to the problem of inlier-based outlier detection, ie,
finding outliers in the test set based on the training set consisting only of inliers. Our key idea …

On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles

HO Marques, L Swersky, J Sander… - Data Mining and …, 2023 - Springer
It has been shown that unsupervised outlier detection methods can be adapted to the one-
class classification problem (Janssens and Postma, in: Proceedings of the 18th annual …