There and back again: Outlier detection between statistical reasoning and data mining algorithms
A Zimek, P Filzmoser - Wiley Interdisciplinary Reviews: Data …, 2018 - Wiley Online Library
Outlier detection has been a topic in statistics for centuries. Over mainly the last two
decades, there has been also an increasing interest in the database and data mining …
decades, there has been also an increasing interest in the database and data mining …
Density-ratio matching under the bregman divergence: a unified framework of density-ratio estimation
Estimation of the ratio of probability densities has attracted a great deal of attention since it
can be used for addressing various statistical paradigms. A naive approach to density-ratio …
can be used for addressing various statistical paradigms. A naive approach to density-ratio …
Learning from positive and unlabeled data: A survey
Learning from positive and unlabeled data or PU learning is the setting where a learner only
has access to positive examples and unlabeled data. The assumption is that the unlabeled …
has access to positive examples and unlabeled data. The assumption is that the unlabeled …
Analysis of learning from positive and unlabeled data
MC Du Plessis, G Niu… - Advances in neural …, 2014 - proceedings.neurips.cc
Learning a classifier from positive and unlabeled data is an important class of classification
problems that are conceivable in many practical applications. In this paper, we first show that …
problems that are conceivable in many practical applications. In this paper, we first show that …
Domain adaptation via transfer component analysis
Domain adaptation allows knowledge from a source domain to be transferred to a different
but related target domain. Intuitively, discovering a good feature representation across …
but related target domain. Intuitively, discovering a good feature representation across …
Convex formulation for learning from positive and unlabeled data
M Du Plessis, G Niu… - … conference on machine …, 2015 - proceedings.mlr.press
We discuss binary classification from only from positive and unlabeled data (PU
classification), which is conceivable in various real-world machine learning problems. Since …
classification), which is conceivable in various real-world machine learning problems. Since …
[图书][B] Machine learning in non-stationary environments: Introduction to covariate shift adaptation
M Sugiyama, M Kawanabe - 2012 - books.google.com
Theory, algorithms, and applications of machine learning techniques to overcome" covariate
shift" non-stationarity. As the power of computing has grown over the past few decades, the …
shift" non-stationarity. As the power of computing has grown over the past few decades, the …
[PDF][PDF] A least-squares approach to direct importance estimation
We address the problem of estimating the ratio of two probability density functions, which is
often referred to as the importance. The importance values can be used for various …
often referred to as the importance. The importance values can be used for various …
Statistical outlier detection using direct density ratio estimation
We propose a new statistical approach to the problem of inlier-based outlier detection, ie,
finding outliers in the test set based on the training set consisting only of inliers. Our key idea …
finding outliers in the test set based on the training set consisting only of inliers. Our key idea …
On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles
HO Marques, L Swersky, J Sander… - Data Mining and …, 2023 - Springer
It has been shown that unsupervised outlier detection methods can be adapted to the one-
class classification problem (Janssens and Postma, in: Proceedings of the 18th annual …
class classification problem (Janssens and Postma, in: Proceedings of the 18th annual …