Towards total recall in industrial anomaly detection

K Roth, L Pemula, J Zepeda… - Proceedings of the …, 2022 - openaccess.thecvf.com
Being able to spot defective parts is a critical component in large-scale industrial
manufacturing. A particular challenge that we address in this work is the cold-start problem …

Deep learning on a data diet: Finding important examples early in training

M Paul, S Ganguli… - Advances in neural …, 2021 - proceedings.neurips.cc
Recent success in deep learning has partially been driven by training increasingly
overparametrized networks on ever larger datasets. It is therefore natural to ask: how much …

Revisiting training strategies and generalization performance in deep metric learning

K Roth, T Milbich, S Sinha, P Gupta… - International …, 2020 - proceedings.mlr.press
Abstract Deep Metric Learning (DML) is arguably one of the most influential lines of research
for learning visual similarities with many proposed approaches every year. Although the field …

Submodularity in data subset selection and active learning

K Wei, R Iyer, J Bilmes - International conference on …, 2015 - proceedings.mlr.press
We study the problem of selecting a subset of big data to train a classifier while incurring
minimal performance loss. We show the connection of submodularity to the data likelihood …

A deep active learning system for species identification and counting in camera trap images

MS Norouzzadeh, D Morris, S Beery… - Methods in ecology …, 2021 - Wiley Online Library
A typical camera trap survey may produce millions of images that require slow, expensive
manual review. Consequently, critical conservation questions may be answered too slowly …

A hybrid machine learning model for intrusion detection in VANET

H Bangui, M Ge, B Buhnova - Computing, 2022 - Springer
Abstract While Vehicular Ad-hoc Network (VANET) is developed to enable effective vehicle
communication and traffic information exchange, VANET is also vulnerable to different …

Data streams: Algorithms and applications

S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …

Coresets for scalable Bayesian logistic regression

J Huggins, T Campbell… - Advances in neural …, 2016 - proceedings.neurips.cc
The use of Bayesian methods in large-scale data settings is attractive because of the rich
hierarchical models, uncertainty quantification, and prior specification they provide …

A unified framework for approximating and clustering data

D Feldman, M Langberg - Proceedings of the forty-third annual ACM …, 2011 - dl.acm.org
Given a set F of n positive functions over a ground set X, we consider the problem of
computing x* that minimizes the expression∑ f∈ Ff (x), over x∈ X. A typical application is …

Submodularity in machine learning and artificial intelligence

J Bilmes - arXiv preprint arXiv:2202.00132, 2022 - arxiv.org
In this manuscript, we offer a gentle review of submodularity and supermodularity and their
properties. We offer a plethora of submodular definitions; a full description of a number of …