Passive-aggressive bounds in bandit feedback classification

Z Kang, M Nielsen, B Yang, MM Ghazi - Information Fusion, 2023 - Elsevier

Online machine learning is an effective way for observation-based learning when a static
dataset is not available. However, it can be challenging in real-world applications, especially …

被引用次数：11 相关文章所有 3 个版本

Online transfer learning with partial feedback

Z Kang, M Nielsen, B Yang, L Deng… - Expert Systems with …, 2023 - Elsevier

Online learning for multi-class classification is a well-studied topic in machine learning. The
standard multi-class classification online learning setting assumes continuous availability of …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Online multiclass classification based on prediction margin for partial feedback

T Kaneko, I Sato, M Sugiyama - arXiv preprint arXiv:1902.01056, 2019 - arxiv.org

We consider the problem of online multiclass classification with partial feedback, where an
algorithm predicts a class for a new instance in each round and only receives its …

被引用次数：16 相关文章所有 2 个版本

[PDF] mlr.press

Exact passive-aggressive algorithms for multiclass classification using bandit feedbacks

M Arora, N Manwani - Asian Conference on Machine …, 2020 - proceedings.mlr.press

In many real-life classification problems, we may not get exact class labels for training
samples. One such example is bandit feedback in multiclass classification. In this setting, we …

被引用次数：7 相关文章所有 5 个版本

Online Learning With Incremental Feature Space and Bandit Feedback

S Gu, T Luo, M He, C Hou - IEEE Transactions on Knowledge …, 2023 - ieeexplore.ieee.org

Online learning is a fundamental paradigm for learning from continuous data stream.
Tradition online learning approaches usually assume that the feature space of data stream …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Learning multiclass classifier under noisy bandit feedback

M Agarwal, N Manwani - Pacific-Asia Conference on Knowledge Discovery …, 2021 - Springer

This paper addresses the problem of multiclass classification with corrupted or noisy bandit
feedback. In this setting, the learner may not receive true feedback. Instead, it receives …

被引用次数：4 相关文章所有 6 个版本

ALBIF: Active Learning with BandIt Feedbacks

M Agarwal, N Manwani - Pacific-Asia Conference on Knowledge Discovery …, 2022 - Springer

Online active learning algorithms reduce human labeling costs by querying only a subset of
informative incoming instances from the data stream to update the classification model …

[PDF][PDF] Learning With Bandit Feedback

M AGARWAL - 2022 - web2py.iiit.ac.in

Have you ever wondered why your feed gets filled with movies or similar web series after
watching a movie on Netflix? This happens because Netflix uses a recommendation system …

[PDF][PDF] Learning multiclass classifier under uncertain/incomplete supervision

M Arora - 2021 - cdn.iiit.ac.in

Online learning is a very well worked learning paradigm that has both theoretical as well as
practical appeal. Online learning aims to make a sequence of accurate predictions given the …

Bandit feedback in Classification and Multi-objective Optimization

H Zhong - 2016 - theses.hal.science

Bandit problems constitute a sequential dynamic allocation problem. The pulling agent has
to explore its environment (ie the arms) to gather information on the one hand, and it has to …

被引用次数：1 相关文章所有 3 个版本