Partial feedback online transfer learning with multi-source domains

Z Kang, M Nielsen, B Yang, MM Ghazi - Information Fusion, 2023 - Elsevier
Online machine learning is an effective way for observation-based learning when a static
dataset is not available. However, it can be challenging in real-world applications, especially …

Online transfer learning with partial feedback

Z Kang, M Nielsen, B Yang, L Deng… - Expert Systems with …, 2023 - Elsevier
Online learning for multi-class classification is a well-studied topic in machine learning. The
standard multi-class classification online learning setting assumes continuous availability of …

Online multiclass classification based on prediction margin for partial feedback

T Kaneko, I Sato, M Sugiyama - arXiv preprint arXiv:1902.01056, 2019 - arxiv.org
We consider the problem of online multiclass classification with partial feedback, where an
algorithm predicts a class for a new instance in each round and only receives its …

Exact passive-aggressive algorithms for multiclass classification using bandit feedbacks

M Arora, N Manwani - Asian Conference on Machine …, 2020 - proceedings.mlr.press
In many real-life classification problems, we may not get exact class labels for training
samples. One such example is bandit feedback in multiclass classification. In this setting, we …

Online Learning With Incremental Feature Space and Bandit Feedback

S Gu, T Luo, M He, C Hou - IEEE Transactions on Knowledge …, 2023 - ieeexplore.ieee.org
Online learning is a fundamental paradigm for learning from continuous data stream.
Tradition online learning approaches usually assume that the feature space of data stream …

Learning multiclass classifier under noisy bandit feedback

M Agarwal, N Manwani - Pacific-Asia Conference on Knowledge Discovery …, 2021 - Springer
This paper addresses the problem of multiclass classification with corrupted or noisy bandit
feedback. In this setting, the learner may not receive true feedback. Instead, it receives …

ALBIF: Active Learning with BandIt Feedbacks

M Agarwal, N Manwani - Pacific-Asia Conference on Knowledge Discovery …, 2022 - Springer
Online active learning algorithms reduce human labeling costs by querying only a subset of
informative incoming instances from the data stream to update the classification model …

[PDF][PDF] Learning With Bandit Feedback

M AGARWAL - 2022 - web2py.iiit.ac.in
Have you ever wondered why your feed gets filled with movies or similar web series after
watching a movie on Netflix? This happens because Netflix uses a recommendation system …

[PDF][PDF] Learning multiclass classifier under uncertain/incomplete supervision

M Arora - 2021 - cdn.iiit.ac.in
Online learning is a very well worked learning paradigm that has both theoretical as well as
practical appeal. Online learning aims to make a sequence of accurate predictions given the …

Bandit feedback in Classification and Multi-objective Optimization

H Zhong - 2016 - theses.hal.science
Bandit problems constitute a sequential dynamic allocation problem. The pulling agent has
to explore its environment (ie the arms) to gather information on the one hand, and it has to …