Task Selection and Assignment for Multi-Modal Multi-Task Dialogue Act Classification with Non-Stationary Multi-Armed Bandits
Multi-task learning (MTL) aims to improve the performance of a primary task by jointly
learning with related auxiliary tasks. Traditional MTL methods select tasks randomly during …
learning with related auxiliary tasks. Traditional MTL methods select tasks randomly during …
Sliding-Window Thompson Sampling for Non-Stationary Settings
$\textit {Restless Bandits} $ describe sequential decision-making problems in which the
rewards evolve with time independently from the actions taken by the policy-maker. It has …
rewards evolve with time independently from the actions taken by the policy-maker. It has …