LLM-informed multi-armed bandit strategies for non-stationary environments

J De Curtò, I de Zarzà, G Roig, JC Cano, P Manzoni… - Electronics, 2023 - mdpi.com
In this paper, we introduce an innovative approach to handling the multi-armed bandit (MAB)
problem in non-stationary environments, harnessing the predictive power of large language …

Preference-learning emitters for mixed-initiative quality-diversity algorithms

R Gallotta, K Arulkumaran… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In mixed-initiative co-creation tasks, wherein a human and a machine jointly create items, it
is important to provide multiple relevant suggestions to the designer. Quality-diversity …

Kadabra: adapting Kademlia for the decentralized web

Y Zhang, S Bojja Venkatakrishnan - International Conference on Financial …, 2023 - Springer
Blockchains have become the catalyst for a growing movement to create a more
decentralized Internet. A fundamental operation of applications in a decentralized Internet is …

Towards a causal decision-making framework for recommender systems

E Cavenaghi, A Zanga, F Stella, M Zanker - ACM Transactions on …, 2024 - dl.acm.org
Causality is gaining more and more attention in the machine learning community and
consequently also in recommender systems research. The limitations of learning offline from …

A systematic literature review of solutions for cold start problem

N Singh, SK Singh - … Journal of System Assurance Engineering and …, 2024 - Springer
Insufficient knowledge about a new bug or a new developer, in the context of
recommendations done in software bug repositories (SBR) mining, impacts the …

Assistance in Teleoperation of Redundant Robots through Predictive Joint Maneuvering

C Brooks, W Rees, D Szafir - ACM Transactions on Human-Robot …, 2024 - dl.acm.org
In teleoperation of redundant robotic manipulators, translating an operator's end effector
motion command to joint space can be a tool for maintaining feasible and precise robot …

Beyond ads: sequential decision-making algorithms in law and public policy

P Henderson, B Chugg, B Anderson… - Proceedings of the 2022 …, 2022 - dl.acm.org
We explore the promises and challenges of employing sequential decision-making
algorithms--such as bandits, reinforcement learning, and active learning--in law and public …

Multi‐armed bandits, Thomson sampling and unsupervised machine learning in phylogenetic graph search

WC Wheeler - Cladistics, 2024 - Wiley Online Library
A phylogenetic graph search relies on a large number of highly parameterized search
procedures (eg branch‐swapping, perturbation, simulated annealing, genetic algorithm) …

Metamorphic testing for recommender systems

S Iakusheva, A Khritankov - … Conference on Analysis of Images, Social …, 2023 - Springer
Recommender systems are commonly based on a multi-armed bandit model. This model
should be carefully tested because it affects the users, but it is technically complicated …

A fine-grain batching-based task allocation algorithm for spatial crowdsourcing

Y Jiao, Z Lin, L Yu, X Wu - ISPRS International Journal of Geo-Information, 2022 - mdpi.com
Task allocation is a critical issue of spatial crowdsourcing. Although the batching strategy
performs better than the real-time matching mode, it still has the following two drawbacks:(1) …