LLM-informed multi-armed bandit strategies for non-stationary environments
In this paper, we introduce an innovative approach to handling the multi-armed bandit (MAB)
problem in non-stationary environments, harnessing the predictive power of large language …
problem in non-stationary environments, harnessing the predictive power of large language …
Preference-learning emitters for mixed-initiative quality-diversity algorithms
R Gallotta, K Arulkumaran… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In mixed-initiative co-creation tasks, wherein a human and a machine jointly create items, it
is important to provide multiple relevant suggestions to the designer. Quality-diversity …
is important to provide multiple relevant suggestions to the designer. Quality-diversity …
Kadabra: adapting Kademlia for the decentralized web
Y Zhang, S Bojja Venkatakrishnan - International Conference on Financial …, 2023 - Springer
Blockchains have become the catalyst for a growing movement to create a more
decentralized Internet. A fundamental operation of applications in a decentralized Internet is …
decentralized Internet. A fundamental operation of applications in a decentralized Internet is …
Towards a causal decision-making framework for recommender systems
Causality is gaining more and more attention in the machine learning community and
consequently also in recommender systems research. The limitations of learning offline from …
consequently also in recommender systems research. The limitations of learning offline from …
A systematic literature review of solutions for cold start problem
N Singh, SK Singh - … Journal of System Assurance Engineering and …, 2024 - Springer
Insufficient knowledge about a new bug or a new developer, in the context of
recommendations done in software bug repositories (SBR) mining, impacts the …
recommendations done in software bug repositories (SBR) mining, impacts the …
Assistance in Teleoperation of Redundant Robots through Predictive Joint Maneuvering
C Brooks, W Rees, D Szafir - ACM Transactions on Human-Robot …, 2024 - dl.acm.org
In teleoperation of redundant robotic manipulators, translating an operator's end effector
motion command to joint space can be a tool for maintaining feasible and precise robot …
motion command to joint space can be a tool for maintaining feasible and precise robot …
Beyond ads: sequential decision-making algorithms in law and public policy
We explore the promises and challenges of employing sequential decision-making
algorithms--such as bandits, reinforcement learning, and active learning--in law and public …
algorithms--such as bandits, reinforcement learning, and active learning--in law and public …
Multi‐armed bandits, Thomson sampling and unsupervised machine learning in phylogenetic graph search
WC Wheeler - Cladistics, 2024 - Wiley Online Library
A phylogenetic graph search relies on a large number of highly parameterized search
procedures (eg branch‐swapping, perturbation, simulated annealing, genetic algorithm) …
procedures (eg branch‐swapping, perturbation, simulated annealing, genetic algorithm) …
Metamorphic testing for recommender systems
S Iakusheva, A Khritankov - … Conference on Analysis of Images, Social …, 2023 - Springer
Recommender systems are commonly based on a multi-armed bandit model. This model
should be carefully tested because it affects the users, but it is technically complicated …
should be carefully tested because it affects the users, but it is technically complicated …
A fine-grain batching-based task allocation algorithm for spatial crowdsourcing
Y Jiao, Z Lin, L Yu, X Wu - ISPRS International Journal of Geo-Information, 2022 - mdpi.com
Task allocation is a critical issue of spatial crowdsourcing. Although the batching strategy
performs better than the real-time matching mode, it still has the following two drawbacks:(1) …
performs better than the real-time matching mode, it still has the following two drawbacks:(1) …