A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - arXiv preprint arXiv …, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …

Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments

T Thai, M Shen, M Garg, A Kalani, N Vaidya… - arXiv preprint arXiv …, 2023 - arxiv.org
Learning to detect, characterize and accommodate novelties is a challenge that agents
operating in open-world domains need to address to be able to guarantee satisfactory task …

Augmenting Content Retrieval Through Machine Learning

PS Pavan, T Sripriya, B Vikas, Y Parmar… - … on Smart Generation …, 2023 - ieeexplore.ieee.org
gadget getting to know is a subset of synthetic intelligence that focuses on making
predictions via reading and getting to know patterns from records. In current years, machine …