Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning

M Verma, S Bhambri, S Kambhampati - arXiv preprint arXiv:2302.08738, 2023 - arxiv.org
Preference Based Reinforcement Learning has shown much promise for utilizing human
binary feedback on queried trajectory pairs to recover the underlying reward model of the …

Data Driven Reward Initialization for Preference based Reinforcement Learning

M Verma, S Kambhampati - arXiv preprint arXiv:2302.08733, 2023 - arxiv.org
Preference-based Reinforcement Learning (PbRL) methods utilize binary feedback from the
human in the loop (HiL) over queried trajectory pairs to learn a reward model in an attempt …

A State Augmentation based approach to Reinforcement Learning from Human Preferences

M Verma, S Kambhampati - arXiv preprint arXiv:2302.08734, 2023 - arxiv.org
Reinforcement Learning has suffered from poor reward specification, and issues for reward
hacking even in simple enough domains. Preference Based Reinforcement Learning …

[HTML][HTML] A multi-step electricity prediction model for residential buildings based on ensemble Empirical Mode Decomposition technique

S Kaur, A Bala, A Parashar - Science and Technology for Energy …, 2024 - stet-review.org
Residential electricity demand is increasing rapidly, constituting about a quarter of total
energy consumption. Electricity demand prediction is one of the sustainable solutions to …

Construction Scheme of Smart City Based on Internet of Things

Y Guo, Y Yang, Y Tao - 2021 6th International Conference on …, 2021 - ieeexplore.ieee.org
In order to overcome a series of problems existing in the process of smart city construction,
such as user information privacy leakage, data processing difficulties, data storage …

Diseño y desarrollo de un laboratorio de pruebas basados en Smart Home aplicando protocolo de comunicación Z-Wave y estándar 802.11.

CM Serrano, JO Mata, VR Sánchez - Ecuadorian Science …, 2021 - journals.gdeon.org
El desarrollo tecnológico actual, con equipos capaces de enviar y recibir datos, junto con
protocolos de comunicación avanzados, hace posible la implementación de casas …

[PDF][PDF] Hybrid Intelligence. The strength and challenges of getting the human in the AI Loop, a literature study

MB CHIVAPONG - 2020 - documentserver.uhasselt.be
Hybrid Intelligence is the exploitation of Human Intelligence and Artificial Intelligence
combination. The emergence of Hybrid Intelligence allows both entities to overcome their …