A survey on optimization techniques for edge artificial intelligence (ai)

C Surianarayanan, JJ Lawrence, PR Chelliah… - Sensors, 2023 - mdpi.com
Artificial Intelligence (Al) models are being produced and used to solve a variety of current
and future business and technical problems. Therefore, AI model engineering processes …

Where reinforcement learning meets process control: Review and guidelines

RR Faria, BDO Capron, AR Secchi, MB de Souza Jr - Processes, 2022 - mdpi.com
This paper presents a literature review of reinforcement learning (RL) and its applications to
process control and optimization. These applications were evaluated from a new …

V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control

HF Song, A Abdolmaleki, JT Springenberg… - arXiv preprint arXiv …, 2019 - arxiv.org
Some of the most successful applications of deep reinforcement learning to challenging
domains in discrete and continuous control have used policy gradient methods in the on …

QDax: A library for quality-diversity and population-based algorithms with hardware acceleration

F Chalumeau, B Lim, R Boige, M Allard… - Journal of Machine …, 2024 - jmlr.org
QDax is an open-source library with a streamlined and modular API for Quality-Diversity
(QD) optimisation algorithms in Jax. The library serves as a versatile tool for optimisation …

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Iteratively learn diverse strategies with state distance information

W Fu, W Du, J Li, S Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
In complex reinforcement learning (RL) problems, policies with similar rewards may have
substantially different behaviors. It remains a fundamental challenge to optimize rewards …

[HTML][HTML] Automatic data augmentation to improve generalization of deep learning in H&E stained histopathology

K Faryna, J van der Laak, G Litjens - Computers in Biology and Medicine, 2024 - Elsevier
In histopathology practice, scanners, tissue processing, staining, and image acquisition
protocols vary from center to center, resulting in subtle variations in images. Vanilla …

Path-specific objectives for safer agent incentives

S Farquhar, R Carey, T Everitt - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
We present a general framework for training safe agents whose naive incentives are unsafe.
As an example, manipulative or deceptive behaviour can improve rewards but should be …

Improving the robustness and quality of biomedical cnn models through adaptive hyperparameter tuning

S Iqbal, AN Qureshi, A Ullah, J Li, T Mahmood - Applied Sciences, 2022 - mdpi.com
Deep learning is an obvious method for the detection of disease, analyzing medical images
and many researchers have looked into it. However, the performance of deep learning …

Automl with bayesian optimizations for big data management

A Karras, C Karras, N Schizas, M Avlonitis, S Sioutas - Information, 2023 - mdpi.com
The field of automated machine learning (AutoML) has gained significant attention in recent
years due to its ability to automate the process of building and optimizing machine learning …