A survey on optimization techniques for edge artificial intelligence (ai)
C Surianarayanan, JJ Lawrence, PR Chelliah… - Sensors, 2023 - mdpi.com
Artificial Intelligence (Al) models are being produced and used to solve a variety of current
and future business and technical problems. Therefore, AI model engineering processes …
and future business and technical problems. Therefore, AI model engineering processes …
Where reinforcement learning meets process control: Review and guidelines
RR Faria, BDO Capron, AR Secchi, MB de Souza Jr - Processes, 2022 - mdpi.com
This paper presents a literature review of reinforcement learning (RL) and its applications to
process control and optimization. These applications were evaluated from a new …
process control and optimization. These applications were evaluated from a new …
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg… - arXiv preprint arXiv …, 2019 - arxiv.org
Some of the most successful applications of deep reinforcement learning to challenging
domains in discrete and continuous control have used policy gradient methods in the on …
domains in discrete and continuous control have used policy gradient methods in the on …
QDax: A library for quality-diversity and population-based algorithms with hardware acceleration
QDax is an open-source library with a streamlined and modular API for Quality-Diversity
(QD) optimisation algorithms in Jax. The library serves as a versatile tool for optimisation …
(QD) optimisation algorithms in Jax. The library serves as a versatile tool for optimisation …
[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …
led to a dramatic increase in the number of applications and methods. Recent works have …
Iteratively learn diverse strategies with state distance information
In complex reinforcement learning (RL) problems, policies with similar rewards may have
substantially different behaviors. It remains a fundamental challenge to optimize rewards …
substantially different behaviors. It remains a fundamental challenge to optimize rewards …
[HTML][HTML] Automatic data augmentation to improve generalization of deep learning in H&E stained histopathology
In histopathology practice, scanners, tissue processing, staining, and image acquisition
protocols vary from center to center, resulting in subtle variations in images. Vanilla …
protocols vary from center to center, resulting in subtle variations in images. Vanilla …
Path-specific objectives for safer agent incentives
We present a general framework for training safe agents whose naive incentives are unsafe.
As an example, manipulative or deceptive behaviour can improve rewards but should be …
As an example, manipulative or deceptive behaviour can improve rewards but should be …
Improving the robustness and quality of biomedical cnn models through adaptive hyperparameter tuning
Deep learning is an obvious method for the detection of disease, analyzing medical images
and many researchers have looked into it. However, the performance of deep learning …
and many researchers have looked into it. However, the performance of deep learning …
Automl with bayesian optimizations for big data management
The field of automated machine learning (AutoML) has gained significant attention in recent
years due to its ability to automate the process of building and optimizing machine learning …
years due to its ability to automate the process of building and optimizing machine learning …