- 学术资源搜索

A survey on optimization techniques for edge artificial intelligence (ai)

C Surianarayanan, JJ Lawrence, PR Chelliah… - Sensors, 2023 - mdpi.com

Artificial Intelligence (Al) models are being produced and used to solve a variety of current
and future business and technical problems. Therefore, AI model engineering processes …

被引用次数：24 相关文章所有 12 个版本

[PDF] mdpi.com

Where reinforcement learning meets process control: Review and guidelines

RR Faria, BDO Capron, AR Secchi, MB de Souza Jr - Processes, 2022 - mdpi.com

This paper presents a literature review of reinforcement learning (RL) and its applications to
process control and optimization. These applications were evaluated from a new …

被引用次数：25 相关文章所有 6 个版本

[PDF] arxiv.org

V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control

HF Song, A Abdolmaleki, JT Springenberg… - arXiv preprint arXiv …, 2019 - arxiv.org

Some of the most successful applications of deep reinforcement learning to challenging
domains in discrete and continuous control have used policy gradient methods in the on …

被引用次数：114 相关文章所有 3 个版本

[PDF] jmlr.org

QDax: A library for quality-diversity and population-based algorithms with hardware acceleration

F Chalumeau, B Lim, R Boige, M Allard… - Journal of Machine …, 2024 - jmlr.org

QDax is an open-source library with a streamlined and modular API for Quality-Diversity
(QD) optimisation algorithms in Jax. The library serves as a versatile tool for optimisation …

被引用次数：11 相关文章所有 3 个版本

[PDF] researchgate.net

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

被引用次数：97 相关文章所有 2 个版本

[PDF] neurips.cc

Iteratively learn diverse strategies with state distance information

W Fu, W Du, J Li, S Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc

In complex reinforcement learning (RL) problems, policies with similar rewards may have
substantially different behaviors. It remains a fundamental challenge to optimize rewards …

被引用次数：2 相关文章所有 9 个版本

[HTML] sciencedirect.com

[HTML][HTML] Automatic data augmentation to improve generalization of deep learning in H&E stained histopathology

K Faryna, J van der Laak, G Litjens - Computers in Biology and Medicine, 2024 - Elsevier

In histopathology practice, scanners, tissue processing, staining, and image acquisition
protocols vary from center to center, resulting in subtle variations in images. Vanilla …

被引用次数：5 相关文章所有 6 个版本

[PDF] aaai.org

Path-specific objectives for safer agent incentives

S Farquhar, R Carey, T Everitt - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

We present a general framework for training safe agents whose naive incentives are unsafe.
As an example, manipulative or deceptive behaviour can improve rewards but should be …

被引用次数：23 相关文章所有 11 个版本

[PDF] mdpi.com

Improving the robustness and quality of biomedical cnn models through adaptive hyperparameter tuning

S Iqbal, AN Qureshi, A Ullah, J Li, T Mahmood - Applied Sciences, 2022 - mdpi.com

Deep learning is an obvious method for the detection of disease, analyzing medical images
and many researchers have looked into it. However, the performance of deep learning …

被引用次数：14 相关文章所有 6 个版本

[PDF] mdpi.com

Automl with bayesian optimizations for big data management

A Karras, C Karras, N Schizas, M Avlonitis, S Sioutas - Information, 2023 - mdpi.com

The field of automated machine learning (AutoML) has gained significant attention in recent
years due to its ability to automate the process of building and optimizing machine learning …

被引用次数：11 相关文章所有 4 个版本