[HTML][HTML] A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

Certified policy smoothing for cooperative multi-agent reinforcement learning

R Mu, W Ruan, LS Marcolino, G Jin, Q Ni - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Cooperative multi-agent reinforcement learning (c-MARL) is widely applied in safety-critical
scenarios, thus the analysis of robustness for c-MARL models is profoundly important …

Reward Certification for Policy Smoothed Reinforcement Learning

R Mu, LS Marcolino, Y Zhang, T Zhang… - Proceedings of the …, 2024 - ojs.aaai.org
Reinforcement Learning (RL) has achieved remarkable success in safety-critical areas, but it
can be weakened by adversarial attacks. Recent studies have introduced``smoothed …

[HTML][HTML] Bridging formal methods and machine learning with model checking and global optimisation

S Bensalem, X Huang, W Ruan, Q Tang, C Wu… - Journal of Logical and …, 2024 - Elsevier
Formal methods and machine learning are two research fields with drastically different
foundations and philosophies. Formal methods utilise mathematically rigorous techniques …

Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond

R Mu, L Marcolino, Q Ni, W Ruan - Neural Networks, 2024 - Elsevier
Recent years have witnessed increasing interest in adversarial attacks on images, while
adversarial video attacks have seldom been explored. In this paper, we propose a sparse …

Model-agnostic reachability analysis on deep neural networks

C Zhang, W Ruan, F Wang, P Xu, G Min… - Pacific-Asia Conference …, 2023 - Springer
Verification plays an essential role in the formal analysis of safety-critical systems. Most
current verification methods have specific requirements when working on Deep Neural …

DIRECT Optimisation with Bayesian Insights: Assessing Reliability Under Fixed Computational Budgets

F Wang, Z Fu, X Huang, W Ruan - OPT 2023: Optimization for Machine … - openreview.net
We introduce a method for probabilistically evaluating the reliability of Lipschitzian global
optimisation under a constrained computational budget, a context frequently encountered in …