Machine learning applications in the resilience of interdependent critical infrastructure systems—A systematic literature review
BA Alkhaleel - International Journal of Critical Infrastructure Protection, 2023 - Elsevier
The resilience of interdependent critical infrastructure systems (ICISs) is critical for the
functioning of society and the economy. ICISs such as power grids and telecommunication …
functioning of society and the economy. ICISs such as power grids and telecommunication …
[HTML][HTML] GOPS: A general optimal control problem solver for autonomous driving and industrial control applications
Solving optimal control problems serves as the basic demand of industrial control tasks.
Existing methods like model predictive control often suffer from heavy online computational …
Existing methods like model predictive control often suffer from heavy online computational …
Integrated decision and control: Toward interpretable and computationally efficient driving intelligence
Decision and control are core functionalities of high-level automated vehicles. Current
mainstream methods, such as functional decomposition and end-to-end reinforcement …
mainstream methods, such as functional decomposition and end-to-end reinforcement …
Aligning language models with human preferences via a bayesian approach
In the quest to advance human-centric natural language generation (NLG) systems,
ensuring alignment between NLG models and human preferences is crucial. For this …
ensuring alignment between NLG models and human preferences is crucial. For this …
Policy iteration based approximate dynamic programming toward autonomous driving in constrained dynamic environment
In the area of autonomous driving, it typically brings great difficulty in solving the motion
planning problem since the vehicle model is nonlinear and the driving scenarios are …
planning problem since the vehicle model is nonlinear and the driving scenarios are …
Stochastic second-order methods improve best-known sample complexity of SGD for gradient-dominated functions
We study the performance of Stochastic Cubic Regularized Newton (SCRN) on a class of
functions satisfying gradient dominance property with $1\le\alpha\le2 $ which holds in a …
functions satisfying gradient dominance property with $1\le\alpha\le2 $ which holds in a …
Learn Zero-Constraint-Violation Safe Policy in Model-Free Constrained Reinforcement Learning
We focus on learning the zero-constraint-violation safe policy in model-free reinforcement
learning (RL). Existing model-free RL studies mostly use the posterior penalty to penalize …
learning (RL). Existing model-free RL studies mostly use the posterior penalty to penalize …
Beyond exact gradients: Convergence of stochastic soft-max policy gradient methods with entropy regularization
Entropy regularization is an efficient technique for encouraging exploration and preventing a
premature convergence of (vanilla) policy gradient methods in reinforcement learning (RL) …
premature convergence of (vanilla) policy gradient methods in reinforcement learning (RL) …
Improve generalization of driving policy at signalized intersections with adversarial learning
Intersections are quite challenging among various driving scenes wherein the interaction of
signal lights and distinct traffic actors poses great difficulty to learn a wise and robust driving …
signal lights and distinct traffic actors poses great difficulty to learn a wise and robust driving …