Gaussian Stochastic Weight Averaging for Bayesian Low-Rank Adaptation of Large Language Models

E Onal, K Flöge, E Caldwell, A Sheverdin… - arXiv preprint arXiv …, 2024 - arxiv.org
Fine-tuned Large Language Models (LLMs) often suffer from overconfidence and poor
calibration, particularly when fine-tuned on small datasets. To address these challenges, we …

Predictive Coding Networks and Inference Learning: Tutorial and Survey

B van Zwol, R Jefferson, EL Broek - arXiv preprint arXiv:2407.04117, 2024 - arxiv.org
Recent years have witnessed a growing call for renewed emphasis on neuroscience-
inspired approaches in artificial intelligence research, under the banner of $\textit {NeuroAI} …

Sparse bayesian neural networks: Bridging model and parameter uncertainty through scalable variational inference

A Hubin, G Storvik - Mathematics, 2024 - mdpi.com
Bayesian neural networks (BNNs) have recently regained a significant amount of attention in
the deep learning community due to the development of scalable approximate Bayesian …

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof

D Lim, M Putterman, R Walters, H Maron… - arXiv preprint arXiv …, 2024 - arxiv.org
Many algorithms and observed phenomena in deep learning appear to be affected by
parameter symmetries--transformations of neural network parameters that do not change the …

Gradient-free variational learning with conditional mixture networks

C Heins, H Wu, D Markovic, A Tschantz, J Beck… - arXiv preprint arXiv …, 2024 - arxiv.org
Balancing computational efficiency with robust predictive performance is crucial in
supervised learning, especially for critical applications. Standard deep learning models …

Linearization Turns Neural Operators into Function-Valued Gaussian Processes

E Magnani, M Pförtner, T Weber, P Hennig - arXiv preprint arXiv …, 2024 - arxiv.org
Modeling dynamical systems, eg in climate and engineering sciences, often necessitates
solving partial differential equations. Neural operators are deep neural networks designed to …

Thermodynamic Bayesian Inference

M Aifer, S Duffield, K Donatella, D Melanson… - arXiv preprint arXiv …, 2024 - arxiv.org
A fully Bayesian treatment of complicated predictive models (such as deep neural networks)
would enable rigorous uncertainty quantification and the automation of higher-level tasks …

Reparameterization invariance in approximate Bayesian inference

H Roy, M Miani, CH Ek, P Hennig, M Pförtner… - arXiv preprint arXiv …, 2024 - arxiv.org
Current approximate posteriors in Bayesian neural networks (BNNs) exhibit a crucial
limitation: they fail to maintain invariance under reparameterization, ie BNNs assign different …

Low-Budget Simulation-Based Inference with Bayesian Neural Networks

A Delaunoy, MB Bonardeaux, S Mishra-Sharma… - arXiv preprint arXiv …, 2024 - arxiv.org
Simulation-based inference methods have been shown to be inaccurate in the data-poor
regime, when training simulations are limited or expensive. Under these circumstances, the …

Bayesian sparsification for deep neural networks with Bayesian model reduction

D Marković, KJ Friston, SJ Kiebel - IEEE Access, 2024 - ieeexplore.ieee.org
Deep learning's immense capabilities are often constrained by the complexity of its models,
leading to an increasing demand for effective sparsification techniques. Bayesian …