Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arXiv preprint arXiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

[PDF][PDF] Managing ai risks in an era of rapid progress

Y Bengio, G Hinton, A Yao, D Song… - arXiv preprint arXiv …, 2023 - blog.biocomm.ai
In this short consensus paper, we outline risks from upcoming, advanced AI systems. We
examine large-scale social harms and malicious uses, as well as an irreversible loss of …

Managing extreme AI risks amid rapid progress

Y Bengio, G Hinton, A Yao, D Song, P Abbeel, T Darrell… - Science, 2024 - science.org
Artificial intelligence (AI) is progressing rapidly, and companies are shifting their focus to
developing generalist AI systems that can autonomously act and pursue goals. Increases in …

Explosive growth from AI automation: A review of the arguments

E Erdil, T Besiroglu - arXiv preprint arXiv:2309.11690, 2023 - arxiv.org
We examine whether substantial AI automation could accelerate global economic growth by
about an order of magnitude, akin to the economic growth effects of the Industrial …

Algorithmic progress in language models

A Ho, T Besiroglu, E Erdil, D Owen, R Rahman… - arXiv preprint arXiv …, 2024 - arxiv.org
We investigate the rate at which algorithms for pre-training language models have improved
since the advent of deep learning. Using a dataset of over 200 language model evaluations …

Computing Power and the Governance of Artificial Intelligence

G Sastry, L Heim, H Belfield, M Anderljung… - arXiv preprint arXiv …, 2024 - arxiv.org
Computing power, or" compute," is crucial for the development and deployment of artificial
intelligence (AI) capabilities. As a result, governments and companies have started to …

Training Compute Thresholds: Features and Functions in AI Regulation

L Heim, L Koessler - arXiv preprint arXiv:2405.10799, 2024 - arxiv.org
Regulators in the US and EU are using thresholds based on training compute--the number
of computational operations used in training--to identify general-purpose artificial …

Beyond AI Exposure: Which Tasks are Cost-Effective to Automate with Computer Vision?

M Svanberg, W Li, M Fleming, B Goehring… - Available at SSRN …, 2024 - papers.ssrn.com
The faster AI automation spreads through the economy, the more profound its potential
impacts, both positive (improved productivity) and negative (worker displacement). The …

International governance of advancing artificial intelligence

N Emery-Xu, R Jordan, R Trager - AI & SOCIETY, 2024 - Springer
New technologies with military applications may demand new modes of governance. In this
article, we develop a taxonomy of technology governance forms, outline their strengths, and …

A Causal Framework for AI Regulation and Auditing

L Sharkey, CN Ghuidhir, D Braun, J Scheurer… - 2024 - preprints.org
Artificial intelligence (AI) systems are poised to become deeply integrated into society. If
developed responsibly, AI has potential to benefit humanity immensely. However, it also …