Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arXiv preprint arXiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Holistic Evaluation of Language Models

P Liang, R Bommasani, T Lee, D Tsipras… - … on Machine Learning … - openreview.net
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

[PDF][PDF] Holistic Evaluation of Language Models

P Liang, R Bommasani, T Lee, D Tsipras… - arXiv preprint arXiv …, 2022 - friedeggs.github.io
(1) Broad coverage and recognition of incompleteness. Given language models' vast
surface of capabilities and risks, we need to evaluate language models over a broad range …

Holistic Evaluation of Language Models

P Liang, R Bommasani, T Lee, D Tsipras… - arXiv e …, 2022 - ui.adsabs.harvard.edu
Abstract Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

[PDF][PDF] Holistic Evaluation of Language Models

P Liang, R Bommasani, T Lee… - arXiv preprint arXiv …, 2022 - thetalkingmachines.com
(1) Broad coverage and recognition of incompleteness. Given language models' vast
surface of capabilities and risks, we need to evaluate language models over a broad range …