Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

Large language models and political science

M Linegar, R Kocielnik, RM Alvarez - Frontiers in Political Science, 2023 - frontiersin.org
Large Language Models (LLMs) are a type of artificial intelligence that uses information from
very large datasets to model the use of language and generate content. While LLMs like …

Supporting human-ai collaboration in auditing llms with llms

C Rastogi, M Tulio Ribeiro, N King, H Nori… - Proceedings of the 2023 …, 2023 - dl.acm.org
Large language models (LLMs) are increasingly becoming all-powerful and pervasive via
deployment in sociotechnical systems. Yet these language models, be it for classification or …

ROBBIE: Robust bias evaluation of large generative language models

D Esiobu, X Tan, S Hosseini, M Ung, Y Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
As generative large language models (LLMs) grow more performant and prevalent, we must
develop comprehensive enough tools to measure and improve their fairness. Different …

Measuring and improving chain-of-thought reasoning in vision-language models

Y Chen, K Sikka, M Cogswell, H Ji… - arXiv preprint arXiv …, 2023 - arxiv.org
Vision-language models (VLMs) have recently demonstrated strong efficacy as visual
assistants that can parse natural queries about the visual content and generate human-like …

Llm-assist: Enhancing closed-loop planning with language-based reasoning

SP Sharan, F Pittaluga, M Chandraker - arXiv preprint arXiv:2401.00125, 2023 - arxiv.org
Although planning is a crucial component of the autonomous driving stack, researchers
have yet to develop robust planning algorithms that are capable of safely handling the …

Language models can be logical solvers

J Feng, R Xu, J Hao, H Sharma, Y Shen, D Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Logical reasoning is a fundamental aspect of human intelligence and a key component of
tasks like problem-solving and decision-making. Recent advancements have enabled Large …

Self-taught optimizer (stop): Recursively self-improving code generation

E Zelikman, E Lorch, L Mackey, AT Kalai - arXiv preprint arXiv:2310.02304, 2023 - arxiv.org
Several recent advances in AI systems (eg, Tree-of-Thoughts and Program-Aided Language
Models) solve problems by providing a" scaffolding" program that structures multiple calls to …

Quiet-star: Language models can teach themselves to think before speaking

E Zelikman, G Harik, Y Shao, V Jayasiri… - arXiv preprint arXiv …, 2024 - arxiv.org
When writing and talking, people sometimes pause to think. Although reasoning-focused
works have often framed reasoning as a method of answering questions or completing …

Denevil: Towards deciphering and navigating the ethical values of large language models via instruction learning

S Duan, X Yi, P Zhang, T Lu, X Xie, N Gu - arXiv preprint arXiv:2310.11053, 2023 - arxiv.org
Large Language Models (LLMs) have made unprecedented breakthroughs, yet their
increasing integration into everyday life might raise societal risks due to generated unethical …