- 学术资源搜索

Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

被引用次数：179 相关文章所有 7 个版本

[PDF] frontiersin.org

Large language models and political science

M Linegar, R Kocielnik, RM Alvarez - Frontiers in Political Science, 2023 - frontiersin.org

Large Language Models (LLMs) are a type of artificial intelligence that uses information from
very large datasets to model the use of language and generate content. While LLMs like …

被引用次数：21 相关文章

[PDF] acm.org

Supporting human-ai collaboration in auditing llms with llms

C Rastogi, M Tulio Ribeiro, N King, H Nori… - Proceedings of the 2023 …, 2023 - dl.acm.org

Large language models (LLMs) are increasingly becoming all-powerful and pervasive via
deployment in sociotechnical systems. Yet these language models, be it for classification or …

被引用次数：69 相关文章所有 4 个版本

[PDF] arxiv.org

ROBBIE: Robust bias evaluation of large generative language models

D Esiobu, X Tan, S Hosseini, M Ung, Y Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

As generative large language models (LLMs) grow more performant and prevalent, we must
develop comprehensive enough tools to measure and improve their fairness. Different …

被引用次数：26 相关文章所有 4 个版本

[PDF] arxiv.org

Measuring and improving chain-of-thought reasoning in vision-language models

Y Chen, K Sikka, M Cogswell, H Ji… - arXiv preprint arXiv …, 2023 - arxiv.org

Vision-language models (VLMs) have recently demonstrated strong efficacy as visual
assistants that can parse natural queries about the visual content and generate human-like …

被引用次数：20 相关文章所有 3 个版本

[PDF] arxiv.org

Llm-assist: Enhancing closed-loop planning with language-based reasoning

SP Sharan, F Pittaluga, M Chandraker - arXiv preprint arXiv:2401.00125, 2023 - arxiv.org

Although planning is a crucial component of the autonomous driving stack, researchers
have yet to develop robust planning algorithms that are capable of safely handling the …

被引用次数：31 相关文章所有 2 个版本

[PDF] arxiv.org

Language models can be logical solvers

J Feng, R Xu, J Hao, H Sharma, Y Shen, D Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

Logical reasoning is a fundamental aspect of human intelligence and a key component of
tasks like problem-solving and decision-making. Recent advancements have enabled Large …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Self-taught optimizer (stop): Recursively self-improving code generation

E Zelikman, E Lorch, L Mackey, AT Kalai - arXiv preprint arXiv:2310.02304, 2023 - arxiv.org

Several recent advances in AI systems (eg, Tree-of-Thoughts and Program-Aided Language
Models) solve problems by providing a" scaffolding" program that structures multiple calls to …

被引用次数：29 相关文章所有 6 个版本

[PDF] arxiv.org

Quiet-star: Language models can teach themselves to think before speaking

E Zelikman, G Harik, Y Shao, V Jayasiri… - arXiv preprint arXiv …, 2024 - arxiv.org

When writing and talking, people sometimes pause to think. Although reasoning-focused
works have often framed reasoning as a method of answering questions or completing …

被引用次数：33 相关文章所有 2 个版本

[PDF] arxiv.org

Denevil: Towards deciphering and navigating the ethical values of large language models via instruction learning

S Duan, X Yi, P Zhang, T Lu, X Xie, N Gu - arXiv preprint arXiv:2310.11053, 2023 - arxiv.org

Large Language Models (LLMs) have made unprecedented breakthroughs, yet their
increasing integration into everyday life might raise societal risks due to generated unethical …

被引用次数：4 相关文章所有 3 个版本