Convfinqa: Exploring the chain of numerical reasoning in conversational finance question answering

P Lu, L Qiu, W Yu, S Welleck, KW Chang - arXiv preprint arXiv:2212.10535, 2022 - arxiv.org

Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in
various fields, including science, engineering, finance, and everyday life. The development …

被引用次数：91 相关文章所有 6 个版本

[PDF] arxiv.org

Program of thoughts prompting: Disentangling computation from reasoning for numerical reasoning tasks

W Chen, X Ma, X Wang, WW Cohen - arXiv preprint arXiv:2211.12588, 2022 - arxiv.org

Recently, there has been significant progress in teaching language models to perform step-
by-step reasoning to solve complex numerical reasoning tasks. Chain-of-thoughts prompting …

被引用次数：391 相关文章所有 4 个版本

[PDF] arxiv.org

Revolutionizing finance with llms: An overview of applications and insights

H Zhao, Z Liu, Z Wu, Y Li, T Yang, P Shu, S Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, Large Language Models (LLMs) like ChatGPT have seen considerable
advancements and have been applied in diverse fields. Built on the Transformer …

被引用次数：21 相关文章所有 2 个版本

[PDF] arxiv.org

Pixiu: A large language model, instruction data and evaluation benchmark for finance

Q Xie, W Han, X Zhang, Y Lai, M Peng… - arXiv preprint arXiv …, 2023 - arxiv.org

Although large language models (LLMs) has shown great performance on natural language
processing (NLP) in the financial domain, there are no publicly available financial tailtored …

被引用次数：74 相关文章所有 3 个版本

[PDF] arxiv.org

Adapting large language models via reading comprehension

D Cheng, S Huang, F Wei - arXiv preprint arXiv:2309.09530, 2023 - arxiv.org

We explore how continued pre-training on domain-specific corpora influences large
language models, revealing that training on the raw corpora endows the model with domain …

被引用次数：36 相关文章所有 3 个版本

[PDF] arxiv.org

Financebench: A new benchmark for financial question answering

P Islam, A Kannappan, D Kiela, R Qian… - arXiv preprint arXiv …, 2023 - arxiv.org

FinanceBench is a first-of-its-kind test suite for evaluating the performance of LLMs on open
book financial question answering (QA). It comprises 10,231 questions about publicly traded …

被引用次数：18 相关文章所有 2 个版本

[PDF] arxiv.org

AI for social science and social science of AI: A survey

R Xu, Y Sun, M Ren, S Guo, R Pan, H Lin, L Sun… - Information Processing …, 2024 - Elsevier

Recent advancements in artificial intelligence, particularly with the emergence of large
language models (LLMs), have sparked a rethinking of artificial general intelligence …

被引用次数：8 相关文章所有 3 个版本

[PDF] neurips.cc

Pixiu: A comprehensive benchmark, instruction dataset and large language model for finance

Q Xie, W Han, X Zhang, Y Lai, M Peng… - Advances in …, 2024 - proceedings.neurips.cc

Although large language models (LLMs) have shown great performance in natural language
processing (NLP) in the financial domain, there are no publicly available financially tailored …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Fineval: A chinese financial domain knowledge evaluation benchmark for large language models

L Zhang, W Cai, Z Liu, Z Yang, W Dai, Y Liao… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated exceptional performance in various
natural language processing tasks, yet their efficacy in more challenging and domain …

被引用次数：25 相关文章所有 2 个版本

[PDF] arxiv.org

Chatqa: Building gpt-4 level conversational qa models

Z Liu, W Ping, R Roy, P Xu, M Shoeybi… - arXiv preprint arXiv …, 2024 - arxiv.org

In this work, we introduce ChatQA, a family of conversational question answering (QA)
models that obtain GPT-4 level accuracies. Specifically, we propose a two-stage instruction …

被引用次数：11 相关文章所有 2 个版本