A survey of deep learning for mathematical reasoning

P Lu, L Qiu, W Yu, S Welleck, KW Chang - arXiv preprint arXiv:2212.10535, 2022 - arxiv.org
Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in
various fields, including science, engineering, finance, and everyday life. The development …

Program of thoughts prompting: Disentangling computation from reasoning for numerical reasoning tasks

W Chen, X Ma, X Wang, WW Cohen - arXiv preprint arXiv:2211.12588, 2022 - arxiv.org
Recently, there has been significant progress in teaching language models to perform step-
by-step reasoning to solve complex numerical reasoning tasks. Chain-of-thoughts prompting …

Revolutionizing finance with llms: An overview of applications and insights

H Zhao, Z Liu, Z Wu, Y Li, T Yang, P Shu, S Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, Large Language Models (LLMs) like ChatGPT have seen considerable
advancements and have been applied in diverse fields. Built on the Transformer …

Pixiu: A large language model, instruction data and evaluation benchmark for finance

Q Xie, W Han, X Zhang, Y Lai, M Peng… - arXiv preprint arXiv …, 2023 - arxiv.org
Although large language models (LLMs) has shown great performance on natural language
processing (NLP) in the financial domain, there are no publicly available financial tailtored …

Adapting large language models via reading comprehension

D Cheng, S Huang, F Wei - arXiv preprint arXiv:2309.09530, 2023 - arxiv.org
We explore how continued pre-training on domain-specific corpora influences large
language models, revealing that training on the raw corpora endows the model with domain …

Financebench: A new benchmark for financial question answering

P Islam, A Kannappan, D Kiela, R Qian… - arXiv preprint arXiv …, 2023 - arxiv.org
FinanceBench is a first-of-its-kind test suite for evaluating the performance of LLMs on open
book financial question answering (QA). It comprises 10,231 questions about publicly traded …

AI for social science and social science of AI: A survey

R Xu, Y Sun, M Ren, S Guo, R Pan, H Lin, L Sun… - Information Processing …, 2024 - Elsevier
Recent advancements in artificial intelligence, particularly with the emergence of large
language models (LLMs), have sparked a rethinking of artificial general intelligence …

Pixiu: A comprehensive benchmark, instruction dataset and large language model for finance

Q Xie, W Han, X Zhang, Y Lai, M Peng… - Advances in …, 2024 - proceedings.neurips.cc
Although large language models (LLMs) have shown great performance in natural language
processing (NLP) in the financial domain, there are no publicly available financially tailored …

Fineval: A chinese financial domain knowledge evaluation benchmark for large language models

L Zhang, W Cai, Z Liu, Z Yang, W Dai, Y Liao… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated exceptional performance in various
natural language processing tasks, yet their efficacy in more challenging and domain …

Chatqa: Building gpt-4 level conversational qa models

Z Liu, W Ping, R Roy, P Xu, M Shoeybi… - arXiv preprint arXiv …, 2024 - arxiv.org
In this work, we introduce ChatQA, a family of conversational question answering (QA)
models that obtain GPT-4 level accuracies. Specifically, we propose a two-stage instruction …