A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

Y Zhang, X Chen, B Jin, S Wang, S Ji, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
In many scientific fields, large language models (LLMs) have revolutionized the way with
which text and other modalities of data (eg, molecules and proteins) are dealt, achieving …

Newclid: A user-friendly replacement for AlphaGeometry

V Sicca, T Xia, M Fédérico, PJ Gorinski… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce a new symbolic solver for geometry, called Newclid, which is based on
AlphaGeometry. Newclid contains a symbolic solver called DDARN (derived from DDAR …

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Y Yan, J Su, J He, F Fu, X Zheng, Y Lyu… - arXiv preprint arXiv …, 2024 - arxiv.org
Mathematical reasoning, a core aspect of human cognition, is vital across many domains,
from educational problem-solving to scientific advancements. As artificial general …

Symbolic Computation for All the Fun

CE Brown, M Janota, M Olšák - arXiv preprint arXiv:2404.12048, 2024 - arxiv.org
Motivated by the recent 10 million dollar AIMO challenge, this paper targets the problem of
finding all functions conforming to a given specification. This is a popular problem at …

OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving?

L Li, Y Luo, T Pan - arXiv preprint arXiv:2411.06198, 2024 - arxiv.org
The Orion-1 model by OpenAI is claimed to have more robust logical reasoning capabilities
than previous large language models. However, some suggest the excellence might be …