From google gemini to openai q*(q-star): A survey of reshaping the generative artificial intelligence (ai) research landscape

TR McIntosh, T Susnjak, T Liu, P Watters… - arXiv preprint arXiv …, 2023 - arxiv.org
This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …

Mathematical language models: A survey

W Liu, H Hu, J Zhou, Y Ding, J Li, J Zeng, M He… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, there has been remarkable progress in leveraging Language Models (LMs),
encompassing Pre-trained Language Models (PLMs) and Large-scale Language Models …

Generative verifiers: Reward modeling as next-token prediction

L Zhang, A Hosseini, H Bansal, M Kazemi… - arXiv preprint arXiv …, 2024 - arxiv.org
Verifiers or reward models are often used to enhance the reasoning performance of large
language models (LLMs). A common approach is the Best-of-N method, where N candidate …

Safe-clip: Removing nsfw concepts from vision-and-language models

S Poppi, T Poppi, F Cocchi, M Cornia, L Baraldi… - … on Computer Vision, 2025 - Springer
Large-scale vision-and-language models, such as CLIP, are typically trained on web-scale
data, which can introduce inappropriate content and lead to the development of unsafe and …

Large language models for mathematical reasoning: Progresses and challenges

J Ahn, R Verma, R Lou, D Liu, R Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive
capabilities of human intelligence. In recent times, there has been a notable surge in the …

Step-dpo: Step-wise preference optimization for long-chain reasoning of llms

X Lai, Z Tian, Y Chen, S Yang, X Peng, J Jia - arXiv preprint arXiv …, 2024 - arxiv.org
Mathematical reasoning presents a significant challenge for Large Language Models
(LLMs) due to the extensive and precise chain of reasoning required for accuracy. Ensuring …

Predicting text preference via structured comparative reasoning

JN Yan, T Liu, J Chiu, J Shen, Z Qin, Y Yu… - Proceedings of the …, 2024 - aclanthology.org
Comparative reasoning plays a crucial role in predicting text preferences; however, large
language models (LLMs) often demonstrate inconsistencies in their reasoning, leading to …

Visual agents as fast and slow thinkers

G Sun, M Jin, Z Wang, CL Wang, S Ma, Q Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Achieving human-level intelligence requires refining cognitive distinctions between System
1 and System 2 thinking. While contemporary AI, driven by large language models …

Fight back against jailbreaking via prompt adversarial tuning

Y Mo, Y Wang, Z Wei, Y Wang - The Thirty-eighth Annual …, 2024 - openreview.net
While Large Language Models (LLMs) have achieved tremendous success in various
applications, they are also susceptible to jailbreaking attacks. Several primary defense …

Embedding self-correction as an inherent ability in large language models for enhanced mathematical reasoning

K Gao, H Cai, Q Shuai, D Gong, Z Li - arXiv preprint arXiv:2410.10735, 2024 - arxiv.org
Accurate mathematical reasoning with Large Language Models (LLMs) is crucial in
revolutionizing domains that heavily rely on such reasoning. However, LLMs often …