LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset

H Li, Y Shao, Y Wu, Q Ai, Y Ma, Y Liu - Proceedings of the 47th …, 2024 - dl.acm.org
As an important component of intelligent legal systems, legal case retrieval plays a critical
role in ensuring judicial justice and fairness. However, the development of legal case …

An intent taxonomy of legal case retrieval

Y Shao, H Li, Y Wu, Y Liu, Q Ai, J Mao, Y Ma… - ACM Transactions on …, 2023 - dl.acm.org
Legal case retrieval is a special Information Retrieval (IR) task focusing on legal case
documents. Depending on the downstream tasks of the retrieved case documents, users' …

Unsupervised real-time hallucination detection based on the internal states of large language models

W Su, C Wang, Q Ai, Y Hu, Z Wu, Y Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Hallucinations in large language models (LLMs) refer to the phenomenon of LLMs
producing responses that are coherent yet factually inaccurate. This issue undermines the …

Dragin: Dynamic retrieval augmented generation based on the real-time information needs of large language models

W Su, Y Tang, Q Ai, Z Wu, Y Liu - arXiv preprint arXiv:2403.10081, 2024 - arxiv.org
Dynamic retrieval augmented generation (RAG) paradigm actively decides when and what
to retrieve during the text generation process of Large Language Models (LLMs). There are …

Leveraging Event Schema to Ask Clarifying Questions for Conversational Legal Case Retrieval

B Liu, Y Hu, Q Ai, Y Liu, Y Wu, C Li… - Proceedings of the 32nd …, 2023 - dl.acm.org
Legal case retrieval is a special IR task aiming to retrieve supporting cases for a given query
case. Existing works have shown that conversational search paradigm can improve users' …

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

H Li, Q Ai, X Han, J Chen, Q Dong, Y Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent research demonstrates the effectiveness of using pre-trained language models for
legal case retrieval. Most of the existing works focus on improving the representation ability …

Pushing the boundaries of legal information processing with integration of large language models

C Nguyen, T Tran, K Le, H Nguyen, T Do… - … Symposium on Artificial …, 2024 - Springer
The legal domain presents unique challenges in information processing, given the
complexity and specificity of legal texts. Addressing these challenges, this work leverages …

Mitigating Entity-Level Hallucination in Large Language Models

W Su, Y Tang, Q Ai, C Wang, Z Wu, Y Liu - arXiv preprint arXiv:2407.09417, 2024 - arxiv.org
The emergence of Large Language Models (LLMs) has revolutionized how users access
information, shifting from traditional search engines to direct question-and-answer …

MUSER: A Multi-View Similar Case Retrieval Dataset

Q Li, Y Hu, F Yao, C Xiao, Z Liu, M Sun… - Proceedings of the 32nd …, 2023 - dl.acm.org
Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role
in promoting judicial fairness. However, existing SCR datasets only focus on the fact …

LEEC: A Legal Element Extraction Dataset with an Extensive Domain-Specific Label System

X Zongyue, L Huanghai, H Yiran, K Kangle… - arXiv preprint arXiv …, 2023 - arxiv.org
As a pivotal task in natural language processing, element extraction has gained significance
in the legal domain. Extracting legal elements from judicial documents helps enhance …