On the (in) effectiveness of large language models for chinese text correction

Y Li, H Huang, S Ma, Y Jiang, Y Li, F Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, the development and progress of Large Language Models (LLMs) have amazed
the entire Artificial Intelligence community. As an outstanding representative of LLMs and the …

Mrrl: Modifying the reference via reinforcement learning for non-autoregressive joint multiple intent detection and slot filling

X Cheng, Z Zhu, B Cao, Q Ye, Y Zou - Findings of the Association …, 2023 - aclanthology.org
With the rise of non-autoregressive approach, some non-autoregressive models for joint
multiple intent detection and slot filling have obtained the promising inference speed …

MESED: A multi-modal entity set expansion dataset with fine-grained semantic classes and hard negative entities

Y Li, T Lu, HT Zheng, Y Li, S Huang, T Yu… - Proceedings of the …, 2024 - ojs.aaai.org
The Entity Set Expansion (ESE) task aims to expand a handful of seed entities with new
entities belonging to the same semantic class. Conventional ESE methods are based on …

CLEME: debiasing multi-reference evaluation for grammatical error correction

J Ye, Y Li, Q Zhou, Y Li, S Ma, HT Zheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Evaluating the performance of Grammatical Error Correction (GEC) systems is a challenging
task due to its subjectivity. Designing an evaluation metric that is as objective as possible is …

Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction

Y Li, S Ma, S Chen, H Huang, S Huang… - Available at SSRN …, 2023 - papers.ssrn.com
Abstract Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input
text, which benefits human daily life and various downstream tasks. Recent approaches …

Towards real-world writing assistance: A chinese character checking benchmark with faked and misspelled characters

Y Li, Z Xu, S Chen, H Huang, Y Li, Y Jiang, Z Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Writing assistance is an application closely related to human life and is also a fundamental
Natural Language Processing (NLP) research field. Its aim is to improve the correctness and …

A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check

H Huang, J Ye, Q Zhou, Y Li, Y Li, F Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, Chinese Spelling Check (CSC) has been greatly improved by designing
task-specific pre-training methods or introducing auxiliary tasks, which mostly solve this task …

Mixedit: Revisiting data augmentation and beyond for grammatical error correction

J Ye, Y Li, Y Li, HT Zheng - arXiv preprint arXiv:2310.11671, 2023 - arxiv.org
Data Augmentation through generating pseudo data has been proven effective in mitigating
the challenge of data scarcity in the field of Grammatical Error Correction (GEC). Various …

Contextual similarity is more valuable than character similarity: An empirical study for chinese spell checking

D Zhang, Y Li, Q Zhou, S Ma, Y Li… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Chinese Spell Checking (CSC) task aims to detect and correct Chinese spelling errors.
Recently, related researches focus on introducing character similarity from confusion set to …

Disentangled phonetic representation for chinese spelling correction

Z Liang, X Quan, Q Wang - arXiv preprint arXiv:2305.14783, 2023 - arxiv.org
Chinese Spelling Correction (CSC) aims to detect and correct erroneous characters in
Chinese texts. Although efforts have been made to introduce phonetic information (Hanyu …