Improving readability for automatic speech recognition transcription

J Liao, S Eskimez, L Lu, Y Shi, M Gong… - ACM Transactions on …, 2023 - dl.acm.org
Modern Automatic Speech Recognition (ASR) systems can achieve high performance in
terms of recognition accuracy. However, a perfectly accurate transcript still can be …

Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error Correction

S Koo, C Park, S Lee, J Seo, S Eo, H Moon… - IEEE Access, 2023 - ieeexplore.ieee.org
In a Data-Centric AI paradigm, the model performance is enhanced without altering the
model architecture, as evidenced by real-world and benchmark dataset demonstrations …

Information Dropping Data Augmentation for Machine Translation Quality Estimation

S Li, X Bi, T Liu, Z Chen - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
Machine translation quality estimation (QE) refers to the quality assessment of machine
translations without a given reference translation. Supervised QE models based on neural …

BERTOEIC: Solving TOEIC problems using simple and efficient data augmentation techniques with pretrained transformer encoders

J Lee, H Moon, C Park, J Seo, S Eo, H Lim - Applied Sciences, 2022 - mdpi.com
Recent studies have attempted to understand natural language and infer answers. Machine
reading comprehension is one of the representatives, and several related datasets have …

Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction

C Park, S Koo, S Lee, J Seo, S Eo, H Moon… - arXiv preprint arXiv …, 2023 - arxiv.org
Data-centric AI approach aims to enhance the model performance without modifying the
model and has been shown to impact model performance positively. While recent attention …

A Self-Supervised Automatic Post-Editing Data Generation Tool

H Moon, C Park, S Eo, J Seo, SJ Lee, H Lim - arXiv preprint arXiv …, 2021 - arxiv.org
Data building for automatic post-editing (APE) requires extensive and expert-level human
effort, as it contains an elaborate process that involves identifying errors in sentences and …

[PDF][PDF] Advanced Techniques in Hindi Automatic Post-Editing: Neural Models and Data Augmentation

P Nair - 2024 - cdn.iiit.ac.in
Automatic post-editing (APE) is a crucial technique for enhancing the quality of machine
translations. In this thesis, we present an APE approach specifically for English-Hindi …

Corrector ortográfico neuronal para errores ortográficos multilingües adversarios para lenguas amazónicas peruanas

G Cardoso Yllanes - 2022 - tesis.pucp.edu.pe
Para combatir los ataques de ejemplos adversarios, se propuso implementar un modelo de
reconocimiento de palabras y entrenarlo con oraciones creadas a través de diferentes …