BartPho: pre-trained sequence-to-sequence models for Vietnamese

NL Tran, DM Le, DQ Nguyen - arXiv preprint arXiv:2109.09701, 2021 - arxiv.org
We present BARTpho with two versions, BARTpho-syllable and BARTpho-word, which are
the first public large-scale monolingual sequence-to-sequence models pre-trained for …

Vietnamese sentiment analysis: an overview and comparative study of fine-tuning pretrained language models

D Van Thin, DN Hao, NLT Nguyen - ACM Transactions on Asian and …, 2023 - dl.acm.org
Sentiment Analysis (SA) is one of the most active research areas in the Natural Language
Processing (NLP) field due to its potential for business and society. With the development of …

ViDeBERTa: A powerful pre-trained language model for Vietnamese

CD Tran, NH Pham, A Nguyen, TS Hy, T Vu - arXiv preprint arXiv …, 2023 - arxiv.org
This paper presents ViDeBERTa, a new pre-trained monolingual language model for
Vietnamese, with three versions-ViDeBERTa_xsmall, ViDeBERTa_base, and …

ViSoBERT: A pre-trained language model for Vietnamese social media text processing

QN Nguyen, TC Phan, DV Nguyen… - arXiv preprint arXiv …, 2023 - arxiv.org
English and Chinese, known as resource-rich languages, have witnessed the strong
development of transformer-based language models for natural language processing tasks …

A multiple choices reading comprehension corpus for Vietnamese language education

ST Luu, KT Hoang, TQ Pham, K Van Nguyen… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine reading comprehension has been an interesting and challenging task in recent
years, with the purpose of extracting useful information from texts. To attain the computer …

Vihealthbert: Pre-trained language models for vietnamese in health text mining

N Minh, VH Tran, V Hoang, HD Ta, TH Bui… - Proceedings of the …, 2022 - aclanthology.org
Pre-trained language models have become crucial to achieving competitive results across
many Natural Language Processing (NLP) problems. For monolingual pre-trained models in …

ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese

CT Phan, QN Nguyen, CT Dang, TH Do… - arXiv preprint arXiv …, 2023 - arxiv.org
Social media processing is a fundamental task in natural language processing with
numerous applications. As Vietnamese social media and information science have grown …

An approach of data augmentation to improve the performance of BERTology models for Vietnamese hate speech detection

ST Luu, K Van Nguyen, NLT Nguyen - Multimedia Tools and Applications, 2024 - Springer
Hate speech detection on social media networks is the classification task that automatically
detects harmful comments from users and prevents the appearance of those toxic comments …

Vietnamese capitalization and punctuation recovery models

HTT Uyen, NA Tu, TD Huy - arXiv preprint arXiv:2207.01312, 2022 - arxiv.org
Despite the rise of recent performant methods in Automatic Speech Recognition (ASR), such
methods do not ensure proper casing and punctuation for their outputs. This problem has a …

A semantics-aware approach for multilingual natural language inference

P Le-Hong, E Cambria - Language Resources and Evaluation, 2023 - Springer
This paper introduces a semantics-aware approach to natural language inference which
allows neural network models to perform better on natural language inference benchmarks …