Pubmedqa: A dataset for biomedical research question answering

Q Jin, B Dhingra, Z Liu, WW Cohen, X Lu - arXiv preprint arXiv:1909.06146, 2019 - arxiv.org
We introduce PubMedQA, a novel biomedical question answering (QA) dataset collected
from PubMed abstracts. The task of PubMedQA is to answer research questions with …

SGM: sequence generation model for multi-label classification

P Yang, X Sun, W Li, S Ma, W Wu, H Wang - arXiv preprint arXiv …, 2018 - arxiv.org
Multi-label classification is an important yet challenging task in natural language processing.
It is more complex than single-label classification in that the labels tend to be correlated …

Glyce: Glyph-vectors for chinese character representations

Y Meng, W Wu, F Wang, X Li, P Nie… - Advances in …, 2019 - proceedings.neurips.cc
It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the
use of the glyph information in those languages. However, due to the lack of rich …

Order-agnostic cross entropy for non-autoregressive machine translation

C Du, Z Tu, J Jiang - International conference on machine …, 2021 - proceedings.mlr.press
We propose a new training objective named order-agnostic cross entropy (OaXE) for fully
non-autoregressive translation (NAT) models. OaXE improves the standard cross-entropy …

Diversity-promoting GAN: A cross-entropy based generative adversarial network for diversified text generation

J Xu, X Ren, J Lin, X Sun - Proceedings of the 2018 conference on …, 2018 - aclanthology.org
Existing text generation methods tend to produce repeated and” boring” expressions. To
tackle this problem, we propose a new text generation model, called Diversity-Promoting …

Is word segmentation necessary for deep learning of Chinese representations?

X Li, Y Meng, X Sun, Q Han, A Yuan, J Li - arXiv preprint arXiv:1905.05526, 2019 - arxiv.org
Segmenting a chunk of text into words is usually the first step of processing Chinese text, but
its necessity has rarely been explored. In this paper, we ask the fundamental question of …

Generating sentences from disentangled syntactic and semantic spaces

Y Bao, H Zhou, S Huang, L Li, L Mou… - arXiv preprint arXiv …, 2019 - arxiv.org
Variational auto-encoders (VAEs) are widely used in natural language generation due to the
regularization of the latent space. However, generating sentences from the continuous latent …

Paraphrase generation with latent bag of words

Y Fu, Y Feng, JP Cunningham - Advances in Neural …, 2019 - proceedings.neurips.cc
Paraphrase generation is a longstanding important problem in natural language processing.
Recent progress in deep generative models has shown promising results on discrete latent …

Machine translation and its evaluation: a study

SK Mondal, H Zhang, HMD Kabir, K Ni… - Artificial Intelligence …, 2023 - Springer
Abstract Machine translation (namely MT) has been one of the most popular fields in
computational linguistics and Artificial Intelligence (AI). As one of the most promising …

A skeleton-based model for promoting coherence among sentences in narrative story generation

J Xu, X Ren, Y Zhang, Q Zeng, X Cai, X Sun - arXiv preprint arXiv …, 2018 - arxiv.org
Narrative story generation is a challenging problem because it demands the generated
sentences with tight semantic connections, which has not been well studied by most existing …