Scene graph as pivoting: Inference-time image-free unsupervised multimodal machine translation with visual scene hallucination
In this work, we investigate a more realistic unsupervised multimodal machine translation
(UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text …
(UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text …
Proactive conversational agents
Conversational agents, or commonly known as dialogue systems, have gained escalating
popularity in recent years. Their widespread applications support conversational interactions …
popularity in recent years. Their widespread applications support conversational interactions …
Disentangling user conversations with voice assistants for online shopping
Conversation disentanglement aims to identify and group utterances from a conversation
into separate threads. Existing methods primarily focus on disentangling multi-party …
into separate threads. Existing methods primarily focus on disentangling multi-party …
A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition
The joint task of Dialog Sentiment Classification (DSC) and Act Recognition (DAR) aims to
predict the sentiment label and act label for each utterance in a dialog simultaneously …
predict the sentiment label and act label for each utterance in a dialog simultaneously …
Temporal Contrastive and Spatial Enhancement Coarse Grained Network for Weakly Supervised Group Activity Recognition
J Guo, Y Ge - Engineering Applications of Artificial Intelligence, 2024 - Elsevier
Group activity recognition (GAR) is an increasingly popular topic in the field of computer
vision. Numerous researchers have proposed a range of methods to achieve outstanding …
vision. Numerous researchers have proposed a range of methods to achieve outstanding …
Dramatic conversation disentanglement
We present a new dataset for studying conversation disentanglement in movies and TV
series. While previous work has focused on conversation disentanglement in IRC chatroom …
series. While previous work has focused on conversation disentanglement in IRC chatroom …
Revisiting conversation discourse for dialogue disentanglement
Dialogue disentanglement aims to detach the chronologically ordered utterances into
several independent sessions. Conversation utterances are essentially organized and …
several independent sessions. Conversation utterances are essentially organized and …
Beyond Language: Empowering Unsupervised Machine Translation with Cross-modal Alignment
Z Yang, Q Fang, Y Feng - openreview.net
Unsupervised machine translation (UMT) has achieved notable performance without any
parallel corpora in recent years. Nevertheless, aligning the source language with the target …
parallel corpora in recent years. Nevertheless, aligning the source language with the target …