Clustering and ranking: Diversity-preserved instruction selection through expert-aligned quality estimation
With contributions from the open-source community, a vast amount of instruction tuning (IT)
data has emerged. Given the significant resource allocation required for training and …
data has emerged. Given the significant resource allocation required for training and …
Know where to go: Make LLM a relevant, responsible, and trustworthy searchers
Abstract The advent of Large Language Models (LLMs) has shown the potential to improve
relevance and provide direct answers in web searches. However, challenges arise in …
relevance and provide direct answers in web searches. However, challenges arise in …
Conversational simulmt: Efficient simultaneous translation with large language models
Simultaneous machine translation (SimulMT) presents a challenging trade-off between
translation quality and latency. Recent studies have shown that LLMs can achieve good …
translation quality and latency. Recent studies have shown that LLMs can achieve good …
Emerging Opportunities of Using Large Language Language Models for Translation Between Drug Molecules and Indications
A drug molecule is a substance that changes the organism's mental or physical state. Every
approved drug has an indication, which refers to the therapeutic use of that drug for treating …
approved drug has an indication, which refers to the therapeutic use of that drug for treating …
Emerging opportunities of using large language models for translation between drug molecules and indications
A drug molecule is a substance that changes an organism's mental or physical state. Every
approved drug has an indication, which refers to the therapeutic use of that drug for treating …
approved drug has an indication, which refers to the therapeutic use of that drug for treating …
A Context-aware Framework for Translation-mediated Conversations
Effective communication is fundamental to any interaction, yet challenges arise when
participants do not share a common language. Automatic translation systems offer a …
participants do not share a common language. Automatic translation systems offer a …
Creative and Context-Aware Translation of East Asian Idioms with GPT-4
As a type of figurative language, an East Asian idiom condenses rich cultural background
into only a few characters. Translating such idioms is challenging for human translators, who …
into only a few characters. Translating such idioms is challenging for human translators, who …
Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing
Large Language Models (LLM's) have demonstrated considerable success in various
Natural Language Processing tasks, but they have yet to attain state-of-the-art performance …
Natural Language Processing tasks, but they have yet to attain state-of-the-art performance …
Optimizing example selection for retrieval-augmented machine translation with translation memories
Retrieval-augmented machine translation leverages examples from a translation memory by
retrieving similar instances. These examples are used to condition the predictions of a …
retrieving similar instances. These examples are used to condition the predictions of a …
How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes
Decoder-only LLMs have shown impressive performance in MT due to their ability to learn
from extensive datasets and generate high-quality translations. However, LLMs often …
from extensive datasets and generate high-quality translations. However, LLMs often …