A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer
Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

Human–AI collaboration enables more empathic conversations in text-based peer-to-peer mental health support

A Sharma, IW Lin, AS Miner, DC Atkins… - Nature Machine …, 2023 - nature.com
Advances in artificial intelligence (AI) are enabling systems that augment and collaborate
with humans to perform simple, mechanistic tasks such as scheduling meetings and …

Sentiment analysis in the era of large language models: A reality check

W Zhang, Y Deng, B Liu, SJ Pan, L Bing - arXiv preprint arXiv:2305.15005, 2023 - arxiv.org
Sentiment analysis (SA) has been a long-standing research area in natural language
processing. It can offer rich insights into human sentiments and opinions and has thus seen …

Internet-augmented dialogue generation

M Komeili, K Shuster, J Weston - arXiv preprint arXiv:2107.07566, 2021 - arxiv.org
The largest store of continually updating knowledge on our planet can be accessed via
internet search. In this work we study giving access to this information to conversational …

Recipes for building an open-domain chatbot

S Roller, E Dinan, N Goyal, D Ju, M Williamson… - arXiv preprint arXiv …, 2020 - arxiv.org
Building open-domain chatbots is a challenging area for machine learning research. While
prior work has shown that scaling neural models in the number of parameters and the size of …

DExperts: Decoding-time controlled text generation with experts and anti-experts

A Liu, M Sap, X Lu, S Swayamdipta… - arXiv preprint arXiv …, 2021 - arxiv.org
Despite recent advances in natural language generation, it remains challenging to control
attributes of generated text. We propose DExperts: Decoding-time Experts, a decoding-time …

Beyond goldfish memory: Long-term open-domain conversation

J Xu, A Szlam, J Weston - arXiv preprint arXiv:2107.07567, 2021 - arxiv.org
Despite recent improvements in open-domain dialogue models, state of the art models are
trained and evaluated on short conversations with little context. In contrast, the long-term …

Aligning ai with shared human values

D Hendrycks, C Burns, S Basart, A Critch, J Li… - arXiv preprint arXiv …, 2020 - arxiv.org
We show how to assess a language model's knowledge of basic concepts of morality. We
introduce the ETHICS dataset, a new benchmark that spans concepts in justice, well-being …

Cosmic: Commonsense knowledge for emotion identification in conversations

D Ghosal, N Majumder, A Gelbukh, R Mihalcea… - arXiv preprint arXiv …, 2020 - arxiv.org
In this paper, we address the task of utterance level emotion recognition in conversations
using commonsense knowledge. We propose COSMIC, a new framework that incorporates …