Skip-thought vectors

SF Ahmed, MSB Alam, M Hassan, MR Rozbu… - Artificial Intelligence …, 2023 - Springer

Deep learning (DL) is revolutionizing evidence-based decision-making techniques that can
be applied across various sectors. Specifically, it possesses the ability to utilize two or more …

被引用次数：169 相关文章所有 10 个版本

[PDF] springer.com

Large-scale multi-modal pre-trained models: A comprehensive survey

X Wang, G Chen, G Qian, P Gao, XY Wei… - Machine Intelligence …, 2023 - Springer

With the urgent demand for generalized deep models, many pre-trained big models are
proposed, such as bidirectional encoder representations (BERT), vision transformer (ViT) …

被引用次数：136 相关文章所有 8 个版本

[PDF] arxiv.org

Lamda: Language models for dialog applications

R Thoppilan, D De Freitas, J Hall, N Shazeer… - arXiv preprint arXiv …, 2022 - arxiv.org

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of
Transformer-based neural language models specialized for dialog, which have up to 137B …

被引用次数：1412 相关文章所有 6 个版本

[PDF] mlr.press

Glam: Efficient scaling of language models with mixture-of-experts

N Du, Y Huang, AM Dai, S Tong… - International …, 2022 - proceedings.mlr.press

Scaling language models with more data, compute and parameters has driven significant
progress in natural language processing. For example, thanks to scaling, GPT-3 was able to …

被引用次数：474 相关文章所有 4 个版本

[PDF] arxiv.org

Text and code embeddings by contrastive pre-training

A Neelakantan, T Xu, R Puri, A Radford, JM Han… - arXiv preprint arXiv …, 2022 - arxiv.org

Text embeddings are useful features in many applications such as semantic search and
computing text similarity. Previous work typically trains models customized for different use …

被引用次数：355 相关文章所有 5 个版本

[PDF] arxiv.org

One embedder, any task: Instruction-finetuned text embeddings

H Su, W Shi, J Kasai, Y Wang, Y Hu… - arXiv preprint arXiv …, 2022 - arxiv.org

We introduce INSTRUCTOR, a new method for computing text embeddings given task
instructions: every text input is embedded together with instructions explaining the use case …

被引用次数：189 相关文章所有 4 个版本

[PDF] arxiv.org

Simcse: Simple contrastive learning of sentence embeddings

T Gao, X Yao, D Chen - arXiv preprint arXiv:2104.08821, 2021 - arxiv.org

This paper presents SimCSE, a simple contrastive learning framework that greatly advances
state-of-the-art sentence embeddings. We first describe an unsupervised approach, which …

被引用次数：2982 相关文章所有 9 个版本

[PDF] arxiv.org

Consert: A contrastive framework for self-supervised sentence representation transfer

Y Yan, R Li, S Wang, F Zhang, W Wu, W Xu - arXiv preprint arXiv …, 2021 - arxiv.org

Learning high-quality sentence representations benefits a wide range of natural language
processing tasks. Though BERT-based pre-trained language models achieve high …

被引用次数：561 相关文章所有 5 个版本

An introduction to deep learning in natural language processing: Models, techniques, and tools

I Lauriola, A Lavelli, F Aiolli - Neurocomputing, 2022 - Elsevier

Abstract Natural Language Processing (NLP) is a branch of artificial intelligence that
involves the design and implementation of systems and algorithms able to interact through …

被引用次数：456 相关文章所有 4 个版本

[PDF] mlr.press

Learning transferable visual models from natural language supervision

A Radford, JW Kim, C Hallacy… - International …, 2021 - proceedings.mlr.press

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined
object categories. This restricted form of supervision limits their generality and usability since …

被引用次数：21950 相关文章所有 20 个版本

Deep learning modelling techniques: current progress, applications, advantages, and challenges

Large-scale multi-modal pre-trained models: A comprehensive survey

Lamda: Language models for dialog applications

Glam: Efficient scaling of language models with mixture-of-experts

Text and code embeddings by contrastive pre-training

One embedder, any task: Instruction-finetuned text embeddings

Simcse: Simple contrastive learning of sentence embeddings

Consert: A contrastive framework for self-supervised sentence representation transfer

An introduction to deep learning in natural language processing: Models, techniques, and tools

Learning transferable visual models from natural language supervision

高级搜索

引用