- 学术资源搜索

Domain specialization as the key to make large language models disruptive: A comprehensive survey

C Ling, X Zhao, J Lu, C Deng, C Zheng, J Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have significantly advanced the field of natural language
processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of …

被引用次数：93 相关文章所有 3 个版本

[PDF] arxiv.org

Parameter-efficient fine-tuning methods for pretrained language models: A critical review and assessment

L Xu, H Xie, SZJ Qin, X Tao, FL Wang - arXiv preprint arXiv:2312.12148, 2023 - arxiv.org

With the continuous growth in the number of parameters of transformer-based pretrained
language models (PLMs), particularly the emergence of large language models (LLMs) with …

被引用次数：140 相关文章所有 2 个版本

[PDF] thecvf.com

Vl-adapter: Parameter-efficient transfer learning for vision-and-language tasks

YL Sung, J Cho, M Bansal - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Recently, fine-tuning language models pre-trained on large text corpora have provided huge
improvements on vision-and-language (V&L) tasks as well as on pure language tasks …

被引用次数：353 相关文章所有 6 个版本

[PDF] neurips.cc

Compacter: Efficient low-rank hypercomplex adapter layers

R Karimi Mahabadi, J Henderson… - Advances in Neural …, 2021 - proceedings.neurips.cc

Adapting large-scale pretrained language models to downstream tasks via fine-tuning is the
standard method for achieving state-of-the-art performance on NLP benchmarks. However …

被引用次数：424 相关文章所有 9 个版本

[PDF] arxiv.org

Dive into deep learning

A Zhang, ZC Lipton, M Li, AJ Smola - arXiv preprint arXiv:2106.11342, 2021 - arxiv.org

This open-source book represents our attempt to make deep learning approachable,
teaching readers the concepts, the context, and the code. The entire book is drafted in …

被引用次数：1194 相关文章所有 9 个版本

[PDF] arxiv.org

Scale efficiently: Insights from pre-training and fine-tuning transformers

Y Tay, M Dehghani, J Rao, W Fedus, S Abnar… - arXiv preprint arXiv …, 2021 - arxiv.org

There remain many open questions pertaining to the scaling behaviour of Transformer
architectures. These scaling decisions and findings can be critical, as training runs often …

被引用次数：135 相关文章所有 3 个版本

[PDF] arxiv.org

Interactive natural language processing

Z Wang, G Zhang, K Yang, N Shi, W Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org

Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …

被引用次数：52 相关文章所有 5 个版本

[PDF] acm.org

Visual tuning

BXB Yu, J Chang, H Wang, L Liu, S Wang… - ACM Computing …, 2024 - dl.acm.org

Fine-tuning visual models has been widely shown promising performance on many
downstream visual tasks. With the surprising development of pre-trained visual foundation …

被引用次数：29 相关文章所有 4 个版本

[PDF] neurips.cc

Polyhistor: Parameter-efficient multi-task adaptation for dense vision tasks

YC Liu, CY Ma, J Tian, Z He… - Advances in Neural …, 2022 - proceedings.neurips.cc

Adapting large-scale pretrained models to various downstream tasks via fine-tuning is a
standard method in machine learning. Recently, parameter-efficient fine-tuning methods …

被引用次数：52 相关文章所有 5 个版本

[PDF] arxiv.org

Perfect: Prompt-free and efficient few-shot learning with language models

RK Mahabadi, L Zettlemoyer, J Henderson… - arXiv preprint arXiv …, 2022 - arxiv.org

Current methods for few-shot fine-tuning of pretrained masked language models (PLMs)
require carefully engineered prompts and verbalizers for each new task to convert examples …

被引用次数：74 相关文章所有 6 个版本