Model spider: Learning to rank pre-trained models efficiently

ZH Tan, JD Liu, XD Bi, P Tan, QC Zheng… - Proceedings of the 30th …, 2024 - dl.acm.org

The learnware paradigm proposed by Zhou (2016) aims to enable users to leverage
numerous existing high-performing models instead of building machine learning models …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Selecting large language model to fine-tune via rectified scaling law

H Lin, B Huang, H Ye, Q Chen, Z Wang, S Li… - arXiv preprint arXiv …, 2024 - arxiv.org

The ever-growing ecosystem of LLMs has posed a challenge in selecting the most
appropriate pre-trained model to fine-tune amidst a sea of options. Given constrained …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Foundation model is efficient multimodal multitask model selector

F Meng, W Shao, Z Peng, C Jiang, K Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

This paper investigates an under-explored but important problem: given a collection of pre-
trained neural networks, predicting their performance on each multi-modal task without fine …

被引用次数：8 相关文章所有 4 个版本

[PDF] neurips.cc

Foundation Model is Efficient Multimodal Multitask Model Selector

W Shao, C Jiang, K Zhang, Y Qiao… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper investigates an under-explored but important problem: given a collection of pre-
trained neural networks, predicting their performance on each multi-modal task without fine …

被引用次数：1 相关文章

[PDF] arxiv.org

Parrot: Multilingual Visual Instruction Tuning

HL Sun, DW Zhou, Y Li, S Lu, C Yi, QG Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid development of Multimodal Large Language Models (MLLMs) like GPT-4V has
marked a significant step towards artificial general intelligence. Existing methods mainly …

被引用次数：6 相关文章所有 2 个版本

[PDF] nature.com

One size does not fit all in evaluating model selection scores for image classification

N Abou Baker, U Handmann - Scientific Reports, 2024 - nature.com

Selecting pretrained models for image classification often involves computationally intensive
finetuning. This study addresses a research gap in the standardized evaluation of …

Universal embedding for pre-trained models and data bench

N Cho, T Cho, J Shin, E Jeon, T Lee - Neurocomputing, 2024 - Elsevier

The transformer architecture has shown significant improvements in the performance of
various natural language processing (NLP) tasks. One of the great advantages of …

[PDF] arxiv.org

Wings: Learning Multimodal LLMs without Text-only Forgetting

YK Zhang, S Lu, Y Li, Y Ma, QG Chen, Z Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

Multimodal large language models (MLLMs), initiated with a trained LLM, first align images
with text and then fine-tune on multimodal mixed inputs. However, the MLLM …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions

YK Zhang, XX Zhong, S Lu, QG Chen, DC Zhan… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid advancements in Large Language Models (LLMs) have significantly expanded
their applications, ranging from multilingual support to domain-specific tasks and multimodal …

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

TJ Huang, JQ Yang, C Shen, KQ Liu, DC Zhan… - arXiv preprint arXiv …, 2024 - arxiv.org

Characterizing users and items through vector representations is crucial for various tasks in
recommender systems. Recent approaches attempt to apply Large Language Models …

被引用次数：2 相关文章所有 2 个版本