Ties-merging: Resolving interference when merging models

P Yadav, D Tam, L Choshen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Transfer learning–ie, further fine-tuning a pre-trained model on a downstream task–can
confer significant advantages, including improved downstream performance, faster …

[PDF][PDF] Resolving interference when merging models

P Yadav, D Tam, L Choshen, C Raffel… - arXiv preprint arXiv …, 2023 - researchgate.net
Transfer learning–ie, further fine-tuning a pre-trained model on a downstream task–can
confer significant advantages, including improved downstream performance, faster …

Exploring the use of personalized AI for identifying misinformation on social media

F Jahanbakhsh, Y Katsis, D Wang, L Popa… - Proceedings of the 2023 …, 2023 - dl.acm.org
This work aims to explore how human assessments and AI predictions can be combined to
identify misinformation on social media. To do so, we design a personalized AI which …

Detectors for safe and reliable llms: Implementations, uses, and limitations

S Achintalwar, AA Garcia, A Anaby-Tavor… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output
to biased and toxic generations. Due to several limiting factors surrounding LLMs (training …

PEANUT: A Human-AI Collaborative Tool for Annotating Audio-Visual Data

Z Zhang, Z Ning, C Xu, Y Tian, TJJ Li - Proceedings of the 36th Annual …, 2023 - dl.acm.org
Audio-visual learning seeks to enhance the computer's multi-modal perception leveraging
the correlation between the auditory and visual modalities. Despite their many useful …

Parameter competition balancing for model merging

G Du, J Lee, J Li, R Jiang, Y Guo, S Yu, H Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
While fine-tuning pretrained models has become common practice, these models often
underperform outside their specific domains. Recently developed model merging …

Text augmentation using dataset reconstruction for low-resource classification

A Rahamim, G Uziel, E Goldbraich… - Findings of the …, 2023 - aclanthology.org
In the deployment of real-world text classification models, label scarcity is a common
problem and as the number of classes increases, this problem becomes even more …

A study of deep active learning methods to reduce labelling efforts in biomedical relation extraction

C Nachtegael, J De Stefani, T Lenaerts - PloS one, 2023 - journals.plos.org
Automatic biomedical relation extraction (bioRE) is an essential task in biomedical research
in order to generate high-quality labelled data that can be used for the development of …

DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations

C Nachtegael, J De Stefani, A Cnudde, T Lenaerts - Database, 2024 - academic.oup.com
While biomedical relation extraction (bioRE) datasets have been instrumental in the
development of methods to support biocuration of single variants from texts, no datasets are …

Zero-shot Topical Text Classification with LLMs-an Experimental Study

S Gretz, A Halfon, I Shnayderman… - Findings of the …, 2023 - aclanthology.org
Abstract Topical Text Classification (TTC) is an ancient, yet timely research area in natural
language processing, with many practical applications. The recent dramatic advancements …