Large language models as annotators: Enhancing generalization of nlp models at minimal cost

P Bansal, A Sharma - arXiv preprint arXiv:2306.15766, 2023 - arxiv.org
State-of-the-art supervised NLP models achieve high accuracy but are also susceptible to
failures on inputs from low-data regimes, such as domains that are not represented in …

Deep encoders with auxiliary parameters for extreme classification

K Dahiya, S Yadav, S Sondhi, D Saini… - Proceedings of the 29th …, 2023 - dl.acm.org
The task of annotating a data point with labels most relevant to it from a large universe of
labels is referred to as Extreme Classification (XC). State-of-the-art XC methods have …

Multi-modal extreme classification

A Mittal, K Dahiya, S Malani… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper develops the MUFIN technique for extreme classification (XC) tasks with millions
of labels where datapoints and labels are endowed with visual and textual descriptors …

ICXML: An in-context learning framework for zero-shot extreme multi-label classification

Y Zhu, H Zamani - arXiv preprint arXiv:2311.09649, 2023 - arxiv.org
This paper focuses on the task of Extreme Multi-Label Classification (XMC) whose goal is to
predict multiple labels for each instance from an extremely large label space. While existing …

Renee: End-to-end training of extreme classification models

V Jain, J Prakash, D Saini, J Jiao… - Proceedings of …, 2023 - proceedings.mlsys.org
Abstract The goal of Extreme Multi-label Classification (XC) is to learn representations that
enable mapping input texts to the most relevant subset of labels selected from an extremely …

Meta-classifier free negative sampling for extreme multilabel classification

M Qaraei, R Babbar - Machine Learning, 2024 - Springer
Negative sampling is a common approach for making the training of deep models in
classification problems with very large output spaces, known as extreme multilabel …

UniDEC: Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification

S Kharbanda, D Gupta, P Malhotra, CJ Hsieh… - arXiv preprint arXiv …, 2024 - arxiv.org
Extreme Multi-label Classification (XMC) involves predicting a subset of relevant labels from
an extremely large label space, given an input query and labels with textual features …

Personalized Retrieval over Millions of Items

H Vemuri, S Agrawal, S Mittal, D Saini, A Soni… - Proceedings of the 46th …, 2023 - dl.acm.org
Personalized retrieval seeks to retrieve items relevant to a user event (eg a page visit or a
query) that are adapted to the user's personal preferences. For example, two users who …

Extreme Meta-Classification for Large-Scale Zero-Shot Retrieval

S Yadav, D Saini, A Buvanesh, B Paliwal… - Proceedings of the 30th …, 2024 - dl.acm.org
We develop accurate and efficient solutions for large-scale retrieval tasks where novel (zero-
shot) items can arrive continuously at a rapid pace. Conventional Siamese-style approaches …

Zero-Shot Learning Over Large Output Spaces: Utilizing Indirect Knowledge Extraction from Large Language Models

J Zhang, N Ullah, R Babbar - arXiv preprint arXiv:2406.09288, 2024 - arxiv.org
Extreme Multi-label Learning (XMC) is a task that allocates the most relevant labels for an
instance from a predefined label set. Extreme Zero-shot XMC (EZ-XMC) is a special setting …