Large language models as annotators: Enhancing generalization of nlp models at minimal cost
State-of-the-art supervised NLP models achieve high accuracy but are also susceptible to
failures on inputs from low-data regimes, such as domains that are not represented in …
failures on inputs from low-data regimes, such as domains that are not represented in …
Deep encoders with auxiliary parameters for extreme classification
The task of annotating a data point with labels most relevant to it from a large universe of
labels is referred to as Extreme Classification (XC). State-of-the-art XC methods have …
labels is referred to as Extreme Classification (XC). State-of-the-art XC methods have …
Multi-modal extreme classification
This paper develops the MUFIN technique for extreme classification (XC) tasks with millions
of labels where datapoints and labels are endowed with visual and textual descriptors …
of labels where datapoints and labels are endowed with visual and textual descriptors …
ICXML: An in-context learning framework for zero-shot extreme multi-label classification
This paper focuses on the task of Extreme Multi-Label Classification (XMC) whose goal is to
predict multiple labels for each instance from an extremely large label space. While existing …
predict multiple labels for each instance from an extremely large label space. While existing …
Renee: End-to-end training of extreme classification models
Abstract The goal of Extreme Multi-label Classification (XC) is to learn representations that
enable mapping input texts to the most relevant subset of labels selected from an extremely …
enable mapping input texts to the most relevant subset of labels selected from an extremely …
Meta-classifier free negative sampling for extreme multilabel classification
Negative sampling is a common approach for making the training of deep models in
classification problems with very large output spaces, known as extreme multilabel …
classification problems with very large output spaces, known as extreme multilabel …
UniDEC: Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification
Extreme Multi-label Classification (XMC) involves predicting a subset of relevant labels from
an extremely large label space, given an input query and labels with textual features …
an extremely large label space, given an input query and labels with textual features …
Personalized Retrieval over Millions of Items
Personalized retrieval seeks to retrieve items relevant to a user event (eg a page visit or a
query) that are adapted to the user's personal preferences. For example, two users who …
query) that are adapted to the user's personal preferences. For example, two users who …
Extreme Meta-Classification for Large-Scale Zero-Shot Retrieval
We develop accurate and efficient solutions for large-scale retrieval tasks where novel (zero-
shot) items can arrive continuously at a rapid pace. Conventional Siamese-style approaches …
shot) items can arrive continuously at a rapid pace. Conventional Siamese-style approaches …
Zero-Shot Learning Over Large Output Spaces: Utilizing Indirect Knowledge Extraction from Large Language Models
Extreme Multi-label Learning (XMC) is a task that allocates the most relevant labels for an
instance from a predefined label set. Extreme Zero-shot XMC (EZ-XMC) is a special setting …
instance from a predefined label set. Extreme Zero-shot XMC (EZ-XMC) is a special setting …