CLAIR: Evaluating image captions with large language models
The evaluation of machine-generated image captions poses an interesting yet persistent
challenge. Effective evaluation measures must consider numerous dimensions of similarity …
challenge. Effective evaluation measures must consider numerous dimensions of similarity …
Mauve scores for generative models: Theory and practice
Generative artificial intelligence has made significant strides, producing text
indistinguishable from human prose and remarkably photorealistic images. Automatically …
indistinguishable from human prose and remarkably photorealistic images. Automatically …
Ic3: Image captioning by committee consensus
If you ask a human to describe an image, they might do so in a thousand different ways.
Traditionally, image captioning models are trained to generate a single" best"(most like a …
Traditionally, image captioning models are trained to generate a single" best"(most like a …
Scalable and accurate self-supervised multimodal representation learning without aligned video and text data
Scaling up weakly-supervised datasets has shown to be highly effective in the image-text
domain and has contributed to most of the recent state-of-the-art computer vision and …
domain and has contributed to most of the recent state-of-the-art computer vision and …
Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Direct Preference Optimization (DPO) has recently emerged as a popular approach to
improve reinforcement learning with human feedback (RLHF), leading to better techniques …
improve reinforcement learning with human feedback (RLHF), leading to better techniques …
Talking Machines: Philosophical Essays on Language Models
N Osman Attah - 2024 - d-scholarship.pitt.edu
This dissertation is a collection of three independent but thematically connected papers on
language models. The first assesses a currently popular account of the nature of the …
language models. The first assesses a currently popular account of the nature of the …
Understanding, Building, and Evaluating Models for Context Aware Conditional Natural Language Generation
DM Chan - 2024 - search.proquest.com
If you ask a human to describe an image, they might do so in a thousand different ways.
Each of these descriptions depends not only on the image but also on a rich tapestry of …
Each of these descriptions depends not only on the image but also on a rich tapestry of …