[HTML][HTML] A survey on dataset quality in machine learning

Y Gong, G Liu, Y Xue, R Li, L Meng - Information and Software Technology, 2023 - Elsevier
With the rise of big data, the quality of datasets has become a crucial factor affecting the
performance of machine learning models. High-quality datasets are essential for the …

Representation bias in data: A survey on identification and resolution techniques

N Shahbazi, Y Lin, A Asudeh, HV Jagadish - ACM Computing Surveys, 2023 - dl.acm.org
Data-driven algorithms are only as good as the data they work with, while datasets,
especially social data, often fail to represent minorities adequately. Representation Bias in …

Towards unbounded machine unlearning

M Kurmanji, P Triantafillou, J Hayes… - Advances in neural …, 2024 - proceedings.neurips.cc
Deep machine unlearning is the problem of'removing'from a trained neural network a subset
of its training set. This problem is very timely and has many applications, including the key …

REVISE: A tool for measuring and mitigating bias in visual datasets

A Wang, A Liu, R Zhang, A Kleiman, L Kim… - International Journal of …, 2022 - Springer
Abstract Machine learning models are known to perpetuate and even amplify the biases
present in the data. However, these data biases frequently do not become apparent until …

X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning

A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty… - arXiv preprint arXiv …, 2023 - arxiv.org
Vision-language pre-training and instruction tuning have demonstrated general-purpose
capabilities in 2D visual reasoning tasks by aligning visual encoders with state-of-the-art …

Algorithmic fairness datasets: the story so far

A Fabris, S Messina, G Silvello, GA Susto - Data Mining and Knowledge …, 2022 - Springer
Data-driven algorithms are studied and deployed in diverse domains to support critical
decisions, directly impacting people's well-being. As a result, a growing community of …

Vision-language models performing zero-shot tasks exhibit disparities between gender groups

M Hall, L Gustafson, A Adcock… - Proceedings of the …, 2023 - openaccess.thecvf.com
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …

[HTML][HTML] Computational pathology: a survey review and the way forward

MS Hosseini, BE Bejnordi, VQH Trinh, L Chan… - Journal of Pathology …, 2024 - Elsevier
Abstract Computational Pathology (CPath) is an interdisciplinary science that augments
developments of computational approaches to analyze and model medical histopathology …

Vision-language models performing zero-shot tasks exhibit gender-based disparities

M Hall, L Gustafson, A Adcock, I Misra… - arXiv preprint arXiv …, 2023 - arxiv.org
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …

Meerkat: Audio-visual large language model for grounding in space and time

S Chowdhury, S Nag, S Dasgupta, J Chen… - … on Computer Vision, 2025 - Springer
Abstract Leveraging Large Language Models' remarkable proficiency in text-based tasks,
recent works on Multi-modal LLMs (MLLMs) extend them to other modalities like vision and …