[HTML][HTML] A survey on dataset quality in machine learning
Y Gong, G Liu, Y Xue, R Li, L Meng - Information and Software Technology, 2023 - Elsevier
With the rise of big data, the quality of datasets has become a crucial factor affecting the
performance of machine learning models. High-quality datasets are essential for the …
performance of machine learning models. High-quality datasets are essential for the …
Representation bias in data: A survey on identification and resolution techniques
Data-driven algorithms are only as good as the data they work with, while datasets,
especially social data, often fail to represent minorities adequately. Representation Bias in …
especially social data, often fail to represent minorities adequately. Representation Bias in …
Towards unbounded machine unlearning
Deep machine unlearning is the problem of'removing'from a trained neural network a subset
of its training set. This problem is very timely and has many applications, including the key …
of its training set. This problem is very timely and has many applications, including the key …
REVISE: A tool for measuring and mitigating bias in visual datasets
Abstract Machine learning models are known to perpetuate and even amplify the biases
present in the data. However, these data biases frequently do not become apparent until …
present in the data. However, these data biases frequently do not become apparent until …
X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning
Vision-language pre-training and instruction tuning have demonstrated general-purpose
capabilities in 2D visual reasoning tasks by aligning visual encoders with state-of-the-art …
capabilities in 2D visual reasoning tasks by aligning visual encoders with state-of-the-art …
Algorithmic fairness datasets: the story so far
Data-driven algorithms are studied and deployed in diverse domains to support critical
decisions, directly impacting people's well-being. As a result, a growing community of …
decisions, directly impacting people's well-being. As a result, a growing community of …
Vision-language models performing zero-shot tasks exhibit disparities between gender groups
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …
different vision tasks. Vision models traditionally required task-specific labels for …
[HTML][HTML] Computational pathology: a survey review and the way forward
Abstract Computational Pathology (CPath) is an interdisciplinary science that augments
developments of computational approaches to analyze and model medical histopathology …
developments of computational approaches to analyze and model medical histopathology …
Vision-language models performing zero-shot tasks exhibit gender-based disparities
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …
different vision tasks. Vision models traditionally required task-specific labels for …
Meerkat: Audio-visual large language model for grounding in space and time
Abstract Leveraging Large Language Models' remarkable proficiency in text-based tasks,
recent works on Multi-modal LLMs (MLLMs) extend them to other modalities like vision and …
recent works on Multi-modal LLMs (MLLMs) extend them to other modalities like vision and …