Data and its (dis) contents: A survey of dataset development and use in machine learning research

A Paullada, ID Raji, EM Bender, E Denton, A Hanna - Patterns, 2021 - cell.com
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …

Problematic machine behavior: A systematic literature review of algorithm audits

J Bandy - Proceedings of the acm on human-computer …, 2021 - dl.acm.org
While algorithm audits are growing rapidly in commonality and public importance, relatively
little scholarly work has gone toward synthesizing prior work and strategizing future research …

The dimensions of data labor: A road map for researchers, activists, and policymakers to empower data producers

H Li, N Vincent, S Chancellor, B Hecht - … of the 2023 ACM conference on …, 2023 - dl.acm.org
Many recent technological advances (eg ChatGPT and search engines) are possible only
because of massive amounts of user-generated data produced through user interactions …

Algorithmic collective action in machine learning

M Hardt, E Mazumdar… - International …, 2023 - proceedings.mlr.press
We initiate a principled study of algorithmic collective action on digital platforms that deploy
machine learning algorithms. We propose a simple theoretical model of a collective …

Data leverage: A framework for empowering the public in its relationship with technology companies

N Vincent, H Li, N Tilly, S Chancellor… - Proceedings of the 2021 …, 2021 - dl.acm.org
Many powerful computing technologies rely on implicit and explicit data contributions from
the public. This dependency suggests a potential source of leverage for the public in its …

Bargaining with the black-box: Designing and deploying worker-centric tools to audit algorithmic management

D Calacci, A Pentland - Proceedings of the ACM on Human-Computer …, 2022 - dl.acm.org
The increasing prevalence of large-scale labor aggregation platforms, worker analytics, and
algorithmic decision-making by management raises the question of whether workers can …

Human and technological infrastructures of fact-checking

P Juneja, T Mitra - Proceedings of the ACM on Human-Computer …, 2022 - dl.acm.org
Increasing demands for fact-checking have led to a growing interest in developing systems
and tools to automate the fact-checking process. However, such systems are limited in …

Operationalizing the legal principle of data minimization for personalization

AJ Biega, P Potash, H Daumé, F Diaz… - Proceedings of the 43rd …, 2020 - dl.acm.org
Article 5 (1)(c) of the European Union's General Data Protection Regulation (GDPR)
requires that" personal data shall be [...] adequate, relevant, and limited to what is necessary …

A deeper investigation of the importance of Wikipedia links to search engine results

N Vincent, B Hecht - Proceedings of the ACM on Human-Computer …, 2021 - dl.acm.org
A growing body of work has highlighted the important role that Wikipedia's volunteer-created
content plays in helping search engines achieve their core goal of addressing the …

Building, Shifting, & Employing Power: A Taxonomy of Responses From Below to Algorithmic Harm

A DeVrio, M Eslami, K Holstein - The 2024 ACM Conference on …, 2024 - dl.acm.org
A large body of research has attempted to ensure that algorithmic systems adhere to notions
of fairness and transparency. Increasingly, researchers have highlighted that mitigating …