Data and its (dis) contents: A survey of dataset development and use in machine learning research

A Paullada, ID Raji, EM Bender, E Denton, A Hanna - Patterns, 2021 - cell.com
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …

[HTML][HTML] Human evaluation of automatically generated text: Current trends and best practice guidelines

C van der Lee, A Gatt, E van Miltenburg… - Computer Speech & …, 2021 - Elsevier
Currently, there is little agreement as to how Natural Language Generation (NLG) systems
should be evaluated, with a particularly high degree of variation in the way that human …

[引用][C] The Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence

K Crawford - 2021 - books.google.com
The hidden costs of artificial intelligence, from natural resources and labor to privacy and
freedom What happens when artificial intelligence saturates political life and depletes the …

Toward verifiable and reproducible human evaluation for text-to-image generation

M Otani, R Togashi, Y Sawai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human evaluation is critical for validating the performance of text-to-image generative
models, as this highly cognitive process requires deep comprehension of text and images …

The ethics of AI ethics: An evaluation of guidelines

T Hagendorff - Minds and machines, 2020 - Springer
Current advances in research, development and application of artificial intelligence (AI)
systems have yielded a far-reaching discourse on AI ethics. In consequence, a number of …

[图书][B] AI now report 2018

M Whittaker, K Crawford, R Dobbe, G Fried, E Kaziunas… - 2018 - stc.org
The AI Now Institute at New York University is an interdisciplinary research institute
dedicated to understanding the social implications of AI technologies. It is the first university …

Vistext: A benchmark for semantically rich chart captioning

BJ Tang, A Boggust, A Satyanarayan - arXiv preprint arXiv:2307.05356, 2023 - arxiv.org
Captions that describe or explain charts help improve recall and comprehension of the
depicted data and provide a more accessible medium for people with visual disabilities …

Quantifying the invisible labor in crowd work

C Toxtli, S Suri, S Savage - Proceedings of the ACM on human-computer …, 2021 - dl.acm.org
Crowdsourcing markets provide workers with a centralized place to find paid work. What
may not be obvious at first glance is that, in addition to the work they do for pay, crowd …

Understanding machine learning practitioners' data documentation perceptions, needs, challenges, and desiderata

AK Heger, LB Marquis, M Vorvoreanu… - Proceedings of the …, 2022 - dl.acm.org
Data is central to the development and evaluation of machine learning (ML) models.
However, the use of problematic or inappropriate datasets can result in harms when the …

Evaluation gaps in machine learning practice

B Hutchinson, N Rostamzadeh, C Greer… - Proceedings of the …, 2022 - dl.acm.org
Forming a reliable judgement of a machine learning (ML) model's appropriateness for an
application ecosystem is critical for its responsible use, and requires considering a broad …