Crowdspeech and voxdiy: Benchmark datasets for crowdsourced audio transcription
Domain-specific data is the crux of the successful transfer of machine learning systems from
benchmarks to real life. In simple problems such as image classification, crowdsourcing has …
benchmarks to real life. In simple problems such as image classification, crowdsourcing has …
Federated iot interaction vulnerability analysis
IoT devices provide users with great convenience in smart homes. However, the
interdependent behaviors across devices may yield unexpected interactions. To analyze the …
interdependent behaviors across devices may yield unexpected interactions. To analyze the …
Demystifying Artificial Intelligence for Data Preparation
Data preparation--the process of discovering, integrating, transforming, cleaning, and
annotating data--is one of the oldest, hardest, yet inevitable data management problems …
annotating data--is one of the oldest, hardest, yet inevitable data management problems …
Quality of sentiment analysis tools: The reasons of inconsistency
WM Kouadri, M Ouziri, S Benbernou… - Proceedings of the …, 2020 - dl.acm.org
In this paper, we present a comprehensive study that evaluates six state-of-the-art sentiment
analysis tools on five public datasets, based on the quality of predictive results in the …
analysis tools on five public datasets, based on the quality of predictive results in the …
Coca: Cost-effective collaborative annotation system by combining experts and amateurs
Data annotation has been a key boost for the artificial intelligence. However, difficult tasks
such as fine-grained classification need lots of labeled data to train a feasible model. On the …
such as fine-grained classification need lots of labeled data to train a feasible model. On the …
Type diversity maximization aware coursewares crowdcollection with limited budget in MOOCs
Massive open online courses (MOOCs) require coursewares with different types of course
resources recommended to learners based on their learning situations to meet personalized …
resources recommended to learners based on their learning situations to meet personalized …
REGROW: Reimagining global crowdsourcing for better human-AI collaboration
Crowdworkers silently enable much of today's AI-based products, with several online
platforms offering a myriad of data labelling and content moderation tasks through …
platforms offering a myriad of data labelling and content moderation tasks through …
Lessons Learned from a Citizen Science Project for Natural Language Processing
Many Natural Language Processing (NLP) systems use annotated corpora for training and
evaluation. However, labeled data is often costly to obtain and scaling annotation projects is …
evaluation. However, labeled data is often costly to obtain and scaling annotation projects is …
Efficient Online Crowdsourcing with Complex Annotations
Crowdsourcing platforms use various truth discovery algorithms to aggregate annotations
from multiple labelers. In an online setting, however, the main challenge is to decide …
from multiple labelers. In an online setting, however, the main challenge is to decide …
[PDF][PDF] MACRO: Incentivizing Multi-leader Game-based Pareto-efficient Crowdsourcing for Video Analytics
In recent years, many crowdsourcing platforms have emerged, using the resources of
recruited workers to perform diverse outsourcing tasks, where the video analytics attracts …
recruited workers to perform diverse outsourcing tasks, where the video analytics attracts …