SNAPS: sensor analytics point solutions for detection and decision support systems

ES McLamore, S Palit Austin Datta, V Morgan… - Sensors, 2019 - mdpi.com
In this review, we discuss the role of sensor analytics point solutions (SNAPS), a reduced
complexity machine-assisted decision support tool. We summarize the approaches used for …

Democratizing data science through interactive curation of ml pipelines

Z Shang, E Zgraggen, B Buratti, F Kossmann… - Proceedings of the …, 2019 - dl.acm.org
Statistical knowledge and domain expertise are key to extract actionable insights out of data,
yet such skills rarely coexist together. In Machine Learning, high-quality results are only …

Northstar: An interactive data science system

T Kraska - 2021 - dspace.mit.edu
© 2018 VLDB Endowment. In order to democratize data science, we need to fundamentally
rethink the current analytics stack, from the user interface to the “guts.“Most importantly …

[PDF][PDF] A Cost-Effective Analysis of Machine Learning Workloads in Public Clouds: Is AutoML Always Worth Using

MMT Ayyalasomayajula, SK Chintala… - … Journal of Computer …, 2019 - researchgate.net
Machine learning (ML) has become integral to fields like healthcare, finance, and
autonomous systems, but developing robust models requires significant computational …

Haipipe: Combining human-generated and machine-generated pipelines for data preparation

S Chen, N Tang, J Fan, X Yan, C Chai, G Li… - Proceedings of the ACM …, 2023 - dl.acm.org
Data preparation is crucial in achieving optimized results for machine learning (ML).
However, having a good data preparation pipeline is highly non-trivial for ML practitioners …

Autood: Automatic outlier detection

L Cao, Y Yan, Y Wang, S Madden… - Proceedings of the ACM …, 2023 - dl.acm.org
Outlier detection is critical in real world. Due to the existence of many outlier detection
techniques which often return different results for the same data set, the users have to …

Unveiling practices of customer service content curators of conversational agents

H Candello, C Pinhanez, M Muller… - Proceedings of the ACM on …, 2022 - dl.acm.org
Conversational interfaces require two types of curation: data curation by data science
workers and content curation by domain experts. Recent years have seen the possibilities …

[PDF][PDF] Towards Quantifying Uncertainty in Data Analysis & Exploration.

Y Chung, S Servan-Schreiber… - IEEE Data Eng …, 2018 - sachaservanschreiber.com
In the age big data, uncertainty in data constantly grows with its volume, variety, and velocity.
Data is noisy, biased and error-prone; blindly applying even the most advanced data …

Democratisation of usable machine learning in computer vision

R Bond, A Koene, A Dix, J Boger, MD Mulvenna… - arXiv preprint arXiv …, 2019 - arxiv.org
Many industries are now investing heavily in data science and automation to replace
manual tasks and/or to help with decision making, especially in the realm of leveraging …

[图书][B] Evaluating performance variability of data pipelines for binary classification with applications to predictive learning analytics

R Bertolini - 2021 - search.proquest.com
Monte Carlo simulation studies are used to examine how eight factors impact predictions of
a binary target outcome in data science pipelines:(1) the choice of four DMMs [Logistic …