Understanding Data Understanding: A Framework to Navigate the Intricacies of Data Analytics

J Holstein, P Spitzer, M Hoell, M Vössing… - arXiv preprint arXiv …, 2024 - arxiv.org
As organizations face the challenges of processing exponentially growing data volumes,
their reliance on analytics to unlock value from this data has intensified. However, the …

Mining Data Wrangling Workflows for Design Patterns Discovery and Specification

A AlMasaud, S Sampaio, P Sampaio - Information Systems Frontiers, 2024 - Springer
In this paper, we investigate Data Wrangling (DW) pipelines in the form of workflows devised
by data analysts with varying levels of experience to find commonalities or patterns. We …

Whither Problem-Solving Environments?

M Dinmore - Proceedings of the 2023 ACM SIGPLAN International …, 2023 - dl.acm.org
During the 1990s and first decade of the 2000s, problem-solving environments (PSEs) were
a topic of research among a community with the vision to create software systems “with all of …

Improving Efficiency in Data Wrangling With Semantic Type Detection

A Yu - 2023 - scholarspace.manoa.hawaii.edu
This thesis presents SLED (Semantic LLM Enrichment of Data), a Python library that
leverages Large Language Models (LLMs) to automate essential tasks in data wrangling …

Detecting CSV File Dialects by Table Uniformity Measurement and Data Type Inference

W García - 2024 - preprints.org
The human-readable simplicity with which the CSV format was devised, together with the
absence of a standard that strictly defines this format, has allowed the proliferation of several …

[引用][C] Towards Effective and Effortless Data Cleaning: From Automatic Approaches to User Involvement

JLM Pereira - 2023 - Ph. D. Dissertation. Instituto Superior …