Data lake management: challenges and opportunities
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …
research. In this tutorial, we review the state-of-the-art in data management for data lakes …
Overview of data exploration techniques
Data exploration is about efficiently extracting knowledge from data even if we do not know
exactly what we are looking for. In this tutorial, we survey recent developments in the …
exactly what we are looking for. In this tutorial, we survey recent developments in the …
Reverse engineering database queries from examples: State-of-the-art, challenges, and research opportunities
DML Martins - Information Systems, 2019 - Elsevier
With the popularization of data access and usage, an increasing number of users without
expert knowledge of databases is required to perform data interactions. Often, these users …
expert knowledge of databases is required to perform data interactions. Often, these users …
Tailoring data source distributions for fairness-aware data integration
Data scientists often develop data sets for analysis by drawing upon sources of data
available to them. A major challenge is to ensure that the data set used for analysis has an …
available to them. A major challenge is to ensure that the data set used for analysis has an …
Programming by examples-and its applications in data wrangling
S Gulwani - Dependable Software Systems Engineering, 2016 - ebooks.iospress.nl
Programming by Examples (PBE) has the potential to revolutionize end-user programming
by enabling end users, most of whom are non-programmers, to create scripts for automating …
by enabling end users, most of whom are non-programmers, to create scripts for automating …
User interaction models for disambiguation in programming by example
Programming by Examples (PBE) has the potential to revolutionize end-user programming
by enabling end users, most of whom are non-programmers, to create small scripts for …
by enabling end users, most of whom are non-programmers, to create small scripts for …
AIDE: an active learning-based approach for interactive data exploration
K Dimitriadou, O Papaemmanouil… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In this paper, we argue that database systems be augmented with an automated data
exploration service that methodically steers users through the data in a meaningful way …
exploration service that methodically steers users through the data in a meaningful way …
Interactive and deterministic data cleaning
We present Falcon, an interactive, deterministic, and declarative data cleaning system,
which uses SQL update queries as the language to repair data. Falcon does not rely on the …
which uses SQL update queries as the language to repair data. Falcon does not rely on the …
Dataxformer: A robust transformation discovery system
In data integration, data curation, and other data analysis tasks, users spend a considerable
amount of time converting data from one representation to another. For example US dates to …
amount of time converting data from one representation to another. For example US dates to …
Programming by examples: Applications, algorithms, and ambiguity resolution
S Gulwani - … Reasoning: 8th International Joint Conference, IJCAR …, 2016 - Springer
Abstract 99% of computer end users do not know programming, and struggle with repetitive
tasks. Programming by Examples (PBE) can revolutionize this landscape by enabling users …
tasks. Programming by Examples (PBE) can revolutionize this landscape by enabling users …