Automating exploratory data analysis via machine learning: An overview

T Milo, A Somech - Proceedings of the 2020 ACM SIGMOD international …, 2020 - dl.acm.org
Exploratory Data Analysis (EDA) is an important initial step for any knowledge discovery
process, in which data scientists interactively explore unfamiliar datasets by issuing a …

Auto-suggest: Learning-to-recommend data preparation steps using data science notebooks

C Yan, Y He - Proceedings of the 2020 ACM SIGMOD International …, 2020 - dl.acm.org
Data preparation is widely recognized as the most time-consuming process in modern
business intelligence (BI) and machine learning (ML) projects. Automating complex data …

Bridging the semantic gap with SQL query logs in natural language interfaces to databases

C Baik, HV Jagadish, Y Li - 2019 IEEE 35th International …, 2019 - ieeexplore.ieee.org
A critical challenge in constructing a natural language interface to database (NLIDB) is
bridging the semantic gap between a natural language query (NLQ) and the underlying …

Query from examples: An iterative, data-driven approach to query construction

H Li, CY Chan, D Maier - Proceedings of the VLDB Endowment, 2015 - dl.acm.org
In this paper, we propose a new approach, called Query from Examples (QFE), to help non-
expert database users construct SQL queries. Our approach, which is designed for users …

Soda: Generating sql for business users

L Blunschi, C Jossen, D Kossman, M Mori… - arXiv preprint arXiv …, 2012 - arxiv.org
The purpose of data warehouses is to enable business analysts to make better decisions.
Over the years the technology has matured and data warehouses have become extremely …

Next-step suggestions for modern interactive data analysis platforms

T Milo, A Somech - Proceedings of the 24th ACM SIGKDD International …, 2018 - dl.acm.org
Modern Interactive Data Analysis (IDA) platforms, such as Kibana, Splunk, and Tableau, are
gradually replacing traditional OLAP/SQL tools, as they allow for easy-to-use data …

Answering why-not questions on top-k queries

Z He, E Lo - IEEE Transactions on Knowledge and Data …, 2012 - ieeexplore.ieee.org
After decades of effort working on database performance, the quality and the usability of
database systems have received more attention in recent years. In particular, the feature of …

AutoMeTS: the autocomplete for medical text simplification

H Van, D Kauchak, G Leroy - arXiv preprint arXiv:2010.10573, 2020 - arxiv.org
The goal of text simplification (TS) is to transform difficult text into a version that is easier to
understand and more broadly accessible to a wide variety of readers. In some domains …

Sqlshare: Results from a multi-year sql-as-a-service experiment

S Jain, D Moritz, D Halperin, B Howe… - Proceedings of the 2016 …, 2016 - dl.acm.org
We analyze the workload from a multi-year deployment of a database-as-a-service platform
targeting scientists and data scientists with minimal database experience. Our hypothesis …

Automatically synthesizing sql queries from input-output examples

S Zhang, Y Sun - … 28th IEEE/ACM International Conference on …, 2013 - ieeexplore.ieee.org
Many computer end-users, such as research scientists and business analysts, need to
frequently query a database, yet lack enough programming knowledge to write a correct …