Automating exploratory data analysis via machine learning: An overview
Exploratory Data Analysis (EDA) is an important initial step for any knowledge discovery
process, in which data scientists interactively explore unfamiliar datasets by issuing a …
process, in which data scientists interactively explore unfamiliar datasets by issuing a …
Auto-suggest: Learning-to-recommend data preparation steps using data science notebooks
C Yan, Y He - Proceedings of the 2020 ACM SIGMOD International …, 2020 - dl.acm.org
Data preparation is widely recognized as the most time-consuming process in modern
business intelligence (BI) and machine learning (ML) projects. Automating complex data …
business intelligence (BI) and machine learning (ML) projects. Automating complex data …
Bridging the semantic gap with SQL query logs in natural language interfaces to databases
A critical challenge in constructing a natural language interface to database (NLIDB) is
bridging the semantic gap between a natural language query (NLQ) and the underlying …
bridging the semantic gap between a natural language query (NLQ) and the underlying …
Query from examples: An iterative, data-driven approach to query construction
In this paper, we propose a new approach, called Query from Examples (QFE), to help non-
expert database users construct SQL queries. Our approach, which is designed for users …
expert database users construct SQL queries. Our approach, which is designed for users …
Soda: Generating sql for business users
L Blunschi, C Jossen, D Kossman, M Mori… - arXiv preprint arXiv …, 2012 - arxiv.org
The purpose of data warehouses is to enable business analysts to make better decisions.
Over the years the technology has matured and data warehouses have become extremely …
Over the years the technology has matured and data warehouses have become extremely …
Next-step suggestions for modern interactive data analysis platforms
Modern Interactive Data Analysis (IDA) platforms, such as Kibana, Splunk, and Tableau, are
gradually replacing traditional OLAP/SQL tools, as they allow for easy-to-use data …
gradually replacing traditional OLAP/SQL tools, as they allow for easy-to-use data …
Answering why-not questions on top-k queries
Z He, E Lo - IEEE Transactions on Knowledge and Data …, 2012 - ieeexplore.ieee.org
After decades of effort working on database performance, the quality and the usability of
database systems have received more attention in recent years. In particular, the feature of …
database systems have received more attention in recent years. In particular, the feature of …
AutoMeTS: the autocomplete for medical text simplification
The goal of text simplification (TS) is to transform difficult text into a version that is easier to
understand and more broadly accessible to a wide variety of readers. In some domains …
understand and more broadly accessible to a wide variety of readers. In some domains …
Sqlshare: Results from a multi-year sql-as-a-service experiment
We analyze the workload from a multi-year deployment of a database-as-a-service platform
targeting scientists and data scientists with minimal database experience. Our hypothesis …
targeting scientists and data scientists with minimal database experience. Our hypothesis …
Automatically synthesizing sql queries from input-output examples
Many computer end-users, such as research scientists and business analysts, need to
frequently query a database, yet lack enough programming knowledge to write a correct …
frequently query a database, yet lack enough programming knowledge to write a correct …