On the design of ai-powered code assistants for notebooks
AI-powered code assistants, such as Copilot, are quickly becoming a ubiquitous component
of contemporary coding contexts. Among these environments, computational notebooks …
of contemporary coding contexts. Among these environments, computational notebooks …
Notable: On-the-fly assistant for data storytelling in computational notebooks
Computational notebooks are widely used for data analysis. Their interleaved displays of
code and execution results (eg, visualizations) are welcomed since they enable iterative …
code and execution results (eg, visualizations) are welcomed since they enable iterative …
Causalvis: Visualizations for causal inference
Causal inference is a statistical paradigm for quantifying causal effects using observational
data. It is a complex process, requiring multiple steps, iterations, and collaborations with …
data. It is a complex process, requiring multiple steps, iterations, and collaborations with …
Dead or alive: Continuous data profiling for interactive data science
W Epperson, V Gorantla, D Moritz… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Profiling data by plotting distributions and analyzing summary statistics is a critical step
throughout data analysis. Currently, this process is manual and tedious since analysts must …
throughout data analysis. Currently, this process is manual and tedious since analysts must …
Waitgpt: Monitoring and steering conversational llm agent in data analysis with on-the-fly code visualization
Large language models (LLMs) support data analysis through conversational user
interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis …
interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis …
How Do Analysts Understand and Verify AI-Assisted Data Analyses?
Data analysis is challenging as it requires synthesizing domain knowledge, statistical
expertise, and programming skills. Assistants powered by large language models (LLMs) …
expertise, and programming skills. Assistants powered by large language models (LLMs) …
ydata-profiling: Accelerating data-centric AI with high-quality data
F Clemente, GM Ribeiro, A Quemy, MS Santos… - Neurocomputing, 2023 - Elsevier
Abstract ydata-profiling is an open-source Python package for advanced exploratory data
analysis that enables users to generate data profiling reports in a simple, fast, and efficient …
analysis that enables users to generate data profiling reports in a simple, fast, and efficient …
Colaroid: A literate programming approach for authoring explorable multi-stage tutorials
Multi-stage programming tutorials are key learning resources for programmers, using
progressive incremental steps to teach them how to build larger software systems. A good …
progressive incremental steps to teach them how to build larger software systems. A good …
Aspirations and practice of ml model documentation: Moving the needle with nudging and traceability
The documentation practice for machine-learned (ML) models often falls short of established
practices for traditional software, which impedes model accountability and inadvertently …
practices for traditional software, which impedes model accountability and inadvertently …
How data analysts use a visualization grammar in practice
Visualization grammars, often based on the Grammar of Graphics (GoG), have much
potential for augmenting data analysis in a programming environment. However, we do not …
potential for augmenting data analysis in a programming environment. However, we do not …