[HTML][HTML] ELII: A novel inverted index for fast temporal query, with application to a large Covid-19 EHR dataset

Y Huang, X Li, GQ Zhang - Journal of Biomedical Informatics, 2021 - Elsevier
Fast temporal query on large EHR-derived data sources presents an emerging big data
challenge, as this query modality is intractable using conventional strategies that have not …

Addressing the need for interactive, efficient, and reproducible data processing in ecology with the datacleanr R package

AG Hurley, RL Peters, C Pappas, DN Steger, I Heinrich - PloS one, 2022 - journals.plos.org
Ecological research, just as all Earth System Sciences, is becoming increasingly data-rich.
Tools for processing of “big data” are continuously developed to meet corresponding …

[PDF][PDF] Should Drag-and-Drop Analytics Become Part of the Data Scientist Toolkit?

BJ Buratti, P Eichmann, Z Shang, E Zgraggen, J Blanc… - 2023 - researchgate.net
Computational Notebooks revolutionized how scientists, practitioners, and corporations
explore and communicate through data. Notebooks fostered a culture of transparency and …

Ordalia: Deep learning hyperparameter search via generalization error bounds extrapolation

BJ Buratti, E Upfal - 2019 IEEE International Conference on Big …, 2019 - ieeexplore.ieee.org
We introduce Ordalia, a novel approach for speeding up deep learning hyperparameter
optimization search through early-pruning of less promising configurations. Our method …

[PDF][PDF] Novel concentration of measure bounds with applications to fairness in machine learning

C Cousins - Brown University, 2020 - cs.brown.edu
I introduce novel concentration-of-measure bounds for the supremum deviation, several
variance concepts, and a family of game-theoretic welfare functions. In particular, I introduce …

[PDF][PDF] Rigorous Statistical Methods for Trustworthy Guarantees in Fair Machine Learning and Beyond

C Cousins - 2021 - cs.brown.edu
Introduction My research has always been broadly interdisciplinary, with a focus on
introducing rigorous modern statistical techniques and finite-sample concentration-of …