[HTML][HTML] ELII: A novel inverted index for fast temporal query, with application to a large Covid-19 EHR dataset
Fast temporal query on large EHR-derived data sources presents an emerging big data
challenge, as this query modality is intractable using conventional strategies that have not …
challenge, as this query modality is intractable using conventional strategies that have not …
Addressing the need for interactive, efficient, and reproducible data processing in ecology with the datacleanr R package
Ecological research, just as all Earth System Sciences, is becoming increasingly data-rich.
Tools for processing of “big data” are continuously developed to meet corresponding …
Tools for processing of “big data” are continuously developed to meet corresponding …
[PDF][PDF] Should Drag-and-Drop Analytics Become Part of the Data Scientist Toolkit?
Computational Notebooks revolutionized how scientists, practitioners, and corporations
explore and communicate through data. Notebooks fostered a culture of transparency and …
explore and communicate through data. Notebooks fostered a culture of transparency and …
Ordalia: Deep learning hyperparameter search via generalization error bounds extrapolation
BJ Buratti, E Upfal - 2019 IEEE International Conference on Big …, 2019 - ieeexplore.ieee.org
We introduce Ordalia, a novel approach for speeding up deep learning hyperparameter
optimization search through early-pruning of less promising configurations. Our method …
optimization search through early-pruning of less promising configurations. Our method …
[PDF][PDF] Novel concentration of measure bounds with applications to fairness in machine learning
C Cousins - Brown University, 2020 - cs.brown.edu
I introduce novel concentration-of-measure bounds for the supremum deviation, several
variance concepts, and a family of game-theoretic welfare functions. In particular, I introduce …
variance concepts, and a family of game-theoretic welfare functions. In particular, I introduce …
[PDF][PDF] Rigorous Statistical Methods for Trustworthy Guarantees in Fair Machine Learning and Beyond
C Cousins - 2021 - cs.brown.edu
Introduction My research has always been broadly interdisciplinary, with a focus on
introducing rigorous modern statistical techniques and finite-sample concentration-of …
introducing rigorous modern statistical techniques and finite-sample concentration-of …