Synthetic data generation for tabular health records: A systematic review

M Hernandez, G Epelde, A Alberdi, R Cilla, D Rankin - Neurocomputing, 2022 - Elsevier
Synthetic data generation (SDG) research has been ongoing for some time with promising
results in different application domains, including healthcare, biometrics and energy …

Synthetic data in health care: A narrative review

A Gonzales, G Guruswamy, SR Smith - PLOS Digital Health, 2023 - journals.plos.org
Data are central to research, public health, and in developing health information technology
(IT) systems. Nevertheless, access to most data in health care is tightly controlled, which …

Advances and open problems in federated learning

P Kairouz, HB McMahan, B Avent… - … and trends® in …, 2021 - nowpublishers.com
Federated learning (FL) is a machine learning setting where many clients (eg, mobile
devices or whole organizations) collaboratively train a model under the orchestration of a …

Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arXiv preprint arXiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

Synthetic data generation: State of the art in health care domain

H Murtaza, M Ahmed, NF Khan, G Murtaza… - Computer Science …, 2023 - Elsevier
Recent progress in artificial intelligence and machine learning has led to the growth of
research in every aspect of life including the health care domain. However, privacy risks and …

[HTML][HTML] Generating high-fidelity synthetic patient data for assessing machine learning healthcare software

A Tucker, Z Wang, Y Rotalinti, P Myles - NPJ digital medicine, 2020 - nature.com
There is a growing demand for the uptake of modern artificial intelligence technologies
within healthcare systems. Many of these technologies exploit historical patient health data …

Simulation intelligence: Towards a new generation of scientific methods

A Lavin, D Krakauer, H Zenil, J Gottschlich… - arXiv preprint arXiv …, 2021 - arxiv.org
The original" Seven Motifs" set forth a roadmap of essential methods for the field of scientific
computing, where a motif is an algorithmic method that captures a pattern of computation …

Generating synthetic data in finance: opportunities, challenges and pitfalls

SA Assefa, D Dervovic, M Mahfouz, RE Tillman… - Proceedings of the First …, 2020 - dl.acm.org
Financial services generate a huge volume of data that is extremely complex and varied.
These datasets are often stored in silos within organisations for various reasons, including …

Winning the NIST Contest: A scalable and general approach to differentially private synthetic data

R McKenna, G Miklau, D Sheldon - arXiv preprint arXiv:2108.04978, 2021 - arxiv.org
We propose a general approach for differentially private synthetic data generation, that
consists of three steps:(1) select a collection of low-dimensional marginals,(2) measure …