Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

Survey on synthetic data generation, evaluation methods and GANs

A Figueira, B Vaz - Mathematics, 2022 - mdpi.com
Synthetic data consists of artificially generated data. When data are scarce, or of poor
quality, synthetic data can be used, for example, to improve the performance of machine …

Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arXiv preprint arXiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

[HTML][HTML] Generative adversarial networks and synthetic patient data: current challenges and future perspectives

A Arora, A Arora - Future Healthcare Journal, 2022 - Elsevier
Artificial intelligence (AI) has been heralded as one of the key technological innovations of
the 21st century. Within healthcare, much attention has been placed upon the ability of …

Conditional sig-wasserstein gans for time series generation

S Liao, H Ni, L Szpruch, M Wiese… - arXiv preprint arXiv …, 2020 - arxiv.org
Generative adversarial networks (GANs) have been extremely successful in generating
samples, from seemingly high dimensional probability measures. However, these methods …

Can you rely on your model evaluation? improving model evaluation with synthetic test data

B van Breugel, N Seedat, F Imrie… - Advances in Neural …, 2024 - proceedings.neurips.cc
Evaluating the performance of machine learning models on diverse and underrepresented
subgroups is essential for ensuring fairness and reliability in real-world applications …

A survey on gan techniques for data augmentation to address the imbalanced data issues in credit card fraud detection

E Strelcenia, S Prakoonwit - Machine Learning and Knowledge Extraction, 2023 - mdpi.com
Data augmentation is an important procedure in deep learning. GAN-based data
augmentation can be utilized in many domains. For instance, in the credit card fraud domain …

Tabular transformers for modeling multivariate time series

I Padhi, Y Schiff, I Melnyk, M Rigotti… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Tabular datasets are ubiquitous in data science applications. Given their importance, it
seems natural to apply state-of-the-art deep learning algorithms in order to fully unlock their …

Generative adversarial networks in finance: an overview

F Eckerli, J Osterrieder - arXiv preprint arXiv:2106.06364, 2021 - arxiv.org
Modelling in finance is a challenging task: the data often has complex statistical properties
and its inner workings are largely unknown. Deep learning algorithms are making progress …

Fin-gan: Forecasting and classifying financial time series via generative adversarial networks

M Vuletić, F Prenzel, M Cucuringu - Quantitative Finance, 2024 - Taylor & Francis
We investigate the use of Generative Adversarial Networks (GANs) for probabilistic
forecasting of financial time series. To this end, we introduce a novel economics-driven loss …