Survey on synthetic data generation, evaluation methods and GANs

A Figueira, B Vaz - Mathematics, 2022 - mdpi.com
Synthetic data consists of artificially generated data. When data are scarce, or of poor
quality, synthetic data can be used, for example, to improve the performance of machine …

Deep learning with small datasets: using autoencoders to address limited datasets in construction management

JMD Delgado, L Oyedele - Applied Soft Computing, 2021 - Elsevier
Large datasets are necessary for deep learning as the performance of the algorithms used
increases as the size of the dataset increases. Poor data management practices and the low …

Unraveling the Potential of Immersive Virtual Environments for Behavior Mapping in the Built Environment: A Mapping Review

R Kumar, D Dhar - Human Behavior and Emerging …, 2023 - Wiley Online Library
Introduction/Purpose. Behavior mapping is a crucial practice to capture precise data on
human activities. Over the years, technological advancements have improved reliable data …

Augmenting building performance predictions during design using generative adversarial networks and immersive virtual environments

C Chokwitthaya, Y Zhu, S Mukhopadhyay… - Automation in …, 2020 - Elsevier
Existing building performance models (existing BPMs) often lack the capability for
addressing human-building interactions in future buildings or buildings under design …

An empirical analysis of KDE-based generative models on small datasets

E Plesovskaya, S Ivanov - Procedia Computer Science, 2021 - Elsevier
One of the approaches to deal with the small dataset problem is synthetic data generation.
Kernel density estimation is a common method to approximate the underlying probability …

FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN

PJ Maliakel, S Ilager, I Brandic - … of the 7th International Workshop on …, 2024 - dl.acm.org
Federated Learning (FL) provides a privacy-preserving mechanism for distributed training of
machine learning models on networked devices (eg, mobile devices, IoT edge nodes). It …

Improved learning performance for small datasets in high dimensions by new dual-net model for non-linear interpolation virtual sample generation

LS Lin, YS Lin, DC Li, YH Liu - Decision Support Systems, 2023 - Elsevier
The number of reliable samples obtained in early decision-making activity is usually
relatively small. Due to variable distribution and incomplete structure of tiny datasets, it is …

Controlling Bias Between Categorical Attributes in Datasets: A Two-Step Optimization Algorithm Leveraging Structural Equation Modeling

E Barbierato, A Pozzi, D Tessera - IEEE Access, 2023 - ieeexplore.ieee.org
In the realm of data-driven systems, understanding and controlling biases in datasets
emerges as a critical challenge. These biases, defined in this study as systematic …

QACDes: QoS-aware context-sensitive design of cyber-physical systems

S Sidhanta, C Chokwitthaya, Y Zhu… - Scientific Reports, 2024 - nature.com
There is a lot of confusion and ambiguity regarding the quantification of the Quality of
Service (QoS) of a system, especially for cyber-physical systems (CPS) involved in …

MOOP: An efficient utility-rich route planning framework over two-fold time-dependent road networks

L Gao, C Chen, F Chu, C Liao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Utility-rich (eg, more attractive or safer) route planning on city-scale road networks is a
common yet time-consuming task. Although both travel time and utility on edges are time …