Applying the Gaussian mixture model to generate large synthetic data from a small data set

A Figueira, B Vaz - Mathematics, 2022 - mdpi.com

Synthetic data consists of artificially generated data. When data are scarce, or of poor
quality, synthetic data can be used, for example, to improve the performance of machine …

被引用次数：220 相关文章所有 7 个版本

[PDF] google.com

Deep learning with small datasets: using autoencoders to address limited datasets in construction management

JMD Delgado, L Oyedele - Applied Soft Computing, 2021 - Elsevier

Large datasets are necessary for deep learning as the performance of the algorithms used
increases as the size of the dataset increases. Poor data management practices and the low …

被引用次数：83 相关文章所有 3 个版本

[PDF] wiley.com Full View

Unraveling the Potential of Immersive Virtual Environments for Behavior Mapping in the Built Environment: A Mapping Review

R Kumar, D Dhar - Human Behavior and Emerging …, 2023 - Wiley Online Library

Introduction/Purpose. Behavior mapping is a crucial practice to capture precise data on
human activities. Over the years, technological advancements have improved reliable data …

被引用次数：2 相关文章所有 4 个版本

[PDF] nsf.gov

Augmenting building performance predictions during design using generative adversarial networks and immersive virtual environments

C Chokwitthaya, Y Zhu, S Mukhopadhyay… - Automation in …, 2020 - Elsevier

Existing building performance models (existing BPMs) often lack the capability for
addressing human-building interactions in future buildings or buildings under design …

被引用次数：34 相关文章所有 2 个版本

[PDF] sciencedirect.com

An empirical analysis of KDE-based generative models on small datasets

E Plesovskaya, S Ivanov - Procedia Computer Science, 2021 - Elsevier

One of the approaches to deal with the small dataset problem is synthetic data generation.
Kernel density estimation is a common method to approximate the underlying probability …

被引用次数：30 相关文章所有 3 个版本

[PDF] acm.org

FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN

PJ Maliakel, S Ilager, I Brandic - … of the 7th International Workshop on …, 2024 - dl.acm.org

Federated Learning (FL) provides a privacy-preserving mechanism for distributed training of
machine learning models on networked devices (eg, mobile devices, IoT edge nodes). It …

被引用次数：1 相关文章所有 6 个版本

Improved learning performance for small datasets in high dimensions by new dual-net model for non-linear interpolation virtual sample generation

LS Lin, YS Lin, DC Li, YH Liu - Decision Support Systems, 2023 - Elsevier

The number of reliable samples obtained in early decision-making activity is usually
relatively small. Due to variable distribution and incomplete structure of tiny datasets, it is …

被引用次数：6 相关文章所有 4 个版本

[PDF] ieee.org

Controlling Bias Between Categorical Attributes in Datasets: A Two-Step Optimization Algorithm Leveraging Structural Equation Modeling

E Barbierato, A Pozzi, D Tessera - IEEE Access, 2023 - ieeexplore.ieee.org

In the realm of data-driven systems, understanding and controlling biases in datasets
emerges as a critical challenge. These biases, defined in this study as systematic …

被引用次数：2 相关文章所有 3 个版本

[PDF] nature.com

QACDes: QoS-aware context-sensitive design of cyber-physical systems

S Sidhanta, C Chokwitthaya, Y Zhu… - Scientific Reports, 2024 - nature.com

There is a lot of confusion and ambiguity regarding the quantification of the Quality of
Service (QoS) of a system, especially for cyber-physical systems (CPS) involved in …

MOOP: An efficient utility-rich route planning framework over two-fold time-dependent road networks

L Gao, C Chen, F Chu, C Liao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Utility-rich (eg, more attractive or safer) route planning on city-scale road networks is a
common yet time-consuming task. Although both travel time and utility on edges are time …

被引用次数：2 相关文章所有 5 个版本