The mastery of details in the workflow of materials machine learning

Y Ma, P Xu, M Li, X Ji, W Zhao, W Lu - npj Computational Materials, 2024 - nature.com
As machine learning (ML) continues to advance in the field of materials science, the
variation in strategies for the same steps of the ML workflow becomes increasingly …

A tensor decomposition method based on embedded geographic meta-knowledge for urban traffic flow imputation

X Luo, S Cheng, L Wang, Y Liang… - International Journal of …, 2024 - Taylor & Francis
Accurate and reliable traffic flow data are essential for intelligent transportation systems;
however, limitations arising from hardware and communication costs often lead to missing …

Blockwise principal component analysis for monotone missing data imputation and dimensionality reduction

TT Do, AM Vu, TL Vo, HT Ly, T Nguyen… - … Joint Conference on …, 2024 - ieeexplore.ieee.org
Monotone missing data is a common problem in data analysis. However, imputation
combined with dimensionality reduction can be computationally expensive, especially with …

Combining datasets to improve model fitting

T Nguyen, R Khadka, N Phan, A Yazidi… - … Joint Conference on …, 2023 - ieeexplore.ieee.org
For many use cases, combining information from different datasets can be of interest to
improve a machine learning model's performance, especially when the number of samples …

Imputation using training labels and classification via label imputation

T Nguyen, TL Vo, P Halvorsen, MA Riegler - arXiv preprint arXiv …, 2023 - arxiv.org
Missing data is a common problem in practical settings. Various imputation methods have
been developed to deal with missing data. However, even though the label is usually …

Data imputation for multivariate time-series data

P Le Lien, TT Do, T Nguyen - 2023 15th International …, 2023 - ieeexplore.ieee.org
Multivariate time-series data are abundant in many application areas, such as finance,
transportation, environment, and healthcare. However, for many reasons, missing data …

Multimedia datasets: challenges and future possibilities

T Nguyen, AM Storås, V Thambawita, SA Hicks… - … on Multimedia Modeling, 2023 - Springer
Public multimedia datasets can enhance knowledge discovery and model development as
more researchers have the opportunity to contribute to exploring them. However, as these …

Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods

NH Pham, KL Vo, MA Vu, T Nguyen, MA Riegler… - … on Multimedia Modeling, 2024 - Springer
Correlation matrix visualization is essential for understanding the relationships between
variables in a dataset, but missing data can seriously affect this important data visualization …

Oversampling and imputation for imbalanced missing data

HH Le, T Nguyen, N Chawla, MA Riegler, P Halvorsen… - 2024 - researchsquare.com
Oversampling and imputation for imbalanced missing data Imbalanced data is a widespread
issue that is naturally occurring. For instance, fraudulent banking transactions are less …

[PDF][PDF] Missing Values Imputation Using Principal Component Analysis Methods

RJD Moh - 2024 - math.montana.edu
Missing values are a common phenomenon encountered in datasets, posing challenges to
data analysis. Thus, it becomes important to employ effective methods for imputing missing …