The mastery of details in the workflow of materials machine learning
Y Ma, P Xu, M Li, X Ji, W Zhao, W Lu - npj Computational Materials, 2024 - nature.com
As machine learning (ML) continues to advance in the field of materials science, the
variation in strategies for the same steps of the ML workflow becomes increasingly …
variation in strategies for the same steps of the ML workflow becomes increasingly …
A tensor decomposition method based on embedded geographic meta-knowledge for urban traffic flow imputation
X Luo, S Cheng, L Wang, Y Liang… - International Journal of …, 2024 - Taylor & Francis
Accurate and reliable traffic flow data are essential for intelligent transportation systems;
however, limitations arising from hardware and communication costs often lead to missing …
however, limitations arising from hardware and communication costs often lead to missing …
Blockwise principal component analysis for monotone missing data imputation and dimensionality reduction
Monotone missing data is a common problem in data analysis. However, imputation
combined with dimensionality reduction can be computationally expensive, especially with …
combined with dimensionality reduction can be computationally expensive, especially with …
Combining datasets to improve model fitting
For many use cases, combining information from different datasets can be of interest to
improve a machine learning model's performance, especially when the number of samples …
improve a machine learning model's performance, especially when the number of samples …
Imputation using training labels and classification via label imputation
Missing data is a common problem in practical settings. Various imputation methods have
been developed to deal with missing data. However, even though the label is usually …
been developed to deal with missing data. However, even though the label is usually …
Data imputation for multivariate time-series data
Multivariate time-series data are abundant in many application areas, such as finance,
transportation, environment, and healthcare. However, for many reasons, missing data …
transportation, environment, and healthcare. However, for many reasons, missing data …
Multimedia datasets: challenges and future possibilities
Public multimedia datasets can enhance knowledge discovery and model development as
more researchers have the opportunity to contribute to exploring them. However, as these …
more researchers have the opportunity to contribute to exploring them. However, as these …
Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods
Correlation matrix visualization is essential for understanding the relationships between
variables in a dataset, but missing data can seriously affect this important data visualization …
variables in a dataset, but missing data can seriously affect this important data visualization …
Oversampling and imputation for imbalanced missing data
Oversampling and imputation for imbalanced missing data Imbalanced data is a widespread
issue that is naturally occurring. For instance, fraudulent banking transactions are less …
issue that is naturally occurring. For instance, fraudulent banking transactions are less …
[PDF][PDF] Missing Values Imputation Using Principal Component Analysis Methods
RJD Moh - 2024 - math.montana.edu
Missing values are a common phenomenon encountered in datasets, posing challenges to
data analysis. Thus, it becomes important to employ effective methods for imputing missing …
data analysis. Thus, it becomes important to employ effective methods for imputing missing …