Mitigating the multicollinearity problem and its machine learning approach: a review

JYL Chan, SMH Leow, KT Bea, WK Cheng… - Mathematics, 2022 - mdpi.com
Technologies have driven big data collection across many fields, such as genomics and
business intelligence. This results in a significant increase in variables and data points …

Recent advances and applications of machine learning in solid-state materials science

J Schmidt, MRG Marques, S Botti… - npj computational …, 2019 - nature.com
One of the most exciting tools that have entered the material science toolbox in recent years
is machine learning. This collection of statistical methods has already proved to be capable …

Machine learning advances for time series forecasting

RP Masini, MC Medeiros… - Journal of economic …, 2023 - Wiley Online Library
In this paper, we survey the most recent advances in supervised machine learning (ML) and
high‐dimensional models for time‐series forecasting. We consider both linear and nonlinear …

Machine learning for microbiologists

F Asnicar, AM Thomas, A Passerini… - Nature Reviews …, 2024 - nature.com
Abstract Machine learning is increasingly important in microbiology where it is used for tasks
such as predicting antibiotic resistance and associating human microbiome features with …

Data‐Driven Machine Learning for Understanding Surface Structures of Heterogeneous Catalysts

H Li, Y Jiao, K Davey, SZ Qiao - … Chemie International Edition, 2023 - Wiley Online Library
The design of heterogeneous catalysts is necessarily surface‐focused, generally achieved
via optimization of adsorption energy and microkinetic modelling. A prerequisite is to ensure …

[HTML][HTML] Machine learning and data mining methods in diabetes research

I Kavakiotis, O Tsave, A Salifoglou… - Computational and …, 2017 - Elsevier
The remarkable advances in biotechnology and health sciences have led to a significant
production of data, such as high throughput genetic data and clinical information, generated …

Advances in surrogate based modeling, feasibility analysis, and optimization: A review

A Bhosekar, M Ierapetritou - Computers & Chemical Engineering, 2018 - Elsevier
The idea of using a simpler surrogate to represent a complex phenomenon has gained
increasing popularity over past three decades. Due to their ability to exploit the black-box …

SISSO: A compressed-sensing method for identifying the best low-dimensional descriptor in an immensity of offered candidates

R Ouyang, S Curtarolo, E Ahmetcik, M Scheffler… - Physical Review …, 2018 - APS
The lack of reliable methods for identifying descriptors—the sets of parameters capturing the
underlying mechanisms of a material's property—is one of the key factors hindering efficient …

Spectral methods for data science: A statistical perspective

Y Chen, Y Chi, J Fan, C Ma - Foundations and Trends® in …, 2021 - nowpublishers.com
Spectral methods have emerged as a simple yet surprisingly effective approach for
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …

[HTML][HTML] Beyond the hype: Big data concepts, methods, and analytics

A Gandomi, M Haider - International journal of information management, 2015 - Elsevier
Size is the first, and at times, the only dimension that leaps out at the mention of big data.
This paper attempts to offer a broader definition of big data that captures its other unique and …