Machine learning in environmental research: common pitfalls and best practices
Machine learning (ML) is increasingly used in environmental research to process large data
sets and decipher complex relationships between system variables. However, due to the …
sets and decipher complex relationships between system variables. However, due to the …
Review of ML and AutoML solutions to forecast time-series data
A Alsharef, K Aggarwal, Sonia, M Kumar… - … Methods in Engineering, 2022 - Springer
Time-series forecasting is a significant discipline of data modeling where past observations
of the same variable are analyzed to predict the future values of the time series. Its …
of the same variable are analyzed to predict the future values of the time series. Its …
Why do tree-based models still outperform deep learning on typical tabular data?
L Grinsztajn, E Oyallon… - Advances in neural …, 2022 - proceedings.neurips.cc
While deep learning has enabled tremendous progress on text and image datasets, its
superiority on tabular data is not clear. We contribute extensive benchmarks of standard and …
superiority on tabular data is not clear. We contribute extensive benchmarks of standard and …
Autokeras: An automl library for deep learning
To use deep learning, one needs to be familiar with various software tools like TensorFlow
or Keras, as well as various model architecture and optimization best practices. Despite …
or Keras, as well as various model architecture and optimization best practices. Despite …
When do neural nets outperform boosted trees on tabular data?
D McElfresh, S Khandagale… - Advances in …, 2024 - proceedings.neurips.cc
Tabular data is one of the most commonly used types of data in machine learning. Despite
recent advances in neural nets (NNs) for tabular data, there is still an active discussion on …
recent advances in neural nets (NNs) for tabular data, there is still an active discussion on …
Tabpfn: A transformer that solves small tabular classification problems in a second
We present TabPFN, a trained Transformer that can do supervised classification for small
tabular datasets in less than a second, needs no hyperparameter tuning and is competitive …
tabular datasets in less than a second, needs no hyperparameter tuning and is competitive …
Lift: Language-interfaced fine-tuning for non-language machine learning tasks
Fine-tuning pretrained language models (LMs) without making any architectural changes
has become a norm for learning various language downstream tasks. However, for non …
has become a norm for learning various language downstream tasks. However, for non …
Efficient methods for natural language processing: A survey
Recent work in natural language processing (NLP) has yielded appealing results from
scaling model parameters and training data; however, using only scale to improve …
scaling model parameters and training data; however, using only scale to improve …
Toward autonomous laboratories: Convergence of artificial intelligence and experimental automation
The ever-increasing demand for novel materials with superior properties inspires retrofitting
traditional research paradigms in the era of artificial intelligence and automation. An …
traditional research paradigms in the era of artificial intelligence and automation. An …
Data-centric ai: Perspectives and challenges
The role of data in building AI systems has recently been significantly magnified by the
emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model …
emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model …