Uncertainty quantification in drug design

LH Mervin, S Johansson, E Semenova, KA Giblin… - Drug discovery today, 2021 - Elsevier
Highlights•Review of the state-of-the-art in uncertainty quantification in drug
design.•Examples from drug-design settings are provided.•Impact on decision making is …

Machine learning for enzyme engineering, selection and design

R Feehan, D Montezano… - … Engineering, Design and …, 2021 - academic.oup.com
Abstract Machine learning is a useful computational tool for large and complex tasks such as
those in the field of enzyme engineering, selection and design. In this review, we examine …

Learning the regulatory code of gene expression

J Zrimec, F Buric, M Kokina, V Garcia… - Frontiers in Molecular …, 2021 - frontiersin.org
Data-driven machine learning is the method of choice for predicting molecular phenotypes
from nucleotide sequence, modeling gene expression events including protein-DNA …

G4Boost: a machine learning-based tool for quadruplex identification and stability prediction

HB Cagirici, H Budak, TZ Sen - BMC bioinformatics, 2022 - Springer
Background G-quadruplexes (G4s), formed within guanine-rich nucleic acids, are secondary
structures involved in important biological processes. Although every G4 motif has the …

Improving enzyme optimum temperature prediction with resampling strategies and ensemble learning

JE Gado, GT Beckham, CM Payne - Journal of Chemical …, 2020 - ACS Publications
Accurate prediction of the optimal catalytic temperature (T opt) of enzymes is vital in
biotechnology, as enzymes with high T opt values are desired for enhanced reaction rates …

Enhancing Sustainable Urban Energy Management through Short-Term Wind Power Forecasting Using LSTM Neural Network

K Kanagarathinam, SK Aruna, S Ravivarman, M Safran… - Sustainability, 2023 - mdpi.com
Integrating wind energy forecasting into urban city energy management systems offers
significant potential for optimizing energy usage, reducing the carbon footprint, and …

Predicting gestational diabetes mellitus risk at 11–13 weeks' gestation: the role of extrachromosomal circular DNA

J Wang, P Huang, F Hou, D Hao, W Li, H Jin - Cardiovascular Diabetology, 2024 - Springer
Background Gestational diabetes mellitus (GDM) significantly impacts maternal and infant
health both immediately and over the long term, yet effective early diagnostic biomarkers are …

The statistical advantage of automatic NLG metrics at the system level

JTZ Wei, R Jia - arXiv preprint arXiv:2105.12437, 2021 - arxiv.org
Estimating the expected output quality of generation systems is central to NLG. This paper
qualifies the notion that automatic metrics are not as good as humans in estimating system …

Is the performance of my deep network too good to be true? A direct approach to estimating the Bayes error in binary classification

T Ishida, I Yamane, N Charoenphakdee, G Niu… - arXiv preprint arXiv …, 2022 - arxiv.org
There is a fundamental limitation in the prediction performance that a machine learning
model can achieve due to the inevitable uncertainty of the prediction target. In classification …

Probabilistic Random Forest improves bioactivity predictions close to the classification threshold by taking into account experimental uncertainty

LH Mervin, MA Trapotsi, AM Afzal, IP Barrett… - Journal of …, 2021 - Springer
Measurements of protein–ligand interactions have reproducibility limits due to experimental
errors. Any model based on such assays will consequentially have such unavoidable errors …