Experimental characterization of de novo proteins and their unevolved random-sequence counterparts

B Heames, F Buchel, M Aubel, V Tretyachenko… - Nature ecology & …, 2023 - nature.com
De novo gene emergence provides a route for new proteins to be formed from previously
non-coding DNA. Proteins born in this way are considered random sequences and typically …

Serverless prediction of peptide properties with recurrent neural networks

M Ansari, AD White - Journal of Chemical Information and …, 2023 - ACS Publications
We present three deep learning sequence-based prediction models for peptide properties
including hemolysis, solubility, and resistance to nonspecific interactions that achieve …

[HTML][HTML] Evaluation of protein descriptors in computer-aided rational protein engineering tasks and its application in property prediction in SARS-CoV-2 spike …

H Lim, HN Jeon, S Lim, Y Jang, T Kim, H Cho… - Computational and …, 2022 - Elsevier
The importance of protein engineering in the research and development of
biopharmaceuticals and biomaterials has increased. Machine learning in computer-aided …

Evotuning protocols for Transformer-based variant effect prediction on multi-domain proteins

H Yamaguchi, Y Saito - Briefings in Bioinformatics, 2021 - academic.oup.com
Accurate variant effect prediction has broad impacts on protein engineering. Recent
machine learning approaches toward this end are based on representation learning, by …

Now what sequence? Pre-trained ensembles for Bayesian optimization of protein sequences

Z Yang, KA Milas, AD White - bioRxiv, 2022 - biorxiv.org
Pre-trained models have been transformative in natural language, computer vision, and now
protein sequences by enabling accuracy with few training examples. We show how to use …

Disease diagnostics using machine learning of immune receptors

ME Zaslavsky, E Craig, JK Michuda, N Sehgal… - Biorxiv, 2022 - biorxiv.org
Clinical diagnoses rely on a wide variety of laboratory tests and imaging studies, interpreted
alongside physical examination findings and the patient's history and symptoms. Currently …

Toward generalizable prediction of antibody thermostability using machine learning on sequence and structure features

A Harmalkar, R Rao, Y Richard Xie, J Honer… - MAbs, 2023 - Taylor & Francis
Over the last three decades, the appeal for monoclonal antibodies (mAbs) as therapeutics
has been steadily increasing as evident with FDA's recent landmark approval of the 100th …

Predicting and interpreting protein Developability via transfer of convolutional sequence representation

AW Golinski, ZD Schmitz, GH Nielsen… - ACS Synthetic …, 2023 - ACS Publications
Engineered proteins have emerged as novel diagnostics, therapeutics, and catalysts. Often,
poor protein developability─ quantified by expression, solubility, and stability─ hinders …

Strategies to identify and edit improvements in synthetic genome segments episomally

A Rudolph, A Nyerges, A Chiappino-Pepe… - Nucleic Acids …, 2023 - academic.oup.com
Genome engineering projects often utilize bacterial artificial chromosomes (BACs) to carry
multi-kilobase DNA segments at low copy number. However, all stages of whole-genome …

Prediction of Adeno-Associated Virus Fitness with a Protein Language Based Machine Learning Model

J Wu, Y Qiu, E Lyashenko, C Mueller, SR Choudhury - bioRxiv, 2024 - biorxiv.org
Abstract Adeno-associated viral (AAV)-based therapeutics have the potential to transform
the lives of patients by delivering one-time treatments for a variety of diseases. However, a …