MuLan-Methyl—multiple transformer-based language models for accurate DNA methylation prediction
Transformer-based language models are successfully used to address massive text-related
tasks. DNA methylation is an important epigenetic mechanism, and its analysis provides …
tasks. DNA methylation is an important epigenetic mechanism, and its analysis provides …
Machine learning in biological physics: From biomolecular prediction to design
J Martin, M Lequerica Mateos, JN Onuchic… - Proceedings of the …, 2024 - pnas.org
Machine learning has been proposed as an alternative to theoretical modeling when
dealing with complex problems in biological physics. However, in this perspective, we argue …
dealing with complex problems in biological physics. However, in this perspective, we argue …
Deep learning in modeling protein complex structures: From contact prediction to end-to-end approaches
Protein–protein interactions play crucial roles in many biological processes. Traditionally,
protein complex structures are normally built by protein–protein docking. With the rapid …
protein complex structures are normally built by protein–protein docking. With the rapid …
[HTML][HTML] Computational tools to predict context-specific protein complexes
Interactions between thousands of proteins define cells' protein–protein interaction (PPI)
network. Some of these interactions lead to the formation of protein complexes. It is …
network. Some of these interactions lead to the formation of protein complexes. It is …
Protein fitness prediction is impacted by the interplay of language models, ensemble learning, and sampling methods
M Mardikoraem, D Woldring - Pharmaceutics, 2023 - mdpi.com
Advances in machine learning (ML) and the availability of protein sequences via high-
throughput sequencing techniques have transformed the ability to design novel diagnostic …
throughput sequencing techniques have transformed the ability to design novel diagnostic …
Applications of machine learning in phylogenetics
Abstract Machine learning has increasingly been applied to a wide range of questions in
phylogenetic inference. Supervised machine learning approaches that rely on simulated …
phylogenetic inference. Supervised machine learning approaches that rely on simulated …
Generative power of a protein language model trained on multiple sequence alignments
Computational models starting from large ensembles of evolutionarily related protein
sequences capture a representation of protein families and learn constraints associated to …
sequences capture a representation of protein families and learn constraints associated to …
DR-BERT: a protein language model to annotate disordered regions
Despite their lack of a rigid structure, intrinsically disordered regions (IDRs) in proteins play
important roles in cellular functions, including mediating protein-protein interactions …
important roles in cellular functions, including mediating protein-protein interactions …
Playing the System: Can Puzzle Players Teach us How to Solve Hard Problems?
R Mutalova, R Sarrazin-Gendron, E Cai… - Proceedings of the …, 2023 - dl.acm.org
With nearly three billion players, video games are more popular than ever. Casual puzzle
games are among the most played categories. These games capitalize on the players' …
games are among the most played categories. These games capitalize on the players' …
Pairing interacting protein sequences using masked language modeling
Predicting which proteins interact together from amino acid sequences is an important task.
We develop a method to pair interacting protein sequences which leverages the power of …
We develop a method to pair interacting protein sequences which leverages the power of …