Leveraging transformers‐based language models in proteome bioinformatics

NQK Le - Proteomics, 2023 - Wiley Online Library
In recent years, the rapid growth of biological data has increased interest in using
bioinformatics to analyze and interpret this data. Proteomics, which studies the structure …

From intuition to AI: evolution of small molecule representations in drug discovery

M McGibbon, S Shave, J Dong, Y Gao… - Briefings in …, 2024 - academic.oup.com
Within drug discovery, the goal of AI scientists and cheminformaticians is to help identify
molecular starting points that will develop into safe and efficacious drugs while reducing …

What can ribo-seq, immunopeptidomics, and proteomics tell us about the noncanonical proteome?

JR Prensner, JG Abelin, LW Kok, KR Clauser… - Molecular & Cellular …, 2023 - ASBMB
Abstract Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of
the human genome and proteome by illuminating thousands of noncanonical sites of …

Discovering microproteins: making the most of ribosome profiling data

S Chothani, L Ho, S Schafer, O Rackham - RNA biology, 2023 - Taylor & Francis
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized
biological process discovery and understanding. Traditionally, gene models have been …

Shining a light on the dark proteome: Non‐canonical open reading frames and their encoded miniproteins as a new frontier in cancer biology

Z Posner, I Yannuzzi, JR Prensner - Protein Science, 2023 - Wiley Online Library
In the decades following the discovery that genes encode proteins, scientists have tried to
exhaustively and comprehensively characterize the human genome. Recent advances in …

Characterization of shared neoantigens landscape in Mismatch Repair Deficient Endometrial Cancer

E De Paolis, C Nero, E Micarelli, G Leoni… - NPJ Precision …, 2024 - nature.com
Endometrial cancer (EC) with Mismatch Repair deficiency (MMRd) is characterized by the
accumulation of insertions/deletions at microsatellite sites. These mutations lead to the …

Untranslated regions (UTRs) are a potential novel source of neoantigens for personalised immunotherapy

CCT Sng, AA Kallor, BS Simpson, G Bedran… - Frontiers in …, 2024 - frontiersin.org
Background Neoantigens, mutated tumour-specific antigens, are key targets of anti-tumour
immunity during checkpoint inhibitor (CPI) treatment. Their identification is fundamental to …

Transfer learning enables predictions in soil-borne diseases

L Xin, P Xie, T Wen, G Niu, J Yuan - Soil Ecology Letters, 2024 - Springer
The Transformer model precisely predicts soil health status from high-throughput
sequencing data. The SMOTE algorithm addresses data imbalance issues, improving model …

[HTML][HTML] What can Ribo-seq and proteomics tell us about the non-canonical proteome?

JR Prensner, JG Abelin, LW Kok, KR Clauser… - Biorxiv, 2023 - ncbi.nlm.nih.gov
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the
human genome and proteome by illuminating thousands of non-canonical sites of ribosome …

A Review on the Applications of Transformer-based language models for Nucleotide Sequence Analysis

N Ghosh, D Santoni, I Saha, G Felici - arXiv preprint arXiv:2412.07201, 2024 - arxiv.org
In recent times, Transformer-based language models are making quite an impact in the field
of natural language processing. As relevant parallels can be drawn between biological …