[PDF][PDF] A simple guide to de novo transcriptome assembly and annotation

V Raghavan, L Kraft, F Mesny… - Briefings in …, 2022 - academic.oup.com
A transcriptome constructed from short-read RNA sequencing (RNA-seq) is an easily
attainable proxy catalog of protein-coding genes when genome assembly is unnecessary …

Accurate and complete genomes from metagenomes

LX Chen, K Anantharaman, A Shaiber… - Genome …, 2020 - genome.cshlp.org
Genomes are an integral component of the biological information about an organism; thus,
the more complete the genome, the more informative it is. Historically, bacterial and …

[HTML][HTML] Highly accurate protein structure prediction with AlphaFold

J Jumper, R Evans, A Pritzel, T Green, M Figurnov… - nature, 2021 - nature.com
Proteins are essential to life, and understanding their structure can facilitate a mechanistic
understanding of their function. Through an enormous experimental effort 1, 2, 3, 4, the …

Pfam: The protein families database in 2021

J Mistry, S Chuguransky, L Williams… - Nucleic acids …, 2021 - academic.oup.com
The Pfam database is a widely used resource for classifying protein sequences into families
and domains. Since Pfam was last described in this journal, over 350 new families have …

A genomic catalog of Earth's microbiomes

S Nayfach, S Roux, R Seshadri, D Udwary… - Nature …, 2021 - nature.com
The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has
enabled insights into the ecology and evolution of environmental and host-associated …

New insights from uncultivated genomes of the global human gut microbiome

S Nayfach, ZJ Shi, R Seshadri, KS Pollard… - Nature, 2019 - nature.com
The genome sequences of many species of the human gut microbiome remain unknown,
largely owing to challenges in cultivating microorganisms under laboratory conditions. Here …

Clustering huge protein sequence sets in linear time

M Steinegger, J Söding - Nature communications, 2018 - nature.com
Metagenomic datasets contain billions of protein sequences that could greatly enhance
large-scale functional annotation and structure prediction. Utilizing this enormous resource …

Evolutionary trajectory of pattern recognition receptors in plants

BPM Ngou, M Wyler, MW Schmid, Y Kadota… - Nature …, 2024 - nature.com
Cell-surface receptors play pivotal roles in many biological processes, including immunity,
development, and reproduction, across diverse organisms. How cell-surface receptors …

Exploring evolution-aware &-free protein language models as protein function predictors

M Hu, F Yuan, K Yang, F Ju, J Su… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Large-scale Protein Language Models (PLMs) have improved performance in
protein prediction tasks, ranging from 3D structure prediction to various function predictions …

Genetic determinants of endophytism in the Arabidopsis root mycobiome

F Mesny, S Miyauchi, T Thiergart, B Pickel… - Nature …, 2021 - nature.com
The roots of Arabidopsis thaliana host diverse fungal communities that affect plant health
and disease states. Here, we sequence the genomes of 41 fungal isolates representative of …