[PDF][PDF] A simple guide to de novo transcriptome assembly and annotation
A transcriptome constructed from short-read RNA sequencing (RNA-seq) is an easily
attainable proxy catalog of protein-coding genes when genome assembly is unnecessary …
attainable proxy catalog of protein-coding genes when genome assembly is unnecessary …
Accurate and complete genomes from metagenomes
Genomes are an integral component of the biological information about an organism; thus,
the more complete the genome, the more informative it is. Historically, bacterial and …
the more complete the genome, the more informative it is. Historically, bacterial and …
[HTML][HTML] Highly accurate protein structure prediction with AlphaFold
Proteins are essential to life, and understanding their structure can facilitate a mechanistic
understanding of their function. Through an enormous experimental effort 1, 2, 3, 4, the …
understanding of their function. Through an enormous experimental effort 1, 2, 3, 4, the …
Pfam: The protein families database in 2021
J Mistry, S Chuguransky, L Williams… - Nucleic acids …, 2021 - academic.oup.com
The Pfam database is a widely used resource for classifying protein sequences into families
and domains. Since Pfam was last described in this journal, over 350 new families have …
and domains. Since Pfam was last described in this journal, over 350 new families have …
A genomic catalog of Earth's microbiomes
The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has
enabled insights into the ecology and evolution of environmental and host-associated …
enabled insights into the ecology and evolution of environmental and host-associated …
New insights from uncultivated genomes of the global human gut microbiome
The genome sequences of many species of the human gut microbiome remain unknown,
largely owing to challenges in cultivating microorganisms under laboratory conditions. Here …
largely owing to challenges in cultivating microorganisms under laboratory conditions. Here …
Clustering huge protein sequence sets in linear time
M Steinegger, J Söding - Nature communications, 2018 - nature.com
Metagenomic datasets contain billions of protein sequences that could greatly enhance
large-scale functional annotation and structure prediction. Utilizing this enormous resource …
large-scale functional annotation and structure prediction. Utilizing this enormous resource …
Evolutionary trajectory of pattern recognition receptors in plants
Cell-surface receptors play pivotal roles in many biological processes, including immunity,
development, and reproduction, across diverse organisms. How cell-surface receptors …
development, and reproduction, across diverse organisms. How cell-surface receptors …
Exploring evolution-aware &-free protein language models as protein function predictors
Abstract Large-scale Protein Language Models (PLMs) have improved performance in
protein prediction tasks, ranging from 3D structure prediction to various function predictions …
protein prediction tasks, ranging from 3D structure prediction to various function predictions …
Genetic determinants of endophytism in the Arabidopsis root mycobiome
F Mesny, S Miyauchi, T Thiergart, B Pickel… - Nature …, 2021 - nature.com
The roots of Arabidopsis thaliana host diverse fungal communities that affect plant health
and disease states. Here, we sequence the genomes of 41 fungal isolates representative of …
and disease states. Here, we sequence the genomes of 41 fungal isolates representative of …