The status of the human gene catalogue

P Amaral, S Carbonell-Sala, FM De La Vega, T Faial… - Nature, 2023 - nature.com
Scientists have been trying to identify every gene in the human genome since the initial draft
was published in 2001. In the years since, much progress has been made in identifying …

Pan-genomics in the human genome era

RM Sherman, SL Salzberg - Nature Reviews Genetics, 2020 - nature.com
Since the early days of the genome era, the scientific community has relied on a single
'reference'genome for each species, which is used as the basis for a wide range of genetic …

A complete reference genome improves analysis of human genetic variation

S Aganezov, SM Yan, DC Soto, M Kirsche, S Zarate… - Science, 2022 - science.org
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200
million base pairs of sequence, corrects thousands of structural errors, and unlocks the most …

[HTML][HTML] GFF utilities: GffRead and GffCompare

G Pertea, M Pertea - F1000Research, 2020 - ncbi.nlm.nih.gov
GTF (Gene Transfer Format) and GFF (General Feature Format) are popular file formats
used by bioinformatics programs to represent and exchange information about various …

GENCODE 2021

A Frankish, M Diekhans, I Jungreis… - Nucleic acids …, 2021 - academic.oup.com
The GENCODE project annotates human and mouse genes and transcripts supported by
experimental data with high accuracy, providing a foundational resource that supports …

The structure, function and evolution of a complete human chromosome 8

GA Logsdon, MR Vollger, PH Hsieh, Y Mao… - Nature, 2021 - nature.com
The complete assembly of each human chromosome is essential for understanding human
biology and evolution,. Here we use complementary long-read sequencing technologies to …

Improved transcriptome assembly using a hybrid of long and short reads with StringTie

A Shumate, B Wong, G Pertea… - PLoS computational …, 2022 - journals.plos.org
Short-read RNA sequencing and long-read RNA sequencing each have their strengths and
weaknesses for transcriptome assembly. While short reads are highly accurate, they are …

Transcriptome assembly from long-read RNA-seq alignments with StringTie2

S Kovaka, AV Zimin, GM Pertea, R Razaghi… - Genome biology, 2019 - Springer
RNA sequencing using the latest single-molecule sequencing instruments produces reads
that are thousands of nucleotides long. The ability to assemble these long reads can greatly …

recount3: summaries and queries for large-scale RNA-seq expression and splicing

C Wilks, SC Zheng, FY Chen, R Charles, B Solomon… - Genome biology, 2021 - Springer
We present recount3, a resource consisting of over 750,000 publicly available human and
mouse RNA sequencing (RNA-seq) samples uniformly processed by our new Monorail …

Transcriptome variation in human tissues revealed by long-read sequencing

DA Glinos, G Garborcauskas, P Hoffman, N Ehsan… - Nature, 2022 - nature.com
Regulation of transcript structure generates transcript diversity and plays an important role in
human disease,,,,,–. The advent of long-read sequencing technologies offers the opportunity …