作者
Sávio Souza Costa, Luís Carlos Guimarães, Artur Silva, Siomar Castro Soares, Rafael Azevedo Baraúna
发表日期
2020/8
来源
Bioinformatics and Biology Insights
卷号
14
页码范围
1177932220938064
出版商
SAGE Publications
简介
Pan-genome is defined as the set of orthologous and unique genes of a specific group of organisms. The pan-genome is composed by the core genome, accessory genome, and species- or strain-specific genes. The pan-genome is considered open or closed based on the alpha value of the Heap law. In an open pan-genome, the number of gene families will continuously increase with the addition of new genomes to the analysis, while in a closed pan-genome, the number of gene families will not increase considerably. The first step of a pan-genome analysis is the homogenization of genome annotation. The same software should be used to annotate genomes, such as GeneMark or RAST. Subsequently, several software are used to calculate the pan-genome such as BPGA, GET_HOMOLOGUES, PGAP, among others. This review presents all these initial steps for those who want to perform a pan-genome …
引用总数
学术搜索中的文章
SS Costa, LC Guimarães, A Silva, SC Soares… - Bioinformatics and Biology Insights, 2020