Population genomic SNPs from epigenetic RADs: Gaining genetic and epigenetic data from a single established next‐generation sequencing approach
Epigenetics is increasingly recognized as an important molecular mechanism underlying
phenotypic variation. To study DNA methylation in ecological and evolutionary contexts,
epiRADseq is a cost‐effective next‐generation sequencing (NGS) technique based on
reduced representation sequencing of genomic regions surrounding non‐/methylated sites.
EpiRADseq for genome‐wide methylation abundance and ddRADseq for genome‐wide
single‐nucleotide polymorphism (SNP) genotyping follow very similar library and …
phenotypic variation. To study DNA methylation in ecological and evolutionary contexts,
epiRADseq is a cost‐effective next‐generation sequencing (NGS) technique based on
reduced representation sequencing of genomic regions surrounding non‐/methylated sites.
EpiRADseq for genome‐wide methylation abundance and ddRADseq for genome‐wide
single‐nucleotide polymorphism (SNP) genotyping follow very similar library and …
Abstract
- Epigenetics is increasingly recognized as an important molecular mechanism underlying phenotypic variation. To study DNA methylation in ecological and evolutionary contexts, epiRADseq is a cost‐effective next‐generation sequencing (NGS) technique based on reduced representation sequencing of genomic regions surrounding non‐/methylated sites. EpiRADseq for genome‐wide methylation abundance and ddRADseq for genome‐wide single‐nucleotide polymorphism (SNP) genotyping follow very similar library and sequencing protocols, but to date these two types of dataset have been handled separately. Here we test the performance of using epiRADseq data to generate SNPs for population genomic analyses.
- We tested the robustness of using epiRADseq data for population genomics with two independent datasets: a newly generated single‐end dataset for the European whitefish Coregonus lavaretus, and a re‐analysis of publicly available, previously published paired‐end data on corals. Using standard bioinformatic pipelines with a reference genome and without (i.e. de novo catalogue loci), we compared the number of SNPs retained, population genetic summary statistics and population genetic structure between data drawn from ddRADseq and epiRADseq library preparations.
- We found that SNPs drawn from epiRADseq are similar in number to those drawn from ddRADseq, with 55%–83% of SNPs being identified by both methods. Genotyping error rate was <5% in both approaches. EpiRADseq‐specific allele dropout was low (~1%). For summary statistics, such as heterozygosity and nucleotide diversity, there is a strong correlation between methods (Spearman's rho > 0.88). Furthermore, identical patterns of population genetic structure were recovered using SNPs from epiRADseq and ddRADseq approaches.
- We show that SNPs obtained from epiRADseq are highly similar to those from ddRADseq and are equivalent for estimating genetic diversity and population structure. This finding is particularly relevant to researchers interested in genetics and epigenetics on the same individuals because using a single epigenomic approach to generate two datasets greatly reduces the time and financial costs compared to using these techniques separately. It also efficiently enables correction of epigenetic estimates with population genetic data. Many studies will benefit from a combinatorial approach with genetic and epigenetic markers and this demonstrates a single, efficient method to do so.
Wiley Online Library
以上显示的是最相近的搜索结果。 查看全部搜索结果