Bayesian haplo-type inference via the dirichlet process
Proceedings of the twenty-first international conference on Machine learning, 2004•dl.acm.org
The problem of inferring haplotypes from genotypes of single nucleotide polymorphisms
(SNPs) is essential for the understanding of genetic variation within and among populations,
with important applications to the genetic analysis of disease propensities and other
complex traits. The problem can be formulated as a mixture model, where the mixture
components correspond to the pool of haplotypes in the population. The size of this pool is
unknown; indeed, knowing the size of the pool would correspond to knowing something …
(SNPs) is essential for the understanding of genetic variation within and among populations,
with important applications to the genetic analysis of disease propensities and other
complex traits. The problem can be formulated as a mixture model, where the mixture
components correspond to the pool of haplotypes in the population. The size of this pool is
unknown; indeed, knowing the size of the pool would correspond to knowing something …
The problem of inferring haplotypes from genotypes of single nucleotide polymorphisms (SNPs) is essential for the understanding of genetic variation within and among populations, with important applications to the genetic analysis of disease propensities and other complex traits. The problem can be formulated as a mixture model, where the mixture components correspond to the pool of haplotypes in the population. The size of this pool is unknown; indeed, knowing the size of the pool would correspond to knowing something significant about the genome and its history. Thus methods for fitting the genotype mixture must crucially address the problem of estimating a mixture with an unknown number of mixture components. In this paper we present a Bayesian approach to this problem based on a nonparametric prior known as the Dirichlet process. The model also incorporates a likelihood that captures statistical errors in the haplotype/genotype relationship. We apply our approach to the analysis of both simulated and real genotype data, and compare to extant methods.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果