Statistics and Its Interface

Volume 8 (2015)

Number 4

Single-sample SNP detection by empirical Bayes method using next-generation sequencing data

Pages: 457 – 462

DOI: https://dx.doi.org/10.4310/SII.2015.v8.n4.a5

Authors

Weijie Ding

Qiang Kou

Xueqin Wang (School of Mathematics and Computational Science, and South China Research Center of Statistics, Sun Yat-Sen University, Guangzhou, China)

Qiuya Xu

Na You (School of Mathematics and Computational Science, and South China Research Center of Statistics, Sun Yat-Sen University, Guangzhou, China)

Abstract

The rapid development of next generation sequencing technology is changing the way of biological research in many aspects, which has become the most popular platform for the genomic structural variation detection. In this paper, we focus on the single-sample next generation sequencing data analysis, and propose a hierarchical structure to model the dispersion of minor allele frequency in the genome scale. The empirical Bayes method is employed to estimate the hyper-parameters, and the minor allele is identified as a sequencing error or heterozygous allele according to the posterior probabilities. We suggest to leave the ambiguous positions with moderate posterior probabilities ungenotyped for better genotype-call error control. The performances of our proposed method are investigated by simulations and a real dataset.

Keywords

next-generation sequencing, single-sample, genotyping, SNP detection, empirical Bayes method

2010 Mathematics Subject Classification

62P10

Published 19 October 2015