Assessment of mapping and SNP-detection algorithms for next-generation sequencing data in cancer genomics

Weixin Wang, Feng Xu, Junwen Wang

Research output: Chapter in Book/Report/Conference proceedingChapter

1 Scopus citations

Abstract

​​​​The rapid development of next-generation sequencing (NGS) technology provides a new chance to extend the scale and resolution of genomic research. How to efficiently map millions of short reads to the reference genome and how to make accurate variant call are two major challenges in NGS analysis. In this chapter, we review current software for aligning short reads and detecting single-nucleotide polymorphisms (SNPs) and extensively evaluate their performance on normal and cancer samples from the Cancer Genome Atlas project and trio’s data from the 1000 Genomes Project. We find that Burrows-Wheeler transform-based aligners are proven to be the most suitable for Illumina platform, and NovoalignCS shows the best overall performance for SOLiD data. We also demonstrate FaSD as the most reliable SNP caller compared with several state-of-the-art programs. Furthermore, NGS shows significantly lower coverage and poorer SNP-calling performance in the CpG island, promoter, and 5’UTR regions of the human genome. We show that both high GC-content and low repetitive elements are the causes of lower coverage in the promoter regions.

Original languageEnglish (US)
Title of host publicationNext Generation Sequencing in Cancer Research: Volume 1: Decoding the Cancer Genome
PublisherSpringer New York
Pages301-317
Number of pages17
ISBN (Electronic)9781461476450
ISBN (Print)9781461476443
DOIs
StatePublished - Jan 1 2013
Externally publishedYes

    Fingerprint

Keywords

  • Alignment
  • Cancer
  • Genotype
  • Next-generation sequencing
  • SNP

ASJC Scopus subject areas

  • Medicine(all)

Cite this

Wang, W., Xu, F., & Wang, J. (2013). Assessment of mapping and SNP-detection algorithms for next-generation sequencing data in cancer genomics. In Next Generation Sequencing in Cancer Research: Volume 1: Decoding the Cancer Genome (pp. 301-317). Springer New York. https://doi.org/10.1007/978-1-4614-7645-0_15