Copy number variant analysis using genome-wide mate-pair sequencing

James Smadbeck, Sarah H. Johnson, Stephanie A. Smoley, Athanasios Gaitatzes, Travis M. Drucker, Roman M. Zenka, Farhad Kosari, Stephen J. Murphy, Nicole Hoppman, Umut Aypar, William R. Sukov, Robert Brian Jenkins, Hutton M. Kearney, Andrew L Feldman, George Vasmatzis

Research output: Contribution to journalArticle

16 Scopus citations

Abstract

Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.

Original languageEnglish (US)
Pages (from-to)459-470
Number of pages12
JournalGenes Chromosomes and Cancer
Volume57
Issue number9
DOIs
StatePublished - Sep 1 2018

    Fingerprint

Keywords

  • bioinformatics
  • cancer genetics
  • chromosomal rearrangements
  • copy number variant analysis
  • next-generation sequencing

ASJC Scopus subject areas

  • Genetics
  • Cancer Research

Cite this

Smadbeck, J., Johnson, S. H., Smoley, S. A., Gaitatzes, A., Drucker, T. M., Zenka, R. M., Kosari, F., Murphy, S. J., Hoppman, N., Aypar, U., Sukov, W. R., Jenkins, R. B., Kearney, H. M., Feldman, A. L., & Vasmatzis, G. (2018). Copy number variant analysis using genome-wide mate-pair sequencing. Genes Chromosomes and Cancer, 57(9), 459-470. https://doi.org/10.1002/gcc.5