Copy number variant analysis using genome-wide mate-pair sequencing

James Smadbeck, Sarah H. Johnson, Stephanie A. Smoley, Athanasios Gaitatzes, Travis M. Drucker, Roman M. Zenka, Farhad Kosari, Stephen J. Murphy, Nicole Hoppman, Umut Aypar, William R. Sukov, Robert Brian Jenkins, Hutton M. Kearney, Andrew L Feldman, George Vasmatzis

Research output: Contribution to journalArticle

12 Citations (Scopus)

Abstract

Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.

Original languageEnglish (US)
Pages (from-to)459-470
Number of pages12
JournalGenes Chromosomes and Cancer
Volume57
Issue number9
DOIs
StatePublished - Sep 1 2018

Fingerprint

Cytogenetic Analysis
Genome
Technology
Karyotyping
Human Genome
Neoplasms

Keywords

  • bioinformatics
  • cancer genetics
  • chromosomal rearrangements
  • copy number variant analysis
  • next-generation sequencing

ASJC Scopus subject areas

  • Genetics
  • Cancer Research

Cite this

Smadbeck, J., Johnson, S. H., Smoley, S. A., Gaitatzes, A., Drucker, T. M., Zenka, R. M., ... Vasmatzis, G. (2018). Copy number variant analysis using genome-wide mate-pair sequencing. Genes Chromosomes and Cancer, 57(9), 459-470. https://doi.org/10.1002/gcc.5

Copy number variant analysis using genome-wide mate-pair sequencing. / Smadbeck, James; Johnson, Sarah H.; Smoley, Stephanie A.; Gaitatzes, Athanasios; Drucker, Travis M.; Zenka, Roman M.; Kosari, Farhad; Murphy, Stephen J.; Hoppman, Nicole; Aypar, Umut; Sukov, William R.; Jenkins, Robert Brian; Kearney, Hutton M.; Feldman, Andrew L; Vasmatzis, George.

In: Genes Chromosomes and Cancer, Vol. 57, No. 9, 01.09.2018, p. 459-470.

Research output: Contribution to journalArticle

Smadbeck, J, Johnson, SH, Smoley, SA, Gaitatzes, A, Drucker, TM, Zenka, RM, Kosari, F, Murphy, SJ, Hoppman, N, Aypar, U, Sukov, WR, Jenkins, RB, Kearney, HM, Feldman, AL & Vasmatzis, G 2018, 'Copy number variant analysis using genome-wide mate-pair sequencing', Genes Chromosomes and Cancer, vol. 57, no. 9, pp. 459-470. https://doi.org/10.1002/gcc.5
Smadbeck J, Johnson SH, Smoley SA, Gaitatzes A, Drucker TM, Zenka RM et al. Copy number variant analysis using genome-wide mate-pair sequencing. Genes Chromosomes and Cancer. 2018 Sep 1;57(9):459-470. https://doi.org/10.1002/gcc.5
Smadbeck, James ; Johnson, Sarah H. ; Smoley, Stephanie A. ; Gaitatzes, Athanasios ; Drucker, Travis M. ; Zenka, Roman M. ; Kosari, Farhad ; Murphy, Stephen J. ; Hoppman, Nicole ; Aypar, Umut ; Sukov, William R. ; Jenkins, Robert Brian ; Kearney, Hutton M. ; Feldman, Andrew L ; Vasmatzis, George. / Copy number variant analysis using genome-wide mate-pair sequencing. In: Genes Chromosomes and Cancer. 2018 ; Vol. 57, No. 9. pp. 459-470.
@article{abdd65fd829346f8853ff36d76f21390,
title = "Copy number variant analysis using genome-wide mate-pair sequencing",
abstract = "Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.",
keywords = "bioinformatics, cancer genetics, chromosomal rearrangements, copy number variant analysis, next-generation sequencing",
author = "James Smadbeck and Johnson, {Sarah H.} and Smoley, {Stephanie A.} and Athanasios Gaitatzes and Drucker, {Travis M.} and Zenka, {Roman M.} and Farhad Kosari and Murphy, {Stephen J.} and Nicole Hoppman and Umut Aypar and Sukov, {William R.} and Jenkins, {Robert Brian} and Kearney, {Hutton M.} and Feldman, {Andrew L} and George Vasmatzis",
year = "2018",
month = "9",
day = "1",
doi = "10.1002/gcc.5",
language = "English (US)",
volume = "57",
pages = "459--470",
journal = "Genes Chromosomes and Cancer",
issn = "1045-2257",
publisher = "Wiley-Liss Inc.",
number = "9",

}

TY - JOUR

T1 - Copy number variant analysis using genome-wide mate-pair sequencing

AU - Smadbeck, James

AU - Johnson, Sarah H.

AU - Smoley, Stephanie A.

AU - Gaitatzes, Athanasios

AU - Drucker, Travis M.

AU - Zenka, Roman M.

AU - Kosari, Farhad

AU - Murphy, Stephen J.

AU - Hoppman, Nicole

AU - Aypar, Umut

AU - Sukov, William R.

AU - Jenkins, Robert Brian

AU - Kearney, Hutton M.

AU - Feldman, Andrew L

AU - Vasmatzis, George

PY - 2018/9/1

Y1 - 2018/9/1

N2 - Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.

AB - Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.

KW - bioinformatics

KW - cancer genetics

KW - chromosomal rearrangements

KW - copy number variant analysis

KW - next-generation sequencing

UR - http://www.scopus.com/inward/record.url?scp=85050017442&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85050017442&partnerID=8YFLogxK

U2 - 10.1002/gcc.5

DO - 10.1002/gcc.5

M3 - Article

C2 - 29726617

AN - SCOPUS:85050017442

VL - 57

SP - 459

EP - 470

JO - Genes Chromosomes and Cancer

JF - Genes Chromosomes and Cancer

SN - 1045-2257

IS - 9

ER -