Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples

Rebecca F. Halperin, Winnie S. Liang, Sidharth Kulkarni, Erica E. Tassone, Jonathan Adkins, Daniel Enriquez, Nhan Tran, Nicole C. Hank, James Newell, Chinnappa Kodira, Ronald Korn, Michael E. Berens, Seungchan Kim, Sara A. Byron

Research output: Contribution to journalArticle

Abstract

Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available.

Original languageEnglish (US)
JournalFrontiers in Oncology
Volume9
Issue numberMAR
DOIs
StatePublished - Jan 1 2019

Fingerprint

Neoplasms
Exome
Translational Medical Research
Glioblastoma
Genomics
Computer Simulation
Prostate
Breast
Software
Alleles
Mutation

Keywords

  • Cancer genomics
  • Cancer hotspot mutations
  • Next generation sequencing
  • Somatic variant calling
  • Tumor exome sequencing
  • Tumor-only sequencing

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Cite this

Halperin, R. F., Liang, W. S., Kulkarni, S., Tassone, E. E., Adkins, J., Enriquez, D., ... Byron, S. A. (2019). Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples. Frontiers in Oncology, 9(MAR). https://doi.org/10.3389/fonc.2019.00119

Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples. / Halperin, Rebecca F.; Liang, Winnie S.; Kulkarni, Sidharth; Tassone, Erica E.; Adkins, Jonathan; Enriquez, Daniel; Tran, Nhan; Hank, Nicole C.; Newell, James; Kodira, Chinnappa; Korn, Ronald; Berens, Michael E.; Kim, Seungchan; Byron, Sara A.

In: Frontiers in Oncology, Vol. 9, No. MAR, 01.01.2019.

Research output: Contribution to journalArticle

Halperin, RF, Liang, WS, Kulkarni, S, Tassone, EE, Adkins, J, Enriquez, D, Tran, N, Hank, NC, Newell, J, Kodira, C, Korn, R, Berens, ME, Kim, S & Byron, SA 2019, 'Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples', Frontiers in Oncology, vol. 9, no. MAR. https://doi.org/10.3389/fonc.2019.00119
Halperin, Rebecca F. ; Liang, Winnie S. ; Kulkarni, Sidharth ; Tassone, Erica E. ; Adkins, Jonathan ; Enriquez, Daniel ; Tran, Nhan ; Hank, Nicole C. ; Newell, James ; Kodira, Chinnappa ; Korn, Ronald ; Berens, Michael E. ; Kim, Seungchan ; Byron, Sara A. / Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples. In: Frontiers in Oncology. 2019 ; Vol. 9, No. MAR.
@article{6f93cfd164b949c0a4d8b379ea0ebf14,
title = "Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples",
abstract = "Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available.",
keywords = "Cancer genomics, Cancer hotspot mutations, Next generation sequencing, Somatic variant calling, Tumor exome sequencing, Tumor-only sequencing",
author = "Halperin, {Rebecca F.} and Liang, {Winnie S.} and Sidharth Kulkarni and Tassone, {Erica E.} and Jonathan Adkins and Daniel Enriquez and Nhan Tran and Hank, {Nicole C.} and James Newell and Chinnappa Kodira and Ronald Korn and Berens, {Michael E.} and Seungchan Kim and Byron, {Sara A.}",
year = "2019",
month = "1",
day = "1",
doi = "10.3389/fonc.2019.00119",
language = "English (US)",
volume = "9",
journal = "Frontiers in Oncology",
issn = "2234-943X",
publisher = "Frontiers Media S. A.",
number = "MAR",

}

TY - JOUR

T1 - Leveraging spatial variation in tumor purity for improved somatic variant calling of archival tumor only samples

AU - Halperin, Rebecca F.

AU - Liang, Winnie S.

AU - Kulkarni, Sidharth

AU - Tassone, Erica E.

AU - Adkins, Jonathan

AU - Enriquez, Daniel

AU - Tran, Nhan

AU - Hank, Nicole C.

AU - Newell, James

AU - Kodira, Chinnappa

AU - Korn, Ronald

AU - Berens, Michael E.

AU - Kim, Seungchan

AU - Byron, Sara A.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available.

AB - Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available.

KW - Cancer genomics

KW - Cancer hotspot mutations

KW - Next generation sequencing

KW - Somatic variant calling

KW - Tumor exome sequencing

KW - Tumor-only sequencing

UR - http://www.scopus.com/inward/record.url?scp=85063346553&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063346553&partnerID=8YFLogxK

U2 - 10.3389/fonc.2019.00119

DO - 10.3389/fonc.2019.00119

M3 - Article

VL - 9

JO - Frontiers in Oncology

JF - Frontiers in Oncology

SN - 2234-943X

IS - MAR

ER -