Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures

Yuelin Zhu, Antai Wang, Minetta C Liu, Alan Zwart, Richard Y. Lee, Ann Gallagher, Yue Wang, William R. Miller, J. Michael Dixon, Robert Clarke

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Established human breast cancer cell lines are widely used as experimental models in breast cancer research. While these cell lines and their variants share many phenotypic characteristics with human breast tumors, the extent to which they reflect the underlying molecular biology of breast cancer remains controversial. We explored this issue using a probabilistic rather than heuristic approach. Data from gene expression microarrays were used to compare the global structures of the transcriptomes of three estrogen receptor alpha positive (ER+) human breast cancer cell lines (MCF-7, T47D, ZR-75-1) and 13 human breast tumors (11 ER+; 2 ER-). Linear representations of the respective data structures were obtained by deriving those top principal components (PCs) required to capture ≥80% of the cumulative variance for each data set (M PCs). We then identified those genes most highly correlated with the M PCs (Pearson's correlation coefficient r≥0.800) and identified a group of 36 genes commonly correlated with both the cell line (M = 5 PCs) and tumor (M = 6 PCs) data structures. All 36 common genes were correlated with PC1 from the breast tumor data: 21/36 genes were correlated with PC1, 14/36 genes correlated with PC2, and 1/36 genes correlated with PC3 from the cell line data. Genes important in defining the data structures include NFκB p65, IGFBP-6, ornithine decarboxylase-1, and paxillin. When data from MDA-MB-435 xenografts (ER-) were included in the analysis, we were unable to find any common genes between these xenografts and the breast tumors. These data clearly imply that MCF-7, T47D, and ZR-75-1 cells and ER+ breast tumors share substantial global similarities in the structures of their respective transcriptomes, and that these cell lines are good models in which to identify molecular events that are likely to be important in some ER+ human breast cancers.

Original languageEnglish (US)
Pages (from-to)1581-1589
Number of pages9
JournalInternational Journal of Oncology
Volume29
Issue number6
StatePublished - Dec 2006
Externally publishedYes

Fingerprint

Estrogen Receptor alpha
Transcriptome
Breast Neoplasms
Cell Line
Myeloma Proteins
Genes
Heterografts
Insulin-Like Growth Factor Binding Protein 6
Paxillin
Ornithine Decarboxylase
Molecular Biology
Theoretical Models
Gene Expression

Keywords

  • Bioinformatics
  • Breast cancer
  • Cell lines
  • Data structure
  • Genomics
  • Microarray analysis
  • Principal component analysis

ASJC Scopus subject areas

  • Cancer Research
  • Oncology

Cite this

Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures. / Zhu, Yuelin; Wang, Antai; Liu, Minetta C; Zwart, Alan; Lee, Richard Y.; Gallagher, Ann; Wang, Yue; Miller, William R.; Dixon, J. Michael; Clarke, Robert.

In: International Journal of Oncology, Vol. 29, No. 6, 12.2006, p. 1581-1589.

Research output: Contribution to journalArticle

Zhu, Y, Wang, A, Liu, MC, Zwart, A, Lee, RY, Gallagher, A, Wang, Y, Miller, WR, Dixon, JM & Clarke, R 2006, 'Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures', International Journal of Oncology, vol. 29, no. 6, pp. 1581-1589.
Zhu, Yuelin ; Wang, Antai ; Liu, Minetta C ; Zwart, Alan ; Lee, Richard Y. ; Gallagher, Ann ; Wang, Yue ; Miller, William R. ; Dixon, J. Michael ; Clarke, Robert. / Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures. In: International Journal of Oncology. 2006 ; Vol. 29, No. 6. pp. 1581-1589.
@article{1c2e2719f7d6404390432aca74811a79,
title = "Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures",
abstract = "Established human breast cancer cell lines are widely used as experimental models in breast cancer research. While these cell lines and their variants share many phenotypic characteristics with human breast tumors, the extent to which they reflect the underlying molecular biology of breast cancer remains controversial. We explored this issue using a probabilistic rather than heuristic approach. Data from gene expression microarrays were used to compare the global structures of the transcriptomes of three estrogen receptor alpha positive (ER+) human breast cancer cell lines (MCF-7, T47D, ZR-75-1) and 13 human breast tumors (11 ER+; 2 ER-). Linear representations of the respective data structures were obtained by deriving those top principal components (PCs) required to capture ≥80{\%} of the cumulative variance for each data set (M PCs). We then identified those genes most highly correlated with the M PCs (Pearson's correlation coefficient r≥0.800) and identified a group of 36 genes commonly correlated with both the cell line (M = 5 PCs) and tumor (M = 6 PCs) data structures. All 36 common genes were correlated with PC1 from the breast tumor data: 21/36 genes were correlated with PC1, 14/36 genes correlated with PC2, and 1/36 genes correlated with PC3 from the cell line data. Genes important in defining the data structures include NFκB p65, IGFBP-6, ornithine decarboxylase-1, and paxillin. When data from MDA-MB-435 xenografts (ER-) were included in the analysis, we were unable to find any common genes between these xenografts and the breast tumors. These data clearly imply that MCF-7, T47D, and ZR-75-1 cells and ER+ breast tumors share substantial global similarities in the structures of their respective transcriptomes, and that these cell lines are good models in which to identify molecular events that are likely to be important in some ER+ human breast cancers.",
keywords = "Bioinformatics, Breast cancer, Cell lines, Data structure, Genomics, Microarray analysis, Principal component analysis",
author = "Yuelin Zhu and Antai Wang and Liu, {Minetta C} and Alan Zwart and Lee, {Richard Y.} and Ann Gallagher and Yue Wang and Miller, {William R.} and Dixon, {J. Michael} and Robert Clarke",
year = "2006",
month = "12",
language = "English (US)",
volume = "29",
pages = "1581--1589",
journal = "International Journal of Oncology",
issn = "1019-6439",
publisher = "Spandidos Publications",
number = "6",

}

TY - JOUR

T1 - Estrogen receptor alpha positive breast tumors and breast cancer cell lines share similarities in their transcriptome data structures

AU - Zhu, Yuelin

AU - Wang, Antai

AU - Liu, Minetta C

AU - Zwart, Alan

AU - Lee, Richard Y.

AU - Gallagher, Ann

AU - Wang, Yue

AU - Miller, William R.

AU - Dixon, J. Michael

AU - Clarke, Robert

PY - 2006/12

Y1 - 2006/12

N2 - Established human breast cancer cell lines are widely used as experimental models in breast cancer research. While these cell lines and their variants share many phenotypic characteristics with human breast tumors, the extent to which they reflect the underlying molecular biology of breast cancer remains controversial. We explored this issue using a probabilistic rather than heuristic approach. Data from gene expression microarrays were used to compare the global structures of the transcriptomes of three estrogen receptor alpha positive (ER+) human breast cancer cell lines (MCF-7, T47D, ZR-75-1) and 13 human breast tumors (11 ER+; 2 ER-). Linear representations of the respective data structures were obtained by deriving those top principal components (PCs) required to capture ≥80% of the cumulative variance for each data set (M PCs). We then identified those genes most highly correlated with the M PCs (Pearson's correlation coefficient r≥0.800) and identified a group of 36 genes commonly correlated with both the cell line (M = 5 PCs) and tumor (M = 6 PCs) data structures. All 36 common genes were correlated with PC1 from the breast tumor data: 21/36 genes were correlated with PC1, 14/36 genes correlated with PC2, and 1/36 genes correlated with PC3 from the cell line data. Genes important in defining the data structures include NFκB p65, IGFBP-6, ornithine decarboxylase-1, and paxillin. When data from MDA-MB-435 xenografts (ER-) were included in the analysis, we were unable to find any common genes between these xenografts and the breast tumors. These data clearly imply that MCF-7, T47D, and ZR-75-1 cells and ER+ breast tumors share substantial global similarities in the structures of their respective transcriptomes, and that these cell lines are good models in which to identify molecular events that are likely to be important in some ER+ human breast cancers.

AB - Established human breast cancer cell lines are widely used as experimental models in breast cancer research. While these cell lines and their variants share many phenotypic characteristics with human breast tumors, the extent to which they reflect the underlying molecular biology of breast cancer remains controversial. We explored this issue using a probabilistic rather than heuristic approach. Data from gene expression microarrays were used to compare the global structures of the transcriptomes of three estrogen receptor alpha positive (ER+) human breast cancer cell lines (MCF-7, T47D, ZR-75-1) and 13 human breast tumors (11 ER+; 2 ER-). Linear representations of the respective data structures were obtained by deriving those top principal components (PCs) required to capture ≥80% of the cumulative variance for each data set (M PCs). We then identified those genes most highly correlated with the M PCs (Pearson's correlation coefficient r≥0.800) and identified a group of 36 genes commonly correlated with both the cell line (M = 5 PCs) and tumor (M = 6 PCs) data structures. All 36 common genes were correlated with PC1 from the breast tumor data: 21/36 genes were correlated with PC1, 14/36 genes correlated with PC2, and 1/36 genes correlated with PC3 from the cell line data. Genes important in defining the data structures include NFκB p65, IGFBP-6, ornithine decarboxylase-1, and paxillin. When data from MDA-MB-435 xenografts (ER-) were included in the analysis, we were unable to find any common genes between these xenografts and the breast tumors. These data clearly imply that MCF-7, T47D, and ZR-75-1 cells and ER+ breast tumors share substantial global similarities in the structures of their respective transcriptomes, and that these cell lines are good models in which to identify molecular events that are likely to be important in some ER+ human breast cancers.

KW - Bioinformatics

KW - Breast cancer

KW - Cell lines

KW - Data structure

KW - Genomics

KW - Microarray analysis

KW - Principal component analysis

UR - http://www.scopus.com/inward/record.url?scp=35348985494&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35348985494&partnerID=8YFLogxK

M3 - Article

C2 - 17089000

AN - SCOPUS:35348985494

VL - 29

SP - 1581

EP - 1589

JO - International Journal of Oncology

JF - International Journal of Oncology

SN - 1019-6439

IS - 6

ER -