Symbolic covariance principal component analysis and visualization for interval-valued data

Jennifer Le-Rademacher, Lynne Billard

Research output: Contribution to journalArticle

26 Citations (Scopus)

Abstract

This article proposes a new approach to principal component analysis (PCA) for interval-valued data. Unlike classical observations, which are represented by single points in p-dimensional space R p interval-valued observations are represented by hyperrectangles in R p, and as such, have an internal structure that does not exist in classical observations. As a consequence, statistical methods for classical data must be modified to account for the structure of the hyper-rectangles before they can be applied to intervalvalued data. This article extends the classical PCA method to interval-valued data by using the so-called symbolic covariance to determine the principal component (PC) space to reflect the total variation of interval-valued data. The article also provides a new approach to constructing the observations in a PC space for better visualization. This new representation of the observations reflects their true structure in the PC space. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)413-432
Number of pages20
JournalJournal of Computational and Graphical Statistics
Volume21
Issue number2
DOIs
StatePublished - Jun 1 2012
Externally publishedYes

Fingerprint

Principal Component Analysis
Visualization
Principal Components
Interval
Total Variation
Rectangle
Statistical method
Observation
Interval data
Principal component analysis
Internal
Principal components

Keywords

  • Convex hull
  • Linear transformation
  • Polytopes
  • Symbolic data analysis

ASJC Scopus subject areas

  • Statistics and Probability
  • Discrete Mathematics and Combinatorics
  • Statistics, Probability and Uncertainty

Cite this

Symbolic covariance principal component analysis and visualization for interval-valued data. / Le-Rademacher, Jennifer; Billard, Lynne.

In: Journal of Computational and Graphical Statistics, Vol. 21, No. 2, 01.06.2012, p. 413-432.

Research output: Contribution to journalArticle

@article{dd6e330fd37341c187139e03a6e77169,
title = "Symbolic covariance principal component analysis and visualization for interval-valued data",
abstract = "This article proposes a new approach to principal component analysis (PCA) for interval-valued data. Unlike classical observations, which are represented by single points in p-dimensional space R p interval-valued observations are represented by hyperrectangles in R p, and as such, have an internal structure that does not exist in classical observations. As a consequence, statistical methods for classical data must be modified to account for the structure of the hyper-rectangles before they can be applied to intervalvalued data. This article extends the classical PCA method to interval-valued data by using the so-called symbolic covariance to determine the principal component (PC) space to reflect the total variation of interval-valued data. The article also provides a new approach to constructing the observations in a PC space for better visualization. This new representation of the observations reflects their true structure in the PC space. Supplementary materials for this article are available online.",
keywords = "Convex hull, Linear transformation, Polytopes, Symbolic data analysis",
author = "Jennifer Le-Rademacher and Lynne Billard",
year = "2012",
month = "6",
day = "1",
doi = "10.1080/10618600.2012.679895",
language = "English (US)",
volume = "21",
pages = "413--432",
journal = "Journal of Computational and Graphical Statistics",
issn = "1061-8600",
publisher = "American Statistical Association",
number = "2",

}

TY - JOUR

T1 - Symbolic covariance principal component analysis and visualization for interval-valued data

AU - Le-Rademacher, Jennifer

AU - Billard, Lynne

PY - 2012/6/1

Y1 - 2012/6/1

N2 - This article proposes a new approach to principal component analysis (PCA) for interval-valued data. Unlike classical observations, which are represented by single points in p-dimensional space R p interval-valued observations are represented by hyperrectangles in R p, and as such, have an internal structure that does not exist in classical observations. As a consequence, statistical methods for classical data must be modified to account for the structure of the hyper-rectangles before they can be applied to intervalvalued data. This article extends the classical PCA method to interval-valued data by using the so-called symbolic covariance to determine the principal component (PC) space to reflect the total variation of interval-valued data. The article also provides a new approach to constructing the observations in a PC space for better visualization. This new representation of the observations reflects their true structure in the PC space. Supplementary materials for this article are available online.

AB - This article proposes a new approach to principal component analysis (PCA) for interval-valued data. Unlike classical observations, which are represented by single points in p-dimensional space R p interval-valued observations are represented by hyperrectangles in R p, and as such, have an internal structure that does not exist in classical observations. As a consequence, statistical methods for classical data must be modified to account for the structure of the hyper-rectangles before they can be applied to intervalvalued data. This article extends the classical PCA method to interval-valued data by using the so-called symbolic covariance to determine the principal component (PC) space to reflect the total variation of interval-valued data. The article also provides a new approach to constructing the observations in a PC space for better visualization. This new representation of the observations reflects their true structure in the PC space. Supplementary materials for this article are available online.

KW - Convex hull

KW - Linear transformation

KW - Polytopes

KW - Symbolic data analysis

UR - http://www.scopus.com/inward/record.url?scp=84862569554&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862569554&partnerID=8YFLogxK

U2 - 10.1080/10618600.2012.679895

DO - 10.1080/10618600.2012.679895

M3 - Article

VL - 21

SP - 413

EP - 432

JO - Journal of Computational and Graphical Statistics

JF - Journal of Computational and Graphical Statistics

SN - 1061-8600

IS - 2

ER -