Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets

Charles Mayo, Steve Conners, Christopher Warren, Robert Miller, Laurence Court, Richard Popple

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Purpose: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. Methods: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. Results: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. Conclusions: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.

Original languageEnglish (US)
Article number111718
JournalMedical Physics
Volume40
Issue number11
DOIs
StatePublished - Nov 2013

Fingerprint

Software Design
Benchmarking
Software
Databases
Nonparametric Statistics
Meta-Analysis
Datasets
Lung

Keywords

  • outcomes database statistics programming

ASJC Scopus subject areas

  • Biophysics
  • Radiology Nuclear Medicine and imaging

Cite this

Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets. / Mayo, Charles; Conners, Steve; Warren, Christopher; Miller, Robert; Court, Laurence; Popple, Richard.

In: Medical Physics, Vol. 40, No. 11, 111718, 11.2013.

Research output: Contribution to journalArticle

Mayo, Charles ; Conners, Steve ; Warren, Christopher ; Miller, Robert ; Court, Laurence ; Popple, Richard. / Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets. In: Medical Physics. 2013 ; Vol. 40, No. 11.
@article{0e83b49533694c6db026316f2d9ca331,
title = "Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets",
abstract = "Purpose: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. Methods: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. Results: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. Conclusions: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.",
keywords = "outcomes database statistics programming",
author = "Charles Mayo and Steve Conners and Christopher Warren and Robert Miller and Laurence Court and Richard Popple",
year = "2013",
month = "11",
doi = "10.1118/1.4824917",
language = "English (US)",
volume = "40",
journal = "Medical Physics",
issn = "0094-2405",
publisher = "AAPM - American Association of Physicists in Medicine",
number = "11",

}

TY - JOUR

T1 - Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets

AU - Mayo, Charles

AU - Conners, Steve

AU - Warren, Christopher

AU - Miller, Robert

AU - Court, Laurence

AU - Popple, Richard

PY - 2013/11

Y1 - 2013/11

N2 - Purpose: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. Methods: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. Results: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. Conclusions: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.

AB - Purpose: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. Methods: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. Results: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. Conclusions: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.

KW - outcomes database statistics programming

UR - http://www.scopus.com/inward/record.url?scp=84889657468&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84889657468&partnerID=8YFLogxK

U2 - 10.1118/1.4824917

DO - 10.1118/1.4824917

M3 - Article

C2 - 24320426

AN - SCOPUS:84889657468

VL - 40

JO - Medical Physics

JF - Medical Physics

SN - 0094-2405

IS - 11

M1 - 111718

ER -