DICOM Data Warehouse: Part 2

Steve G. Langer

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

In 2010, the DICOM Data Warehouse (DDW) was launched as a data warehouse for DICOM meta-data. Its chief design goals were to have a flexible database schema that enabled it to index standard patient and study information, modality specific tags (public and private), and create a framework to derive computable information (derived tags) from the former items. Furthermore, it was to map the above information to an internally standard lexicon that enables a non-DICOM savvy programmer to write standard SQL queries and retrieve the equivalent data from a cohort of scanners, regardless of what tag that data element was found in over the changing epochs of DICOM and ensuing migration of elements from private to public tags. After 5 years, the original design has scaled astonishingly well. Very little has changed in the database schema. The knowledge base is now fluent in over 90 device types. Also, additional stored procedures have been written to compute data that is derivable from standard or mapped tags. Finally, an early concern is that the system would not be able to address the variability DICOM-SR objects has been addressed. As of this writing the system is indexing 300 MR, 600 CT, and 2000 other (XA, DR, CR, MG) imaging studies per day. The only remaining issue to be solved is the case for tags that were not prospectively indexed—and indeed, this final challenge may lead to a noSQL, big data, approach in a subsequent version.

Original languageEnglish (US)
JournalJournal of Digital Imaging
DOIs
StateAccepted/In press - Oct 30 2015

Fingerprint

Digital Imaging and Communications in Medicine (DICOM)
Data warehouses
Databases
Knowledge Bases
Metadata
Imaging techniques
Equipment and Supplies

Keywords

  • Computer systems
  • Data mining
  • Databases

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging
  • Radiological and Ultrasound Technology
  • Computer Science Applications

Cite this

DICOM Data Warehouse : Part 2. / Langer, Steve G.

In: Journal of Digital Imaging, 30.10.2015.

Research output: Contribution to journalArticle

@article{a4769a90609549c0b7fb0d06e5928cd9,
title = "DICOM Data Warehouse: Part 2",
abstract = "In 2010, the DICOM Data Warehouse (DDW) was launched as a data warehouse for DICOM meta-data. Its chief design goals were to have a flexible database schema that enabled it to index standard patient and study information, modality specific tags (public and private), and create a framework to derive computable information (derived tags) from the former items. Furthermore, it was to map the above information to an internally standard lexicon that enables a non-DICOM savvy programmer to write standard SQL queries and retrieve the equivalent data from a cohort of scanners, regardless of what tag that data element was found in over the changing epochs of DICOM and ensuing migration of elements from private to public tags. After 5 years, the original design has scaled astonishingly well. Very little has changed in the database schema. The knowledge base is now fluent in over 90 device types. Also, additional stored procedures have been written to compute data that is derivable from standard or mapped tags. Finally, an early concern is that the system would not be able to address the variability DICOM-SR objects has been addressed. As of this writing the system is indexing 300 MR, 600 CT, and 2000 other (XA, DR, CR, MG) imaging studies per day. The only remaining issue to be solved is the case for tags that were not prospectively indexed—and indeed, this final challenge may lead to a noSQL, big data, approach in a subsequent version.",
keywords = "Computer systems, Data mining, Databases",
author = "Langer, {Steve G.}",
year = "2015",
month = "10",
day = "30",
doi = "10.1007/s10278-015-9830-4",
language = "English (US)",
journal = "Journal of Digital Imaging",
issn = "0897-1889",
publisher = "Springer New York",

}

TY - JOUR

T1 - DICOM Data Warehouse

T2 - Part 2

AU - Langer, Steve G.

PY - 2015/10/30

Y1 - 2015/10/30

N2 - In 2010, the DICOM Data Warehouse (DDW) was launched as a data warehouse for DICOM meta-data. Its chief design goals were to have a flexible database schema that enabled it to index standard patient and study information, modality specific tags (public and private), and create a framework to derive computable information (derived tags) from the former items. Furthermore, it was to map the above information to an internally standard lexicon that enables a non-DICOM savvy programmer to write standard SQL queries and retrieve the equivalent data from a cohort of scanners, regardless of what tag that data element was found in over the changing epochs of DICOM and ensuing migration of elements from private to public tags. After 5 years, the original design has scaled astonishingly well. Very little has changed in the database schema. The knowledge base is now fluent in over 90 device types. Also, additional stored procedures have been written to compute data that is derivable from standard or mapped tags. Finally, an early concern is that the system would not be able to address the variability DICOM-SR objects has been addressed. As of this writing the system is indexing 300 MR, 600 CT, and 2000 other (XA, DR, CR, MG) imaging studies per day. The only remaining issue to be solved is the case for tags that were not prospectively indexed—and indeed, this final challenge may lead to a noSQL, big data, approach in a subsequent version.

AB - In 2010, the DICOM Data Warehouse (DDW) was launched as a data warehouse for DICOM meta-data. Its chief design goals were to have a flexible database schema that enabled it to index standard patient and study information, modality specific tags (public and private), and create a framework to derive computable information (derived tags) from the former items. Furthermore, it was to map the above information to an internally standard lexicon that enables a non-DICOM savvy programmer to write standard SQL queries and retrieve the equivalent data from a cohort of scanners, regardless of what tag that data element was found in over the changing epochs of DICOM and ensuing migration of elements from private to public tags. After 5 years, the original design has scaled astonishingly well. Very little has changed in the database schema. The knowledge base is now fluent in over 90 device types. Also, additional stored procedures have been written to compute data that is derivable from standard or mapped tags. Finally, an early concern is that the system would not be able to address the variability DICOM-SR objects has been addressed. As of this writing the system is indexing 300 MR, 600 CT, and 2000 other (XA, DR, CR, MG) imaging studies per day. The only remaining issue to be solved is the case for tags that were not prospectively indexed—and indeed, this final challenge may lead to a noSQL, big data, approach in a subsequent version.

KW - Computer systems

KW - Data mining

KW - Databases

UR - http://www.scopus.com/inward/record.url?scp=84945543778&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84945543778&partnerID=8YFLogxK

U2 - 10.1007/s10278-015-9830-4

DO - 10.1007/s10278-015-9830-4

M3 - Article

C2 - 26518194

AN - SCOPUS:84945543778

JO - Journal of Digital Imaging

JF - Journal of Digital Imaging

SN - 0897-1889

ER -