Assessing Metadata Quality of a Federally Sponsored Health Data Repository

David T. Marc, James Beattie, Vitaly D Herasevich, Laël Gatewood, Rui Zhang

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

The U.S. Federal Government developed HealthData.gov to disseminate healthcare datasets to the public. Metadata is provided for each datasets and is the sole source of information to find and retrieve data. This study employed automated quality assessments of the HealthData.gov metadata published from 2012 to 2014 to measure completeness, accuracy, and consistency of applying standards. The results demonstrated that metadata published in earlier years had lower completeness, accuracy, and consistency. Also, metadata that underwent modifications following their original creation were of higher quality. HealthData.gov did not uniformly apply Dublin Core Metadata Initiative to the metadata, which is a widely accepted metadata standard. These findings suggested that the HealthData.gov metadata suffered from quality issues, particularly related to information that wasn't frequently updated. The results supported the need for policies to standardize metadata and contributed to the development of automated measures of metadata quality.

Original languageEnglish (US)
Pages (from-to)864-873
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
Volume2016
StatePublished - 2016

Fingerprint

Health
Metadata
Federal Government
Delivery of Health Care
Datasets

ASJC Scopus subject areas

  • Medicine(all)

Cite this

Assessing Metadata Quality of a Federally Sponsored Health Data Repository. / Marc, David T.; Beattie, James; Herasevich, Vitaly D; Gatewood, Laël; Zhang, Rui.

In: AMIA ... Annual Symposium proceedings. AMIA Symposium, Vol. 2016, 2016, p. 864-873.

Research output: Contribution to journalArticle

Marc, David T. ; Beattie, James ; Herasevich, Vitaly D ; Gatewood, Laël ; Zhang, Rui. / Assessing Metadata Quality of a Federally Sponsored Health Data Repository. In: AMIA ... Annual Symposium proceedings. AMIA Symposium. 2016 ; Vol. 2016. pp. 864-873.
@article{9376ed5252d74802affbe61f820a3de6,
title = "Assessing Metadata Quality of a Federally Sponsored Health Data Repository",
abstract = "The U.S. Federal Government developed HealthData.gov to disseminate healthcare datasets to the public. Metadata is provided for each datasets and is the sole source of information to find and retrieve data. This study employed automated quality assessments of the HealthData.gov metadata published from 2012 to 2014 to measure completeness, accuracy, and consistency of applying standards. The results demonstrated that metadata published in earlier years had lower completeness, accuracy, and consistency. Also, metadata that underwent modifications following their original creation were of higher quality. HealthData.gov did not uniformly apply Dublin Core Metadata Initiative to the metadata, which is a widely accepted metadata standard. These findings suggested that the HealthData.gov metadata suffered from quality issues, particularly related to information that wasn't frequently updated. The results supported the need for policies to standardize metadata and contributed to the development of automated measures of metadata quality.",
author = "Marc, {David T.} and James Beattie and Herasevich, {Vitaly D} and La{\"e}l Gatewood and Rui Zhang",
year = "2016",
language = "English (US)",
volume = "2016",
pages = "864--873",
journal = "AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium",
issn = "1559-4076",
publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - Assessing Metadata Quality of a Federally Sponsored Health Data Repository

AU - Marc, David T.

AU - Beattie, James

AU - Herasevich, Vitaly D

AU - Gatewood, Laël

AU - Zhang, Rui

PY - 2016

Y1 - 2016

N2 - The U.S. Federal Government developed HealthData.gov to disseminate healthcare datasets to the public. Metadata is provided for each datasets and is the sole source of information to find and retrieve data. This study employed automated quality assessments of the HealthData.gov metadata published from 2012 to 2014 to measure completeness, accuracy, and consistency of applying standards. The results demonstrated that metadata published in earlier years had lower completeness, accuracy, and consistency. Also, metadata that underwent modifications following their original creation were of higher quality. HealthData.gov did not uniformly apply Dublin Core Metadata Initiative to the metadata, which is a widely accepted metadata standard. These findings suggested that the HealthData.gov metadata suffered from quality issues, particularly related to information that wasn't frequently updated. The results supported the need for policies to standardize metadata and contributed to the development of automated measures of metadata quality.

AB - The U.S. Federal Government developed HealthData.gov to disseminate healthcare datasets to the public. Metadata is provided for each datasets and is the sole source of information to find and retrieve data. This study employed automated quality assessments of the HealthData.gov metadata published from 2012 to 2014 to measure completeness, accuracy, and consistency of applying standards. The results demonstrated that metadata published in earlier years had lower completeness, accuracy, and consistency. Also, metadata that underwent modifications following their original creation were of higher quality. HealthData.gov did not uniformly apply Dublin Core Metadata Initiative to the metadata, which is a widely accepted metadata standard. These findings suggested that the HealthData.gov metadata suffered from quality issues, particularly related to information that wasn't frequently updated. The results supported the need for policies to standardize metadata and contributed to the development of automated measures of metadata quality.

UR - http://www.scopus.com/inward/record.url?scp=85027533087&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027533087&partnerID=8YFLogxK

M3 - Article

VL - 2016

SP - 864

EP - 873

JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

SN - 1559-4076

ER -