A study of abbreviations in the UMLS.

H. Liu, Y. A. Lussier, C. Friedman

Research output: Contribution to journalArticlepeer-review

42 Scopus citations

Abstract

Abbreviations are widely used in medicine. The understanding of abbreviations is important for medical language processing and information retrieval systems. The Unified Medical Language System (UMLS) contains a large number of abbreviations. We hypothesized that extracting and studying the UMLS abbreviations can be helpful for understanding the characteristics of abbreviations in medicine. In this paper, we describe a method for extracting abbreviations from the UMLS. We evaluated the method and studied the ambiguous nature of the abbreviations. In addition, the coverage of the UMLS abbreviations in medical reports was studied. Using our method, we extracted 163,666 unique (abbreviation, full form) pairs from the UMLS with a precision of 97.5%, and a recall of 96%. The UMLS abbreviations were highly ambiguous: 33.1% of abbreviations with six characters or less had multiple meanings; the average number of different full forms for all abbreviations with six characters or less was 2.28. The coverage of the UMLS abbreviations in medical reports was over 66%.

Original languageEnglish (US)
Pages (from-to)393-397
Number of pages5
JournalProceedings / AMIA ... Annual Symposium. AMIA Symposium
StatePublished - 2001

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'A study of abbreviations in the UMLS.'. Together they form a unique fingerprint.

Cite this