Comparing the effects of two semantic terminology models on classification of clinical notes: A study of heart murmur findings

Guoqian Jiang; Christopher G. Chute

Comparing the effects of two semantic terminology models on classification of clinical notes: A study of heart murmur findings

Guoqian Jiang, Christopher G. Chute

Artificial Intelligence and Informatics

Research output: Contribution to journal › Conference article › peer-review

Abstract

Objectives: We compared the effects of two semantic terminology models on classification of clinical notes through a study in the domain of heart murmur findings. Methods: One schema was established from the existing SNOMED CT model (S-Model) and the other was from a template model (T-Model) which uses base concepts and non-hierarchical relationships to characterize the murmurs. A corpus of clinical notes (n=309) was collected and annotated using the two schemas. The annotations were coded for a decision tree classifier for text classification task. The standard information retrieval measures of precision, recall, f-score and accuracy and the paired t-test were used for evaluation. Results: The performance of S-Model was better than the original T-Model (p<0.05 for recall and f-score). A revised T-Model by extending its structure and corresponding values performed better than S-Model (p<0.05 for recall and accuracy). Conclusion: We discovered that content coverage is a more important factor than terminology model for classification; however a templatestyle facilitates content gap discovery and completion.

Original language	English (US)
Pages (from-to)	59-65
Number of pages	7
Journal	CEUR Workshop Proceedings
Volume	410
State	Published - 2008
Event	3rd International Conference on Formal Biomedical Knowledge Representation in Medicine: Representing and Sharing Knowledge Using SNOMED, KR-MED 2008 - Phoenix, AZ, United States Duration: May 31 2008 → Jun 2 2008

ASJC Scopus subject areas

General Computer Science

Cite this

@article{8e64f5ebf4c8439f86029fdc801e45d3,

title = "Comparing the effects of two semantic terminology models on classification of clinical notes: A study of heart murmur findings",

abstract = "Objectives: We compared the effects of two semantic terminology models on classification of clinical notes through a study in the domain of heart murmur findings. Methods: One schema was established from the existing SNOMED CT model (S-Model) and the other was from a template model (T-Model) which uses base concepts and non-hierarchical relationships to characterize the murmurs. A corpus of clinical notes (n=309) was collected and annotated using the two schemas. The annotations were coded for a decision tree classifier for text classification task. The standard information retrieval measures of precision, recall, f-score and accuracy and the paired t-test were used for evaluation. Results: The performance of S-Model was better than the original T-Model (p<0.05 for recall and f-score). A revised T-Model by extending its structure and corresponding values performed better than S-Model (p<0.05 for recall and accuracy). Conclusion: We discovered that content coverage is a more important factor than terminology model for classification; however a templatestyle facilitates content gap discovery and completion.",

author = "Guoqian Jiang and Chute, {Christopher G.}",

year = "2008",

language = "English (US)",

volume = "410",

pages = "59--65",

journal = "CEUR Workshop Proceedings",

issn = "1613-0073",

publisher = "CEUR-WS",

note = "3rd International Conference on Formal Biomedical Knowledge Representation in Medicine: Representing and Sharing Knowledge Using SNOMED, KR-MED 2008 ; Conference date: 31-05-2008 Through 02-06-2008",

}

TY - JOUR

T1 - Comparing the effects of two semantic terminology models on classification of clinical notes

T2 - 3rd International Conference on Formal Biomedical Knowledge Representation in Medicine: Representing and Sharing Knowledge Using SNOMED, KR-MED 2008

AU - Jiang, Guoqian

AU - Chute, Christopher G.

PY - 2008

Y1 - 2008

N2 - Objectives: We compared the effects of two semantic terminology models on classification of clinical notes through a study in the domain of heart murmur findings. Methods: One schema was established from the existing SNOMED CT model (S-Model) and the other was from a template model (T-Model) which uses base concepts and non-hierarchical relationships to characterize the murmurs. A corpus of clinical notes (n=309) was collected and annotated using the two schemas. The annotations were coded for a decision tree classifier for text classification task. The standard information retrieval measures of precision, recall, f-score and accuracy and the paired t-test were used for evaluation. Results: The performance of S-Model was better than the original T-Model (p<0.05 for recall and f-score). A revised T-Model by extending its structure and corresponding values performed better than S-Model (p<0.05 for recall and accuracy). Conclusion: We discovered that content coverage is a more important factor than terminology model for classification; however a templatestyle facilitates content gap discovery and completion.

AB - Objectives: We compared the effects of two semantic terminology models on classification of clinical notes through a study in the domain of heart murmur findings. Methods: One schema was established from the existing SNOMED CT model (S-Model) and the other was from a template model (T-Model) which uses base concepts and non-hierarchical relationships to characterize the murmurs. A corpus of clinical notes (n=309) was collected and annotated using the two schemas. The annotations were coded for a decision tree classifier for text classification task. The standard information retrieval measures of precision, recall, f-score and accuracy and the paired t-test were used for evaluation. Results: The performance of S-Model was better than the original T-Model (p<0.05 for recall and f-score). A revised T-Model by extending its structure and corresponding values performed better than S-Model (p<0.05 for recall and accuracy). Conclusion: We discovered that content coverage is a more important factor than terminology model for classification; however a templatestyle facilitates content gap discovery and completion.

UR - http://www.scopus.com/inward/record.url?scp=84872831165&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84872831165&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84872831165

SN - 1613-0073

VL - 410

SP - 59

EP - 65

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

Y2 - 31 May 2008 through 2 June 2008

ER -

Comparing the effects of two semantic terminology models on classification of clinical notes: A study of heart murmur findings

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this