Integrated cTAKES for concept mention detection and normalization

Hongfang D Liu, Kavishwar Wagholikar, Siddhartha Jonnalagadda, Sunghwan Sohn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

We participated Task 1 using an existing system MedTagger implemented in inte-grated cTAKES (icTAKES). The concept mention detection is based on Conditional Random Fields (CRF) and the concept mention normalization is based on a greedy dictionary lookup algorithm. A distinctive feature in MedTagger compared to other concept mention detection systems is the incorporation of dictionary lookup results into a machine learning framework for sequential labeling. Dictionary lookup results of MedLex and semantic vectors representing distributed semantics were used as features. Overall, the precision, recall, and F-measure of our best run for concept mention are 0.8, 0.573, and 0.668 respectively for strict evaluation and 0.939, 0.766, and 0.844 for relaxed evaluation. The accuracy of our best run for concept men-tion normalization is 54.6% and 87.0% for strict and relaxed mapping, respectively.

Original languageEnglish (US)
Title of host publicationCEUR Workshop Proceedings
PublisherCEUR-WS
Volume1179
StatePublished - 2013
Event2013 Working Notes for CLEF Conference, CLEF 2013 - Valencia, Spain
Duration: Sep 23 2013Sep 26 2013

Other

Other2013 Working Notes for CLEF Conference, CLEF 2013
CountrySpain
CityValencia
Period9/23/139/26/13

    Fingerprint

Keywords

  • Conditional random fields
  • Dictionary lookup
  • Distributed semantics
  • Named entity recognition
  • Normalization

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Liu, H. D., Wagholikar, K., Jonnalagadda, S., & Sohn, S. (2013). Integrated cTAKES for concept mention detection and normalization. In CEUR Workshop Proceedings (Vol. 1179). CEUR-WS.