TY - GEN
T1 - A Natural Language Processing Approach to Acquire Accurate Health Provider Directory Information
AU - Cook, Matthew
AU - Yao, Lixia
AU - Wang, Xiaoyan
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/16
Y1 - 2018/7/16
N2 - Introduction Accurate information in provider directories are vital in health care including health information exchange, health benefits exchange, quality reporting, and in the reimbursement and delivery of care. Maintaining provider directory data and keeping it up to date is challenging. The objective of this study is to determine the feasibility of using NLP techniques to combine disparate resources and acquire accurate information on health providers. Methods Publically available state licensure lists in Connecticut were obtained along with National Plan and Provider Enumeration System (NPPES) public use files. Connecticut licensure lists textual information of each health professional who is licensed to practice within the state. A NLP-based system was developed based on Healthcare Provider Taxonomy code, location, and name and address information to identify textual data within the state and federal records. Qualitative and quantitative evaluation were performed, and the recall and precision were calculated. Results We identified nurse midwives, nurse practitioners, and dentists in the State of Connecticut. The recall and precision were 0.95 and 0.93 respectively. Using the system, we were able to accurately acquire 6,849 of the 7,177 records of health provider directory information. Conclusion The authors demonstrated that the NLP based approach was effective at acquiring health provider information. Furthermore, the NLP-based system can be re-Applied on the updated information further reducing processing burdens as data changes.
AB - Introduction Accurate information in provider directories are vital in health care including health information exchange, health benefits exchange, quality reporting, and in the reimbursement and delivery of care. Maintaining provider directory data and keeping it up to date is challenging. The objective of this study is to determine the feasibility of using NLP techniques to combine disparate resources and acquire accurate information on health providers. Methods Publically available state licensure lists in Connecticut were obtained along with National Plan and Provider Enumeration System (NPPES) public use files. Connecticut licensure lists textual information of each health professional who is licensed to practice within the state. A NLP-based system was developed based on Healthcare Provider Taxonomy code, location, and name and address information to identify textual data within the state and federal records. Qualitative and quantitative evaluation were performed, and the recall and precision were calculated. Results We identified nurse midwives, nurse practitioners, and dentists in the State of Connecticut. The recall and precision were 0.95 and 0.93 respectively. Using the system, we were able to accurately acquire 6,849 of the 7,177 records of health provider directory information. Conclusion The authors demonstrated that the NLP based approach was effective at acquiring health provider information. Furthermore, the NLP-based system can be re-Applied on the updated information further reducing processing burdens as data changes.
KW - health provider directory information
KW - natural language processing
UR - http://www.scopus.com/inward/record.url?scp=85051039903&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85051039903&partnerID=8YFLogxK
U2 - 10.1109/ICHI-W.2018.00027
DO - 10.1109/ICHI-W.2018.00027
M3 - Conference contribution
AN - SCOPUS:85051039903
T3 - Proceedings - 2018 IEEE International Conference on Healthcare Informatics Workshops, ICHI-W 2018
SP - 76
EP - 77
BT - Proceedings - 2018 IEEE International Conference on Healthcare Informatics Workshops, ICHI-W 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th IEEE International Conference on Healthcare Informatics Workshops, ICHI-W 2018
Y2 - 4 June 2018 through 7 June 2018
ER -