Leveraging Collaborative Filtering to Accelerate Rare Disease Diagnosis

Feichen Shen, Sijia Liu, Yanshan Wang, Liwei Wang, Naveed Afzal, Hongfang D Liu

Research output: Contribution to journalArticle

8 Scopus citations


In the USA, rare diseases are defined as those affecting fewer than 200,000 patients at any given time. Patients with rare diseases are frequently misdiagnosed or undiagnosed which may due to the lack of knowledge and experience of care providers. We hypothesize that patients' phenotypic information available in electronic medical records (EMR) can be leveraged to accelerate disease diagnosis based on the intuition that providers need to document associated phenotypic information to support the diagnosis decision, especially for rare diseases. In this study, we proposed a collaborative filtering system enriched with natural language processing and semantic techniques to assist rare disease diagnosis based on phenotypic characterization. Specifically, we leveraged four similarity measurements with two neighborhood algorithms on 2010-2015 Mayo Clinic unstructured large patient cohort and evaluated different approaches. Preliminary results demonstrated that the use of collaborative filtering with phenotypic information is able to stratify patients with relatively similar rare diseases.

Original languageEnglish (US)
Pages (from-to)1554-1563
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
StatePublished - Jan 1 2017


ASJC Scopus subject areas

  • Medicine(all)

Cite this