Bayesian Change Point Detection for Mixed Data with Missing Values

Alexander C. Murph, Curtis B. Storlie

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When a predictive model is in production, it must be monitored over time to ensure that its performance does not suffer from drift or abrupt changes to data. Typically this is done by evaluating the algorithm's predictions to outcome data and ensuring that the algorithm maintains an acceptable level of accuracy over time. However, it is far preferable to learn about major changes in the input data that could affect the models performance in real-time, long before learning that the performance of the model itself has dropped by monitoring outcome data. Thus, there is large need for robust, real-time monitoring of high dimensional input data over time. Here we consider the problem of change point detection on high-dimensional longitudinal data with mixed variable types and missing values. We do this by fitting an array of Mixture Gaussian Graphical Models to groupings of homogeneous data in time, called regimes, which we model as the observed states of a Markov process with unknown transition probabilities. The primary goal of this model is to identify when there is a regime change, as this indicates a significant change in the input data distribution. To handle the messy nature of real-world data which has mixed continuous/discrete variable types, missing data, etc., we take a Bayesian latent variable approach. This affords us flexibility to handle missing values in a principled manner, while simultaneously providing a way to encode discrete and censored values into a continuous framework. We take this approach a step further by encoding the missingness structure, which allows our model to then detect major changes in the patterns of missingness, in addition to the structure of the data distributions themselves.

Original languageEnglish (US)
Title of host publicationProceedings - 2022 IEEE 10th International Conference on Healthcare Informatics, ICHI 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages499-501
Number of pages3
ISBN (Electronic)9781665468459
DOIs
StatePublished - 2022
Event10th IEEE International Conference on Healthcare Informatics, ICHI 2022 - Rochester, United States
Duration: Jun 11 2022Jun 14 2022

Publication series

NameProceedings - 2022 IEEE 10th International Conference on Healthcare Informatics, ICHI 2022

Conference

Conference10th IEEE International Conference on Healthcare Informatics, ICHI 2022
Country/TerritoryUnited States
CityRochester
Period6/11/226/14/22

Keywords

  • Bayesian Latent Variables
  • Change Points
  • Gaussian Graphical Models
  • Gaussian Mixture Models

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Health Informatics

Fingerprint

Dive into the research topics of 'Bayesian Change Point Detection for Mixed Data with Missing Values'. Together they form a unique fingerprint.

Cite this