Domain Knowledge Augmented Contrastive Learning on Dynamic Hypergraphs for Improved Health Risk Prediction

Abstract

Accurate health risk prediction is crucial for making informed clinical decisions and assessing the appropriate allocation of medical resources. While recent deep learning based approaches have shown great promise in risk prediction, they primarily focus on modeling the sequential information in Electronic Health Records (EHRs) and fail to leverage the rich mobility interactions among health entities. As a result, the existing approaches yield unsatisfactory performance in downstream risk prediction tasks, especially tasks such as Clostridioides difficile Infection (CDI) incidence prediction that are primarily spread through mobility interactions. To address this issue, we propose a new approach that leverages Hypergraphs to explicitly model mobility interactions to improve predictive performance in health risk prediction tasks. Unlike regular graphs that are limited to modeling pairwise relationships, hypergraphs can effectively characterize the complex high-order semantic relationships between health entities. Moreover, we introduce a new contrastive learning strategy that exploits the domain knowledge to generate semantically meaningful positive (homologous) and negative (heterologous) pairs needed for contrastive learning. This unique contrastive pair augmentation strategy boosts the power of contrastive learning by generating feature representations that are both robust and well-aligned with the domain knowledge. Experiments on two real-world datasets demonstrate the advantage of our approach in both short-term and long-term risk prediction tasks, such as Clostridioides difficile infection incidence prediction and MICU transfer prediction. Our framework obtains gains in performance up to 29.49 % for PHOP, 30.64 % for MIMIC-IV for MICU transfer prediction, 13.17 % for PHOP, and 4.45 % for MIMIC-IV for CDI Incidence Prediction.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This project is partially funded by: 1. The CDC MInD Healthcare Network grant U01CK000594 and the associated COVID-19 supplemental funding. 2. NSF SCH Grant 2306331. 3. US National Institute of Health (NIH) under grant R01LM014012 and the National Science Foundation (NSF) under grant ITE-2333740.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The PHOP Data used in this work was approved by the University of Iowa Institutional Review Board with IRB #202007450.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

留言 (0)

沒有登入
gif