Background: Genome-wide association studies have been crucial in gaining insights into the genetics of cardiometabolic diseases. However, little is known about the genetics of cardiometabolic disease progression which may have both a different genetic architecture and significant implications for treatment decisions. Disease progression can be ascertained by the time from the first disease diagnosis to a second qualifying event (e.g. diagnostic lab, code or procedure). While data of this nature have been available in large repositories such as the UK Biobank, large-scale genome-wide screens in a time-to-event setting have been extremely challenging due to various computational and statistical challenges. Methods and Results: We applied our method, snpnet-Cox, that has proven to be an effective method for simultaneous variable selection and estimation in high-dimensional settings, to examine the genetic contributions to cardiometabolic disease progression, measured by time from disease diagnosis to time of complication/comorbidity diagnosed or procedure in the UK Biobank. We apply a Cox regression model in a time-to-event setting to compute polygenic hazard scores (PHS). We identified ten new PHS that significantly predict disease progression. One example is the PHS that significantly predicts the time from hyperlipidemia diagnosis to having coronary artery bypass graft (CABG) surgery performed (Hazards Ratio 1.3 per PHS standard deviation: p=4.5x10-9). In this PHS, we identified a common variant, rs11041816 (downstream of LMO1), which protects against this disease progression (beta = -0.05). Conclusion: snpnet-Cox is a fast and reliable tool to compute PHS capturing genetics in the time-to-event setting. The computed PHS can be used to stratify individuals with an underlying diagnosis (e.g. hyperlipidemia) into different trajectories disease progression (e.g CABG) thereby identifying potential points of intervention. With more time-to-event data to be released, this approach can provide great insight into disease progression at the fraction of computational cost necessary. We make available ten polygenic hazard scores that we find to be significant predictors of cardiometabolic disease progression. Key words: Genetics, cardiometabolic diseases, polygenic hazards score, Lasso, prediction, hyperlipidemia, operation
Competing Interest StatementMAR is a CoFounder of Broadwing Bio
Funding StatementJ.M.J. is supported by grant NNF17OC0025806 from the Novo Nordisk Foundation and the Stanford Bio-X Program. J.W.K is supported by NIH grants: U41HG009649, R01 DK116750, R01 DK120565, P30DK116074. M.A.R. is supported by Stanford University and a National Institute of Health center for Multi- and Trans-ethnic Mapping of Mendelian and Complex Diseases grant (5U01 HG009080).
Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The data was obtained from UK Biobank application 24983.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data AvailabilityAll data is available from UK Biobank. The results are publicly available at https://biobankengine.shinyapps.io/disease_progression/
留言 (0)