Development and Validation of Machine Learning Algorithms to Classify Lower Urinary Tract Symptoms

Abstract

Background: Lower urinary tract urinary symptoms (LUTS), such as urinary urgency, frequency, and incontinence, affect the majority of the population at some point in the lifepspan, causing substantial morbidity, yet few receive effective care. Acccurate diagnosis and treatment is usually dictated by the dominant symptom, however, the sizeable symptomatic overlap between disease categories and subjectivity of language used to describe symptoms leads to high rates of misdiagnosis. We hypothesized that more specific and homogeneous LUTS diagnoses are characterized not by specific pathognomic features, but by patterns of existing symptoms indicative of unique causes of convergent symptomatologies. To improve care and diagnostic acuracy, we sought to employ a data-driven approach to LUTS categorization using machine learning to generate diagnostic groupings based on patient-reported clinical data, creating a novel tool for diagnosis for patients with voiding complaints. Methods and Findings: Questionnaire responses in a development cohort of 514 female subjects was used for model development, identifying 4 major categories and 9 specific phenotypes of LUTS using agglomerative hierarchical clustering. The dominant features for each cluster and phenotype were examined independently by two urologic specialists and assigned a clinical identity consistent with recognized causes of voiding dysfunction. Then, a supervised machine learning model was trained to assign unseen patients into these phenotypes. The model was then applied to a validation population of 571 additional subjects to validate the diagnostic machine learning algorithm, demonstrating good reproducibillty of the phenotypes and their symptomatic patterns in an independent cohort. This data-driven, hierarchical clustering approach captured overlapping symptoms inherent in typical patients, recognizing common uncomplicated diagnoses (e.g. overactive bladder) and several novel diagnostic categories (e.g. myofascial pelvic pain). The assigned clusters and phenotypes were consistent with coded primary diagnoses with 70% accuracy; however, physician primary diagnosis may not reflect the complete range of patient symptoms as patients often present with multiple urinary symptoms that do not fit pre-established diagnoses. Although this analysis reflects care-seeking populations from a single center, further studies will need to determine if the model is scalable to the population at large. Conclusions: We describe the generation of a machine learning algorithm relying only on validated patient-reported symptoms for accurate diagnostic classification. Given a growing physician shortage and increasing challenges for patients accessing specialist care, this type of digital technology holds great potential to improve the recognition, diagnosis, and treatment of functional urologic conditions. While future prospective work with larger, multi-institutional cohorts is needed, with refinement, this approach is capable of increasing both the equity and rapidity of access to effective urologic care.

Competing Interest Statement

A.L. Ackerman receives grant funding from Medtronic, Inc. and MicrogenDx, is an advisor for Abbvie and Watershed Medical.

Funding Statement

This study was supported by NIH K08DK118176.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The Cedars-Sinai Medical Center Institutional Review Board provided approval for the study (IRB#00040261).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data produced in the present study are available upon reasonable request to the authors.

留言 (0)

沒有登入
gif