In the past 30 years, the incidence of kidney stones has significantly increased due to environmental changes, including imbalanced diet and restricted physical activity (Wigner et al., 2021). During this period, the incidence of kidney stones has nearly doubled, with a particularly notable increase among adolescents and young women (Hill et al., 2022; Tasian et al., 2016). Kidney stones, as a common disease of the urinary system, with calcium oxalate (CaOx) stones accounting for the highest proportion, approximately 75%–85% of all urinary stones (Sakhaee et al., 2012; Thongprayoon et al., 2020). The adherence of calcium oxalate (CaOx) crystals to renal tubular epithelial cells serves as a critical contributor in the development of CaOx stone formation (Qi et al., 2020). To further explain this phenomenon, we introduced the concept of Randall’s Plaque (RP). This is a widely accepted theory aimed at explaining the formation mechanism of calcium oxalate stones (Khan et al., 2021). Randall’s plaque was initially described by Randall in 1937 as a pre-stone lesion of kidney stones, and the vast majority of kidney stones belong to the calcium oxalate stone type (Randall, 1937).
In order to further explore the biological characteristics and formation mechanism of kidney stones, we first obtained gene expression profile datasets covering normal renal papillary tissue and Randall’s Plaque (RP) tissue from the GEO database. Subsequently, we rigorously screened differentially expressed genes (DEGs) using the limma algorithm and visualized these differences through volcano plots and heatmaps. To further analyze the functional characteristics and key functional modules of these differentially expressed genes, we utilized the ClusterProfiler and WGCNA tools. Through these analyses, we not only understood the role of these genes in the formation of kidney stones but also identified potential key regulatory pathways. Through the combined application of various machine learning algorithms, we successfully screened out the optimal feature genes KLK1 and MMP10. To gain deeper insights into the roles of these two genes in kidney stones, we compared their expression levels and explored their biological significance. In addition, we used GSEA and CIBERSORT algorithms to further evaluate the relevance of these two optimal feature genes to the formation of kidney stones and their impact on immune cell infiltration. These analyses provide us with a deeper understanding of the biological characteristics and potential mechanisms of kidney stones, and offer new directions for future research.
2 Methods2.1 Data collection and processingWe downloaded the GSE73680 dataset from the GEO database, which includes gene expression profiles of 33 normal renal papillary tissues and 29 Randall’s Plaque (RP) tissues.
2.2 Identification of differentially expressed genes (DEGs) and gene set enrichment analysisDifferentially expressed genes (DEGs) between Randall’s Plaque (RP) tissues and normal renal papillary tissues were discerned utilizing the “limma” R package. During the screening process, we applied strict criteria of |log2 FC| > 0.5 and p-value <0.05. To visually display these differential genes, volcano plots and heatmaps were generated using R software version 4.3.0. Subsequently, to explore the functional characteristics of these differential genes, gene set enrichment analysis (GSEA) was performed using the “ClusterProfiler” R package (Wu et al., 2021). Enrichment significance was determined at a set q value (False Discovery Rate, FDR) < 0.05 and p. adjust <0.05, with normalized enrichment scores (NES) serving as assessment metrics. A positive NES signifies upregulation, while a negative value indicates downregulation.
2.3 Weighted gene co-expression network analysis (WGCNA)Weighted gene co-expression network analysis was performed utilizing the “WGCNA” R package to detect essential functional modules that could elucidate the biological traits of kidney stone samples (Langfelder and Horvath, 2008). Before constructing the network, we carefully examined the merged gene matrix to ensure no abnormal samples were erroneously included, thus ensuring the accuracy of subsequent sample clustering. Genes displaying comparable expression profiles were grouped into coherent co-expression modules following the analysis of the weighted correlation adjacency matrix and cluster analysis. Subsequently, the topological overlap matrix (TOM) was constructed from the adjacency matrix to partition genes into modules based on dissimilarities. A cutting height of 0.25, a minimum module size of 50, and a soft threshold power of 24 (achieving a scale-free R2 of 0.9) were chosen during this process. Following module partitioning, gene significance (GS) and module membership (MM) values were computed for each gene. To explore the relationship between functional modules and kidney stones, Spearman correlation coefficients and corresponding p values were computed between control groups, kidney stone groups, and various functional modules. Finally, key modules most relevant to kidney stones were determined, and core genes were extracted for further analysis.
2.4 Functional enrichment analysisFollowing the analysis, we identified overlapping candidate genes from differentially expressed genes (DEGs) and module genes. A Venn diagram was generated using appropriate software to visually represent the gene overlaps. Enrichment analysis of the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Disease Ontology (DO) was performed using the “clusterProfiler” and “DOSE” R packages to explore the functions and pathways linked to the common candidate genes (Wu et al., 2021).
2.5 Screening of optimal feature genesA variety of machine learning algorithms, including LASSO, SVM-REF, RF, Boruta, XGBOOST, GBM, and decision trees, were employed for disease status prediction and identification of key prognostic factors. Specifically, LASSO regression was utilized to select and regularize variables to improve the predictive accuracy of the model (Sepulveda, 2020). Additionally, we employed widely-used supervised learning methods such as SVM and RF to prevent overfitting and produce interpretable results (Kanehisa and Goto, 2000; Yu et al., 2015). Furthermore, we used the SVM-RFE algorithm to precisely locate the most discriminative gene set, laying the foundation for determining the most appropriate feature genes (Li et al., 2020). In the context of predictive problems, decision trees, as the cornerstone of random forests (RF), play an important role, and we determined the optimal number of trees through testing. Following the construction of the Random Forest (RF) model, the backward elimination technique was applied to identify significant genes, with a gene importance threshold of greater than 2 being a pivotal selection criterion in the RF algorithm (Tian et al., 2020). Additionally, the Boruta algorithm significantly influenced the feature selection process by creating shadow features and comparing Z-scores of real features with shadow features (Degenhardt et al., 2019). XGBOOST (XGB), similar to random forests, also originates from decision tree-based ensemble learning (Bentéjac et al., 2021). However, XGB further improves the predictive ability and robustness of the model through its unique sequential learning strategy and embedded gradient descent algorithm (Chen and Guestrin, 2016; Shwartz-Ziv and Armon, 2022). Finally, decision trees excel in handling nonlinear relationships between variables and provide excellent data visualization (Quinlan, 1986). Through the comprehensive application of these machine learning algorithms, we successfully determined the optimal feature genes.
2.6 Expression and diagnostic significance of optimal feature genesWilcoxon rank-sum tests were performed to evaluate the expression levels of optimal feature genes in kidney stone samples compared to control samples. Additionally, to further validate the predictive efficacy of these feature genes, receiver operating characteristic (ROC) curves were used for evaluation.
2.7 GSEA and correlation analysis of optimal feature genesMoreover, the biological significance of optimal feature genes was explored using gene set enrichment analysis (GSEA) with the “c2. cp.kegg.v11.0. symbols” gene set as a reference. To ensure result accuracy, gene set permutation was iterated 1,000 times for each analysis to derive normalized enrichment scores. Enrichment of gene sets with q values below 0.05 was deemed significant. Furthermore, Pearson correlation analysis was performed to evaluate the relationships between expression levels of optimal feature genes and identify correlations among them.
2.8 Evaluation of marker gene sets and immune cell infiltrationThe CIBERSORT algorithm, as a gene expression-based deconvolution method, was used to assess the variation of the internal genome of samples and compare it with the changes in other genes (Newman et al., 2019)In this study, the CIBERSORT algorithm was employed to ascertain the infiltration status of 22 immune cells in both normal and kidney stone samples. Boxplots were utilized for a visual representation of the variances in immune cell composition among patients exhibiting diverse immune patterns (Leek et al., 2012). Through Wilcoxon rank-sum tests, we conducted differential analysis of the proportions of these immune cells, considering p values less than 0.05 as statistically significant differences.
Moreover, to assess the relative levels of 50 marker gene sets (h.all.v7.5.1. symbols.gmt) in the combined dataset, the single-sample gene set enrichment analysis (ssGSEA) algorithm was utilized. Additionally, Spearman correlations were computed to investigate the relationship between these marker gene sets and the optimal feature genes.
3 Results3.1 Identification of DEGs and gene set enrichment analysis in the kidney stone datasetFollowing a thorough analysis, a detailed list of 412 Differentially Expressed Genes (DEGs) was compiled. Among these DEGs, 194 genes were observed to be downregulated while 218 genes were upregulated in the disease state, as illustrated in volcano plots and heatmaps (Figure 1A). Additionally, a heatmap focusing on the top 60 genes exhibiting the highest variance was generated to investigate prominent differences between the disease and control groups (Figure 1B). Furthermore, to deepen the analysis, Gene Set Enrichment Analysis (GSEA) was utilized to explore the dataset.
Figure 1. Identification of DEGs and gene set enrichment analysis in the kidney stone dataset.
Based on the Normalized Enrichment Score (NES), we categorized enriched pathways into downregulated and upregulated groups. In the downregulated group, several significantly enriched pathways included taurine and hypotaurine metabolism, ascorbate and aldarate metabolism, porphyrin metabolism, PPAR signaling pathway, drug metabolism-cytochrome P450, ribosome, fat digestion and absorption, glycine, serine and threonine metabolism, and cocaine addiction, among others (Figure 1C). In the upregulated group, significantly enriched pathways included systemic lupus erythematosus, arginine biosynthesis, Vibrio cholerae infection, nitrogen metabolism, rheumatoid arthritis, intestinal IgA production immune network, collecting duct acid secretion, Staphylococcus aureus infection, and asthma, among others (Figure 1D).
3.2 Selection of WGCNA modules and hub genesA co-expression network was established through Weighted gene co-expression network analysis (WGCNA) with 32,077 genes, encompassing 33 control samples and 29 kidney stone samples. Initial steps involved sample clustering and outlier specimen exclusion using a predetermined threshold, as illustrated in Figure 2A. To ensure the network’s scale-free attribute and optimal average connectivity, a soft power threshold of 24 was set, resulting in a scale-free R^2 value of 0.9, as shown in Figure 2B.
Figure 2. Selection of WGCNA modules and identification of hub genes.
Subsequently, under a clustering height cut-off of 0.25, we merged highly correlated modules and ultimately identified 5 modules for in-depth study. The initiation and merging processes of these modules were clearly displayed in the clustering tree (Figure 2C). To assess the correlation between these modules, we conducted correlation analysis, which revealed no significant associations between modules, demonstrating the rationality and independence of our module division (Figure 2D).
To validate the reliability of module characterization, we also performed intra-modular transcriptional correlation analysis, revealing no substantive relationships between modules, further confirming the accuracy of module division (Figure 2E). Additionally, we used module eigengenes (ME) to examine the correlation between module features and clinical traits, revealing a strong correlation between the yellow module and kidney stones (R = 0.27, P = 8e-08) (Figures 2F, G).
Based on this finding, we included 326 candidate genes from the yellow module in subsequent analyses. These genes may play important roles in the pathogenesis of kidney stones, providing new clues for our in-depth study of the biological mechanisms underlying kidney stones.
3.3 Functional enrichment analysis of overlapping DEGsFrom the previously identified DEGs and genes in the yellow module, we successfully screened out 62 overlapping genes, named candidate feature genes, which were visually displayed in the figure (Figure 3A). To further elucidate the biological functions and enriched pathways that these candidate feature genes may be involved in, we performed KEGG (Kyoto Encyclopedia of Genes and Genomes) and DO (Disease Ontology) analyses.
Figure 3. Functional enrichment analysis of overlapping differentially expressed genes (DEGs).
KEGG enrichment analysis revealed signaling pathways that these candidate feature genes may participate in. These pathways include the proximal tubule bicarbonate reclamation pathway in the kidney, crucial for maintaining acid-base balance; protein digestion and absorption in the digestive system, providing amino acids to support energy production and tissue repair; pathways involving glucose and galactose metabolism in carbohydrate metabolism, critical for energy metabolism and biosynthesis processes; as well as multiple processes related to endocrine regulation, such as calcium reabsorption, mineral absorption, steroid hormone biosynthesis, and insulin secretion. Furthermore, these genes are also involved in chemical carcinogen metabolism, bile secretion, IL-17 signaling pathway, and other important biological processes (Figures 3B, D–F).
DO analysis showed that these candidate feature genes are mainly associated with various disease states. For example, they are significantly associated with metabolic abnormalities such as glucose intolerance and hyperglycemia, obesity, and nutrient excess states; as well as digestive system disorders such as constipation and intestinal dysmotility, and urinary system diseases such as renal failure and glomerulonephritis. Additionally, these genes are associated with immune system defects, cardiovascular diseases, various types of cancers, and other disease states such as arthritis, anemia, varicose veins, and connective tissue diseases. In summary, candidate feature genes play important roles in the pathogenesis of kidney stones (Figure 3C).
3.4 Identification of optimal feature genes through integration of multiple machine learning algorithmsTo identify putative feature genes, we employed seven different machine learning algorithms. Firstly, through LASSO analysis, we screened out 9 genes from the 62 candidate feature genes as diagnostic markers for kidney stones (Figure 4A). Simultaneously, we utilized the SVM-REF algorithm to cross-validate all candidate genes, demonstrating that all 62 genes have distinctive features (Figure 4B). Additionally, we used random forest (RF) algorithm to identify 36 feature genes with importance scores greater than 0 (Figure 4C), 10 key genes were identified through decision tree (DT) algorithm (Figure 4D), 51 feature genes were selected through gradient boosting machine (GBM) algorithm (Figure 4E), 46 important genes were determined using XgBOOST algorithm (Figure 4F), and 6 feature genes were confirmed by the Boruta algorithm (Figure 4G). Upon intersecting the feature genes identified by the seven aforementioned algorithms, two optimal feature genes, namely, KLK1 and MMP10, were successfully pinpointed (Figure 5). These two genes not only serve as potential diagnostic markers for kidney stones but also may be key genes in the progression of kidney stones.
Figure 4. Identification of optimal feature genes through integration of multiple machine learning algorithms.
Figure 5. Venn diagram showing KLK1 and MMP10 as key feature genes identified across seven machine learning algorithms.
3.5 Evaluation of expression and diagnostic significance of optimal feature genesThe expression levels of the two optimal feature genes, KLK1 and MMP10, were validated in 29 kidney stone samples and 33 normal samples. The results indicated a significant upregulation of KLK1 and MMP10 genes in kidney stone samples, suggesting their pivotal roles in kidney stone progression (p < 0.01) (Figures 6A, B). To further assess the diagnostic and predictive potential of these genes quantitatively, ROC curve analysis was conducted. The analysis revealed an AUC value of 0.724 for KLK1 and 0.737 for MMP10, highlighting their substantial diagnostic value in evaluating kidney stone progression (Figures 6C, D).
Figure 6. Evaluation of expression and diagnostic significance of optimal feature genes.
3.6 Functional identification of 2 feature genesGiven the important significance of these two feature genes in prognosis, we further conducted Gene Set Enrichment Analysis (GSEA) to explore their potential biological functions. We stratified kidney stone samples into two subgroups based on the median expression levels of these two optimal feature genes.
For the high KLK1 expression subgroup, we found significantly enriched pathways related to asthma, allograft rejection, systemic lupus erythematosus, type I diabetes, and the impact on intestinal immune networks for IgA production, among others (Figure 7A). In contrast, in the low KLK1 expression subgroup, significantly enriched pathways included nicotine addiction, adolescent-onset type diabetes, linoleic acid metabolism, steroid hormone (Figure 7B).
Figure 7. Functional identification of 2 feature genes.
In the high MMP10 expression subgroup, pathways significantly enriched included those associated with allograft rejection, systemic lupus erythematosus, type I diabetes, mucin-type O-glycan biosynthesis, and graft-versus-host disease (see relevant figures for details). Conversely, in the low MMP10 expression subgroup, pathways significantly enriched included adolescent-onset type diabetes, linoleic acid metabolism, drug metabolism-cytochrome P450, protein digestion and absorption, and dilated cardiomyopathy (Figure 7C).
It is noteworthy that, regardless of whether in the high expression group of KLK1 or MMP10, the type I diabetes pathway was significantly enriched. Whereas, in the low expression groups of KLK1 and MMP10, pathways related to adolescent-onset type diabetes and linoleic acid metabolism were significantly enriched (Figure 7D).
3.7 Signature gene sets and immune cell infiltrationTo assess variances in immune cell infiltration and signature gene sets between kidney stone patients and control samples, we utilized the CIBERSORT algorithm. Evaluation of differential immune cell infiltration revealed no substantial variances between the two groups at the immune cell level (Figures 8A, B). Furthermore, to explore enrichment variations in signature gene sets between the kidney stone and control groups, the ssGSEA algorithm was employed to analyze significant differences in 50 signature gene sets based on enrichment scores.
Figure 8. Signature gene sets and immune cell infiltration.
The analysis results revealed significant differences in the distribution of 50 signature gene sets between kidney stone patients and control samples. Specifically, several characteristic gene sets exhibited significant differences, including downregulation of typical KRAS signaling transduction, markers of spermatogenesis, and differences in the spermatogenesis gene set characteristics. Consequently, it can be reasonably inferred that these signature gene sets are overactivated in the kidney stone group compared to the normal group (Figure 8C).
Furthermore, we also found that the performance of the two optimal feature genes (KLK1 and MMP10) was generally consistent in most signature gene sets. For example, both of these feature genes showed positive correlations with the interferon gamma and alpha response signature gene sets. Furthermore, KLK1 was positively correlated with the apical surface signature gene set and negatively correlated with the pancreatic beta cell signature gene set, while MMP10 exhibited a positive correlation with the allograft rejection signature gene set (Figure 8D).
These findings provide important clues about the various roles that these two optimal feature genes may play in the pathogenesis of kidney stones, warranting further comprehensive investigation.
4 DiscussionIn our study, we utilized a range of methods to comprehensively investigate the biological characteristics and potential mechanisms underlying kidney stones. Initially, we obtained the GSE73680 dataset from the GEO database, which includes gene expression profiles of normal renal papillary tissues and Randall’s plaques (RP) tissues. After rigorous screening using the R package “limma”, we identified 194 downregulated and 218 upregulated differentially expressed genes (DEGs). Subsequently, through gene set enrichment analysis using “ClusterProfiler”, we revealed key biological pathways closely associated with kidney stones. Additionally, we conducted weighted gene co-expression network analysis (WGCNA) to identify core functional modules during the pathogenesis of kidney stones.
In the subsequent integration of machine learning algorithms, we identified two optimal feature genes—KLK1 and MMP10, and confirmed their significant upregulation in kidney stone samples. Within proteases, renal KLK1 and matrix metalloproteinase-10 (MMP-10) have emerged as key players in renal function and blood pressure regulation (Sonkar et al., 2022; Golias et al., 2007). KLK1, initially discovered in human urine, is expressed in renal tubular epithelial cells and involved in electrolyte and water homeostasis, blood pressure regulation, and inflammation (Naicker et al., 1999). Studies have shown that renal KLK1 deficiency is associated with sodium retention and hypertension (Devetzi et al., 2018). MMP-10, also known as Stromelysin-2 or tansin-2, is a zinc-dependent endopeptidase (Wozniak et al., 2021; Tan and Liu, 2012). Although MMP-10 expression is barely detectable in development or normal adult tissues, it significantly increases after tissue injury (Zuo et al., 2021; McMahan et al., 2016; Koller et al., 2012; Lv et al., 2019). These findings not only provide new insights into the pathogenesis of kidney stones but also offer potential biomarkers for subsequent diagnostic and therapeutic research.
Given the potential importance of KLK1 and MMP10 in kidney stones, we further conducted gene set enrichment analysis (GSEA). The results showed significant enrichment of the type I diabetes pathway in both the high expression groups of KLK1 and MMP10, suggesting their association with the progression of diabetic nephropathy. Particularly in tubulointerstitial inflammation, KLK1 acts by activating PAR-4, providing a potential target for future diabetic nephropathy treatment (Yiu et al., 2014). The high expression of MMP-10 in chronic kidney disease (CKD), diabetes, and atherosclerosis also indicates its role as an important regulator of inflammation in these diseases (Coll et al., 2010; Toni et al., 2013; Mora-Gutiérrez et al., 2020). In the low expression groups of KLK1 and MMP10, pathways related to adolescent-onset type diabetes and linoleic acid metabolism were significantly enriched.
For a more precise evaluation of the diagnostic and predictive efficacy of these two genes in kidney stones, ROC curve analysis was performed. The results showed that the AUC (area under the curve) values of KLK1 and MMP10 were 0.724 and 0.737, respectively, both exceeding 0.7, indicating their high diagnostic value in evaluating the progression of kidney stones. This finding not only provides new insights into the pathogenesis of kidney stones but also offers potential biomarkers for future diagnostic and therapeutic strategies.
We conducted enrichment analysis of 50 signature gene sets using the ssGSEA algorithm, revealing significant differences in several signature gene sets between the kidney stone group and the control group, including downregulation of KRAS signaling transduction and markers of spermatogenesis. Particularly, KLK1 and MMP10 exhibited similar expression patterns in most signature gene sets, positively correlated with interferon gamma and alpha response, and associated with apical surface signature, pancreatic beta cell signature, and allograft rejection signature, suggesting their multifaceted roles in the pathogenesis of kidney stones.
The analysis highlighted several biases and limitations inherent in the employed machine learning algorithms. LASSO may exclude relevant features due to coefficient shrinkage and struggles with correlated variables. SVM-RFE’s sensitivity to kernel and regularization parameters, along with its computational demands, complicates its use with large datasets. Random Forest can overfit and shows bias towards features with more levels, while the Boruta algorithm may prioritize high-variance features, limiting its applicability due to computational constraints.
Ensemble methods like XGBOOST and GBM are prone to overfitting without careful hyperparameter tuning, complicating interpretability and making them sensitive to outliers. Decision Trees, although interpretable, are biased towards features with more levels and are prone to overfitting and instability. Data imbalance poses a risk of biased predictions across these models. The computational cost of ensemble methods impacts scalability, emphasizing the need for addressing these biases to ensure reliable and reproducible outcomes in future research.
The study’s findings have several clinical implications that could inform therapeutic strategies. By identifying algorithmic biases and limitations, clinicians and researchers can better select and fine-tune models to enhance predictive accuracy and interpretability. For instance, understanding LASSO’s tendency to exclude correlated features can guide the integration of complementary models to ensure comprehensive feature selection, crucial for personalized medicine approaches.
SVM-RFE and ensemble methods like Random Forest and XGBOOST, despite their challenges, offer robust frameworks for identifying key biomarkers, aiding in the development of targeted therapies. By addressing overfitting and interpretability issues, these models can be optimized for clinical decision-making, improving treatment efficacy and patient outcomes.
Moreover, recognizing the impact of data imbalance allows for the development of strategies to ensure fair and unbiased predictions, crucial for equitable healthcare delivery. Overall, the study underscores the importance of tailoring machine learning applications to clinical contexts, ultimately enhancing therapeutic interventions and advancing precision medicine.
In summary, our study not only provides new insights into the pathogenesis of kidney stones but also lays a solid foundation for future diagnostic and therapeutic research.
5 LimitationsFirst, limited sample size, the study’s findings are based on a relatively small sample size of 33 normal tissues and 29 RP tissues. This limitation may impact the generalizability and robustness of the results. In future research, we plan to incorporate a larger number of samples to enhance the reliability of our conclusions. Second, risk of Overfitting, due to the small dataset, there is a potential risk of overfitting, which could affect the accuracy and applicability of our findings. We aim to address this by employing more advanced statistical methods and incorporating a larger sample size in subsequent studies. Third, need for External Validation, our current study lacks validation with external datasets, which is crucial for confirming the diagnostic value of the markers identified. Future research will focus on validating our results using diverse external datasets to ensure the findings are applicable across different populations and settings.
6 ConclusionOverall, our findings contribute to a better understanding of the molecular mechanisms underlying kidney stone formation and progression, laying a foundation for the development of novel diagnostic and therapeutic strategies. Further research is warranted to validate the clinical utility of identified biomarkers and explore their potential applications in personalized medicine for kidney stone patients.
Data availability statementThe raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributionsKA: Conceptualization, Data curation, Writing–original draft. AiA: Conceptualization, Investigation, Writing–original draft. AsA: Formal Analysis, Methodology, Project administration, Writing–original draft. GS: Funding acquisition, Supervision, Writing–review and editing.
FundingThe author(s) declare that financial support was received for the research, authorship, and/or publication of this article. Fund Natural Science Foundation of Xinjiang Uygur Autonomous Region (2023D01C107) which the first author KA received.
Conflict of interestThe authors declare that the study was carried out without any commercial or financial relationships that may be served as a potential conflict of interest.
Publisher’s noteAll claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
ReferencesBentéjac, C., Csörgő, A., and Martínez-Muñoz, G. (2021). A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 54, 1937–1967. doi:10.1007/s10462-020-09896-5
CrossRef Full Text | Google Scholar
Chen, T., and Guestrin, C. (2016). “Xgboost: a scalable tree boosting system,” in Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining.
Coll, B., Rodríguez, J. A., Craver, L., Orbe, J., Martínez-Alonso, M., Ortiz, A., et al. (2010). Serum levels of matrix metalloproteinase-10 are associated with the severity of atherosclerosis in patients with chronic kidney disease. Kidney Int. 78 (12), 1275–1280. doi:10.1038/ki.2010.329
PubMed Abstract | CrossRef Full Text | Google Scholar
Degenhardt, F., Seifert, S., and Szymczak, S. (2019). Evaluation of variable selection methods for random forests and omics data sets. Brief. Bioinform 20 (2), 492–503. doi:10.1093/bib/bbx124
PubMed Abstract | CrossRef Full Text | Google Scholar
Devetzi, M., Goulielmaki, M., Khoury, N., Spandidos, D. A., Sotiropoulou, G., Christodoulou, I., et al. (2018). Genetically-modified stem cells in treatment of human diseases: tissue kallikrein (KLK1)-based targeted therapy (Review). Int. J. Mol. Med. 41 (3), 1177–1186. doi:10.3892/ijmm.2018.3361
PubMed Abstract | CrossRef Full Text | Google Scholar
Golias, C., Charalabopoulos, A., Stagikas, D., Charalabopoulos, K., and Batistatou, A. (2007). The kinin system--bradykinin: biological effects and clinical implications. Multiple role of the kinin system--bradykinin. Hippokratia 11 (3), 124–128.
PubMed Abstract | Google Scholar
Hill, A. J., Basourakos, S. P., Lewicki, P., Wu, X., Arenas-Gallo, C., Chuang, D., et al. (2022). Incidence of kidney stones in the United States: the continuous national health and nutrition examination survey. J. Urol. 207 (4), 851–856. doi:10.1097/JU.0000000000002331
PubMed Abstract | CrossRef Full Text | Google Scholar
Khan, S. R., Canales, B. K., and Dominguez-Gutierrez, P. R. (2021). Randall's plaque and calcium oxalate stone formation: role for immunity and inflammation. Nat. Rev. Nephrol. 17 (6), 417–433. doi:10.1038/s41581-020-00392-1
PubMed Abstract | CrossRef Full Text | Google Scholar
Koller, F. L., Dozier, E. A., Nam, K. T., Swee, M., Birkland, T. P., Parks, W. C., et al. (2012). Lack of MMP10 exacerbates experimental colitis and promotes development of inflammation-associated colonic dysplasia. Lab. Invest. 92 (12), 1749–1759. doi:10.1038/labinvest.2012.141
PubMed Abstract | CrossRef Full Text | Google Scholar
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., and Storey, J. D. (2012). The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28 (6), 882–883. doi:10.1093/bioinformatics/bts034
PubMed Abstract | CrossRef Full Text | Google Scholar
Li, H., Lai, L., and Shen, J. (2020). Development of a susceptibility gene based novel predictive model for the diagnosis of ulcerative colitis using random forest and artificial neural network. Aging (Albany NY) 12 (20), 20471–20482. doi:10.18632/aging.103861
PubMed Abstract | CrossRef Full Text | Google Scholar
Lv, Y. P., Cheng, P., Zhang, J. Y., Mao, F. Y., Teng, Y. S., Liu, Y. G., et al. (2019). Helicobacter pylori-induced matrix metallopeptidase-10 promotes gastric bacterial colonization and gastritis. Sci. Adv. 5 (4), eaau6547. doi:10.1126/sciadv.aau6547
PubMed Abstract | CrossRef Full Text | Google Scholar
McMahan, R. S., Birkland, T. P., Smigiel, K. S., Vandivort, T. C., Rohani, M. G., Manicone, A. M., et al. (2016). Stromelysin-2 (MMP10) moderates inflammation by controlling macrophage activation. J. Immunol. 197 (3), 899–909. doi:10.4049/jimmunol.1600502
PubMed Abstract | CrossRef Full Text | Google Scholar
Mora-Gutiérrez, J. M., Rodríguez, J. A., Fernández-Seara, M. A., Orbe, J., Escalada, F. J., Soler, M. J., et al. (2020). MMP-10 is increased in early stage diabetic kidney disease and can be reduced by renin-angiotensin system blockade. Sci. Rep. 10 (1), 26. doi:10.1038/s41598-019-56856-3
PubMed Abstract | CrossRef Full Text | Google Scholar
Naicker, S., Naidoo, S., Ramsaroop, R., Moodley, D., and Bhoola, K. (1999). Tissue kallikrein and kinins in renal disease. Immunopharmacology 44 (1-2), 183–192. doi:10.1016/s0162-3109(99)00089-2
PubMed Abstract | CrossRef Full Text | Google Scholar
Newman, A. M., Steen, C. B., Liu, C. L., Gentles, A. J., Chaudhuri, A. A., Scherer, F., et al. (2019). Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37 (7), 773–782. doi:10.1038/s41587-019-0114-2
PubMed Abstract | CrossRef Full Text | Google Scholar
Qi, S., Wang, Q., Xie, B., Chen, Y., Zhang, Z., and Xu, Y. (2020). P38 MAPK signaling pathway mediates COM crystal-induced crystal adhesion change in rat renal tubular epithelial cells. Urolithiasis 48 (1), 9–18. doi:10.1007/s00240-019-01143-z
PubMed Abstract | CrossRef Full Text | Google Scholar
Sakhaee, K., Maalouf, N. M., and Sinnott, B. (2012). Clinical review. Kidney stones 2012: pathogenesis, diagnosis, and management. J. Clin. Endocrinol. Metab. 97 (6), 1847–1860. doi:10.1210/jc.2011-3492
PubMed Abstract | CrossRef Full Text | Google Scholar
Shwartz-Ziv, R., and Armon, A. (2022). Tabular data: deep learning is not all you need. Inf. Fusion 81, 84–90. doi:10.1016/j.inffus.2021.11.011
CrossRef Full Text | Google Scholar
Sonkar, S. K., Singh, P. K., Chandra, S., Sonkar, G. K., Bhosale, V., and Sharma, S. (2022). Cathepsin S as an early biomarker for cardiovascular disease in chronic kidney disease patients. J. Bras. Nefrol. 44 (3), 329–335. doi:10.1590/2175-8239-JBN-2021-0135
PubMed Abstract | CrossRef Full Text | Google Scholar
Tasian, G. E., Ross, M. E., Song, L., Sas, D. J., Keren, R., Denburg, M. R., et al. (2016). Annual incidence of nephrolithiasis among children and adults in South Carolina from 1997 to 2012. Clin. J. Am. Soc. Nephrol. 11 (3), 488–496. doi:10.2215/CJN.07610715
PubMed Abstract | CrossRef Full Text | Google Scholar
Tian, Y., Yang, J., Lan, M., and Zou, T. (2020). Construction and analysis of a joint diagnosis model of random forest and artificial neural network for heart failure. Aging (Albany NY) 12 (24), 26221–26235. doi:10.18632/aging.202405
PubMed Abstract | CrossRef Full Text | Google Scholar
Toni, M., Hermida, J., Goñi, M. J., Fernández, P., Parks, W. C., Toledo, E., et al. (2013). Matrix metalloproteinase-10 plays an active role in microvascular complications in type 1 diabetic patients. Diabetologia 56 (12), 2743–2752. doi:10.1007/s00125-013-3052-4
PubMed Abstract | CrossRef Full Text | Google Scholar
Wigner, P., Grębowski, R., Bijak, M., Szemraj, J., and Saluk-Bijak, J. (2021). The molecular aspect of nephrolithiasis development. Cells 10 (8), 1926. doi:10.3390/cells10081926
PubMed Abstract | CrossRef Full Text | Google Scholar
Wozniak, J., Floege, J., Ostendorf, T., and Ludwig, A. (2021). Key metalloproteinase-mediated pathways in the kidney. Nat. Rev. Nephrol. 17 (8), 513–527. doi:10.1038/s41581-021-00415-5
PubMed Abstract | CrossRef Full Text | Google Scholar
Wu, T., Hu, E., Xu, S., Chen, M., Guo, P., Dai, Z., et al. (2021). clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innov. (Camb) 2 (3), 100141. doi:10.1016/j.xinn.2021.100141
CrossRef Full Text | Google Scholar
Yiu, W. H., Wong, D. W. L., Chan, L. Y. Y., Leung, J. C. K., Chan, K. W., Lan, H. Y., et al. (2014). Tissue kallikrein mediates pro-inflammatory pathways and activation of protease-activated receptor-4 in proximal tubular epithelial cells. Plos One 9 (2), e88894. doi:10.1371/journal.pone.0088894
留言 (0)