Intestinal microbiota as biomarkers for different colorectal lesions based on colorectal cancer screening participants in community

Introduction

Colorectal cancer (CRC) is the third most common cancer and the second leading cause of cancer-associated death worldwide (Eng et al., 2024; Morgan et al., 2023). The incidence of CRC has been on the decline in affluent regions, largely attributed to the implementation of robust screening initiatives. However, this trend has not been uniform across all regions, many low- and middle-income countries, including China, are experiencing an upward trend of CRC incidence (Krul et al., 2023; Wong et al., 2021; Arnold et al., 2017). In addition, there is a notably increase in the mortality and burden of CRC in China, as evidenced by studies indicating a significant rise in cases among individuals in urban regions (Cao et al., 2021; Yu et al., 2018). This epidemiological trend underscores the critical need for targeted screening and early detection strategies to reduce the burden of CRC.

Among CRC screening populations, inflammation, polyps, adenomas, and CRC represent the most prevalent types of colorectal lesions (Gupta, 2022; Chen et al., 2019). A comprehensive meta-analysis has revealed that, among individuals at average risk, the prevalence rates for polyps, non-advanced adenomas, advanced adenomas, and CRC are 30.2, 17.7, 5.7, and 0.3%, respectively (Heitman et al., 2009). Several studies indicate that approximately 50–70% of CRC cases are linked to the progression of adenomas (Morgan et al., 2023; Elsayed et al., 2021; Zauber et al., 2012), and the transformation of an adenoma into CRC typically spans several years to decades (Keum and Giovannucci, 2019; Stryker et al., 1987). Therefore, timely intervention for inflammatory conditions and early removal of polyps and adenomas can prevent disease progression and reduce the incidence and mortality of CRC (He et al., 2020).

The composition of the microbiota has been extensively investigated for its role in various diseases (Sun et al., 2024; Gomaa, 2020). Notably, the gut microbiota has received considerable attention in the context of CRC diagnosis due to its complex involvement in carcinogenesis and tumor progression (Coker et al., 2022; Rezasoltani et al., 2018). Researchers have found that CRC patients have a reduced diversity and richness of gut microbiota compared to healthy individuals (Li et al., 2022; Mo et al., 2020; Cheng et al., 2020). Moreover, fecal microbiota profiles have been explored as potential screening tools for the early diagnosis of CRC, with particular emphasis on candidate pathogens such as Fusobacterium, Parvimonas, Gemella, Leptotrichia, and numerous other microbial taxa. However, the effectiveness of these biomarkers vary significantly depending on the different stage of colorectal lesions, as indicated by studies that highlight the differences in microbial profiles between healthy individuals, adenoma patients, and those with cancer (Tito et al., 2024; Li et al., 2022; Mizutani et al., 2020). Moreover, while some studies have reported promising results in differentiating colorectal adenoma from CRC using microbial biomarkers, further validation is necessary to ensure their reliability and specificity in clinical settings (Guo et al., 2024). Due to the complexity of intestinal microbes, stable early-stage biomarkers are lacking, studies show inconsistent results, and much work remains for their clinical use in CRC screening (Wang et al., 2023; Hua et al., 2022; Olovo et al., 2021; Hale et al., 2017; Zackular et al., 2013). Therefore, their accuracy needs validation across different precancerous stages, and their efficiency should be assessed.

In this community-based real-world study, we aim to understand the microbiota signatures in CRC carcinogenesis. Specifically, we systematically evaluated the microbiota characteristics in individuals at various lesion stages participating in CRC screening programs using 16S rRNA sequencing. By doing so, we aim to not only gain insight into the role of microbiota in colorectal carcinogenesis, but also to identify potential microbiota biomarkers for screening lesions for the development of CRC at different stages. The findings of this study will serve as a valuable resource for guiding future non-invasive screening techniques and preventive measures for CRC.

Materials and methods Study population

This study population consisted of community-based individuals enrolled in the CRC screening program administered by the Chronic Disease Prevention and Control Institute of Nanshan District. All participants were residents of Nanshan District, Shenzhen City, Guangdong Province, China, age 45–74 years old. Each participant provided informed consent prior to participating in the project. Trained medical personnel or investigators utilized a pre-established questionnaire assessment system to conduct risk evaluations (National Cancer Center, China, 2021; Chinese Medical Association, 2015). During the assessment, demographic data such as age, gender, height, weight, presence of chronic diseases, alcohol drinking history were gathered from the participants (more details are provided in the Supplementary material). Subsequently, a fecal immunochemical test (FIT) test was administered, and the stool samples were collected and stored. We excluded individuals with a prior history of CRC, younger than 45 or older than 75, pregnant women, and those who had taken antibiotics or probiotics within 2 months prior to stool collection. CRC risk was determined by testing positive based on the questionnaire assessment system or FIT, and participants were advised to schedule a colonoscopy within 90 days of the risk assessment.

A cohort of 20,729 participants was recruited for CRC screening program between May 2017 and December 2019. Following risk assessment, 5,600 individuals were classified as high risk for CRC. Of these, 1,266 participants provided stool samples before electronic colonoscopy. After excluding 506 samples due to insufficient quantity and 146 samples failing DNA quality control, 614 stool samples were ultimately utilized for 16S rRNA testing. After excluding 7 with melanosis coli, 9 with diverticulum, 9 had other malignant neoplasms (diagnosed within 5 years after the colonoscopy diagnosis), 12 lack questionnaire information, 14 were overlapping and 34 had inconsistent basic information compared to colonoscopy information, a total of 529 samples were analyzed in this study. Among these, 224 subjects were classified as normal, 99 had inflammatory disease, 129 had colorectal polyps, 59 had non-advanced adenomas, and 18 had higher colorectal lesions (13 advanced adenoma, 5 CRC). The flowchart of the cohort enrollment are presented in Figure 1. All participants provided informed consent, and the project was approved by the Ethics Committee of the Shenzhen Nanshan Center for Chronic Disease Control.

www.frontiersin.org

Figure 1. Flowchart of CRC screening cohort and enrollment.

Stool sample collection and storage

Stool sample collection involved two distinct ways. The first involved participants depositing stool sample into collection tubes, which were then stored in a preservation bag with ice packs. These samples were expeditiously transported to a collection site within two-hour, where they were collected by a designated collector, maintained at 2–8°C, and subsequently deposited into a −80°C freezer within four-hour. The alternative way entailed the collection of stool specimens directly into tubes pre-filled with preservation medium. Samples were collected at room temperature by the collector from the designated site where participants had placed them. These were then stored in a −80°C freezer within 24-h. For more detailed information, please refer to the Supplementary Figure 1.

Colonoscopy and clinical diagnosis

The electronic colonoscopy examination was performed by three tertiary public hospitals in Nanshan District. The normal control (Nor) group was characterized by the absence of inflammation, polyps, or any other bowel diseases observed during colonoscopy. Polyps were confirmed through histopathological if diameter exceeded 0.2 mm. The polyp group (Pol) was these identified as hyperplastic polyp or polyps with diameter less than 0.2 mm. Non-advanced adenoma (Nade) was defined as tubular adenoma with diameter less than 1 cm and lacking villous tissue. Advanced adenoma (Aade) was characterized by villous or tubulovillous features, or serrated adenomas, or adenomas larger than 1 cm, or the identification of high-grade dysplasia via histopathological examination. CRC patients were diagnosed through colonoscopy and histopathological examination. In certain analyses conducted within this study, we categorized the CRC and Aade subgroups as higher lesions (HLes). Based on the location of the polyps, participants were stratified into proximal and distal cases. Proximal polyps were defined as those in the cecum, ascending colon, hepatic flexure, and transverse colon. In contrast, distal polyps are found in the descending colon, the sigmoid colon, the rectum, or in the spleen. Those who exhibited proximal as well as distal polyps being categorized as proximal + distal.

DNA extraction, library construction and sequencing

Genomic DNA was extracted from stool samples utilizing the Apostle MiniGenomics Genomic DNA Extraction according to the manufacturer’s instructions. The quality and concentration of the DNA were evaluated through the use of 2% agarose gel (Tanon, China) and NanoDrop spectrophotometer (Thermo Fisher Scientific, USA). 16S rRNA gene sequencing was conducted using PCR libraries derived from the bacterial V4 region. The 16S V4 amplicon library using the Acegen 16S V4 Amplicon-Seq Kit. Genomic DNA (1–50 ng) was amplified in the first PCR round (10 cycles) with 16S V4 primers, followed by purification with Agencourt AMPure XP magnetic beads. The purified products underwent a second PCR (15–25 cycles) with index tag primers. The final library was purified again, quantified using Qubit 3.0 and Agilent 2100 Bioanalyzer, and sequenced on the Illumina platform (MiSeq sequencer) with a double-end index sequencing strategy and a read length of PE250. The primers of V4 were as follows:

16S V4 Forward: GACGCTCTTCCGATCTTATGGTAATTG TGTGCCAGCMGCCGCGGTAA.

16S V4 Reverse: TGTGCTCTTCCGATCTAGTCAGTCAGCCGGACTACH VGGGTWTCTAAT.

PCR Forward: AATGATACGGCGACCACCGAGATCTACACTCTTTCC CTACACGACGCT CTTCCGATCT.

PCR Reverse: CAAGCAGAAGACGGCATACGAGATXXXXXXXXXXXXGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT.

Sequencing data filtering

In this research, the trimming technique was employed to eliminate low-quality sequencing data and sequencing adapters, yielding refined data for further analysis. Trimmomatic software was utilized for the initial data trimming process, with the procedures of data processing involving the removal of low-quality sequences through a sliding window method using a window size of 4 bases. If the mean base quality falls below 15, the sequence is truncated at that particular position. Eliminate sequences with quality scores below 3 at the start and end, as well as those containing undetermined bases. Additionally, remove sequences contaminated with adapters through two methods: trimming the initial portion if the alignment score with the adapter sequence exceeds 7 (approximately 12 bp) and ensuring a base score above 30 in the overlap region between two sequences. Exclude sequences trimmed to less than 36 nucleotides and those unable to form pairs.

Data clustering and annotation

Data clustering and annotation were performed using the QIIME 2 software package. Double-ended sequencing data was imported into QIIME 2 to form.qza format that QIIME 2 can recognize. Dada2 was used for denoising and merging of paired-end sequences. FeatureTable and FeatureData summaries in QIIME 2 were used to generate representative sequence lists and perform statistical analysis. Sequences were clustered based on 97% similarity to create multiple taxonomic units, with each unit referred to as an Operational Taxonomic Unit (OTU). The representative sequence list generated by QIIME 2 was converted to tsv format, i.e., OTU table, using the biom software package; OTU table was annotated with species using the feature-classifier function in QIIME 2 to obtain taxonomic levels (phylum, class, order, family, genus, species) for each OTU. The dilution curves suggested the sufficiency of the sequencing data volume (Supplementary Figure 2).

Microbiota health character index and community structure difference

By comparing the relative abundance of two groups of microbial species associated with good and poor health conditions, the Gut Microbiota Health Index (GMHI) was employed to evaluate the propensity for disease; a lower GMHI value corresponds to a diminished health index (Gupta et al., 2020). The bacterial community stability was evaluated by the average variation degree (AVD), which is mainly based on the deviation of the mean OTU relative abundance. The lower AVD values indicates higher microbiome stability (Xun et al., 2021). The two-dimensional Principal Coordinates Analysis (PCoA) scatter plot was used to illustrate the similarities and differences between the normal control and lesion groups. The Bray–Curtis dissimilarity method was utilized to compute the distances between samples, thereby reflecting the degree of aggregation and dispersion within the sample communities. The intergroup differences in Beta diversity are analyzed using within-group intersample distance matrix data, which assesses the variation in community structure between groups. The Kruskal-Wallis test was applied to statistically evaluate differences among multiple groups.

The TwoGroup Welch’s t-test (Zhang et al., 2018; Ye et al., 2017), linear discriminant analysis (LDA) effect size (LEfSe), and Random Forest analysis at the genus level were employed to identify key microorganisms biomarkers. Taxa identified as ‘unclassified’ at the genus level were excluded. These methods were utilized to determine significant differences in abundance between groups and to develop predictive models. Data analysis of the bacterial composition and species differences between groups were conducted using Majorbio Cloud platform (Han et al., 2024).

Functional prediction analysis

Based on OTUs of the 16S rRNA sequences data, PICRUSt was used to estimate the abundance of functional categories (KEGG/COG analysis). Subsequently, a differential abundance analysis between different groups was conducted by utilizing LEfSe software with LDA score of 2.

Statistical analysis

Chi-square or Fisher’s Exact tests were employed for qualitative comparisons, while ANOVA was utilized for quantitative analyses, to evaluate demographic characteristics across different groups. For evaluating accuracy of selected differential related OTUs in predicting various stages of colorectal lesions, receiver operating characteristic curves (ROC) and areas under the curve (AUC) were used. R software (Version 4.2.1) was used to generate the figures. Statistical significance was determined by a p-value of 0.05.

Results Basic characteristics of study subjects

Table 1 displays the demographic information of the 529 participants included in the study, consisting of 196 men and 333 women with age ranging from 45 to 75 years. 48.14% of the participants were under 60 years old. Significant differences among the five groups of participants in terms of gender and age were observed (p < 0.05).

www.frontiersin.org

Table 1. Basic demographic and clinical characteristics of the participants.

Taxonomic classification of microbial communities

Based on the initial analysis, 28,952 de novo OTUs were categorized into 19 phyla, 33 classes, 95 orders, 175 families, and 503 genera. By including only taxa with at least two sequence reads in at least five participants, 14 phyla, 21 classes, 58 orders, 112 families, 347 genera, and 3,560 OTUs remain. And there were 1,046 OTUs shared among the Les, Nade, Inf, Pol, and Nor groups in addition to group-specific unique ones. (The distribution of OTUs among different groups is illustrated in Supplementary Figure 3).

Intestinal microbiota health and diversity analysis

In contrast to HLes, Nade, Inf, and Pol groups, the Nor control group had significantly higher GMHIs (Wilcoxon rank sum test, p < 0.05; Figure 2A). The HLes group had the highest AVD of 0.527, and the control group had the lowest AVD of 0.346 (Figure 2B). The PCoA scatter plot indicates no significant differences in the overall community structure of beta diversity among the five groups (Bray-Curtis; p = 0.477) (Figure 2C). Furthermore, the analysis of intergroup differences in beta diversity based on within-group intersample distance matrix data shows significant differences in community structure among the five groups (Bray-Curtis; Kruskal-Wallis H test, p = 2.583e-61) (Figure 2D).

www.frontiersin.org

Figure 2. Microbial diversity and health index of the five groups. (A) Comparing of GMHI between Nor group with the other four lesion groups. (A-1) GMHI of HLes and Nor groups. (A-2) GMHI of Nade and Nor groups. (A-3) GMHI of Pol and Nor groups. (A-4) GMHI of Inf and Nor groups. (B) AVD of these five groups. (C) PCoA plot of these five groups. (D) Intergroup difference in beta diversity.

Community composition of microbiota in different groups

Figure 3A heatmap illustrates the bacterial relative abundances which were log10 transformed of the 30 most dominant bacterial communities at the genus level across the five groups. The predominant genera within the top 10 abundances collectively constitute over 50% of the total sequences in these groups. Among these, Bacteroides and Faecalibacterium emerge as the two most dominant genera. Additionally, Megamonas was identified as the third-ranked (4.98%) prevalent genus in the HLes group (Figure 3B and Supplementary Figure 5). Further analysis revealed that in the CRC group, Megamonas constituted 17.44% of the microbial community, ranking second after Bacteroides (23.72%) and followed by Prevotella_9 (7.24%). Conversely, in the Aade group, Prevotella_9 represented a relatively minor proportion at 1.29%, with the predominant taxa were Bacteroides (28.18%), Faecalibacterium (6.53%), and Ruminococcus (3.26%) (Figure 3C and Supplementary Figure 5).

www.frontiersin.org

Figure 3. Community heatmap and barplot analysis at genus level within the five groups. (A) Community heatmap of these five groups (top 50). (B,C) Community barplot of the different groups.

Species differences in different groups

The multiple groups’ significant difference test illustrated in Figure 4A reveals that Erysipelotrichaceae_UCG-003, Lachnospiraceae_UCG-004, Tyzzerella, and other genera, demonstrated statistically significant or highly significant differences in average relative abundance across the five groups (p < 0.05, p < 0.01, p < 0.0001). Differentially abundant fecal bacterial taxa between the disease and control groups were further identified by LEfSe analysis (Figure 4B). The species Tyzzerella, Faecalitalea, et al. exhibit significant differences between the HLes and Nor groups. More specifically, Lactiplantibacillus, Tyzzerella, et al. show significant differences between the CRC and Nor groups. And Intestinimonas, Aeromonas, et al. are significantly different between the Aade and Nor groups. Furthermore, class of Negatwoutes and Vellondlales-Seenomonadales, et al. exhibit significant differences between the Nade and Nor groups. The species Ladobaallacese and Paraprevotela exhibit significant differences between the Pol and Nor groups. Aside from that, the species Eysipdatodotrdlaceae and Lschnospira show significant differences between the Inf and Nor groups.

www.frontiersin.org

Figure 4. (A) Multiple groups significant difference test shows that some species were significantly different at the genus level among the five groups of samples. (B) LEfSe analysis shows the differentially abundant bacteria between normal control and lesion groups.

Prediction model for distinguish different disease status

Through LEfSe, TwoGroups comparison and Random Forest analysis, we construct optimum OTU sets and develop corresponding predictive models to differentiate lesions from healthy controls. After adjusting for the covariates of age and gender and comparing the predictive models developed using these three methodologies, the TwoGroups method exhibit superior efficacy in screening for higher lesions. The AUC values were determined to be 0.81 for the comparison between HLes and Nor, 0.90 for HLes versus Nade, 0.90 for HLes versus Pol, and 0.93 for HLes versus Inf (Figure 5A-1 and Table 2). Following further stratification of the HLes group into the CRC and Aade subgroups, the AUC values for HLes versus Nor increased to 0.98 and 0.95, respectively (see Figure 5A-1). In contrast, when compared to the Nor group, the predictive efficacy of the AUC for the Nade, Inf, and Pol groups was 0.82, 0.71, and 0.53, respectively (refer to Figure 5A-3). Given the low efficiency to distinguish the Pol from the Nor group, individuals from the Pol group were combined with the Nor group to form a control group. Consequently, the AUC values for discriminating the HLes group, the Nade group, and the Inf group from the combined Nor and Pol group were 0.9 (with 0.98 for CRC and 0.98 for Aade), 0.7, and 0.72, respectively, as illustrated in Figures 5A-2, A-4. Table 2 illustrates the predictive performance of the various models in distinguishing between the lesion groups and the corresponding healthy control groups.

www.frontiersin.org

Figure 5. The ROC curves and their corresponding AUCs for microbiota OTU biomarkers among normal and disease groups. (A-1) The ROC curves between the HLes and Nor groups. (A-2) The ROC curves between the Les and Pol + Nor groups. (A-3) The ROC curves between the Nade, Pol, Inf, and Nor groups. (A-4) The ROC curves between the Nade, Inf, and Pol + Nor groups.

www.frontiersin.org

Table 2. Prediction models’ efficiency using microbiota biomarkers on genus level conferred by Random Forest, LefSe, and TwoGroups methods.

Functional analysis

We detected distinct enzymatic profiles in the groups using PICRUSt2 for functional prediction. Notably, 6-phospho-beta-glucosidase, along with other six enzymes, were more prevalent in individuals with Aade and CRC compared to those in the Inf, Pol, and Nor groups (refer to Figure 6A-1). The microbiota of the Aade group exhibited an elevated presence of polyamine-transporting ATPase, diglucosyl diacylglycerol synthase, exodeoxyribonuclease V, and 5′ to 3′ exodeoxyribonuclease. Conversely, the microbiota of the Nor group was predicted to have an increased biosynthesis of muramoyltetrapeptide carboxypeptidase and other enzymes (see Figure 6A-2 for more information). Additionally, the CRC group’s microbiota demonstrated an increased presence of mannose-6-phosphate isomerase, alpha-phosphotrehalase, histidinol-phosphatase, and pyridoxal 5′-phosphate synthase. In contrast, the microbiota of the Nor group is predicted to show increased biosynthetic activity of RNA helicase and additional enzymes (see Figure 6A-3).

www.frontiersin.org

Figure 6. Inferred functional differences between groups of KEGG enzymes using 16S amplicons. (A-1) The bars represent enzymes that are predicted to be significantly enriched in the microbiota of individuals with Aade and CRC compared to those in the Inf, Pol, and Nor groups. (A-2,A-3) Bule bars represent enzymes predicted to be more prevalent in individuals with Aade and CRC compared to the Nor group. In contrast, red bars denote enzymes that are more commonly found in the Nor group than in those with CRC and Aade.

Discussion

In our study, we observed that the health index of the intestinal microbiota in individuals with colorectal lesions was notably lower compared to that of the healthy control group. We identified potential microbial biomarkers that correlate with different stages of colorectal lesions who were undergoing CRC screening. In addition, a comparison of 16S sequences between disease and control groups revealed possible metabolic changes in microbes.

The examination of intestinal microbiota is increasingly acknowledged as a significant approach for identifying various tumor types (Liu et al., 2024; Amaro-da-Cruz et al., 2024). Previous research has underscored a robust association between gut microbiota and colorectal carcinogenesis, indicating that distinct microbial profiles could serve as potential biomarkers for screening and diagnosis (Tilg et al., 2018; Konstantinov et al., 2013). However, challenges remain in terms of variability in microbiota composition among different individuals and the need for robust methodologies to accurately assess microbial communities in different precancerous stages (Amaro-da-Cruz et al., 2024; Lam and Fong, 2024).

Microbiota dysbiosis has emerged as a significant factor in the development of colorectal carcinogenesis. Research indicates that dysbiosis of the gut microbiota may be implicated in the pathogenesis of CRC, particularly in its early stages (Yu et al., 2023; Kim et al., 2020; Sobhani et al., 2011). In peptic ulcer disease, dysbiosis of the gut microbiota is often observed, which can lead to an imbalance in the release of inflammatory factors such as interleukin-6 (IL-6), interleukin-1β (IL-1β), and tumor necrosis factor-α (TNF-α). These factors can impair the mucosal barrier function, promote ulcer formation, and impede healing (Garavaglia et al., 2024). Moreover, the presence of specific bacteria, such as Helicobacter pylori, can significantly alter the composition of the gut microbiota, this alteration can further contribute to the development and progression of gastrointestinal cancer (Chen et al., 2021; Dash et al., 2019). Research found that individuals with conventional adenomas exhibited lower species richness in their gut microbiota compared to polyp-free controls, suggesting that specific microbial profiles may be associated with early stages of colorectal carcinogenesis (Peters et al., 2016). Evidence points to a complex interplay between microbiota dysbiosis and colorectal carcinogenesis, where specific microbial changes may contribute to the initiation and progression of cancer (Kang et al., 2023; Pakbin et al., 2023; Coker et al., 2018). Our study revealed a decline in the health index during the early stages of colorectal lesions, with the stability of the microflora being inferior to that observed in the healthy intestinal population participating in community-based CRC screening.

Numerous bacterial species have been found to be associated with the pathogenesis of CRC, such as Fusobacterium nucleatum, Bacteroides fragilis, Escherichia coli, Enterococcus faecalis, Helicobacter hepaticus, Peptostreptococcus anaerobius, Helicobacter pylori, Streptococcus bovis, and Porphyromonas gingivalis (Senthakumaran et al., 2023; Kaźmierczak-Siedlecka et al., 2020; Wong and Yu, 2019; Yu et al., 2017). Our study identified Bacteroides as the predominant genus within our CRC screening cohort. Nonetheless, the variation in the mean relative abundance of this bacterium across different stages of the disease was not pronounced. This finding contrasts with previous research that has recognized Bacteroides fragilis as one of the seven bacteria enriched in CRC across diverse populations (Dai et al., 2018). It is possible that the methodological differences between our study and previous research contribute to the observed discrepancy. Research has demonstrated that the composition of the gut microbiota, including Bacteroides, Fusobacterium and others, is correlated with various factors such as transit time, fecal calprotectin levels, and BMI, which are significant covariates in the study (Tito et al., 2024; Baraibar et al., 2023; Hale et al., 2017).

At the genus level, among the top ten average relative abundances of microbial communities, significant alterations in Megamonas, Prevotella_9, Tyzzerella, Agathobacter and others were observed during the process of colorectal carcinogenesis. However, these strains, though relatively uncommon in earlier studies and documented in a limited number of investigations, have been noted in the literature. Notably, Megamonas has been reported to be associated with the progression of CRC, particularly showing elevated levels in moderately differentiated CRC cases (Qi et al., 2022; Han et al., 2020). This suggests a potential involvement of Megamonas in the progression and severity of CRC. Moreover, Prevotella, a bacterium influenced by dietary factors, has been identified as an oral pathogen linked to an increased risk of CRC (Yang et al., 2019). Additionally, research conducted by Zhang et al. (2023) also considers Prevotella as a potential biomarker for CRC diagnosis. Tyzzerella has been recognized as a predominant bacterial group capable of differentiating between individuals with CRC, adenomas, and those who are healthy. This implies that the presence of Tyzzerella in the gut microbiome could potentially serve as an indicator for the development of CRC (Senthakumaran et al., 2023; Kim et al., 2022). Our findings also underscore the important potential of these gut microbiome as biomarkers for assessing colorectal precancerous lesions.

Numerous studies have suggested that biomarkers related to the intestinal microbiota show promise for the early detection of CRC. However, a definitive biomarker for accurately predicting the presence of early colorectal cancerous lesions remains elusive, limiting the use of gut microbiota as a diagnostic tool in CRC screening populations (Olovo et al., 2021; Krigul et al., 2021; Zhou et al., 2020). A variety of populations, along with different sampling and detection methods, may contribute to this problem. This study focused on individuals who were undergoing CRC screening. We utilized three distinct methodologies to identify unique microbiota profiles that showed significant differences between the groups. The microbiota biomarkers identified in our research demonstrated the ability to distinguish advanced adenomas and CRC from healthy controls, with the predictive accuracy of the selected microbial markers achieving the AUC values of 0.98. The AUC values were minimally affected by combining samples from the Pol group with those from the normal group. The predictive performance observed in this study is comparable to the report which employed 11 metabolite biomarkers and six bacterial species to differentiate CRC from normal controls (Coker et al., 2022). Additionally, researchers achieved high AUC values using bacterial-related biomarkers in their analyses to differentiate CRC vs. CRA (AUC 0.994) (Gao et al., 2022) and CRC vs. NC (AUC 0.930) (Yang et al., 2020). Notably, the AUC value for differentiate CRA from NC derived from our study exceed those reported in several other studies, including an AUC value of 0.8 (Wu et al., 2021) and an AUC of 0.792 (Wei et al., 2020). These findings collectively indicate that the gut microbiota biomarkers hold promise for practical application in initiatives for CRC screening and early detection.

Although the bacterial-related biomarkers identified in this study have demonstrated satisfactory results as screening markers within a real-world CRC screening population, several limitations persist. Notably, challenges were encountered in obtaining stool samples from individuals undergoing CRC screening, as not all participants in the screening program provided fecal samples. Consequently, our study was limited to testing the gut microbiota of only those individuals who both underwent colonoscopy and provided fecal samples. Additionally, the current study is limited by a small sample size, particularly within the advanced adenoma and CRC populations. Further research is required to ascertain the applicability of these findings to populations undergoing CRC screening in diverse cultural and regional contexts.

Conclusion

This study has identified specific microbial biomarkers that can differentiate between colorectal or other site lesions and comparative healthy individuals, thereby advancing our understanding of the potential utility of gut microbiota in community-based CRC screening programs. And efforts will be concentrated on broadening the application of gut microbiota as CRC screening biomarkers across diverse populations.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the Shenzhen Nanshan Center for Chronic Disease Control. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin. Written informed consent was obtained from the individual(s), and minor(s)’ legal guardian/next of kin, for the publication of any potentially identifiable images or data included in this article.

Author contributions

GL: Data curation, Formal analysis, Methodology, Visualization, Writing – original draft. DZ: Funding acquisition, Supervision, Writing – review & editing. BO: Methodology, Writing – review & editing. YC: Resources, Writing – review & editing. YZ: Methodology, Writing – review & editing, Data curation.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported in part by Science and Technology Research Program of Shenzhen City (JCYJ20220531091804010).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2025.1529858/full#supplementary-material

Abbreviations

Aade, advanced adenoma; AUC, area under curve; AVD, average variation degree; BMI, body mass index; CI, confidence interval; CRC, colorectal cancer; FDR, false discovery rate; GMHI, Gut Microbiota Health Index; FIT, fecal immunochemical test; HLes, higher lesions; Inf, inflammatory disease; Nor/NC, normal control; Nade, non-advanced adenoma; OTU, operational taxonomic unit; Pol, polyps; ROC, receiver operating characteristic.

Footnotes References

Amaro-da-Cruz, A., Rubio-Tomás, T., and Álvarez-Mercado, A. I. (2024). Specific microbiome patterns and their association with breast cancer: the intestinal microbiota as a potential biomarker and therapeutic strategy. Clin. Transl. Oncol. 27, 15–41. doi: 10.1007/s12094-024-03554-w

PubMed Abstract | Crossref Full Text | Google Scholar

Arnold, M., Sierra, M. S., Laversanne, M., Soerjomataram, I., Jemal, A., and Bray, F. (2017). Global patterns and trends in colorectal cancer incidence and mortality. Gut 66, 683–691. doi: 10.1136/gutjnl-2015-310912

PubMed Abstract | Crossref Full Text | Google Scholar

Baraibar, I., Ros, J., Saoudi, N., Salvà, F., García, A., Castells, M. R., et al. (2023). Sex and gender perspectives in colorectal cancer. ESMO Open 8:101204. doi: 10.1016/j.esmoop.2023.101204

PubMed Abstract | Crossref Full Text | Google Scholar

Cao, W., Chen, H.-D., Yi-Wen, Y., Li, N., and Chen, W.-Q. (2021). Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin. Med. J. 134, 783–791. doi: 10.1097/CM9.0000000000001474

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, H., Li, N., Ren, J., Feng, X., Lyu, Z., Wei, L., et al. (2019). Participation and yield of a population-based colorectal cancer screening programme in China. Gut 68, 1450–1457. doi: 10.1136/gutjnl-2018-317124

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, C.-C., Liou, J.-M., Lee, Y.-C., Hong, T.-C., El-Omar, E. M., and Ming-Shiang, W. (2021). The interplay between helicobacter pylori and gastrointestinal microbiota. Gut Microbes 13, 1–22. doi: 10.1080/19490976.2021.1909459

PubMed Abstract | Crossref Full Text | Google Scholar

Chinese Medical Association (2015). 中国早期结直肠癌及癌前病变筛查与诊治共识 [Consensus on screening, diagnosis and treatment of early colorectal cancer and precancerous lesions in China]. Chinese J. Pract. Internal Med. 35, 211–227.

Google Scholar

Coker, O. O., Dai, Z., Nie, Y., Zhao, G., Cao, L., Nakatsu, G., et al. (2018). Mucosal microbiome dysbiosis in gastric carcinogenesis. Gut 67, 1024–1032. doi: 10.1136/gutjnl-2017-314281

PubMed Abstract | Crossref Full Text | Google Scholar

Coker, O. O., Liu, C., William Ka Kei, W., Wong, S. H., Jia, W., Sung, J. J. Y., et al. (2022). Altered gut metabolites and microbiota interactions are implicated in colorectal carcinogenesis and can be non-invasive diagnostic biomarkers. Microbiome 10:35. doi: 10.1186/s40168-021-01208-5

PubMed Abstract | Crossref Full Text | Google Scholar

Dai, Z., Coker, O. O., Nakatsu, G., Wu, W. K. K., Zhao, L., Chen, Z., et al. (2018). Multi-cohort analysis of colorectal cancer metagenome identified altered bacteria across populations and universal bacterial markers. Microbiome 6:70. doi: 10.1186/s40168-018-0451-2

PubMed Abstract | Crossref Full Text | Google Scholar

Dash, N. R., Khoder, G., Nada, A. M., and Al Bataineh, M. T. (2019). Exploring the impact of Helicobacter pylori on gut microbiome composition. PLoS One 14:e0218274. doi: 10.1371/journal.pone.0218274

PubMed Abstract | Crossref Full Text | Google Scholar

Elsayed, I., Li, L., Sheahan, K., Moran, B., Bakheit, S., and Wang, X. (2021). Adenoma to carcinoma: a portrait of molecular and immunological profiles of colorectal sporadic tumors. Int. Immunopharmacol. 100:108168. doi: 10.1016/j.intimp.2021.108168

PubMed Abstract | Crossref Full Text | Google Scholar

Eng, C., Yoshino, T., Ruíz-García, E., Mostafa, N., Cann, C. G., O’Brian, B., et al. (2024). Colorectal cancer. Lancet 404, 294–310. doi: 10.1016/s0140-6736(24)00360-x

PubMed Abstract | Crossref Full Text | Google Scholar

Gao, R., Chunyan, W., Zhu, Y., Kong, C., Zhu, Y., Gao, Y., et al. (2022). Integrated analysis of colorectal cancer reveals cross-cohort gut microbial signatures and associated serum metabolites. Gastroenterology 163, 1024–1037.e9. doi: 10.1053/j.gastro.2022.06.069

PubMed Abstract | Crossref Full Text | Google Scholar

Garavaglia, B., Vallino, L., Amoruso, A., Pane, M., Ferraresi, A., and Isidoro, C. (2024). The role of gut microbiota, immune system, and autophagy in the pathogenesis of inflammatory bowel disease: molecular mechanisms and therapeutic approaches. Aspects Mol. Med. 4:100056. doi: 10.1016/j.amolm.2024.100056

Crossref Full Text |

留言 (0)

沒有登入
gif