Deep immunophenotyping reveals that autoimmune and autoinflammatory disorders are spread along two immunological axes capturing disease inflammation levels and types

Introduction

Autoimmune diseases (AD) and autoinflammatory diseases (AIF) are pathological conditions arising from the imbalance between immune tolerance and activation.1 AD and AIF represent more than 80 different heterogeneous disorders affecting up to 8% of the world’s population.2 Despite the prevalence and heterogeneity of these diseases, the pathophysiology and the nosology of these diseases remain largely elusive, and despite numerous treatment option we currently lack curative ones.3

AD and AIF can be classified according to several criteria,4 and are mostly classified according to a combination of clinical and biological feature sets. A generally accepted notion is that AD predominantly stems from dysregulations of adaptive immunity and that AIF arises primarily from dysregulations of innate immunity. However, it has been proposed that AD and AIF could rather lie on a continuum ranging from autoimmune to autoinflammatory, with different contributions of both the innate and the adaptive immune responses.5–8 Additionally, these disorders can be classified based on their types/localisations (such as joint, blood vessel, bowel, metabolism, or muscle), which suggest a contribution of tissue-specific factors in the proposed continuum.4 7 9

Only few studies have evaluated the differences between AD and AIF in a systematic manner, limiting the validation of the continuum hypothesis and the identification of immunological components that could be responsible for their similarities or differences. To revisit the nosology of AD and AIF, we initiated the Transimmunom observational clinical trial involving 443 patients with one or more of 15 disorders ranging from pure AD to pure AIF.10 Patients’ medical history and status were recorded, and deep phenotyping performed. We report here the results of the deep cytometry immunophenotyping that analysed more than 600 innate and adaptive immune cell parameters (both absolute cell counts and percentages) from patient’s blood. Unsupervised and supervised analyses were conducted with the purpose of: (1) evaluating the hypothesis of an AD to AIF continuum, (2) identifying clusters of diseases along with their shared and specific cell parameters and (3) characterising potential biomarkers.

Materials and methodsStudy design and participants

Building on on the Transimmunom clinical trial,10 we collected peripheral blood from 443 patients (enrolled from 2015 to 2022) who were affected by 15 distinct ADs or AIFs or related conditions, as well as blood from 71 healthy volunteers (HV) to serve as a reference condition (figure 1). Our screened disorders ranged from pure AIF to pure AD and included different types of disorder activities without any threshold of disease activity. These disorders included arthritis disorders as Behçet’s disease (BD; n=38), knee osteoarthritis (OA; n=45), rheumatoid arthritis (RA; n=91), spondyloarthritis (SA; n=58) and systemic lupus erythematosus (SLE; n=33); blood vessel disorders as antiphospholipid syndrome (APLS; n=23), Churg-Strauss disease (CS; n=6), granulomatosis with polyangiitis (GPA; n=14) and Takayasu arteritis (TA; n=22); metabolic disorders as type 1 diabetes (T1D; n=60) and type 2 diabetes (T2D; n=27); muscle disorder as myositis (MY; n=4); inflammatory bowel diseases (IBD) as Crohn’s disease (CD; n=10) and ulcerative colitis (UC; n=5); and IBD-like diseases as familial Mediterranean fever (FMF; n=7). OA and T2D were primarily included in this disease spectrum as ‘benchmarks’ for RA and T1D, but also considering the recent observations that suggest that these diseases have autoimmune and autoinflammatory components.11–15 Criteria used for disease diagnostics are indicated in online supplemental table S1. Main treatments associated with diseases are indicated in online supplemental table S2. These treatments included classical therapies provided to patients with AD and AIF, such as insulin for patients with T1D, oral antidiabetics for patients with T2D, hydroxychloroquine for patients with SLE, non-steroidal anti-inflammatory drugs (NSAIDs) for patients with AS, glucocorticosteroids for patients with BD, CS, GPA and TA, along with synthetic disease-modifying antirheumatic drugs, biological disease-modifying antirheumatic drugs and NSAIDs.

Figure 1Figure 1Figure 1

Experimental design of the Transimmunom observational clinical trial. We standardly collected whole blood from 447 patients having 15 autoimmune or autoinflammatory diseases enrolled in the Transimmunom observational clinical trial, and from 71 healthy volunteers. Disorders ranged from pure autoinflammatory to pure autoimmune diseases. The abbreviation and the number of patients associated with each disease are indicated. After quality control, samples were profiled using 13 flow cytometry panels quantifying a total of 66 unique cell markers to characterise more than 600 cell parameters (ie, absolute cell counts and percentages of cell populations). AID, autoimmune disease; AIF, autoinflammatory disease.

Deep immunophenotyping

Cytometry profiling of patient blood samples was performed using 12 flow cytometry panels of 10 cell markers each, as previously described.16 An additional cytometry panel targeting innate lymphoid cells (ILC) was used. In detail, these panels were designed to perform advanced analysis of T cells, B cells, natural killer (NK) cells, mucosal-associated invariant T (MAIT) cells, myeloid cells, monocytes and dendritic cells (DC). A set of three panels was specifically created to analyse the activation, migration and memory phenotype of T cells. One panel was specifically designed for investigating CD4+ T cell polarisation, whereas two panels primarily concentrated on studying the phenotype of regulatory T cells. Other panels were designed to explore B cells, NK cells, monocytes, dendritic cells, MAIT cells and myeloid-derived suppressor cells. We also developed an extra panel to identify the main immune cell populations. This panel included numeration beads, allowing to determine the absolute counts of all populations, while serving as a reference tube that allows the calculations of absolute counts in all other panels by extrapolation from shared populations. All acquisitions were performed on a Gallios cytometer (Beckman Coulter) maintained daily according to the manufacturer’s recommendations with Flow Check Pro and Flow Set Pro fluorospheres. Cell parameters were defined as previously described,16 and were obtained using manual gating using Kaluza V.1.3 software (Beckman Coulter). Regulatory T lymphocytes (Tregs) were defined as CD25+/CD127− cells among T cells. The expression of FoxP3 was also measured in one cytometry panel. ILCs were defined as CD127+ cells and based on a negative linage comprising CD4, CD3, CD14, CD19, CD34, TCRγδ, CD1a, TCRαβ, CD11c, CD94, CD123, FcεR1a and CD303. ILC1s were defined as CD294−/CD117−, ILC2s were defined as CD294+ and ILC3s were defined as CD294−/CD117+. More than 600 cell parameters were quantified using such deep immunophenotyping of innate and adaptive cell populations.

As previously described, Duraclone technology was used to standardise the staining procedures, which provides the possibility to use custom-designed panels of antibodies that are dried and precoated in individual tubes for direct labelling of blood. Moreover, we evaluated the coefficient of variability of measurements of populations that are shared by different panels. For validation of the regulatory T cell measurements, we assessed the correlation between the values obtained with or without the FoxP3 marker.16

Univariate analyses

Due to the unbalanced nature of the number of patients per biological condition, both Cliff’s Delta effect size and two-way non-parametric Wilcoxon test were used to identify cell parameters that were significantly differentially abundant between conditions. Cliff’s Delta is a non-parametric measure of effect size that is used in statistics to quantify the magnitude of the difference between two groups or conditions.17 Cliff’s Delta ranges from −1 to +1, where values closer to −1 indicate a large effect size in favour of the first group, values closer to +1 indicate a large effect size in favour of the second group and values close to 0 indicate a small or negligible effect size. Cell parameters with an absolute value of Cliff’s Delta effect size higher than 0.33—corresponding to a medium magnitude change—and with a p value lower than 0.05 were considered statistically significant.

Multivariate analyses

Hierarchical agglomerative clustering and dendrogram representations represented with heatmaps were constructed based on the Euclidean distances and using the Ward’s linkage method. Principal component analyses (PCA) were generated using the FactoMineR R package using unscaled Cliff’s Delta effect size values of cell parameters in each disease relative to HV. Cell parameters with an eigenvalue lower threshold than −0.5 or higher than 0.5 in one axis were considered to be associated with PCA axes. The identification of disease clusters along PCA axes was performed using k-means clustering. The optimal number of clusters was determined using the NbClust R package. Multidimensional scaling (MDS) representation was generated using the MASS R package based on Cliff’s Delta effect size values of cell parameters in each disease relative to HV. The coexpression network was constructed using the Spearman coefficient of correlation using an absolute threshold of 0.6. Classification decision trees were generated using the partykit R package based on all available cell parameters, using a maximal depth parameter of 5 and a minimal bucket size parameter of 10.

Patient and public involvement

Patients or the public were not involved in the design, conduct, reporting or dissemination plans of our research.

Data availability

The relative percentage or absolute count of the 224 cell parameters differentially abundant in at least one disease relative to HV is available on the Zenodo open repository through DOI: 10.5281/zenodo.10364382.

ResultsExperimental design and patient demographic characteristics

We included in this study all the 443 patients and 71 HVs from the Transimmunom trial from which blood samples were available (figure 1). The list of diseases included in this study and patient demographics are presented in table 1. Diseases ranged from purely AD to purely AIF. We also included diseases such as OA and T2D that were primarily used as ‘benchmaks’ for RA and T1D, respectively, and for which accumulating observations suggest autoimmune or autoinflammatory components in their aetiologies.11–15 Across all diseases, the mean age was 44.42±15.30 years, the mean body mass index (BMI) was 25.82±9.68 and the sex ratio was 57.70% towards females. There were no differences in age, BMI or sex ratio of patients relative to HV, except when expected (female bias for RA, SLE and TA; high BMI for APLS, OA, TA, T2D and UC; age bias for APLS, CS, GPA, MY, TA, OA, RA and T2D). The average time from diagnosis was 5.11±6.91 years (table 1). More than 600 innate and adaptive immune cell populations from patient’s blood were characterised using multiple flow cytometry panels that were designed to perform advanced analysis of T cells, B cells, NK cells, MAIT cells, myeloid cells, monocytes, dendritic cells and ILCs.16

Table 1

Characteristics of groups of individuals included in the study

Identification of five clusters of AD or AIF

We first identified cell parameters (absolute cell counts or percentages) differentially abundant in each disease relative to HV (online supplemental figure S1A). Based on our panels, we observed that the diseases had a stronger impact on adaptive immunity than on innate immunity (online supplemental figure S1B,C).

Unsupervised hierarchical clustering classified the 15 diseases into five clusters based on the set of 224 cell parameters significantly different relative to HV (figure 2A). The first cluster (C1) included BD, OA, SA, RA, T1D and T2D. The second cluster (C2) consisted solely of MY. The third cluster (C3) encompassed APLS, CS, GPA, SLE and TA. The fourth cluster (C4) contained only FMF. Lastly, the fifth cluster (C5) included CD and UC.

Figure 2Figure 2Figure 2

Disorders are gathered into five distinct clusters having each shared and specific immunological contributions. (A) Heatmap representation of Cliff’s Delta effect size measurements of cell parameters statistically different in at least one disease relative to healthy volunteers. Cliff’s Delta is a non-parametric effect size measure that quantifies the difference between two groups, with a range from −1 (all values in one group are lower than those in the other group) to 1 (all values in one group are larger than those in the other group), and 0 indicating no difference between the groups. Effect sizes are represented using a coloured gradient scale ranging from −0.8 to 0.8. Cell parameters downregulated in disease are represented in orange, and cell parameters upregulated in diseases are represented in blue. Unsupervised hierarchical clustering was used to automatically gather diseases and cell populations. Diseases are coloured based on the five disease clusters identified by the hierarchical clustering. (B) Dot plot representation showing the cell parameters found to be significantly impacted in at least seven diseases relative to healthy volunteers. Cell parameters are gathered by main immunological families (ie, Teffs, Tregs, B cells and ILCs). Diseases are ordered and coloured based on the five identified disease clusters. Significantly downregulated cell parameters are indicated in orange, and significantly upregulated cell parameters are indicated in blue. (C) Principal component analysis based on Cliff’s Delta values of each group of patients relative to the healthy condition. The percentages of variance information captured by the two first components are indicated along each axis. Conditions are coloured based on the five disease clusters identified by the hierarchical clustering. (D) Correlation circle showing cell parameters associated with the two first principal components (PC1). Selected parameters are coloured based on their immunological families (ie, Teffs, Tregs, B cells and innate). In such representation, each variable is represented by a dot and an arrow, with its coordinates corresponding to its correlation with PC1 (x-axis) and PC2 (y-axis). The closer the point is to the edge of the circle, the stronger its contribution to the respective principal components. Variables with arrows pointing in the same direction (acute angle between them) have a positive correlation. The closer the angle is to 0 degree, the stronger the correlation is. APLS, antiphospholipid syndrome; BD, Behçet’s disease; CD, Crohn’s disease; CS, Churg-Strauss disease; FMF, familial Mediterranean fever; GPA, granulomatosis with polyangiitis; HV, healthy volunteer; ILC, innate lymphoid cell; MY, myositis; OA, osteoarthritis; PCA, principal component analysis; RA, rheumatoid arthritis; SA, spondyloarthritis; SLE, systemic lupus erythematosus; T1D, type 1 diabetes; T2D, type 2 diabetes; TA, Takayasu arteritis; UC, ulcerative colitis.

We identified a set of 23 cell populations that were impacted by at least seven diseases relative to HV (figure 2B). These cell populations were mainly associated with Tregs, effector T lymphocytes (Teffs), B cells and ILCs. Remarkably, Treg subsets were consistently upregulated in all diseases relative to HV, except for the percentages of Inducible T-cell Costimulator (ICOS)+ Tregs among Tregs that were downregulated in 10 diseases (APLS, CD, CS, FMF, GPA, OA, RA, SLE, TA, and UC). The absolute counts of ICOS+ Tregs were downregulated in eight diseases (APLS, CD, FMF, GPA, RA, SLE, TA, and UC). Teff subsets were mainly downregulated in almost all diseases relative to HV, except for Th17-associated Teff cell parameters that were upregulated in seven diseases (BD, FMF, GPA, T1D, T2D, TA, and SA). The percentages of CD32− cells and transitional cells among B cells were downregulated in seven diseases (CD, CS, GPA, OA, SLE, TA, and UC). The percentage of ILC3 among ILCs was found to be consistently downregulated in eight diseases (APLS, BD, GPA, OA, RA, SLE, T2D, and TA).

To deconvolute the identified disease clustering, and to understand its driving components, we used PCA. This unsupervised analysis captured around 34% of the variance information in its first component (PC1) and around 25% of the variance information in its second component (PC2). The PCA confirmed the hierarchical clustering (figure 2C). A set of 39 cell parameters, which mainly involved Teffs, Tregs, B cells with a few innate cell populations such as ILC3s, type 1 natural killer T (NKT) cells and DC, was captured by this analysis and explained the disease clustering (figure 2D). Noteworthy, multidimensional scaling representation also confirmed the good separation of the five identified clusters of diseases (online supplemental figure S2).

The ICOS+/LAG3+ Treg ratio clusters AD and AIF according to their inflammatory status

We determined that disorders were spread into three clusters along the first axis (PC1) of the PCA (figure 3A). This clustering appears to be associated with the disease inflammatory levels. The first cluster (PC1-C1) comprised HV, T1D and T2D that are associated with no inflammation. Additionally, this cluster contained BD, SA, OA, MY and RA diseases that are disorders with low inflammatory levels. The second cluster (PC1-C2) comprised APLS, GPA, CS and TA diseases that are disorders with moderate inflammatory levels. Finally, the third cluster (PC1-C3) comprised CD, FMF and UC that are disorders with high inflammatory levels. Of note, SLE was classified in the second cluster (PC1-C2) associated with moderate inflammatory levels.

Figure 3Figure 3Figure 3

Disorders are spread along a first immunological axis that is mainly driven by an LAG3/ICOS balance in Tregs. (A) Principal component analysis constructed based on Cliff’s Delta values of each group of patients relative to the healthy condition along with the correlation circle of cell parameters exclusively associated with the first principal component (PC1). Conditions are coloured based on the three clusters of diseases identified along the PC1 axis. Selected parameters are coloured based on their immunological families. (B) Bar chart representation showing the eigenvalues of cell parameters exclusively associated with the PC1. Cell parameters with negative eigenvalues are driving the disease on the left of the PCA representation. Cell parameters with positive eigenvalues are driving the disease on the right part of the PCA representation. (C and D) Heatmap representation showing the Cliff’s Delta measures in each disease relative to healthy volunteers for selected cell parameters at the disease or cluster levels. Diseases or clusters of diseases are ordered as projected on the PC1 axis. Parameters significantly dysregulated in one condition are indicated with a red dot. Cell parameters are ordered according to their eigenvalues. (E and F) Boxplot and jitter representations showing the log2 fold change between the percentage of ICOS+ cells within Tregs and the percentage of LAG3+ cells within Tregs at the disease or cluster disease levels. Significant comparisons to healthy volunteers are indicated with their p values (**p<0.01, ***p<0.001) and Cliff’s Delta effect size (ES) measure. APLS, antiphospholipid syndrome; BD, Behçet’s disease; CD, Crohn’s disease; CS, Churg-Strauss disease; FMF, familial Mediterranean fever; GPA, granulomatosis with polyangiitis; HV, healthy volunteer; MY, myositis; OA, osteoarthritis; PCA, principal component analysis; RA, rheumatoid arthritis; SA, spondyloarthritis; SLE, systemic lupus erythematosus; T1D, type 1 diabetes; T2D, type 2 diabetes; TA, Takayasu arteritis; UC, ulcerative colitis.

This first axis was specifically associated with 13 cell parameters (figure 3A,B). Four parameters positively correlated with this PC1 axis—and drove diseases on the right part of the PCA. These parameters consisted of the percentages of LAP+ cells among Tregs, LAG3+ cells among Tregs, CD32− cells among B cells and the absolute number of mDC2. Nine parameters negatively correlated with this PC1 axis and drove diseases on the left part of the PCA. These parameters consisted of multiple cell parameters associated with ICOS+ in Tregs, Th or Tc, as well as the number of type 1 NKT cells.

Heatmap representations of effect size measures for these 13 cell parameters showed a clear gradient of downregulation or upregulation relative to HV along this first axis in the three clusters (figure 3C,D). This set of 13 parameters was almost not significantly impacted in the first cluster (PC1-C1) and was increasingly significantly impacted in the second and the third clusters (PC1-C2 and PC1-C3). The absolute numbers of ICOS+ Tregs and ICOS+ Teffs, and the percentage of LAG3+ cells among Tregs were representative of this gradient (online supplemental figure S3A–C). Of note, we found a strong positive correlation between the absolute numbers of ICOS+ Teffs and ICOS+ Tregs (online supplemental figure S3D).

Furthermore, we identified that the balance between the percentages of LAG3+ Tregs and ICOS+ Tregs was the most determining factor of this gradient (figure 3E,F).

Activated Treg subsets and ILC3 clusters AD and AIF according to the type of affected tissues

We identified that disorders were spread into three clusters along the second axis (PC2) of the PCA (figure 4A). UC, CD and FMF diseases (that are IBD and IBD-like disorders) were localised on the bottom of the PC2 axis, in a first cluster (PC2-C1). RA, BD, SA, SLE and OA (that are arthritis disorders) as well as T1D and T2D (that are metabolic diseases) were positioned in the middle of the PC2 axis, in a second cluster (PC2-C2). Finally, APLS, CS, TA, GPA (that are blood vessel disorders) and MY diseases were located at the top of the PC2 axis, in a C3 disease cluster, with MY (that is a muscle disorder) being at the extreme top of the PCA (PC2-C3). Thus, PC2 appears to cluster diseases based on affected tissues.

Figure 4Figure 4Figure 4

Disorders are spread along a second immunological axis that is mainly driven by Treg subsets and type 3 ILC. (A) Principal component analysis constructed based on Cliff’s Delta values of each group of patients relative to the healthy condition along with the correlation circle of cell parameters exclusively associated with the second principal component (PC2). Conditions are coloured based on the three clusters of disease identified along the PC2 axis. Selected parameters are coloured based on their immunological families. (B) Bar chart representation showing the eigenvalues of cell parameters exclusively associated with the PC2. Cell parameters with negative eigenvalues are driving the disease at the bottom of the PCA representation. Cell parameters with positive eigenvalues are driving the disease on the top part of the PCA representation. (C and D) Heatmap representation showing the Cliff’s Delta measures in each disease relative to healthy volunteers for all cell parameters at the disease or cluster levels. Diseases or clusters of diseases are ordered as projected on the PC2 axis. Parameters significantly dysregulated in one condition are indicated with a red dot. Cell parameters are ordered according to their eigenvalues. APLS, antiphospholipid syndrome; BD, Behçet’s disease; CD, Crohn’s disease; CS, Churg-Strauss disease; FMF, familial Mediterranean fever; GPA, granulomatosis with polyangiitis; HV, healthy volunteer; ILC, innate lymphoid cell; MY, myositis; OA, osteoarthritis; PCA, principal component analysis; RA, rheumatoid arthritis; SA, spondyloarthritis; SLE, systemic lupus erythematosus; T1D, type 1 diabetes; T2D, type 2 diabetes; TA, Takayasu arteritis; UC, ulcerative colitis.

This second axis was specifically associated with 10 cell parameters (figure 4A,B). Seven parameters positively correlated with this PC2 axis, driving the diseases to the top of the PCA. These cell parameters consisted of Tregs and Treg subsets—including LAP+ Tregs, CD45RA− Tregs and FoxP3+ Tregs—central memory CD4+ T cells and CD95+ Th cells. Conversely, three cell populations negatively correlated with this PC2 axis driving the diseases on the bottom of the PCA representation. These parameters included the percentage of TCR Va7.2+ cells among T cells, the percentage of plasmacytoid dendritic cells among CD45+ mononuclear cells and the percentage of ILC3s among ILCs.

Heatmap representations of effect size measures for this set of 10 cell parameters showed a clear gradient of regulations relative to HV along this second axis in the three clusters (figure 4C,D). This set of 10 parameters was almost not significantly impacted in the PC2-C1 cluster and was increasingly significantly impacted in the PC2-C2 and PC2-C3 clusters. The percentages of LAP+ cells among Tregs, the absolute number of LAP+ Tregs, the percentage of ILC3s among ILCs and the absolute number of ILC3s were representative of this gradient of dysregulations (online supplemental figure S4).

Disease clusters can be captured by a restricted core of immune cell populations

Together, the signature of 23 cell parameters significantly associated with PC1 or PC2 axes was enough to capture the five clusters of diseases that were previously identified (figure 5A). The C1 disease cluster contained T1D and T2D—which are two metabolic disorders without inflammation—along with BD, OA, RA and SA—which are arthritis disorders with low inflammation degree. The C2 disease cluster contained MY alone—a muscle disorder with a low inflammation. The C3 disease cluster contained APLS, CS, GPA and TA—which are blood vessel disorders with moderate degrees of inflammation. Finally, the C4 and C5 disease clusters comprised CD, FMF and UC—which are inflammatory bowel or inflammatory bowel-like diseases with high degrees of inflammation. Of note, SLE, which is mainly an arthritis disease generally associated with low inflammation degree, was found to belong to the C3 disease cluster.

Figure 5Figure 5Figure 5

Disease clusters can be captured by a restricted core of immune cell populations. (A) Heatmap representation of Cliff’s Delta effect size measurements relative to healthy volunteers in all diseases for the 23 cell parameters associated with first principal component (PC1) and PC2. Unsupervised hierarchical clustering was used to automatically gather diseases and cell populations. Diseases are coloured based on their associated inflammation levels (no-inflammation levels in yellow, low-inflammation levels in orange, moderate-inflammation levels in red, high-inflammation levels in brown) and shaped based on their type/localisation (stars for metabolic diseases, down triangles for arthritis, up triangles for blood vessels, squares for inflammatory bowel disease (IBD) and lozenge for IBD-like). (B) Heatmap representation of Cliff’s Delta effect size measurements relative to healthy volunteers in the five clusters of diseases for the 23 cell parameters associated with PC1 and PC2. (C) Coexpression network showing the significant correlations between cell parameters and other cell parameters. Parameters included in the immunological signature are represented with circles, while other cell parameters associated with parameters of this signature are represented with hexagonal shapes. Parameters are coloured according to their immunological families. (D) Dot plot and bar plot representations showing cell parameters of the signature or associated with the signature statistically different in diseases relative to healthy volunteers along with their connectivity degrees in the coexpression network and their number of occurrences across diseases. APLS, antiphospholipid syndrome; BD, Behçet’s disease; CD, Crohn’s disease; CS, Churg-Strauss disease; FMF, familial Mediterranean fever; GPA, granulomatosis with polyangiitis; ILC, innate lymphoid cell; MY, myositis; OA, osteoarthritis; RA, rheumatoid arthritis; SA, spondyloarthritis; SLE, systemic lupus erythematosus; T1D, type 1 diabetes; T2D, type 2 diabetes; TA, Takayasu arteritis; UC, ulcerative colitis.

Each cluster of disease was characterised by specific patterns of downregulated or upregulated cell parameters relative to the healthy group along the PCA axes (figure 5B). The C1 disease cluster was characterised by no dysregulations of parameters associated with the PC1 axis and limited significant dysregulations of parameters associated with the PC2 axis. The C2 disease cluster, which comprised MY alone, was characterised by dysregulation of cell parameters mainly associated with the PC2 axis. The third clustering of diseases was characterised by significant dysregulation of cell parameters associated with both PC1 and PC2 axes. Finally, the C4 and C5 disease clusters were characterised by dysregulations of cell parameters mainly associated with the PC1 axis.

To further investigate the characteristics of this signature, we created a coexpression network based on its parameters (figure 5C). We identified a main community of Teff− and Treg− parameters positively correlated. Among them, the number of CD95+ Th/Teffs/Tregs, the number of Tregs and the number of central memories Th/Teff had the highest connectivity degrees, reflecting potential central activities in immune dysregulations and could serve as biomarkers (figure 5D).

The number of LAG3+ Tregs is a key immunological marker in multiple ADs and AIFs

To complement our findings, we generated classification trees capturing parameters or combination of parameters that best separate patients from healthy individuals in each disease (online supplemental figures S5–S7). These supervised analyses aimed to capture the most discriminating features in each disease independently. Each classification tree was generated using all available cell parameters, and we quantified the model classification precisions.

A total of 12 cell parameters were found by classification trees (figure 6A,B). We found that patients from the C1 disease cluster were characterised by heterogeneous disease-specific markers rather than a common set of markers. The p values associated with the selected parameters were less significant, and the accuracy of decision tree predictions was lower for these diseases compared with other clusters of diseases. Patients from C2, C3 and C4 disease clusters were all characterised by a higher number of LAG3+ Tregs compared with HVs, with more significant p values and higher decision tree predictions compared with other clusters of diseases. Finally, the C5 disease cluster was characterised by a higher percentage of NKp44+ cells among NKdim cells relative to HV. Of note, the downregulation of the number of ICOS+ Tregs was found to be the most discriminative parameter to segregate all patients—regardless of their diseases—from controls, with a limited classification precision (online supplemental figure S8).

Figure 6Figure 6Figure 6

Classification decision trees identify LAG3+ Tregs as a key marker in multiple diseases. (A) Dot plot representation showing cell parameters found to be significantly involved in classification decision trees. For each disease, a classification decision tree was generated to identify the best marker or combination of markers that distinguish patients from healthy volunteers. The size of each dot is proportional to the −log10 of the p value, and dots are coloured according to the effect size relative to healthy volunteers. Cell parameters are sorted based on their number of occurrences in the generated models. (B) Accuracy, ranging from 0 to 1, of each classification decision tree are represented using a bar chart representation. (C and D) Boxplot and jitter representations showing the number of LAG3+ Tregs and the percentage of NKp44+ within NKdim cells for each disease. Diseases are gathered by identified disease clusters. Significant comparisons to healthy volunteers are indicated with their p values (*p<0.05, **p<0.01, ***p<0.001) and Cliff’s Delta effect size (ES) measure. APLS, antiphospholipid syndrome; BD, Behçet’s disease; CD, Crohn’s disease; CS, Churg-Strauss disease; FMF, familial Mediterranean fever; GPA, granulomatosis with polyangiitis; HV, healthy volunteer; MY, myositis; OA, osteoarthritis; RA, rheumatoid arthritis; SA, spondyloarthritis; SLE, systemic lupus erythematosus; T1D, type 1 d

留言 (0)

沒有登入
gif