Quantitative proteomics identified circulating biomarkers in lung adenocarcinoma diagnosis

Global proteome characterization of LUAD and healthy plasma

The mass and signal intensity of peptide and fragment ions after peptide fragmentation were obtained by mass spectrometry. The information at the peptide level is called the primary spectrum, and the information at the peptide fragment ion is called the secondary spectrum. Secondary MS data were retrieved by Proteome Discoverer (V2.4.1.15), Homo_sapiens_9606_PR_20201214. fasta database. In this research, proteomic analysis was matched on 10 LUAD peripheral blood (7 female and 3 male) and 10 healthy peripheral blood (Fig. 1a). A total of 1,181,604 secondary mass spectra were collected (Fig. 1b). 340,834 spectra are matching theoretical secondary spectra in the database, with a utilization ratio of 28.85%. A total of 11936 peptide sequences were identified from the matching results, including 10922 unique peptide sequences. During quantification, one protein corresponded to multiple specific peptides, and 2094 identification proteins were identified by specific peptides, resulting in 1772 proteins. Principal component analysis was used to show a general pattern of changes in protein abundance within and between the two groups to observe the similarities and differences between samples. As shown in Fig. 1c, the LUAD group exhibited clustering specificity, while the healthy group spread out randomly. The Pearson correlation coefficient between samples is shown by a heatmap. The correlation coefficient between proteomics data is shown in Fig. 1d, and the correlation between LUAD samples was 0.916–0.947 and between healthy samples ranged from 0.890 to 0.945, indicating a significant correlation.

Fig. 1figure 1

Global proteome characterization of LUAD and healthy plasma a The protocol of detailing experiments b Identification of quantifiable proteins from MS/MS spectra c Principal Component Analysis of quantified proteins, a complete separation of LUAD and NL groups d Pearson’s Correlation Coefficient of LUAD and NL groups. e volcano plot of all proteins in LUAD compared to NL

Proteomic features of DEPs in LUAD

To judge the conspicuousness of the difference in protein expression, we performed a t-test on the Log2FC of each protein in the LUAD group compared with the NL group. When the P value was ≤ 0.05, Log2FC > 1.5 were up-regulated DEPs, and Log2FC < 1/1.5 were down-regulated DEPs, and 317 DEPs were finally obtained. Compared with healthy controls, in the quantifiable proteins, there are 208 up-regulated proteins and 109 down-regulated proteins in the LUAD group signally, like MYH6, POSTN, NAP1L1, EXOC2 and NOTCH1. A volcano plot was drawn exhibiting DEPs in statistics in LUAD in comparison with the NL group (Fig. 1e). Furthermore, the heatmap also represented a hierarchical cluster of the DEPs (Fig. 2a). According to Gene Ontology (GO) functional classification for the 317 DEPs, they were cataloged into three categories and 22 terms, including 12 biological processes, three cellular components, and seven molecular functions (Fig. 2b). These proteins are involved in cellular processes, binding and catalytic activity. Figure 2c shows that about 36% DEPs were in the cytoplasm (115 proteins), 23% in extracellular space (74 proteins), 13% in mitochondria (42 proteins), and 11% in the nucleus (36 proteins), which suggests that DEPs of LUAD may be secreted to carry out signal transduction, participate in tumor energy metabolism through mitochondria, and regulate gene expression in the nucleus. The Clusters of Orthologous Groups of proteins (COG) categories indicated that the DEPs were related to energy metabolism, carbohydrate metabolism, signal pathway and mechanisms, cytoskeleton and protein modification after translation, protein transportation and chaperones (Fig. 2d).

Fig. 2figure 2

Proteomic features of differentially expressed proteins in lung adenocarcinoma a Heatmap with hierarchical clustering of DEPs b The orange indicating high expression and green indicating low expression. Each line represents a protein, and each column represents a sample c GO analysis of DEPs. d subcellular localization prediction of DEPs e KOG categories of DEPs f KEGG enrichment analyses of DEPs

KEGG and GO enrichment analyses of DEPs in plasma

To further predict the possible roles of DEPs, we conducted functional enrichment analysis using Fisher’s exact test, P < 0.05 is required. The GO analysis included biological processes, cellular component, and molecular function annotations, explaining the function of DEPs in multiple angles. The upregulated proteins in the plasma of LUAD patients compared with healthy subjects were significantly enriched in neurogenesis, negative regulation of cell communication and signal transduction, positive regulation of cell death and other biological processes (Fig. 2e). According to the cellular component annotation, the majority of the DEPs originated from the endoplasmic reticulum, extracellular space, Golgi apparatus, and endoplasmic reticulum lumen and are involved in protein processing and transport. The molecular function analysis revealed that DEPs were enriched in ionic binding activity and protein heterodimerization activity (Fig. 2f), which act a pivotal part in transmembrane transports and biological information transfer. The downregulated proteins were centered on phosphorylation, regulation of organelle organization, ribose and nucleoside phosphate metabolic processes. Cellular component annotation was focused on the cytoskeleton, mitochondrion, microtubule cytoskeleton, and other regions related to cell proliferation. The molecular functions of the downregulated proteins were enriched in anion binding, small molecule binding and nucleotide binding, which is consistent with the upregulated proteins.

Development of targeted protein assays using PRM

To further extend the research for application and prognosis role, PRM detection was further tested. The 317 DEPs obtained previously were retrieved from the database, and the next study focused on 40 proteins closely related to prognosis. PRM testing was performed in peripheral blood from an additional 10 LUAD patients and 10 healthy subjects, of which 35 proteins were quantitatively analyzed, limited by certain protein characteristics and expression abundance. 27 proteins were statistically significant, including 20 up-regulated differential proteins and seven down-regulated DEPs (Additional file 1: Table S1). The obtained data were processed by Skyline (v.3.6) to calculate protein relative abundance (Fig. 3a). The DEPs were involved in platelet activation, VEGFA-VEGFR2 signaling pathway, and carbohydrate catabolic process (Fig. 3b).

Fig. 3figure 3

Detection of targeted proteins by PRM a 35 proteins according to the adjusted P value are colored in blue (down-regulated) and yellow (up-regulated) b GO analysis of 27 DEPs detection by PRM c PPI network analysis was performed using the STRING11.0 online software. d the graph demonstrates individually the top 10 significant DEPs, P value < 0.05

As illustrated in Fig. 3D, we researched the altered proteins in STRING (V.11.5) and accessed the PPI network (Fig. 3c); these proteins may have a specific role in the early occurrence of LUAD. GAPDH, TUBA4A, WDR1, and LDHA are the central proteins shown in the network. The candidate proteins GAPDH and RAC1 showed the highest connectivity with other differentially expressed proteins between LUAD and NL using STRING by calculating the ratio of LUAD/NL, the expression levels of the top 10 significant DEPs, RAC1, ACTR2, PFKP, FHL1, UQCRC1, POSTN, RAB27B, ARPC2, RAP1B, and PNP, are shown in Fig. 3d.

KM-plotter was used to evaluate the prognosis performance of every DEP. The KM analysis showed that LUAD patients with positive ATCR2, FHL1, RAB27B, and RAP1B (Fig. 4) expression had observably longer OS than patients with negative expression (P < 0.05). The high expression of ARPC2, PFKP, PNP, RAC1, GAPDH, and TUBA4A was observably negatively correlated with OS prognosis (P < 0.05). ARPC2 is involved in the control of intracellular dynamic changes of actin and facilitates cell migration and tumor metastasis in lung, colon, and breast cancer. PFKP is a platelet-specific phosphofructokinase that makes a critical difference in metabolic reprogramming in certain cancers, including bladder cancer, breast cancer, and lung cancer, and is a potential driver gene in the GEO (Gene Expression Omnibus) database [20,21,22,23,24]. RAC1 (Rac family small GTPase 1) is a driver gene that regulates plenty of cellular events, particularly in cell growth and division, cytoskeletal and synaptic recombination, autophagy, and tumor metastasis. These 10 genes were analyzed for combined KM prognosis (Fig. 4) [25,26,27].

Fig. 4figure 4

Kaplan–Meier survival analysis of LUAD group compared with NL group

Overall survival of ATCR2, FHL1, RAB27B, RAP1B, UQCRC1, ARPC2, PFKP, PNP, POSTN, RAC1, GAPDH and TUBA4A were performed in KM-Plotter online survival analysis site

Potential diagnostic markers in LUAD

The samples used in this study were mainly concentrated in early LUAD, so the next step was to draw ROC curves to check the effect of each DEPs in the diagnosis of LUAD. If the AUC was > 0.70, the proteins could be regarded as a potential independent diagnostic factor. The mean plasma protein expression of 10 healthy people was negative, and ROC analysis was performed on the 27 proteins with differential expression by Medcalc software. 17 out of 27 proteins revealed a high AUC (> 0.80) between the LUAD group and NL group. Among those proteins, CCT7 had an AUC of 0.960, and there were five proteins with an AUC from 0.90 to 0.95, indicating that these central proteins might have the discriminative capacity in LUAD (Fig. 5a), P < 0.05. Logistic regression analysis was performed for 6 plasma proteins with higher AUCs, including CCT7, UQCRC1, PGD, GAPDH, LDHA, and GP5 (Table 2), resulting in a detection rate of 92% (Fig. 5b). UQCRC1 acts on cytochrome C upstream or internally of mitochondrial electron transport and has been studied extensively in Alzheimer's disease, which may play an important role in the targeted therapy of pancreatic cancer [28,29,30].

Fig. 5figure 5

ROC curves of the best protein a The AUC values of six proteins in these datasets b The AUC values of the combination of six-protein

Table 2 Six differentially expressed proteins identified in plasma samplesExperimental verification by western blot and immunohistochemistry

We examined the expression of these 6 potential biomarkers by WB detection in 25 plasma samples (14 patients and 10 controls) and immunohistochemical detection in 22 lung tissues (10 LUAD and 12 tumor-adjacent tissues) (Table 3). The expression of GP5, CCT7, UQCRC1, PGD, GAPDH and LDHA were higher in the LUAD patients’ plasma (Fig. 6a, b), which basically consistent with the PRM’s result. In addition, we also verified the expression of those proteins in LUAD tissues and tumor-adjacent tissues by IHC with a similar trend (Fig. 6c).

Table 3 Clinical information of patients included in WB and IHCFig. 6figure 6

Western blot analysis and immunohistochemistry staining of six proteins a Western blot of GP5, CCT7, UQCRC1, PGD, GAPDH, LDHA in healthy and LUAD plasmas, Ponceau S was used for quantification of total protein, b histogram of protein expression level calculated by line’s IntDen divide total protein per lane (µg). c Immunohistochemistry staining of GP5, CCT7, UQCRC1, PGD, GAPDH and LDHA in LUAD tissues and tumor-adjacent tissues; scale bar = 100 µm (magnification, × 200). (*P value < 0.05, **P value < 0.01)

留言 (0)

沒有登入
gif