Abstract
Background/Aim: Better stratification of the risk of relapse will help select the right patients for adjuvant treatment and improve targeted therapies for patients with colon cancer. Materials and Methods: To understand why a subset of tumors relapse, we compared the proteome of two groups of patients with colon cancer with similar stage, stratified based on the presence or absence of recurrence. Results: Using tumor biopsies from the primary operation, we identified dissimilarity between recurrent and nonrecurrent mismatch satellite stable colon cancer and found that signaling related to immune activation and inflammation was associated with relapse. Conclusion: Immune modulation may have an effect on mismatch satellite stable colon cancer. At present, immune therapy is offered primarily to microsatellite instable colon cancer. Hopefully, immune therapy in mismatch satellite stable colon cancer beyond PD-1 and PD-L1 inhibitors can be implemented.
Colorectal cancer is the third most common type of cancer in men and second in women, accounting for 10 and 9.2%, respectively, of total cancer cases worldwide (1). Colon cancer typically develops from a benign precursor adenoma that is visible on the mucosal surface of the colon. Primary treatment following early detection is surgery, with postoperative adjuvant therapy for selected groups based on the histopathological findings in the operation specimen – in principle when lymph node involvement is detected. Colon cancer is staged according to the Union Internationale Contre le Cancer (UICC) TNM system, stratifying the local growth of the tumour, the spread to lymph nodes and the distant metastases (2). The TNM stage correlates well with the prognosis and the recurrence rate and is used to group the disease into stages as a measure of the severity of cancer.
There are at least three distinct pathways in colorectal cancer pathogenesis: the chromosomal instability (CIN), microsatellite instability (MSI), and CpG island methylator phenotype (CIMP) pathways (3). The molecular mechanisms underlying cancer development are multifactorial, however initial disease mechanisms in colorectal cancer suggest activation of the canonical Wnt signalling pathway, through mutations in the APC (4) and β-catenin (5) genes, followed by accumulation of somatic mutations in KRAS (6) and PI3K (7), and inactivation of TGF-β (8) as well as the p53 pathway (8). The microsatellite instable high (MSI-H) cancers have a defect in mismatch repair (MMR) proteins, leaving spontaneous mutations un-repaired during DNA replication, thus causing frame-shift mutations or silenced genes (9). MSI-H cancers are in general hypermutated, expressing several neoantigens causing a strong immune response (10). This effect is linked to the response to immune therapy by PD-1 and PD-L1 inhibitors (11-14).
The role of immunity and immune activation in colorectal cancer has been thoroughly reviewed by several authors (10, 15-19). A subgroup of microsatellite stable (MSS) tumors has high immune activation, but at present a clear set of criteria for the tumors responding to of immune therapy has not been defined (12). Identifying candidates for immune therapy in MSS may help improve the outcome of subgroups of CRC.
In recent years, the Consensus Molecular Subtypes (CMS) can predict prognosis (20-22). A challenge with this classification is that it is technically demanding and time consuming, and not all patients can be clearly allocated to the four groups. Tools such as ColoType (23) help stratify CMS based on RNA signature with prognostic value. However, CMS stratification is still complex partly due to cell heterogeneity (24) and the predictive utility is still limited. Proteomics and subsequent pathway analyses may play a role in understanding and clarifying the mechanisms underlying the risk of recurrent disease.
Vasaikar et al. (25) has recently published a proteogenomic characterization of colon cancer, analysing 110 patients with a broad set of omics methodologies. A major finding was that proteomics identified an association between decreased CD8 T-cell infiltration and increased glycolysis in MSI-H tumors, suggesting glycolysis as a potential target to overcome the resistance of MSI-H tumors to immune checkpoint blockade. For the non-hypermutated cancers, somatic mutations are most frequent in APC, TP53, SMAD4, PIK3CA, KRAS, TCF7L2, SOX9, ARID1A and FAM123B (26). A better characterization of the molecular mechanisms underlying the differences, in particular between stages TNM2 and TNM3, is warranted. Identifying specific protein signatures for recurrence risk of these two stages are of clinical interest. There are potential markers for stratifying recurrence risk, but none are validated and implemented to current clinical use (27).
The aim of the present study was to identify proteins associated with relapse, which can help in selecting patients who need adjuvant therapy after surgery. This study is a feasibility study based on a limited material and the objective was thus to assess the value of large-scale proteomic analyses of perioperative biopsies from colon cancer patients. The main current clinical problem is that we need to treat many patients unnecessarily to rescue those with high risk of new tumor activity. In order to pinpoint these proteins, we have identified differentially expressed proteins that may contribute to progression into relapse, by comparing relapse versus non-relapse samples. We also describe the proteomic differences between TNM2 and TNM3 as these patient-groups receive different treatment based on staging at initial diagnosis.
Materials and Methods
Patients. The patients included in the present study are from a cohort of 265 patients with localized colon cancer that were included in a prospective study from January 2007 to December 2011 (28), with perioperative biopsies collected by the operating surgeon. In this cohort, 78 patients had pMMR-tumour material stored in RNA-Later. The patients included were selected based on matching similar TNM and remission status groups on tumor site, recurrence status, age and gender to reduce bias. Distant metastases were excluded by chest and abdomen CT imaging. A complete mesocolic resection (CME) with high vascular tie was performed. No patient received neoadjuvant chemotherapy or radiation. Patients under 70 years received adjuvant chemotherapy according to the Nordic FLOX regimen, whereas patients between 70 and 75 years were treated by the Nordic FLv regimen if there was nodal spread (29, 30). The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Regional Ethics Committee West (REK Vest 15666), the Norwegian Medicines Control Authority and the Data Inspectorate and thus comply with national regulations.
Surgical resection was performed at Haraldsplass Deaconess Hospital, Bergen, Norway. All patients included were diagnosed with MSS colon adenocarcinomas. The patients were followed-up for minimum five years. The regular check-up included clinical examination, CEA blood test and computed tomography (CT) of the chest and abdomen every sixth month for three years, then yearly to the fifth year after surgery. Colonoscopy was done every three years. We had access to the full records of the patients, both during treatment and follow-up for five years, including relapse status. The material included patients with TNM stages I-III of colon cancer.
Processing of biopsy material: Tissue lysis and protein digestion. Per-operative biopsies from tumour tissue and control tissue were obtained from 25 patients with sufficient fresh-frozen material available. Biopsies (n=50, 25 tumour and 25 normal colon tissue samples) were cut and homogenized in 0.5 ml Dulbecco’s phosphate buffered saline (DPBS) and TissueLyzer. Samples were sonicated three times for 30 s at 30% power, followed by centrifugation at 14,000 ×g for 3 min. The supernatant was transferred to a new tube, and the protein concentration was measured using the Bradford assay. From each sample 20 μg total protein were stored at -80°C before further processing. Five samples of 20 μg were pooled (Table I) to make 100 μg samples. The same was done for the respective normal tissue samples and ten pooled samples of 100 μg protein were processed for proteomics.
Patient characteristics, primary tumor site, primary tumor histology, primary tumor grade, lymph nodes and positive lymph nodes.
The entire amount of protein (100 μg) in each sample was reduced, alkylated and digested into peptides with trypsin. The peptide samples were TMT-labelled (10-plex) (Thermo Fisher, San Jose, CA, USA) and fractionated as described in (31).
Tandem mass tag (TMT) 10-plex labelling. TMT reagents were re-suspended in acetonitrile (ACN). Desalted peptides were re-suspended in 100 μl of 200 mM HEPES pH 8.5, 30 μl of ACN, and 10 μl of the TMT reagents were added to the respective peptide samples, gently vortexed, and incubated for 1.5 h at room temperature (RT). To prevent unwanted labelling, the reaction was quenched by adding 10 μl of 5% hydroxylamine and incubated for 15 min at RT. Equal amounts of the TMT-labelled samples were combined and concentrated to near dryness, followed by desalting via C18 solid phase extraction.
Off-line basic pH reversed phase fractionation. The combined labelled peptide samples were pre-fractionated by basic pH reversed phase HPLC as described previously (32), using an Agilent (P/N 770995-902) 300Extend-C18, 5 μm, 250 mm × 4.6 mm id column, connected to an Agilent Technology off-line LC-system (Santa Clara, CA, USA). Solvent A was 5% ACN, 10 mM NH4HCO3 pH 8, and solvent B was 90% ACN, NH4HCO3 pH 8. The samples were re-suspended in 500 μl solvent A and loaded onto the column. Column flow was set to 0.8 ml/min and the gradient length was 70 min, as follows: from 0-35 min solvent 50% A/50% B, and from 35-50 min 100% B, and from 50-70 min 100% A. The labelled peptides were fractionated into 96 fractions, and further combined into a total of 12 fractions. Each fraction was acidified with 1% formic acid, concentrated by vacuum centrifugation to near dryness, and desalted by StageTip. Each fraction was dissolved in 5% ACN/5% formic acid for LC-MS/MS analysis.
LC-MS3 analysis. From each of the 12 fractions, ~5 μg was dissolved in 1% aqueous formic acid (FA) prior to LC-MS/MS analysis on an Orbitrap Fusion mass spectrometer (Thermo Fisher Scientific) coupled to a Proxeon EASY-nLC 1000 liquid chromatography (LC) pump (Thermo Fisher Scientific). Peptides were fractionated on a 75-μm inner diameter microcapillary column packed with ~0.5 cm of Magic C4 resin [5 μm, 100 Å, Michrom Bioresources (Auburn, CA, USA)] followed by ~35 cm of GP-18 resin (1.8 μm, 200 Å, Sepax, Newark, DE, USA). For each analysis, we loaded ~1 μg onto the column.
Peptides were separated using a 3 hr gradient of 6 to 26% ACN in 0.125% FA at a flow rate of ~350 nl/min. Each analysis used the multi-notch MS3-based TMT method (33) on an Orbitrap Fusion mass spectrometer, which has been shown to reduce ion interference compared to MS2 quantification (32). The scan sequence began with an MS1 spectrum (Orbitrap analysis; resolution 120,000; mass range 400-1400 m/z; automatic gain control (AGC) target 2×105; maximum injection time 100 ms). Precursors for MS2/MS3 analysis were selected using a TopSpeed of 2 s. MS2 analysis consisted of collision-induced dissociation (quadrupole ion trap analysis; AGC 4×103; normalized collision energy (NCE) 35; maximum injection time 150 ms). Following acquisition of each MS2 spectrum, we collected an MS3 spectrum using our recently described method (33) in which multiple MS2 fragment ions were captured in the MS3 precursor population using isolation waveforms with multiple frequency notches. MS3 precursors were fragmented by high-energy collision-induced dissociation (HCD) and analysed using the Orbitrap (NCE 55; AGC 5×104; maximum injection time 150 ms, resolution was 60,000 at 400 Th).
Data analysis. Mass spectra were processed using a Sequest-based in-house software pipeline (34), and spectra were converted to mzXML using a modified version of ReAdW.exe. Database searching included all entries from the human UniProt database (March 11, 2014). This database was concatenated with one composed of all protein sequences in the reversed order. Searches were performed using a 50 ppm precursor ion tolerance for total protein level analysis. The product ion tolerance was set to 0.9 Da. These wide mass tolerance windows were chosen to maximize sensitivity in conjunction with Sequest searches and linear discriminant analysis (34, 35). TMT tags on lysine residues and peptide N termini (+229,163 Da) and carbamidomethylation of cysteine residues (+57,021 Da) were set as static modifications, while oxidation of methionine residues (+15,995 Da) was set as a variable modification.
Peptide-spectrum matches (PSMs) were adjusted to a 1% false discovery rate (FDR) (36, 37). PSM filtering was performed using a linear discriminant analysis, as described previously (34), while considering the following parameters: XCorr, ΔCn, missed cleavages, peptide length, charge state, and precursor mass accuracy. For TMT-based reporter ion quantitation, we extracted the summed signal-to-noise (S/N) ratio for each TMT channel and found the closest matching centroid to the expected mass of the TMT reporter ion.
The search space for each reporter ion was limited to a range of 0.003 Th to prevent overlap between the isobaric reporter ions. For protein-level comparisons, PSMs were identified, quantified, and collapsed to a 1% peptide false discovery rate (FDR) and then collapsed further to a final protein-level FDR of 1%. Moreover, protein assembly was guided by principles of parsimony to produce the smallest set of proteins necessary to account for all observed peptides.
Proteins were quantified by summing reporter ion counts across all matching PSMs using in-house software, as described previously (34). PSMs with poor quality, MS3 spectra with more than eight TMT reporter ion channels missing, MS3 spectra with TMT reporter summed signal-to-noise ratio that was less than 100, or no MS3 spectra were excluded from quantitation (38). Protein quantitation values were exported for further analysis in Microsoft Excel (Redmond WA, USA) or SAS JMP (Cary, NC, USA). Each reporter ion channel was summed across all quantified proteins and normalized assuming equal protein loading of all 10 samples.
The pathway analyses were generated using QIAGEN’s Ingenuity Pathway Analysis (IPA®, Redwood City, CA, USA). Briefly, the analyses were performed with the following settings: Expression Value Type (Exp Log Ratio), Reference set (Ingenuity Knowledge Base + Endogenous chemicals), Relationships to consider (Direct and Indirect Relationships), Interaction networks (35 molecules/network; 25 networks/analysis), Data Source (all), Confidence (Experimentally Observed), Species (Human, Mouse, Rat), Tissue & Cell Lines (all), Mutations (all).
The clustering analyses were performed in Perseus version 1.5.6.0 (39). Roughly, data were imported as log2-transformed data. Z-score normalization was performed based on rows. Hierarchical clustering was performed with settings Euclidian distance, averaged linkage with preprocessing k-means both in rows and columns tree. We defined 15 row clusters, exported protein list and analysed on basis of group means with SEM. In selection for proteins with a differential expression, we used a threshold of 1.5 in fold change.
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (40, 41) partner repository with the dataset identifier PXD026630.
Results
Per-operative biopsies were resected from tumour tissue and control tissue from 25 patients diagnosed with microsatellite stable colon adenocarcinomas (Patient characteristics are shown in Table I). From a total of 50 samples (tumour and normal tissue samples), tissues from five patients were grouped according to TNM staging (Figure 1A) as well as recurrence status into five pools of tumor tissue (TNM1, TNM2R+, TNM2R-, TNM3R+, TNM3R-) and normal colon tissues resected from the same patients (Figure 1B). The samples were pooled independent of gender, age and location of the tumour at initial diagnosis (right-side or left-side colon cancer). As the focus of this study was evaluating the differences in the proteome of tumors from colon cancer patients with recurrence versus patients without recurrence, we restricted our attention to MSS colon adenocarcinomas diagnosed with TNM stage 1, 2 and 3, including TNM2 and TNM3 with and without recurrence.
Cluster analysis of the cancer samples. (A) Cartoon depicting tumour development and the stages selected for this study according to the TNM system. (B) Experimental design depicting the cohorts employed and workflow. (C) Hierarchical analysis of the proteome landscape in the 10 different samples (n=5/sample/TNM stage). (D) Graphs depicting the centroid of Cluster #1 and Cluster #15 (light green – Cancer samples, black – control samples) and the total number of proteins belonging to each cluster. (E) Scheme depicting the proteome analysis workflow. (F) Top 10 Canonical Pathways characterising the analysed proteome landscape, as determined by Ingenuity Pathway Analysis (IPA) software. (G) Top Predicted Upstream Regulators for the analysed proteome landscape, as inferred by IPA. (H) Top Canonical Pathways and Top Predicted Upstream Regulators characterising the group of proteins with low abundance in the tumour samples (Cluster #1), as determined by pathway analysis (IPA). (I) Top Canonical Pathways and Predicted Upstream Regulators characterising the group of proteins with high abundance in the tumour samples (Cluster #15), as determined by pathway analysis (IPA).
Our proteomics analysis allowed for confident (1% FDR) identification of 4,805 protein quantified across all ten samples with at least one unique peptide sequence. A further elimination of known contaminants left 4,713 proteins (Figure 1B).
Relapse samples were distinguishable by specific protein signatures indicating risk of recurrent colon cancer. To describe the proteome changes that influence the progression to relapse, we clustered the tumor tissue samples based on similarity. Hierarchical clustering across all proteins identified clearly discriminates between the tumor and control samples (Figure 1C), while TNM2 and TNM3 subgroups clustered separately from TNM1. Of interest, the recurring tumor groups (TNM2R+ and TNMR3+) clustered closest. These results suggest that tumor samples and healthy tissue are characterised by considerably different proteome fingerprints and that the tumors of patients with later relapse are defined by specific proteome signatures distinct from the primary tumor of patients without later relapse (Figure 1C).
Cluster analysis revealed proteome signatures associated with protein synthesis, cellular remodelling and stress. As an initial analysis we clustered all 4,713 quantified proteins based on their abundance profiles between patient groups to obtain an unbiased view of the proteome changes occurring during colon cancer progression. Interestingly, our results showed that 2919 proteins (61%) separated cancer and control samples collected from the same patients. Among these, two distinct clusters emerged (Cluster #1 and Cluster #15) separating TNM stage 1-3 as compared to control samples, regardless of the relapse status (Figure 1D).
Pathway analysis of the combined clusters (i.e., all the 2919 proteins separating cancer and control samples, Figure 1E) predict that the 10 most important canonical pathways involver are related to translation, antioxidant activity, endocytosis and cytoskeleton remodelling (Figure 1F). Pathways related with protein synthesis represented the dominant signature, suggesting that the relapsed tumours present an altered protein metabolism (Figure 1F). Importantly, the top 5 upstream regulators responsible for the observed proteome landscape included transcription regulators tightly linked with malignant transformation such as MYC, HNF4A, TP53 and MYCN (Figure 1G).
Subsequently, we addressed each cluster separately to distinguish which of the tumor signatures identified above are defined by increased or decreased abundance of proteins in tumor tissue relative to normal tissue. Pathway analysis revealed that cytoskeleton remodelling fingerprint was characterised by proteins with low abundance in the tumour (Figure 1H). Moreover, this proteome landscape (cluster 1) was compatible with the predicted action of upstream regulators with known cancer connection, such as HRAS (top 1), TGFB1 (top 2) and TP53 (top 3). Of importance, TGFB1I1 and its downstream effector SMAD4 were found in cluster 1, presenting a very low abundance in the cancer samples.
In contrast, proteins with high abundance in tumor (Cluster 15, Figure 1I) strongly defined the proteome fingerprint related to protein synthesis. Based on this subset, the analysis predicted the involvement of another set of key cancer regulators MYC (top 1), HNF4A (top 2) and CST5 (top 3). MYCBP belonged to cluster 15 and displayed a high abundance in the cancer samples.
These data indicate that cancer and control samples are mainly distinguished by differences in protein synthesis and remodelling regardless of the malignant relapse potential, and these processes are likely controlled by critical cell fate regulators, such as MYC, TGFB or TP53.
Of interest, a lower percentage of transcription regulators were present in the cluster encompassing proteins with lower abundance in the cancer samples (cluster #1, 4.26%) than in the cluster of the proteins with higher abundance (cluster 15, 9.48%), suggesting that, at least in this case, most processes driving tumour development are controlled via increases in protein abundance.
At individual level, epigenetic modifiers such as the deacetylase MIDEAS (mitotic deacetylase associated SANT domain protein, ELMSAN1), the methyltransferases METTL7A, METTL7B, and METTL14, and the DNA methylation reader MECP2 presented a lower abundance in the tumour tissues. Of interest, receptors with a key role in growth regulation such as EGFR and critical players of cancer development such as K-RAS, SMAD4, GSK3B, MAPK1, MAPK3, APPL1, and RAF1 were also characterized by low abundance in the cancer samples.
In contrast, critical transcription regulators such as RB1, STAT1, STAT6, MTA2 and YAP1 as well as the growth receptor like INSR displayed higher abundance in the tumour tissue. Moreover, kinases previously demonstrated to drive cancer development such as MTOR, SRC, ROCK2, RAC2, PIK3R4, PIK3CD, PIK3R1 exhibited the same regulation.
Proteins displaying TNM-stage dependent changes in abundance define immune system-related signatures. To map the signatures defining the progression of the disease, we identified two clusters (Cluster #5 and #11) where the abundance of the grouped proteins changes with increasing TNM stage (Figure 2A). As before, we used IPA and performed pathway analysis on the 444 proteins exhibiting the above-described regulatory pattern (Figure 2B). The analysis identified a strong immune system signature, with signaling involved in immune system regulation in the top canonical pathways (Figure 2C). Based on the observed proteome landscape, HNF4A, MAPT, PTP4A3, PSEN1, TGFB1 and NFE2L2 were identified as top predicted upstream regulators.
Pathway analysis of the proteins displaying a gradual variation in abundance according to TNM stage. (A) Graphs depicting the centroid of Cluster #5 and Cluster #11 (light green – Cancer samples, black – control samples, red rectangle highlights the increase (upper panel) of protein abundance according to TNM stage, blue rectangle highlights the decrease (lower panel) of protein abundance according to TNM stage) and the total number of proteins belonging to each cluster. (B) Scheme depicting the proteome analysis workflow. (C) Top Canonical Pathways and Top Predicted Upstream Regulators characterising the analysed proteome landscape, as determined by Ingenuity Pathway Analysis (IPA) software. (D) Top Canonical Pathways and Predicted Upstream Regulators characterising the group of proteins with gradually increased abundance towards TNM3 (Cluster #5), as determined by pathway analysis (IPA). (E) Top Canonical Pathways and Top Predicted Upstream Regulators characterising the group of proteins with gradually decreased abundance towards TNM3 (Cluster #11), as determined by pathway analysis (IPA).
Interestingly, the identified immune signature was defined exclusively by proteins showing increased abundance with TNM stages (cluster 5, Figure 2D, greens). At individual level, this group contained important epigenetic modifiers such as the lysine methyltransferase KMT2D and the histone deacetylases HDAC1 and HDAC2, that all displayed increasingly elevated levels with disease progression.
In contrast, the proteins with opposite regulation controlled small molecule metabolism (cluster 11, Figure 2E, yellow). Moreover, this group contained only two distinct transcription regulators, further confirming its low involvement in the regulatory process.
The analysis of the common relapse proteome landscape revealed a strong immune system signature. To map the molecular mechanisms behind disease relapse we compared tumour tissue from the patient groups that suffered from recurrent disease (TNM2R+ and TNM3R+) with those from the group of patients with the same initial diagnosis but recurrence-free (TNM2R- and TNM3R-). We found 252 differentially expressed proteins (FC>1.5) between R+ (TNM2 and TNM3) and R- (TNM2 and TNM3). These 252 proteins define a common relapse signature and were further analysed using IPA (Figure 3A).
Pathway analysis of the differentially expressed proteins (DEPs) between overall TNMR+ and TNMR-. (A) Scheme depicting the proteome analysis workflow. (B) Top Canonical Pathways with predicted regulation characterizing the TNMR+ vs. TNMR- DEPs landscape, as determined by Ingenuity Pathway Analysis (IPA) software (z-score >0.1). (C) Top 5 Predicted Activated Upstream Regulators and IFNG targeted proteins observed regulated in the DEP set analyzed. (D) Top 5 Predicted Inhibited Upstream Regulators and MAPK1 targeted proteins observed regulated in the DEP set analyzed. (E) Graph of relevant DEPs observed dysregulated in TNMR+.
The pathway analysis revealed signalling related to immune system and inflammation in the top canonical pathways (Figure 3B). Further confirming this result, based on the observed proteome landscape, interferon gamma (IFNG) as well as other critical immune system regulators (IFNA2, IFNL1 and IL27) were predicted as top activated upstream regulators based on the observed proteome landscape (Figure 3C). As top inhibited upstream regulators the program predicted the MAPK (Figure 3D). The inferred activity of both IFNG and MAPK was based on the observed up-regulation of proteins like the transcriptional effector of the interferon pathway STAT1, the major histocompatibility complexes HLA-A, HLA-B and HLA-C, and interferon-induced proteins IFI30, IFIT, amongst others (Figure 3E).
TNM2 relapse signature further corroborated the immune system engagement. To further investigate the proteome adaptations underlying the progression from non-relapsing to relapsing within the group of patients with TNM2 colon adenocarcinomas, the proteome of TNM2R+ versus TNM2R- were compared. We found 735 differentially expressed proteins (FC>1.5) between TNM2R+ and TNM2R- (Figure 4A). Pathway analysis reconfirmed the presence in the top canonical pathways of signalling related to immune system, inflammation and wound healing (Figure 4B). Moreover, in the top upstream regulators the program inferred amongst others the immunity regulator AHR and IFNG as well as the proto-oncogenes MYC and KRAS (Figure 4C). Activation of KRAS, known for its involvement in colorectal cancer, was predicted based on 56 distinct proteins observed dysregulated in our dataset, amongst which the DNA methylase DNMT1, the superoxide dismutase SOD3, the glucose transporter GLUT1 (SLC2A1) and the transcription factor STAT1.
Pathway analysis of the differentially expressed proteins (DEPs) between TNM2R+ and TNM2R-. (A) Scheme depicting the proteome analysis workflow. (B) Top Canonical Pathways with predicted regulation characterizing the TNM2R+ vs. TNM2R- DEPs landscape, as determined by Ingenuity Pathway Analysis (IPA) software (z-score >0.1). (C) Top 5 Predicted Activated Upstream Regulators and KRAS targeted proteins observed regulated in the DEP set analysed (coloured borders define DEPs discussed in text). (D) Top 5 Predicted Inhibited Upstream Regulators and AGT targeted proteins observed regulated in the DEP set analysed. (E) Graph depicting AGT observed down-regulated. (F) Graph depicting relevant DEPs observed down-regulated in TNM2R+. (G) Graph depicting relevant DEPs observed up-regulated in TNM2R+.
Interestingly, angiotensin (AGT) was predicted as the lead inhibited upstream regulator, while the inflammatory cytokine IL1A, the chemokine receptor CCR2 and the key regulator of plasma cell differentiation, B-cell development, and antibody production PRDM1 followed (Figure 4D). Of note, AGT was also observed down-regulated in the analysed DEP group (Figure 4E).
At individual levels, when comparing TNM2R+ versus TNM2R-, the chromatin remodelers ATRX and DNMT1 and the antioxidant enzyme SOD3 were found down-regulated (Figure 4F), while the epigenetic modifier KMT2D, the transcription regulators STAT1 and RB1 as well as the glucose transporter SLC2A1 were observed up-regulated (Figure 4G).
TNM3 relapse signature revealed a link between metabolism and the immune system. To describe the TNM3 relapse signature, we focused on the 348 differentially expressed proteins (FC >1.5) between TNM3R+ and TNM3R- (Figure 5A). Pathway analysis indicated mainly signalling involved in growth metabolism and immune system (Figure 5B). RICTOR, a nutrient and growth factor integrative hub, was the top predicted activated upstream regulator defining the observed proteome landscape, followed by two regulators involved in inflammation and immune system signalling (Figure 5C). Similarly, the top predicted inactivated upstream regulator was MLXIPL, a glucose-controlled regulator of lipid metabolism (Figure 5D). Moreover, the graphical pathway integration of the proteome landscape characterising the TNM3R+/TNM3R-comparison, displayed an intricate interconnection between growth metabolism and immune system (Figure 5E).
Pathway analysis of the differentially expressed proteins (DEPs) between TNM3R+ and TNM3R-. (A) Scheme depicting the proteome analysis workflow. (B) Top Canonical Pathways with predicted regulation characterizing the TNM3R+ vs. TNM3R- DEPs landscape, as determined by Ingenuity Pathway Analysis (IPA) software (z-score >0.1). (C) Top 5 Predicted Activated Upstream Regulators and IL1A targeted proteins observed regulated in the DEP set analyzed (colored borders define DEPs discussed in text). (D) Top 5 Predicted Inhibited Upstream Regulators and IL10RA targeted proteins observed regulated in the DEP set analysed. (E) IPA generated graphical summary of the analyzed proteome landscape. (F) Graph depicting relevant DEPs observed up-regulated in TNM3R+. (G) Graph MECP2 observed down-regulation in TNM3R+.
At individual level, we observed the up-regulation of STAT1 transcription factor, similar to the above analysis of the TNMR2+ samples (Figure 5F). Moreover, the glucose metabolism enzyme hexokinase 3 (HK3) and the interferon-induced proteins IFI30 and IFIT followed the same regulation, while the methylation maintaining factor MECP2 was found down-regulated.
Overall, our results suggest a clear involvement of inflammation and immune system regulation in relapse occurrence in colon cancer. Moreover, the pathway analysis suggests a link between the immune system and growth metabolism in this context.
Discussion
Proteomics profiling in colorectal cancer is a powerful tool for describing the molecular characteristics underlying TNM staging. It can be a useful tool in discovering protein signatures between relapsing and non-relapsing tumours. High-throughput proteomics have the ability to capture changes in thousands of proteins simultaneously and are therefore a useful platform for biomarker discoveries. In this study, we used TMT 10-plex quantitative proteomics approach for profiling the proteome of resected tumour tissue and autologous normal colon tissue. We created pools consisting of five patients in each group, selecting only for TNM-stage and recurrence status. There are several studies stating that there is a difference in the tumor biology dependent on the location of the primary tumor (20, 42, 43). The objective of the present study was to assess the differences in the proteome of colon cancer on basis of TNM stage and histology. The patients included in this study were classified by clinical pathological data. For a future study, we would advocate primary CMS-typing, as well as CIN/CIMP-classification to heighten the quality of the patient material as well as increase the subclassification of tumors included. This is not currently a part of ESMO clinical practice guidelines (44).
There is a need for good studies matching well selected clinical patient material with adequate methodology. A particular finding in this study was the strong association in/between MSS and immune activation, currently not part of standard treatment. This finding is well discussed previously by Mlecnic et al. (10). Our relapse signatures predicted genes such as JAK1, SERPINB3, RICTOR, ACKR2, STAT, TNF and KRAS to be involved, but call for validation in larger patient cohorts. Still, we demonstrated the methodological ability to identify changes that can discern the ability to metastasise at an early timepoint (operations specimen biopsy) in an unbiased fashion.
The finding of differential abundance in IFN between TNM2+ and TNM 3+ versus TNM2- and TNM3- groups correlates with the experimental model by Shankaran et al. (45), suggesting IFN activation in primary tumors play a role in predicting future recurrent disease.
The finding that immune activation promotes tumor activation seems rather contra intuitive. There is currently a lack of clinical models of how tumors adapt to the immune system, but it is well known that immune activation has a large anti-tumor activity in early stages. More advanced tumors seem to adapt to the immune system to promote growth and metastatic colonization. Gonzales et al. (46) presents a thorough review on the roles of the immune system in cancer. In CRC, there are several studies on immune evasion, also in presumed “cold” MSS CRC, in particular those by Tauriello et al. (47), Wang et al. (48), Xiao et al. (49) and Klement et al. (50). This feasibility study lacks the power to elucidate the mechanisms involved concerning immune alteration in tumor microenvironment, but the findings might represent detection of tumor microenvironment alterations in recurrent CRC.
We interpret our results on the basis that the differences in proteome are rather large than small, causing 61% of proteins to separate cancer and control in cluster analyses with FC<1.5. This change in proteome illustrated in Figure 1 represents a phenotypic change in cancer tissue, primarily on cancer-related and an associated alteration in the household proteins. Reducing complexity in a dataset to increase the signal/noise ratio is a challenge, our cluster strategy is an unsupervised way to filter the proteins where statistical significance represents biological significance. This study is underpowered for a full characterization of proteome change in recurrent versus non-recurrent colon cancer but presents insight in the feasibility of such a study. There are few large studies with data on clinical follow-up. In the transition to clinical application, there is a current unmet need for studies of recurrent disease.
Conclusion
In this feasibility study that included MSS colon cancer only, signalling related to immune activation and inflammation were the top canonical pathways associated with relapse. Immunotherapy is known to induce durable responses in both non-metastatic and metastatic MSI-H CRC, but only a minority of MSS patients respond to the current available treatment. Still, as most CRC patients have MSS tumours, identifying subgroups of MSS patients that may respond to immunotherapy will generate new treatment options for a significant number of patients. Our findings in this feasibility study show the ability of proteomics to detect changes in immune activation. If these results are validated in larger experiments, this implies that immune therapy may become an important avenue for selected MSS colon cancer patients with risk of relapse.
Acknowledgements
This research was funded by Helse Vest and Trond Mohn Stiftelse. S.C. was supported by funds from the Research Council of Norway (NFR 251041) and Novo Nordic Foundation (NNF15OC0015054 and NNF21OC0067325).
Footnotes
Authors’ Contributions
Conceptualization: M.B., K.H., H.V., O.D. and M.M.; investigation: M.B., K.H., H.V., S.C., J.P. and M.M.; data curation: J.P.; writing— original draft preparation: M.B. and O.D.; writing—review and editing: M.B., K.H., H.V., S.C., O.D., and M.M..; visualization: H.V. and S.C..; project administration: M.B. and M.M.; funding acquisition: O.D. All Authors have read and agreed to the published version of the manuscript.
Conflicts of Interest
K.H has been on an advisory board for Daiichi Sankyo. The other Authors declare no conflict of interest in relation to this study. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
- Received December 21, 2021.
- Revision received January 10, 2022.
- Accepted January 12, 2022.
- Copyright © 2022, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved