Abstract
Background/Aim: Circulating mRNA can be a useful source of cancer biomarkers. We took advantage of direct transcriptomic analysis in plasma RNA to identify novel mRNA markers for non-small cell lung cancer (NSCLC). Patients and Methods: Plasma RNA from NSCLC patients and healthy individuals was profiled with cDNA-mediated annealing, selection, extension and ligation (DASL) microarrays. The microarray results were further validated in plasma RNA. Results: Through RNA profiling and online database mining, four gene transcripts were filtered as candidate markers of NSCLC. After validation, the PCTAIRE-1 transcript was identified as a circulating mRNA marker. The diagnostic potential of PCTAIRE-1 was evaluated by receiver operating characteristic curve analysis, which gave a sensitivity and specificity of 60% and 85%, respectively. In addition, high plasma PCTK1 levels were also correlated with poor progression-free survival (p=0.008). Conclusion: Circulating mRNA can be profiled with the DASL assay. From the profile, PCTAIRE-1 RNA in the plasma we discovered as a novel diagnostic/prognostic biomarker and an indicator of poor survival in NSCLC patients.
The detection of biomarkers in liquid biopsy (i.e., in body fluids) is valuable for the diagnosis and prognosis of malignant diseases, as its sampling is less invasive and can be carried out repeatedly. Some well-known circulating cancer markers, including CEA, PSA, CA125, and CA153, are proteins. These markers do not have sufficient sensitivity and specificity for detecting certain cancers [e.g., non-small cell lung cancer (NSCLC)]. The identification of novel biomarkers in these cancers is thus in great demand. In addition to protein markers, circulating nucleic acids are promising sources for new biomarkers, as circulating nucleic acids provide information regarding the genome or gene expression. In addition, some well-developed technologies, such as polymerase chain reaction (PCR) and high-throughput sequencing, can detect traces of nucleic acids in the circulation, allowing the usage of nucleic acid markers.
Different species of circulating nucleic acid markers, including cell-free DNA (cfDNA), microRNA (miRNA) (1-4), long noncoding RNA (lncRNA) (4-8), and mRNA (9), have been linked with malignant diseases. Among them, mRNA is one of the least investigated species (9). Until now, only a few circulating mRNA markers have been reported. For example, tyrosinase mRNA can be detected in the blood of some melanoma patients (10); telomerase mRNAs can be detected in breast cancer patients (11), CK19, mammaglobin, 5T4, and circulating thyroid-stimulating hormone receptor (TSHR) mRNAs can be detected in thyroid cancer patients (12); 5T4 mRNA can be detected in lung cancer patients (13); and CEA and CK19 mRNAs can be detected in colorectal cancer patients (14). These studies have revealed that circulating mRNAs can be potential biomarkers.
A difficulty in identifying circulating mRNA markers is that the RNA profile in plasma is different from that in tumor tissues. As blood is rich in background RNA from the whole body, most RNA markers found in tumor tissues may be masked by the background and cannot be used. A high-throughput analysis in blood samples could be directly used to discover mRNA markers that are not masked. Another difficulty is that mRNA in plasma is usually fragmented and found at a low abundance. A special amplification technology is required to enrich its amount before conducting subsequent analyses. In our previous study, we have shown that circulating mRNA could be amplified and profiled using the cDNA-mediated annealing, selection, extension, and ligation (DASL) assay (15). In the current study, we applied the DASL assay to identify novel circulating mRNA biomarkers in NSCLC patients.
PCTAIRE-1 (PCTK1), also known as cyclin-dependent kinase 16 (CDK16), belongs to the PCTAIRE family, which comprises a conserved central kinase domain with an amino acid sequence highly similar to that of CDK family members (16). PCTK1 is highly expressed in the brain and testis but not in other tissues (17-20). PCTK1 is involved in a wide range of biological functions, including neurite outgrowth (21), spermatogenesis (22), secretory transport (23, 24) and insulin secretion (25). Recently, few studies have revealed the involvement of PCTK1 in cell proliferation or anti-apoptosis in several types of malignancies (26, 27).
In this study, we found that the levels of PCTK1 RNA are high in the plasma of NSCLC patients. We also demonstrated that high levels of PCTK1 RNA are correlated with poor survival. NSCLC is the most common type of lung cancer cases and an important cause of death worldwide (28-31). The development of diagnostic and prognostic biomarkers will be helpful in cancer management. The identification of circulating PCTK1 as a novel mRNA biomarker for NSCLC is valuable for the future development of diagnostic or monitoring tools.
Patients and Methods
Patient samples and plasma collection. Forty-three NSCLC patients were recruited from the Chang Gang Memorial Hospital. Another forty-three healthy controls were recruited from the Health Check Center of the same hospital. Written informed consent from each patient was obtained before sample collection. This study was approved by the Chang Gung Memorial Hospital Institution Review Board (with approval numbers 100-2065A3, 102-1749C, 102-1584C1, 103-7926C1, and 104-7490D). The clinical stage of each patient was classified according to the AJCC staging system. All participants were divided into two groups. Five patients (1 male and 4 females, ranging from 49 to 90 years of age, mean age: 69.2±14.9) with primary stage IV NSCLC and six healthy individuals (2 males and 4 females, ranging from 56 to 61 years of age, mean age: 58.1±1.7) were enlisted in the discovery phase. The NSCLC patients provided blood samples before (Nontreatment group) and two weeks after the first treatment with chemotherapy or targeted therapy (Treatment group). The control individuals provided blood samples once (Healthy group). Thirty-eight treatment-naive patients with non-small cell lung cancer regardless of stage (21 males and 17 females, ranging from 38 to 81 years of age, mean age: 61.7±12.2 years) and thirty-seven healthy individuals with a similar distribution of age (28 males and 5 females, ranging from 53 to 79 years of age, mean age: 62.9±7.7 years) were enrolled in the validation phase. Peripheral blood from the participants was collected into an EDTA tube and centrifuged at 1600 × g for 10 min at room temperature. The supernatants (plasma) were separated and stored at −80°C until further processing.
Isolation of circulating mRNAs. Total RNA was extracted from plasma using TRIzol LS reagent (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer's instructions. In the discovery phase, RNA from each sample was extracted from 5 ml plasma. In the validation phase, RNA was extracted from 250 μl plasma. Briefly, a 250-μl plasma aliquot was mixed with one milliliter TRIzol LS reagent. After 5 min of incubation, 266 μl of chloroform was added to the mixture. Then, the mixture was centrifuged at 14000 × g for 15 min at 4°C. The aqueous layer was transferred to a new tube. An equal volume of isopropanol and 1 μl GlycoBlue (Thermo Fisher Scientific) was added to the solution to precipitate RNA. The solution was further centrifuged at 14000 × g for 15 min at 4°C. The RNA pellet was washed with cold 70% ethanol, air dried for 10 min, and resuspended in 30 μl DEPC-treated water at 56°C for 10 min. RNA concentration was estimated with a Nanodrop ND-1000 (Thermo Fisher Scientific) and the RiboGreen RNA assay on a Qubit fluorescence reader (Thermo Fisher Scientific).
Microarray analysis for circulating mRNA profiling. The DASL assay (Illumina, San Diego, CA, USA) was used to preamplify and label the mRNA transcripts from plasma samples. The pre-amplified cDNA products were then hybridized to a Human HT-12 v4 Expression BeadChip (Illumina), which contains probes for 23,811 coding transcripts. The DASL assay and expression profiling experiments were performed at Genetech Biotech (Taipei, Taiwan). Raw data were processed using Partek Genomic Suite (PGS) software (St. Louis, MO, USA) for bioinformatics analysis (32). The similarities and differences in circulating mRNA profiles between NSCLC patients and healthy controls were assessed by using principal component analysis (PCA) and hierarchical clustering, which group samples according to their expression features. Hierarchical clustering was performed with the Euclidean and Pearson's dissimilarity algorithms. The differentially expressed genes were selected for further analysis if they met the following criteria: genes that were not low in abundance or with an abnormal pattern in all samples; genes whose average expression levels between patients and controls were significantly different (with p<0.05) in one-way ANOVA; and genes whose mean expression levels in one group was at least two-fold greater than that in the other group.
Online database mining. To confirm the differentially expressed genes that had also been identified in lung tissue mRNA profiles, the selected genes were then filtered in the Oncomine database (http://www.oncomine.org) (33, 34). In the Oncomine search for gene expression (cancer vs. normal), a fold change above 1.5 and a p-value below 0.05 were set as the thresholds.
Reverse transcription quantitative PCR (RT-qPCR). To quantify the target mRNA level, the extracted RNA was reverse transcribed into cDNA with MMLV reverse transcriptase (Promega, Madison, WI, USA). Then, 2 μl of the cDNA was used as a template for qPCR, which included Platinum Taq polymerase and its reaction buffer (Thermo Fisher Scientific), 4 dNTPs, primers, and SYBR Green I. qPCR was performed in triplicate with the following thermal cycling conditions: incubation at 95°C for 3 min, followed by 45 thermal cycles (95°C for 15 sec, 60°C for 30 sec and 72°C for 30 sec), and a final incubation at 72°C for 3 min. The fluorescent signals were detected at each cycle at 60°C, which generated an amplification curve of each reaction. The relative level of a target gene to a housekeeping gene, β2-microglobulin (B2M), was calculated from their threshold cycle (Ct) with the equation 2^(CtB2M-Ctgene). The primer sequences for PCTK1 were 5’-GTCAGCCTATCTGAGATTGG-3’ (forward) and 5’-TTCATATTCCAGTCTGATCTC-3’ (reverse). Those for B2M were 5’-CATTCAGACTTGTCTTTCAG-3’ (forward) and 5’-TTCAAACCTCCATGATGC-3’ (reverse).
Immunohistochemical staining. The formalin-fixed, paraffin-embedded (FFPE) tissue sections from NSCLC patients were obtained from the Tissue Bank in Chang Gung Memorial Hospital. One of the serial sections was stained with hematoxylin and eosin to identify the most representative regions of the tumor. Immunohistochemistry was performed according to a protocol from Abcam (https://www.abcam.com/ps/pdf/protocols/ihc_p.pdf), with slight modifications. Briefly, the tissue section was first treated with heated 0.01 M citrate buffer (pH 6.0) to retrieve antigens, then soaked in 3% H2O2 to inactivate endogenous peroxidase, and in Antibody Diluent solution (Dako, Glostrup, Denmark) to block nonspecific signals. Then, samples were incubated with antibodies against the PCTK1 protein (1:200 dilution; HPA 001366, Sigma-Aldrich, St. Louis, MO, USA) overnight at 4°C and then with anti-rabbit antibodies conjugated with horseradish peroxidase (1:250 dilution; Calbiochem, Darmstadt, Germany) for 60 min at room temperature. Color was revealed by treatment with DAB chromogen (DAKO) and hematoxylin. The slides were scored for PCTK1 staining under the microscope by a senior pathologist without knowledge of the patients' histories. The PCTK1 score was calculated according to a simplified H score system (35) based on the percentage of stained cells (3, >90%; 2, 50-89%; 1, 10-49%; and 0, 0-9%) and the intensity of staining (3, strong; 2, moderate; 1, weak; or 0, none). The two scores were multiplied by each other and divided by 3 to obtain the final score. Positive staining was defined as a final score ≥1.
Statistical analysis. Statistical analysis was conducted with SPSS software for Windows version 17.0 (SPSS, Inc., Chicago, IL, USA). A nonparametric Mann-Whitney test was used to compare gene expression levels or IHC scores between different groups. Receiver operating characteristic (ROC) curve and area under curve analyses were used to estimate the discriminative power and determine the cut-off value of the PCTK1 RNA level in specimens. A chi-square test was used to determine the association between circulating PCTK1 levels and clinical-pathological characteristics. The survival curves were plotted with the Kaplan–Meier method and analyzed using the log-rank test. A p-value≤0.05 was regarded as statistically significant.
Results
Experimental design. The aim of this study was to identify circulating mRNA markers of NSCLC. The study design included a discovery phase and a validation phase. The whole experimental strategy is simplified in a flow chart in Figure 1.
Circulating mRNA profiling in the discovery phase. Five NSCLC patients and six healthy controls were included in the discovery phase. The NSCLC patients provided blood samples before (Nontreatment group) and two weeks after the first treatment with chemotherapy or targeted therapy (Treatment group). The control individuals provided blood samples once (Healthy group). RNA was extracted from the samples and profiled with the DASL assay and an HT-12 bead-based microarray, which were designed to amplify fragmented RNA. The microarray detected approximately 15,000 genes in the plasma RNA of each sample. There was no significant difference in the number of detected genes or the average intensity between samples or between groups. PCA plots demonstrated that the NSCLC patients were grouped together and distinguished from healthy controls (Figure 2A). Hierarchical clustering also confirmed that the NSCLC patients were clustered apart from the healthy controls (Figure 2B). However, in the same patient, the plasma RNA profiles before and after treatment were not significantly different.
Selection of candidate genes. We then selected genes whose levels were elevated in the NSCLC patients by comparing the profiles of the Nontreatment group and the Healthy group (the criteria are detailed in the Materials and Methods section). Ninety-nine genes were selected at this stage (not shown). We then determined the levels of these genes in lung cancer tissues in the Oncomine database. Only four genes revealed pronounced elevation in most analyses in the dataset: MAP1B, MMP1, PCTK1 and PTPRF. We then determined the circulating mRNA levels of the four genes in the validation cohort, including 75 plasma samples (38 from NSCLC patients and 37 from healthy controls). Among the four candidate genes, only PCTK1 demonstrated significant elevation in patients' plasma (Figure 3A).
Diagnostic and prognostic value of circulating PCTK1 RNA. A ROC curve was applied to test the ability of PCTK1 RNA in the diagnosis of NSCLC. When the optimal threshold was set, the area under the ROC curve was 0.7745, giving a sensitivity and specificity of 60% and 85%, respectively (Figure 3B). Based on this threshold, the patients were divided into two groups: PCTK1-high and PCTK1-low. In the survival analysis, the PCTK1-high group had significantly (p=0.008) shorter progression-free survival (PFS) (mean=5.5 months) than the PCTK1-low group (mean=14.3 months) in the log-rank test (Figure 4A). The PCTK1-high group also had shorter OS (mean=22.7 months) than the PCTK1-low group (mean=55.5 months), although the difference was not statistically significant (p=0.195) (Figure 4B). In contrast, the PCTK1 levels were not associated with other clinical-pathological factors, such as age, sex, cancer stage, tumor pathology, smoking history, TKI treatment, or EGFR status (Table I).
Confirmation of PCTK1 overexpression in NSCLC tissues. Then, the PCTK1 protein levels were determined in 29 FFPE tissues from lung cancer patients. Among them, 18 were from needle biopsies, and 11 were from surgically removed tissues. Only the surgically removed tissues had an obvious normal component in the tissue sections. Immunohistochemical staining showed that cancer tissues had strong positive staining for the PCTK1 protein that was mainly cytoplasmic (Figure 5A). The H score also showed that the tumor tissues had significantly higher staining than the normal tissues (Figure 5B). An H score ≥1 was defined as positive; 93% (27/29) of the tumor tissues were positive, but only 18% (2/11) of the normal tissues were positive.
Discussion
In this study, we successfully isolated and profiled circulating mRNAs from NSCLC patients by using the DASL assay. The DASL assay is specialized for the amplification of poor-quality and fragmented RNA extracted from FFPE tissues (15, 25, 36, 37). It can be adapted for the amplification of circulating mRNA in plasma samples. Through the DASL assay and online database mining, we selected certain transcripts as candidate mRNA markers. In the subsequent validation experiments, we confirmed the elevation of PCTK1 in NSCLC patients, both in peripheral blood and in tumor tissues, and that the elevation of circulating PCTK1 RNA is associated with a poor prognosis.
RNA overexpressed in cancer tissues is not necessarily expressed at a high level in the peripheral blood. This is because the tumor mass accounts only for a small portion of the body, and both the tumor tissue and other normal tissues contribute RNAs to the circulation. Most RNAs elevated in tumor tissues are thus masked by background RNAs from normal tissues. Therefore, the RNA candidates overexpressed in tumor tissues are not necessarily good markers in the circulation. Our results confirmed this argument, as only four gene transcripts selected in the plasma matched the overexpressed genes in NSCLC tissues in the Oncomine database. Our study demonstrated that the direct profiling of circulating RNA is a feasible strategy and can be applied to biomarker selection in other diseases.
Recent studies have indicated that cells can release different particles, including exosomes (38, 39), microparticles (40) and apoptotic bodies (41). These particles are protected by the plasma membrane and are considered the major sources of circulating RNA. This notion is also confirmed by the observation that nude RNA is very unstable in the blood, which is rich in RNase activity (42). Information about which particle PCTK1 mRNA is derived from will be helpful in the development of better diagnostic methods. For example, different methods have been developed to capture extracellular vesicles (43, 44). If PCTK1 mRNA exists in the captured vesicles, these technologies can be applied in the enrichment of circulating PCTK1 RNA in the detection procedure.
In current clinical practice, CEA and CYFRA21-1 are frequently used as biomarkers for the diagnosis or follow-up of NSCLC (45), but their specificity and sensitivity are not satisfactory. Over the past few years, several mRNA biomarkers circulating in peripheral blood mononuclear cells (PBMCs) for NSCLC have been reported. Some of these PBMC RNAs show high specificity and sensitivity in the detection of lung cancer, including angiopoietin-2 (Ang-2) (46), A-kinase anchor protein 4 (AKAP4) (47), cytokeratin 7 (CK7) (48), E74-like factor 3 (ELF3) (48), epidermal growth factor receptor (EGFR) (48), and erythropoietin-producing hepatocellular carcinoma receptor B4 (EphB4) (48). Our current study used plasma RNA instead of PBMC RNA and discovered that PCTK1 is a newly identified plasma biomarker with 85% specificity and 60% sensitivity. In addition, a high level of PCTK1 expression is an indicator of a low survival rate. The combination of these markers and other circulating RNA species, such as miRNA or long lncRNA, may have the potential to make a marker panel for the noninvasive diagnosis or follow-up of NSCLC.
PCTK1 is a cyclin-dependent kinase-related kinase that is involved in cell cycle regulation. Initially, PCTK1 has been reported to participate in neurite outgrowth (21) and spermatogenesis (22). However, some studies have discovered its modulatory role in protein exocytosis (24) and the export of secretory proteins from the endoplasmic reticulum to the Golgi apparatus (23). In recent years, several lines of evidence have shown that PCTK1 is involved in various human cancers, such as liver (49), colon, breast and prostate cancers and malignant melanoma (27, 50, 51). Specifically, Wang et al. have found that PCTK1 is overexpressed in NSCLC and plays a role in cancer cell growth and anti-apoptosis (26). Our finding that PCTK1 is upregulated in lung cancer tissues confirmed their observation. These findings together specify the notable role of PCTK1 in promoting tumorigenesis and make it a potential target for cancer treatment.
Conclusion
Our study demonstrated a strategy for discovering circulating RNA markers and provided a novel biomarker: plasma PCTK1 mRNA. This biomarker can be valuable for the diagnosis/prognosis of NSCLC, and the PCTK1 gene may also be a potential therapeutic target.
Acknowledgements
This research was supported by the grants from Chang Gung Memorial Hospital (CMRPD1C0321-3, CMRPD1B0091-3, and CMRPG3A0661). The Authors thank Miss Zi-Ming Huang for the assistance of preparing the manuscript.
Footnotes
↵# These Authors contributed equally to this study.
Authors' Contributions
Conceived and designed the experiments: CC Chiou and J Chang. Conducted the experiments: CL Shih, CL Wang, JD Luo, and JJ Hsieh. Analyzed the data: CC Chiou, J Chang, CW Wang, and CJ Yu. Wrote the paper: CC Chiou.
This article is freely accessible online.
Conflicts of Interest
The Authors declare no competing financial interests regarding this study.
- Received September 4, 2019.
- Revision received September 26, 2019.
- Accepted September 27, 2019.
- Copyright© 2020, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved