Exploring functions of long noncoding RNAs across multiple cancers through co-expression network

Li, Suqing; Li, Bin; Zheng, Yuanting; Li, Menglong; Shi, Leming; Pu, Xuemei

doi:10.1038/s41598-017-00856-8

Download PDF

Article
Open access
Published: 07 April 2017

Exploring functions of long noncoding RNAs across multiple cancers through co-expression network

Suqing Li¹,
Bin Li²,
Yuanting Zheng^2,3,
Menglong Li¹,
Leming Shi^2,3 &
…
Xuemei Pu¹

Scientific Reports volume 7, Article number: 754 (2017) Cite this article

6589 Accesses
36 Citations
11 Altmetric
Metrics details

Subjects

Abstract

In contrast to protein-coding genes, long-noncoding RNAs (lncRNAs) are much less well understood, despite increasing evidence indicating a wide range of their biological functions, and possible roles in various cancers. Based on public RNA-seq datasets of four solid cancer types, we here utilize Weighted Correlation Network Analysis (WGCNA) to propose a strategy for exploring the functions of lncRNAs altered in more than two cancer types, which we call onco-lncRNAs. Results indicate that cancer-expressed lncRNAs show high tissue specificity and are weakly expressed, more so than protein-coding genes. Most of the 236 onco-lncRNAs we identified have not been reported to have associations with cancers before. Our analysis exploits co-expression network to reveal that onco-lncRNAs likely play key roles in the multistep development of human cancers, covering a wide range of functions in genome stability maintenance, signaling, cell adhesion and motility, morphogenesis, cell cycle, immune and inflammatory response. These observations contribute to a more comprehensive understanding of cancer-associated lncRNAs, while demonstrating a novel and efficient strategy for subsequent functional studies of lncRNAs.

Integrative competing endogenous RNA network analyses identify novel lncRNA and genes implicated in metastatic breast cancer

Article Open access 10 February 2023

Dulari K. Jayarathna, Miguel E. Rentería, … Neha S. Gandhi

Network of clinically-relevant lncRNAs-mRNAs associated with prognosis of hepatocellular carcinoma patients

Article Open access 07 July 2020

Lee Jin Lim, Yu Jin, … Caroline G. Lee

Long noncoding RNAs: fine-tuners hidden in the cancer signaling network

Article Open access 11 October 2021

Shanshan Zhao, Xue Zhang, … Song Zhang

Introduction

Long noncoding RNA (lncRNA) belongs to a class of noncoding RNAs longer than 200 nucleotides^{1, 2}. With the development of RNA sequencing, epigenomic technologies and computational techniques, an increasing number of lncRNAs have been discovered³. Although lncRNAs were previously regarded as “noise” in the genome owing to lack of protein-encoding capacity, more and more emerging evidences have indicated that the lncRNAs play a wide range of roles, covering biological functions like cell proliferation, survival, differentiation, and chromatin remodeling^4,5,6,7. Consequently, it is not surprising that the dysregulation of lncRNA genes was implicated in tumor biology⁸.

However, compared to well-studied protein-coding genes, the functions of most lncRNAs have not been elucidated despite of their large proportions in genomes. In the lncRNAdb v2.0⁹, less than 1% of lncRNAs have been individually characterized among nearly 16,000 annotated lncRNA genes in GENCODE. Thus, it remains a great challenge in understanding the functional characteristics of lncRNAs. In general, loss- and gain-of function biological experiments through gene knockdown, overexpression or editing are considered to be golden standards to define the functions of lncRNAs¹⁰. However, the characterization through the experimental approaches is still limited due to their low throughput and demand for prior knowledge about potential mechanisms of the candidates¹¹.

Alternatively, computational analysis provides another way to explore the functions of the lncRNAs. Some computational work predicted lncRNA structures based on their sequences^{12, 13}. However, the structures predicted by the computational methods still remain a high false-positive rate, and the distinct structure–function relationships for many lncRNAs are still unknown¹⁴. In addition, some computational studies explored the potential functions of lncRNAs through identifying molecules interacting with them^{15, 16}. But, the lack of molecular interaction data for many lncRNAs also hampers their functional annotation.

It is well accepted that co-expressed genes are more likely to be co-regulated and functionally related¹⁷. Therefore, identifying co-expressed protein-coding genes can help assign the functions of lncRNAs¹¹. Weighted correlation network analysis (WGCNA), a powerful guilt-by-association (GBA) method for constructing co-expression network based on expression data, can reconstruct gene co-expression modules and summarize such modules using module eigengenes and intramodular hub genes¹⁸. It has been successfully applied to study protein-coding genes, like distinguishing dysfunctional regulatory subnetworks and finding candidate biomarkers¹⁹. However, few studies used it to investigate cancer-associated lncRNAs²⁰. Cogill et al.²¹ used known cancer-associated coding genes from COSMIC to find co-expressed lncRNAs only from microarray expression data of normal tissues rather than cancer tissues, and constructed a co-expression network using WGCNA to explore their potential functions.

In recent years, the Cancer Genome Atlas (TCGA, https://cancergenome.nih.gov/) project has generated comprehensive, multi-dimensional maps of the key genomic changes in 33 types of cancers, which help us understanding how such changes interact to drive diseases. From this project, researchers found that cancers from different tissues could share some common features like mutations, methylation, and transcriptomic changes, and the cross-cancer aberrations are more likely to act as oncogenic contributors and can provide an opportunity to find new therapeutic biomarkers in clinics^{22, 23}. In fact, some previous work showed that a few lncRNAs are altered in multiple cancers. For example, MALAT1 was first identified as a prognostic biomaker for lung cancer survival²⁴. Later, its expression dysregulation was also observed in other types of tumors, including malignancy in liver²⁵, breast²⁶ and colon²⁷. In addition, other lncRNAs like HOTAIR, PTENP1, MEG3 and CONCR were reported to be dysregulated in several cancer types^{28, 29}. Yan et al.³⁰ also observed that some lncRNAs are abnormally expressed in several cancers. However, most works focused on cancer-associated lncRNAs only in independent cancer type³¹ while studies on the lncRNAs across multiple cancers have been absent. In fact, this kind of lncRNAs may be proved as potential oncogenes or tumor suppressors across multiple cancers and extend our understanding of the common events across tumor types. Thus, it is highly desired to study these poorly understood but crucial regulators across multiple cancers.

In this work, we utilized a computational strategy to perform a systematic study on lncRNAs significantly altered in more than 2 cancer types, based on public RNA-seq datasets of four common solid cancer types (prostate cancer, bladder cancer, lung adenocarcinoma and breast cancer). RNA-seq is a revolutionary technology based on next-generation sequencing, and is considered as the most comprehensive way for studying complete transcriptome in more details and with more accurate measurements than other techniques of lncRNAs expression profiling like microarray and serial analysis of gene expression (SAGE)⁴. Finally, 236 onco-lncRNAs were identified in our work, and most of them have not been reported to be related with cancers. WGCNA combined with DAVID (the database for annotation, visualization and integrated discovery)³² were used to explore their functions. We revealed that the onco-lncRNAs likely take key roles in the multistep development of human cancers, covering a wide range of functions in genome stability maintenance, signaling, cell adhesion and motility, morphogenesis, cell cycle, immune and inflammatory response. Our study contributes to a comprehensive understanding of the onco-lncRNAs with the aid of the co-expression network, which may guide subsequently experimental studies on the altered lncRNAs in cancers.

Results

Expression profiles of lncRNAs across cancers

We downloaded public RNA-seq datasets containing four cancer types for our analysis: bladder cancer (BLC)³³, prostate cancer (PRC)³⁴, lung adenocarcinoma (ADC)³⁵ and estrogen receptor positive (ER+) breast cancer (EBC)³⁶ (Supplementary Table S1). GENCODE v23 gtf file, containing 19,797 protein-coding genes and 15,931 lncRNA genes, was used for annotation.

After mapping and quantification, we defined expressed genes based on a threshold of FPKM ≥ 1 in more than 80% of normal samples or 80% of tumor samples for each cancer type. Consequently, there are total 14,470 expressed protein-coding genes (PCGs) (73.1% of all annotated protein-coding genes in the GENCODE) and 2,902 expressed lncRNA genes (18.2% of all annotated lncRNA genes in the GENCODE) in the four cancer types. For all the expressed lncRNAs and PCGs, we calculated the number and proportion of expressed genes appearing in different number of cancers (Fig. 1a–c). We found that the majority (77.0%) of the expressed PCGs are detected in all the four cancers compared with 30.1% of the lncRNAs. Meanwhile, a minority (9.2%) of the PCGs show expression in only one cancer in contrast with a bigger proportion (34.6%) of the lncRNAs. We also computed distributions of FPKM values for the expressed lncRNAs and PCGs in each cancer type. As shown in Fig. 1d, the lncRNAs have a lower expression level than the PCGs in all the cancer types. The observation provides further support for previous observations that the expression of lncRNA genes displays much more tissue-specific and lower expression than PCGs³⁷.

Differentially expressed lncRNA genes

We defined differentially expressed genes between the tumor samples and matched normal samples based on the following criteria: fold change ≥2 or ≤0.5 and FDR ≤ 0.01. And we got 357, 321, 267 and 375 differentially expressed lncRNAs (DELs) in BLC, PRC, ADC and EBC, respectively. The total number of DELs for the four cancer types is 1,010. To obtain an overview of the expression profile for DELs in each cancer, we performed hierarchical cluster analysis (Fig. 2). It can be seen that all heatmaps show a distinct regulating direction and a clear separation between the normal samples and the tumor ones for the DELs. In addition, we also identified 5,595 differential expressed protein-coding genes (DEPs) in all.

Among all the DELs, there are 774 (76.6% of the 1,010 DELs) lncRNAs which are differentially expressed only in one cancer type (Supplementary Table S2). Only few DELs here were indicated by earlier works to be associated with cancers (Supplementary Table S3). For instance, UCA1 was reported to play a regulatory role in promoting human bladder cancer proliferation³⁸. In our analysis, it is up-regulated (log2FC = 3.3, FDR = 4.5 × 10⁻⁴) in BLC. AATBC is also differentially expressed (log2FC = 3.6, FDR = 2.5 × 10⁻⁹) in BLC, which was reported to facilitate proliferation and inhibit cell apoptosis in bladder cancer³⁹. PCAT29, as a new biomarker in prostate cancer⁴⁰, is the most significant DELs (log2FC = 2.68, FDR = 4.43 × 10⁻¹⁵) in PRC from our analysis. CTBP1-AS is observed to be significantly altered (log2FC = 2.7, FDR = 8.2 × 10⁻¹⁰) in PRC, which was reported to be an androgen-responsive lncRNA in prostate cancer⁴¹. The consistency between our results and the findings from earlier works confirms the reliability of our analysis method. Intriguingly, most of DELs altered in only one cancer type have not been reported to be related to cancer yet, for example, MIR99AHG, the most down-regulated DEL (log2FC = −6.44, FDR = 1.23 × 10⁻²⁷) in BLC, and LINC00968, a significantly down-regulated DEL (log2FC = −3.77, FDR = 1.58 × 10⁻⁴⁹) in ADC. These unreported lncRNAs could provide helpful information for possible biomarkers in further experiments owing to their significant dysregulation in the specific tumor type.

The remaining 236 (23.4%) DELs are altered in more than two cancer types (Supplementary Table S4). Previous studies indicated that lncRNAs differentially expressed in multiple cancer types may have conserved oncogenic or tumor suppressor roles⁴². Thus, we defined the 236 DELs as onco-lncRNAs in our study. Among all onco-lncRNAs, there were only 9 DELs dysregulated in all the four cancer types: CTD-2047H16.2, CTD-2517M22.14, CTD-2574D22.3, FGF14-AS2, PVT1, RP11-196G18.22, RP11-346D14.1, RP11-498C9.4 and RP11-510N19.5. Majority of the 236 DELs were missed in earlier studies (Supplementary Table S5). Only 11 onco-lncRNAs were reported to have a bearing on tumorigenesis previously^{42,43,44,45,46,47,48,49,50,51,52} (Fig. 3), two of which (PVT1 and MEG3) were confirmed by conclusive evidences as cancer-associated lncRNAs in multiple cancers^{43, 44}. The other nine known cancer-associated lncRNAs were only studied in one cancer type and there have been no experimental evidences and clinic data to support their associations with multiple cancers. In contrast, we also found 2,017 PCGs significantly altered in more than two cancer types (Supplementary Table S6), in which 92 genes were reported as oncogenes in COSMIC database (https://cancer.sanger.ac.uk/census) (Supplementary Table S7), for example, MYC, NOTCH1 and MET.

To gain insight into the associations between the onco-lncRNAs and multiple cancers, we did a survival analysis through an online tool Kaplan-Meier Plotter, which contains a large number of microarray datasets of breast cancer, lung cancer, gastric cancer and ovarian cancer^53,54,55,56. We chose three known cancer-associated lncRNAs (ADAMTS9-AS2, FGF14-AS2 and PCAT19) whose Affymetrix id can be found in this tool to perform the survival analysis over the four cancer types included in the Kaplan-Meier Plotter (Supplementary Fig. S1). For all the three lncRNAs, the survival time is significantly separated between high-expression groups and low-expression ones in all the four tumor types (p ≤ 0.05). And overexpression of the three lncRNAs exhibits a good prognostic effect in all the cancer types, except for FGF14-AS2 whose low expression is good for the survival in the gastric and ovarian cancer. Although the three lncRNAs were identified as cancer-related by previous studies on single cancer types^{45, 46, 51}, our analysis has revealed for the first time that their expression levels are significantly associated with clinical prognosis across multiple cancers. The results imply that the three lncRNAs may play important roles in multiple cancers, which could also provide support for the other onco-lncRNAs identified here being important in multiple cancers.

Module-based functional characterization of onco-lncRNAs with co-expression network analysis

In order to explore potential functions of the 236 onco-lncRNAs, we used WGCNA to construct a co-expression network based on their normalized expression data of all the 236 onco-lncRNAs and 6,316 PCGs whose expression profile are highly correlated with at least five onco-lncRNAs (see Materials and Methods). Finally, we got 18 modules with sizes ranging from 34 to 1,463 genes, in which the number of onco-lncRNAs varies between 0 and 67 (Supplementary Fig. S2 and Supplementary Table S4). We took the first principal component as a module eigengene and used it to represent the overall expression profile of a module¹⁸, as shown in Supplementary Fig. S3. We obtained the variation of the eigengene between the normal tissues and the tumor ones by one-way analysis of variance (ANOVA) with FDR corrected p-value. The p-value cutoff was set to be 0.0001. Consequently, 12 modules containing the onco-lncRNAs were selected for downstream analysis. The details of the 12 modules are listed in Table 1.

Table 1 Overview of 12 cancer-associated modules.

Full size table

To further determine the biological functions of the onco-lncRNAs in the 12 modules, DAVID³² was used to mine the modules’ biological significance including GO biological process (BP) terms and KEGG pathways. Supplementary Table S8 lists all significantly enriched GO BP terms (p ≤ 0.05) for each module. Figure 4 displays three representative terms for each module. Table 2 lists the significant KEGG pathways with p ≤ 0.05 and gene counts ≥5. According to major biological processes, the 12 modules were parceled out in the following six sections.

Table 2 Significant KEGG pathways (P ≤ 0.05, counts ≥ 5) associated with 12 cancer-associated modules.

Full size table

Modules associated with signal transduction

The blue module contains 41 onco-lncRNAs and 1,402 PCGs. As observed from Fig. 4 and Supplementary Table S8, PCGs in this module are significantly enriched in processes like regulation of Ras protein signal transductions, protein amino acid phosphorylation and protein kinase cascade. The significant KEGG pathways included several signaling pathways relevant to cancer, like VEGF signaling pathway, Notch signaling pathway and MAPK signaling pathway (Table 2). VEGF signaling pathway can mediate proliferation and migration of endothelial cells and promote their survival and vascular permeability. Inappropriate regulation of VEGF was observed to have effects on cell migration and survival in cancers⁵⁷. The black module contains four onco-lncRNAs and their enriched functions are similar to the blue module, as reflected by Fig. 4, Table 2 and Supplementary Table S8.

In the two modules, only two onco-lncRNAs (PVT1 and PCAT6 in the blue module), which exhibit connections to the genes with topological overlaps (w) greater than or equal to 0.05 in the network, were reported to have associations with cancers^{43, 47}. Supplementary Fig. S4a,b show distributions of their expression profiles in the four cancers. It can be seen that PVT1 displays significant overexpression in all the four tumor tissues while PCAT6 is significantly overexpressed in three types of tumor tissue except for PRC. In addition, the two lncRNAs are highly connected (w ≥ 0.05) to genes showing functions in the signaling cascade (Supplementary Fig. S5). For example, HGS is a positive regulator of VEGF and insulin signaling⁵⁸, and the absence of KCTD13 is likely to lead to hyperactivation of the RhoA signaling pathway⁵⁹.

PVT1, as a candidate oncogene, was revealed to be related with cell proliferation and tumor progression in many neoplastic diseases⁴³. Tseng⁶⁰ indicated that high MYC protein levels in 8q24-amplified human cancer cells require gain of PVT1 expression to suppress phosphorylation of T58, in turn protecting MYC protein from degradation. Indeed, our functional analysis of the blue module shows that phosphorylation is a significant functional term (Supplementary Table S8) and the nodes connected to PVT1 contain a MYC-related gene, EHMT1 (PCC = 0.72) (Supplementary Fig. S5), which is part of the E2F6 complex involved in silencing of MYC-responsive genes and G0/G1 cell cycle transition⁶¹. The consistency between our computational analysis and the earlier observations confirms reliability of our predicted results. Thus, it is reasonable to speculate that the onco-lncRNAs clustered in the blue and black modules very likely play important roles in many signaling circuits, in turn influencing the cancer progress.

Modules associated with response to immune activity and stimulus

The yellow module, which contains 12 onco-lncRNAs and 441 PCGs, shows functional enrichment in response to endogenous stimulus, organic substances, and blood vessel development (Fig. 4, Supplementary Table S8). Pathway analysis further reveals that the genes in the module are enriched in some signal transduction pathways like insulin signaling pathway and chemokine signaling pathway, and also in some cancer-associated pathways like melanoma (Table 2). The cyan and green modules, which contain 8 and 7 onco-lncRNAs, respectively, share the BP terms about vascular system with the yellow module. In addition, the cyan and green modules also show enrichment in immune response, and inflammatory regulation (Fig. 4, Supplementary Table S8).

The lncRNA FGF14-AS2 in the yellow module, was reported to be a breast-cancer-associated lncRNA and may act as a tumor suppressor⁴⁵. This gene is observed to be down-regulated in the tumor tissues of all the four cancer types, as evidenced by Supplementary Fig. S4c. Some protein-coding genes connected to it (w ≥ 0.05) take roles in the function of response to stimulus (Supplementary Fig. S6). In addition, FGF14-AS2 shows a high correlation with a famous cancer-associated gene VEGFB (PCC = 0.70), which is a member of vascular endothelial growth factor family and dysregulated in many cancers⁶².

The cyan module contains four lncRNAs (MIR22HG, PCAT19, FENDRR and DIO3OS) whose functions were characterized^{51, 63,64,65}. Although there have been no studies to provide evidence for their associations with cancers, in our study, they exhibit an accordant down-expression pattern in the cancer tissues when they were significantly dys-expressed (Supplementary Fig. S4d–g). In addition, some protein-coding genes connected to the four lncRNAs (w ≥ 0.05) are also associated with the response to stimulus (Supplementary Fig. S7). For MIR22HG, two studies revealed its roles in chemical stress responses^{63, 66}. In our study, MIR22HG shows a positive correlation with IL6 (PCC = 0.68), which was implicated in inflammation, hematopoiesis and carcinogenesis⁶⁷. In addition, MIR22HG also exhibits a positive correlation with CCL2 (PCC = 0.80), which was reported to be involved in immunoregulatory and inflammatory processes of multiple cancers⁶⁸. FENDRR presents a strong positive correlation with FOXF1 (PCC = 0.85), consistent with a previous study⁴². FOXF1 was indicated to play roles in response to wounding and chemical stimulus, and be important in human development and tissue repair⁶⁹.

The observations above indicate that the onco-lncRNAs in these modules may contribute to functions associated with the response to stimulus and immunity. Previous researches proposed that the immune response is an attempt by the immune system to eradicate tumor, and could enhance tumorigenesis and progression⁷⁰.

Modules associated with cell adhesion

There are 16 onco-lncRNAs and 288 PCGs in the red module. Our analysis shows that the genes in this module are significantly enriched in BP terms like cytoskeleton organization, cell junction assembly and cell adhesion (Fig. 4 and Supplementary Table S8). Further analysis indicates that they show enrichments in pathways associated with focal adhesion, regulation of actin cytoskeleton, tight junction and ECM-receptor interaction (Table 2), consistent with the observations from BP terms. Similarly, genes in the greenyellow module containing four onco-lncRNAs also show significant enrichment in the functions involved in cell migration and cell adhesion (Fig. 4 and Supplementary Table S8).

There are two reported lncRNAs (ADAMTS9-AS2 and MIR143HG) in the red module. MIR143HG, as a cardiac mesoderm enhancer-associated non-coding RNA⁷¹, is observed to be significantly down-regulated in BLC and EBC (Supplementary Fig. S4i). ADAMTS9-AS2 was reported to be significantly down-regulated in glioma tumor tissues and its overexpression would result in significant inhibition of glioma cell migration⁴⁶. In our study, ADAMTS9-AS2 is significantly down-expressed in BLC, ADC and EBC (Supplementary Fig. S4h). Furthermore, it can be seen from Supplementary Fig. S8 that most genes directly connected to ADAMTS9-AS2 (w ≥ 0.05) also participate in the functions like cell adhesion and migration, for example, NCAM1⁷² (PCC = 0.70) and PALLD⁷³ (PCC = 0.70). Similar to ADAMTS9-AS2, MIR143HG also exhibits connections with genes involved in cell adhesion like TGFB1I1⁷⁴ (PCC = 0.80).

Thus, it can be conjectured that the onco-lncRNAs in the red and greenyellow modules most possibly participate in maintaining cell shape and changing attachment to other cells or extracellular matrix. The dysregulation in these functions can promote migration of cancer cells, leading to local invasion and distant metastasis⁷⁰.

Module associated with genomic stability

The brown module contains the most onco-lncRNAs (67) among all the modules and 844 PCGs. However, none of the 67 onco-lncRNAs have been reported to have associations with cancers. The genes in this module significantly contribute to response to DNA damage stimulus, DNA repair and chromosome organization processes (Fig. 4 and Supplementary Table S8). The BP terms are associated with functions of maintaining genomic stability and their disorders were revealed to be connected with predisposition to cancer^{75, 76}. In addition, the genes are mainly enriched in spliceosome pathway and base excision repair pathway (Table 2), which are also associated with regulation of genomic stability. Therefore, it is reasonable to infer that the 67 onco-lncRNAs may play roles in maintaining genomic stability under normal circumstances and their imbalances could promote the cancerization progress.

Module associated with morphogenesis

The magenta module contains 29 onco-lncRNAs and 182 PCGs. None of the 29 onco-lncRNAs have been reported to be correlated with cancers. The enrichment analysis for BP terms reveals that most significant terms are involved in embryogenesis progresses, like embryonic skeletal system morphogenesis, embryonic organ morphogenesis and embryonic morphogenesis (Fig. 4 and Supplementary Table S8). Some evidences indicated that genes with functions involved in the embryo development stage would play a role in carcinogenesis⁷⁷. Thus, it can be inferred that the onco-lncRNAs in the module magenta may be associated with steps of the embryonic morphogenesis and would advance carcinoma to progress to higher pathological grades^78,79,80.

Modules associated with cell cycle

The tan, lightgreen and salmon modules contain three, four and five onco-lncRNAs, and the number of PCGs are 118, 30 and 109, respectively. The enrichment function terms in the three modules are all involved in cell cycles (Fig. 4 and Supplementary Table S8). As accepted, the most fundamental trait of cancer cells is to sustain proliferation and the abnormality of the cell cycle has been considered to be a common feature of cancers⁸¹. Thus, it can be speculated that the 12 onco-lncRNAs in the three modules could play an important role in cancers through dysregulation of the cell cycle.

In summary, the module analysis above indicates that the functions of onco-lncRNAs in the 12 modules are involved in the biological roles relevant to malignancies. Compared to the previous observations^{21, 31}, some new functions of the lncRNAs like the morphogenesis and immune regulation are revealed by our work.

Hub-based analysis

Highly connected hub nodes are central to the network’s architecture¹⁸ and some studies suggested that genes more centralized in the network are more likely to be key drivers to proper cellular function than peripheral genes⁸². As observed above, the brown module has the most number of onco-lncRNAs and its eigengene shows a significant difference between the tumor and normal samples. Taking the brown module as an example, we further identify its intramodular hub genes. Although the filter for genes used in building the network may lead the onco-lncRNA connectivity towards higher values, we could obtain more important nodes from the onco-lncRNAs through identifying hub nodes. We selected the 5% of nodes (51/911) with the highest connectivity as hub genes from the brown module, which contain 11 onco-lncRNAs and 40 PCGs. Figure 5a is a network of these hub genes, which only displays connections with w above a threshold of 0.2. It can be seen that the 11 onco-lncRNAs exhibit high connectivity with neighboring genes (RAD50, CHD9, KMT2A, ARID4B and RING1) whose functions are involved in maintaining genome stability. Especially, RAD50, KMT2A and ARID4B were revealed as biomarkers in a broad range of human malignancies^{83, 84}. In addition, the 51 hub genes are overexpressed in the tumor tissues of each cancer type (Fig. 5b). The observation further indicates that the 11 onco-lncRNAs may play important roles in the regulation of genome stability in tumor biology. Meanwhile, their higher connectivity than other onco-lncRNAs in brown module suggests that they may play more crucial roles in the biological functions and the development of cancers than the other onco-lncRNAs in this module.

Conclusion

Although accumulating studies have indicated that the lncRNAs play important roles in tumor progress, the functions for most lncRNAs have not been unraveled, in particular for potential oncogenic lncRNAs across multiple cancers. In this study, we mainly utilized the gene co-expression network to study the functions of the onco-lncRNAs for the four solid cancer types.

The 236 onco-lncRNAs altered in multiple cancers were identified, majority of which were unreported previously to have associations with cancers. Our co-expression network and function enrichment analysis indicate that the onco-lncRNAs should play carcinogenic roles in the most fundamental functions involved in regulating proliferation and genome stability, providing further supports for the previous observations³¹. More importantly, our results reveal that the onco-lncRNAs are also associated with some biological capabilities implicated in the processes related to major hallmarks of cancers, like cell adhesion and motility, morphogenesis, immune and inflammatory response.

Overall, our study is the first time to use WGCNA approach to investigate the functions of the lncRNAs across multiple cancers based on RNA-seq data. Although the biological importance of the unreported onco-lncRNAs need further evaluation by experiments, our study proposed a facile yet efficient strategy to identify important lncRNAs associated with cancers and predict their potential functional roles, which may guide subsequently experimental studies.

Materials and Methods

RNA-seq datasets

Raw fastaq files of RNA-seq datasets for the four cancer types were downloaded from the European Nucleotide Archive (http://www.ebi.ac.uk/ena), including bladder cancer (SRP018008)³³, prostate cancer (ERP000550)³⁴, lung adenocarcinoma (SRP012656)³⁵ and estrogen receptor positive (ER+) breast cancer (SRP042620)³⁶ (Supplementary Table S1). In each dataset, we only choose those tumor samples with matched adjacent normal samples. Finally, we obtained 132 samples, of which at least 11 sample pairs for each cancer type, to analyze downstream.

Raw reads alignment and expression quantification

After the sequence quality control on the raw sequence data by fastqc v0.11.5 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), raw reads were mapped back to the reference genome GRCh38.p3 by TopHat v2.0.13^{85, 86}, and we used the GENCODE v23 gtf file (ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_23/gencode.v23.annotation.gtf.gz) as annotation file, which contains 15,931 lncRNA genes. Then we used Cufflinks v2.2.1^{86, 87} for the gene assembly and quantification. We obtained the gene expression levels by summarizing the FPKM value (Fragment Per Kilobase per Million mapped reads). In order to minimize the false positive and maintain a high number of differential expressed genes in downstream analysis, we only kept the expressed genes in terms of the criterion of FPKM ≥ 1 in more than 80% of the normal samples or 80% of the tumor samples for each cancer type according to Supplementary Fig. S9.

Differential expression analysis

We performed differential expression analysis on each cancer, based on BAM files derived from TopHat. DESeq2 v1.12.4⁸⁸ was used to test differential expression between the tumor and normal samples. A gene is defined as a differentially expressed gene between the normal sample and tumor one when the FDR adjusted p value is less than 0.01 (FDR ≤ 0.01) and the fold change (FC) is at least 2 times higher or lower (|log2FC| ≥ 1).

Co-expression network construction of onco-lncRNAs

We defined the lncRNAs significantly altered in more than two cancer types as onco-lncRNAs. In order to predict their functions, WGCNA v1.51¹⁸ was used to construct a co-expression network between the onco-lncRNAs and their “closely correlated” PCGs, based on the signed Pearson Correlation Coefficient (PCC) between their normalized expression levels as provided by Cuffnorm⁸⁶. A PCG is defined to be “closely correlated” with the onco-lncRNAs when its absolute values of Pearson correlation coefficients with more than 5 onco-lncRNAs are equal or greater than 0.5. Consequently, 6,316 correlated-PCGs were obtained. We then calculated a correlation matrix containing the absolute values of pairwise Pearson correlations among all the onco-lncRNAs and the correlated PCGs for the samples under study. In order to achieve a scale-free topology, we set β = 9 in terms of Supplementary Fig. S10 and converted the pairwise correlation into an adjacency matrix of connection strengths through soft-thresholding approach (connection strength = |correlation|^β). A dissimilarity matrix based on topological overlap measure (TOM) was used to identify gene modules through a dynamic tree-cutting algorithm¹⁸. All modules were assigned to the corresponding color. The module eigengene was used to represent each module, which was calculated by the first principal component. Using the module eigengenes, Module-Cancer relationships were estimated by one-way ANOVA with FDR corrected p-value between the module eigengene and the tissue type (normal and tumor). Then we selected 12 significantly cancer-associated modules (p-value ≤ 0.0001) for the downstream analysis. We also analyzed the hub genes of the brown module, which were derived from top 5% genes with the highest connectivity.

Functional enrichment analysis of onco-lncRNA-containing modules

We used DAVID v6.7³² (https://david-d.ncifcrf.gov/) to perform the functional enrichment analysis for each module. The tool computes a modified Fisher exact test p-value. In the main text, we only show the three representative terms from top 10 most significantly GO BP terms of each module. But, all significant terms (p ≤ 0.05) are listed in Supplementary Table S8. In addition, we only concerned significant KEGG pathways with p-value ≤ 0.05 and the number of enriched genes ≥ 5 (Table 2).

Statistical analysis and visualization

Statistical analysis was performed using R-3.3.1. Most of the visualizations were also presented by R, except for the survival analysis and the network visualization, where the Kaplan-Meier Plotter (http://kmplot.com/) and Cytoscape v3.3.0 (http://www.cytoscape.org/) tools were used. For the survival analysis, we chose recommended parameters from the web server to analyze the association between a queried gene and the survival time. Samples were grouped according to the median expression of the selected gene. All the survival curves denote overall survival (OS).

References

Kapranov, P. et al. RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription. Science 316, 1484–1488, doi:10.1126/science.1138341 (2007).
Article ADS CAS PubMed Google Scholar
St Laurent, G., Wahlestedt, C. & Kapranov, P. The Landscape of long noncoding RNA classification. Trends Genet 31, 239–251, doi:10.1016/j.tig.2015.03.007 (2015).
Article Google Scholar
Kapusta, A. & Feschotte, C. Volatile evolution of long noncoding RNA repertoires: mechanisms and biological implications. Trends Genet 30, 439–452, doi:10.1016/j.tig.2014.08.004 (2014).
Article CAS PubMed PubMed Central Google Scholar
Fatica, A. & Bozzoni, I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet 15, 7–21, doi:10.1038/nrg3606 (2014).
Article CAS PubMed Google Scholar
Satpathy, A. T. & Chang, H. Y. Long noncoding RNA in hematopoiesis and immunity. Immunity 42, 792–804, doi:10.1016/j.immuni.2015.05.004 (2015).
Article CAS PubMed Google Scholar
Devaux, Y. et al. Long noncoding RNAs in cardiac development and ageing. Nat Rev Cardiol 12, 415–425, doi:10.1038/nrcardio.2015.55 (2015).
Article CAS PubMed Google Scholar
Quinn, J. J. & Chang, H. Y. Unique features of long non-coding RNA biogenesis and function. Nat Rev Genet 17, 47–62, doi:10.1038/nrg.2015.10 (2016).
Article CAS PubMed Google Scholar
Batista, P. J. & Chang, H. Y. Long noncoding RNAs: cellular address codes in development and disease. Cell 152, 1298–1307, doi:10.1016/j.cell.2013.02.012 (2013).
Article CAS PubMed PubMed Central Google Scholar
Quek, X. C. et al. lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs. Nucleic Acids Res 43, D168–173, doi:10.1093/nar/gku988 (2015).
Article PubMed Google Scholar
Jiang, Q. et al. LncRNA2Function: a comprehensive resource for functional investigation of human lncRNAs based on RNA-seq data. BMC genomics 16, 1, doi:10.1186/1471-2164-16-S3-S2 (2015).
Article Google Scholar
Signal, B., Gloss, B. S. & Dinger, M. E. Computational Approaches for Functional Prediction and Characterisation of Long Noncoding RNAs. Trends Genet 32, 620–637, doi:10.1016/j.tig.2016.08.004 (2016).
Article CAS PubMed Google Scholar
Volders, P. J. et al. LNCipedia: a database for annotated human lncRNA transcript sequences and structures. Nucleic Acids Res 41, D246–251, doi:10.1093/nar/gks915 (2013).
Article CAS PubMed Google Scholar
Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res 41, D226–232, doi:10.1093/nar/gks1005 (2013).
Article CAS PubMed Google Scholar
Guo, X. et al. Advances in long noncoding RNAs: identification, structure prediction and function annotation. Brief Funct Genomics 15, 38–46, doi:10.1093/bfgp/elv022 (2016).
PubMed Google Scholar
Chu, C., Qu, K., Zhong, F. L., Artandi, S. E. & Chang, H. Y. Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol Cell 44, 667–678, doi:10.1016/j.molcel.2011.08.027 (2011).
Article CAS PubMed PubMed Central Google Scholar
Simon, M. D. et al. The genomic binding sites of a noncoding RNA. Proceedings of the National Academy of Sciences 108, 20497–20502, doi:10.1073/pnas.1113536108 (2011).
Article ADS CAS Google Scholar
Stuart, J. M., Segal, E., Koller, D. & Kim, S. K. A gene-coexpression network for global discovery of conserved genetic modules. science 302, 249–255, doi:10.1126/science.1087447 (2003).
Article ADS CAS PubMed Google Scholar
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9, 559, doi:10.1186/1471-2105-9-559 (2008).
Article PubMed PubMed Central Google Scholar
Li, J., Li, Y.-X. & Li, Y.-Y. Differential Regulatory Analysis Based on Coexpression Network in Cancer Research. BioMed Research International 2016, 8, doi:10.1155/2016/4241293 (2016).
Google Scholar
Cui, W. et al. Discovery and characterization of long intergenic non-coding RNAs (lincRNA) module biomarkers in prostate cancer: an integrative analysis of RNA-Seq data. BMC genomics 16, 1, doi:10.1186/1471-2164-16-S7-S3 (2015).
Article Google Scholar
Cogill, S. B. & Wang, L. Co-expression Network Analysis of Human lncRNAs and Cancer Genes. Cancer Inform 13, 49–59, doi:10.4137/CIN.S14070 (2014).
PubMed PubMed Central Google Scholar
Kaczkowski, B. et al. Transcriptome Analysis of Recurrently Deregulated Genes across Multiple Cancers Identifies New Pan-Cancer Biomarkers. Cancer Res 76, 216–226, doi:10.1158/0008-5472.CAN-15-0484 (2016).
Article CAS PubMed Google Scholar
Cancer Genome Atlas Research, N. et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet 45, 1113–1120, doi:10.1038/ng.2764 (2013).
Article Google Scholar
Ji, P. et al. MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene 22, 8031–8041, doi:10.1038/sj.onc.1206928 (2003).
Article PubMed Google Scholar
Lin, R., Maeda, S., Liu, C., Karin, M. & Edgington, T. S. A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas. Oncogene 26, 851–858, doi:10.1038/sj.onc.1209846 (2007).
Article CAS PubMed Google Scholar
Guffanti, A. et al. A transcriptional sketch of a primary human breast cancer by 454 deep sequencing. BMC Genomics 10, 163, doi:10.1186/1471-2164-10-163 (2009).
Article PubMed PubMed Central Google Scholar
Xu, C., Yang, M., Tian, J., Wang, X. & Li, Z. MALAT-1: a long non-coding RNA and its important 3′ end functional motif in colorectal cancer metastasis. Int J Oncol 39, 169–175, doi:10.3892/ijo.2011.1007 (2011).
PubMed Google Scholar
Huarte, M. The emerging role of lncRNAs in cancer. Nat Med 21, 1253–1261, doi:10.1038/nm.3981 (2015).
Article CAS PubMed Google Scholar
Marchese, F. P. et al. A Long Noncoding RNA Regulates Sister Chromatid Cohesion. Mol Cell 63, 397–407, doi:10.1016/j.molcel.2016.06.031 (2016).
Article CAS PubMed Google Scholar
Yan, X. et al. Comprehensive Genomic Characterization of Long Non-coding RNAs across Human Cancers. Cancer Cell 28, 529–540, doi:10.1016/j.ccell.2015.09.006 (2015).
Article CAS PubMed PubMed Central Google Scholar
Iyer, M. K. et al. The landscape of long noncoding RNAs in the human transcriptome. Nat Genet 47, 199–208, doi:10.1038/ng.3192 (2015).
Article CAS PubMed PubMed Central Google Scholar
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4, 44–57, doi:10.1038/nprot.2008.211 (2009).
Article PubMed Google Scholar
Guo, G. et al. Whole-genome and whole-exome sequencing of bladder cancer identifies frequent alterations in genes involved in sister chromatid cohesion and segregation. Nat Genet 45, 1459–1463, doi:10.1038/ng.2798 (2013).
Article CAS PubMed Google Scholar
Ren, S. et al. RNA-seq analysis of prostate cancer in the Chinese population identifies recurrent gene fusions, cancer-associated long noncoding RNAs and aberrant alternative splicings. Cell Res 22, 806–821, doi:10.1038/cr.2012.30 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. C. et al. A high-dimensional, deep-sequencing study of lung adenocarcinoma in female never-smokers. PloS one 8, e55596, doi:10.1371/journal.pone.0055596 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Varley, K. E. et al. Recurrent read-through fusion transcripts in breast cancer. Breast Cancer Res Treat 146, 287–297, doi:10.1007/s10549-014-3019-2 (2014).
Article PubMed PubMed Central Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res 22, 1760–1774, doi:10.1101/gr.135350.111 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. J. et al. Long non-coding RNA UCA1 promotes glutamine metabolism by targeting miR-16 in human bladder cancer. Jpn J Clin Oncol 45, 1055–1063, doi:10.1093/jjco/hyv132 (2015).
Article CAS PubMed Google Scholar
Zhao, F. et al. Knockdown of a novel lincRNA AATBC suppresses proliferation and induces apoptosis in bladder cancer. Oncotarget 6, 1064–78, doi:10.18632/oncotarget.2833 (2015).
Article PubMed Google Scholar
Malik, R. et al. The lncRNA PCAT29 inhibits oncogenic phenotypes in prostate cancer. Mol Cancer Res 12, 1081–1087, doi:10.1158/1541-7786.MCR-14-0257 (2014).
Article CAS PubMed PubMed Central Google Scholar
Takayama, K. et al. Androgen-responsive long noncoding RNA CTBP1-AS promotes prostate cancer. EMBO J 32, 1665–1680, doi:10.1038/emboj.2013.99 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cabanski, C. R. et al. Pan-cancer transcriptome analysis reveals long noncoding RNAs with conserved function. RNA Biol 12, 628–642, doi:10.1080/15476286.2015.1038012 (2015).
Article PubMed PubMed Central Google Scholar
Colombo, T., Farina, L., Macino, G. & Paci, P. PVT1: a rising star among oncogenic long noncoding RNAs. Biomed Res Int 2015, 304208–10, doi:10.1155/2015/304208 (2015).
Article PubMed PubMed Central Google Scholar
Zhou, Y., Zhang, X. & Klibanski, A. MEG3 noncoding RNA: a tumor suppressor. J Mol Endocrinol 48, R45–53, doi:10.1530/JME-12-0008 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yang, F. et al. A novel long non-coding RNA FGF14-AS2 is correlated with progression and prognosis in breast cancer. Biochem Biophys Res Commun 470, 479–483, doi:10.1016/j.bbrc.2016.01.147 (2016).
Article CAS PubMed Google Scholar
Yao, J. et al. A new tumor suppressor LncRNA ADAMTS9-AS2 is regulated by DNMT1 and inhibits migration of glioma cells. Tumour Biol 35, 7935–7944, doi:10.1007/s13277-014-1949-2 (2014).
Article CAS PubMed Google Scholar
Du, Z. et al. Integrative genomic analyses reveal clinically relevant long noncoding RNAs in human cancer. Nat Struct Mol Biol 20, 908–913, doi:10.1038/nsmb.2591 (2013).
Article CAS PubMed PubMed Central Google Scholar
Xu, T.-p. et al. Decreased expression of the long non-coding RNA FENDRR is associated with poor prognosis in gastric cancer and FENDRR regulates gastric cancer cell metastasis by affecting fibronectin1 expression. Journal of hematology & oncology 7, 63, doi: 10.1186/s13045-014-0063-7 (2014).
Article CAS Google Scholar
Sakurai, K., Reon, B. J., Anaya, J. & Dutta, A. The lncRNA DRAIC/PCAT29 Locus Constitutes a Tumor-Suppressive Nexus. Mol Cancer Res 13, 828–838, doi:10.1158/1541-7786.MCR-15-0016-T (2015).
Article CAS PubMed PubMed Central Google Scholar
Muys, B. R. et al. Placenta-Enriched LincRNAs MIR503HG and LINC00629 Decrease Migration and Invasion Potential of JEG-3 Cell Line. PLoS One 11, e0151560, doi:10.1371/journal.pone.0151560 (2016).
Article PubMed PubMed Central Google Scholar
Hazelett, D. J. et al. Comprehensive functional annotation of 77 prostate cancer risk loci. PLoS Genet 10, e1004102, doi:10.1371/journal.pgen.1004102 (2014).
Article PubMed PubMed Central Google Scholar
Arab, K. et al. Long noncoding RNA TARID directs demethylation and activation of the tumor suppressor TCF21 via GADD45A. Mol Cell 55, 604–614, doi:10.1016/j.molcel.2014.06.031 (2014).
Article CAS PubMed Google Scholar
Györffy, B. et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Research and Treatment 123, 725–731, doi:10.1007/s10549-009-0674-9 (2010).
Article PubMed Google Scholar
Gyorffy, B., Surowiak, P., Budczies, J. & Lanczky, A. Online survival analysis software to assess the prognostic value of biomarkers using transcriptomic data in non-small-cell lung cancer. PLoS One 8, e82241, doi:10.1371/journal.pone.0082241 (2013).
Article ADS PubMed PubMed Central Google Scholar
Szász, A. M. et al. Cross-validation of survival associated biomarkers in gastric cancer using transcriptomic data of 1,065 patients. Oncotarget 7, 49322–49333, doi:10.18632/oncotarget.10337 (2016).
PubMed PubMed Central Google Scholar
Gyorffy, B., Lanczky, A. & Szallasi, Z. Implementing an online tool for genome-wide validation of survival-associated biomarkers in ovarian-cancer using microarray data from 1287 patients. Endocr Relat Cancer 19, 197–208, doi:10.1530/ERC-11-0329 (2012).
Article CAS PubMed Google Scholar
Cross, M. J., Dixelius, J., Matsumoto, T. & Claesson-Welsh, L. VEGF-receptor signal transduction. Trends in Biochemical Sciences 28, 488–494, doi:10.1016/s0968-0004(03)00193-2 (2003).
Article CAS PubMed Google Scholar
Hasseine, L. K. et al. Hrs is a positive regulator of VEGF and insulin signaling. Exp Cell Res 313, 1927–1942, doi:10.1016/j.yexcr.2007.02.034 (2007).
Article CAS PubMed Google Scholar
Packer, A. Actin remodeling. Cell (2013).
Tseng, Y. Y. et al. PVT1 dependence in cancer with MYC copy-number increase. Nature 512, 82–86, doi:10.1038/nature13311 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Ogawa, H., Ishiguro, K.-i, Gaubatz, S., Livingston, D. M. & Nakatani, Y. A complex with chromatin modifiers that occupies E2F-and Myc-responsive genes in G0 cells. Science 296, 1132–1136, doi:10.1126/science.1069861 (2002).
Article ADS CAS PubMed Google Scholar
Bry, M., Kivela, R., Leppanen, V. M. & Alitalo, K. Vascular endothelial growth factor-B in physiology and disease. Physiol Rev 94, 779–794, doi:10.1152/physrev.00028.2013 (2014).
Article CAS PubMed Google Scholar
Tani, H. & Torimura, M. Identification of short-lived long non-coding RNAs as surrogate indicators for chemical stress response. Biochem Biophys Res Commun 439, 547–551, doi:10.1016/j.bbrc.2013.09.006 (2013).
Article CAS PubMed Google Scholar
Grote, P. et al. The tissue-specific lncRNA Fendrr is an essential regulator of heart and body wall development in the mouse. Dev Cell 24, 206–214, doi:10.1016/j.devcel.2012.12.012 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hernandez, A., Martinez, M. E., Croteau, W. & St Germain, D. L. Complex organization and structure of sense and antisense transcripts expressed from the DIO3 gene imprinted locus. Genomics 83, 413–424, doi:10.1016/j.ygeno.2003.08.024 (2004).
Article CAS PubMed Google Scholar
Tani, H., Onuma, Y., Ito, Y. & Torimura, M. Long non-coding RNAs as surrogate indicators for chemical stress responses in human-induced pluripotent stem cells. PloS one 9, e106282, doi:10.1371/journal.pone.0106282 (2014).
Article ADS PubMed PubMed Central Google Scholar
Schaper, F. & Rose-John, S. Interleukin-6: Biology, signaling and strategies of blockade. Cytokine Growth Factor Rev 26, 475–487, doi:10.1016/j.cytogfr.2015.07.004 (2015).
Article CAS PubMed Google Scholar
Mizutani, K. et al. The Chemokine CCL2 Increases Prostate Tumor Growth and Bone Metastasis through Macrophage and Osteoclast Recruitment. Neoplasia 11, 1235–1242, doi:10.1593/neo.09988 (2009).
Article CAS PubMed PubMed Central Google Scholar
Saito, R. A. et al. Forkhead box F1 regulates tumor-promoting properties of cancer-associated fibroblasts in lung cancer. Cancer Res 70, 2644–2654, doi:10.1158/0008-5472.CAN-09-3644 (2010).
Article CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674, doi:10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Ounzain, S. et al. CARMEN, a human super enhancer-associated long noncoding RNA controlling cardiac specification, differentiation and homeostasis. J Mol Cell Cardiol 89, 98–112, doi:10.1016/j.yjmcc.2015.09.016 (2015).
Article CAS PubMed Google Scholar
Xu, S., Li, X., Zhang, J. & Chen, J. Prognostic value of CD56 in patients with acute myeloid leukemia: a meta-analysis. J Cancer Res Clin Oncol 141, 1859–1870, doi:10.1007/s00432-015-1977-3 (2015).
Article CAS PubMed Google Scholar
Najm, P. & El-Sibai, M. Palladin regulation of the actin structures needed for cancer invasion. Cell Adh Migr 8, 29–35, doi:10.4161/cam.28024 (2014).
Article PubMed Google Scholar
Wu, J.-R. et al. Hydrogen peroxide inducible clone-5 mediates reactive oxygen species signaling for hepatocellular carcinoma progression. Oncotarget 6, 32526–44, doi:10.18632/oncotarget.5322 (2015).
PubMed PubMed Central Google Scholar
Curtin, N. J. DNA repair dysregulation from cancer driver to therapeutic target. Nat Rev Cancer 12, 801–817, doi:10.1038/nrc3399 (2012).
Article CAS PubMed Google Scholar
Zink, D., Fischer, A. H. & Nickerson, J. A. Nuclear structure in cancer cells. Nat Rev Cancer 4, 677–687, doi:10.1038/nrc1430 (2004).
Article CAS PubMed Google Scholar
Hendrix, M. J. et al. Reprogramming metastatic tumour cells with embryonic microenvironments. Nat Rev Cancer 7, 246–255, doi:10.1038/nrc2108 (2007).
Article CAS PubMed Google Scholar
Yang, J. et al. Twist, a master regulator of morphogenesis, plays an essential role in tumor metastasis. Cell 117, 927–939, doi:10.1016/j.cell.2004.06.006 (2004).
Article CAS PubMed Google Scholar
Brabletz, T., Jung, A. & Kirchner, T. Beta-catenin and the morphogenesis of colorectal cancer. Virchows Arch 441, 1–11, doi:10.1007/s00428-002-0642-9 (2002).
Article CAS PubMed Google Scholar
Friedl, P. & Gilmour, D. Collective cell migration in morphogenesis, regeneration and cancer. Nat Rev Mol Cell Biol 10, 445–457, doi:10.1038/nrm2720 (2009).
Article CAS PubMed Google Scholar
Hartwell, L. H. & Kastan, M. B. Cell cycle control and cancer. Science 266, 1821–8, doi:10.1126/science.7997877 (1994).
Article ADS CAS PubMed Google Scholar
Yang, Y. et al. Gene co-expression network analysis reveals common system-level properties of prognostic genes across cancer types. Nat Commun 5, 3231, doi:10.1038/ncomms4231 (2014).
ADS PubMed PubMed Central Google Scholar
Zhang, M. et al. Copy number deletion of RAD50 as predictive marker of BRCAness and PARP inhibitor response in BRCA wild type ovarian cancer. Gynecol Oncol 141, 57–64, doi:10.1016/j.ygyno.2016.01.004 (2016).
Article CAS PubMed PubMed Central Google Scholar
De Boer, J., Walf-Vorderwülbecke, V. & Williams, O. In focus: MLL-rearranged leukemia. Leukemia 27, 1224–1228, doi:10.1038/leu.2013.78 (2013).
Article PubMed Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111, doi:10.1093/bioinformatics/btp120 (2009).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc 7, 562–578, doi:10.1038/nprot.2012.016 (2012).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 28, 511–515, doi:10.1038/nbt.1621 (2010).
Article CAS PubMed PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550, doi:10.1186/s13059-014-0550-8 (2014).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This project is supported in part by the National Natural Science Foundation of China (Grant Nos 21573151, 21273154, 31471239, 31671368) and Sichuan Province Science and Technology Support Program (Grant No. 2015GZ0193).

Author information

Authors and Affiliations

College of Chemistry, Sichuan University, Chengdu, 610064, China
Suqing Li, Menglong Li & Xuemei Pu
Center for Pharmacogenomics, School of Life Sciences, and State Key Laboratory of Genetic Engineering and Shanghai Cancer Center/Cancer Institute, Fudan University, Shanghai, 201203, China
Bin Li, Yuanting Zheng & Leming Shi
Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, 200438, China
Yuanting Zheng & Leming Shi

Authors

Suqing Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuanting Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Menglong Li
View author publications
You can also search for this author in PubMed Google Scholar
Leming Shi
View author publications
You can also search for this author in PubMed Google Scholar
Xuemei Pu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.P. and L.S. designed the experiments; S.L., B.L. and Y.Z. performed computation and data analysis. S.L. wrote the main manuscript text and prepared all the figures. X.P., L.S. and M.L. discussed the results and revised the manuscript. All authors contributed to discussions about the results and the manuscript.

Corresponding authors

Correspondence to Leming Shi or Xuemei Pu.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, S., Li, B., Zheng, Y. et al. Exploring functions of long noncoding RNAs across multiple cancers through co-expression network. Sci Rep 7, 754 (2017). https://doi.org/10.1038/s41598-017-00856-8

Download citation

Received: 02 December 2016
Accepted: 15 March 2017
Published: 07 April 2017
DOI: https://doi.org/10.1038/s41598-017-00856-8

This article is cited by

TP63–TRIM29 axis regulates enhancer methylation and chromosomal instability in prostate cancer
- R. Sultanov
- A. Mulyukina
- G. Arapidi
Epigenetics & Chromatin (2024)
The roles of lncRNAs in Th17-associated diseases, with special focus on JAK/STAT signaling pathway
- Han Wang
- Lanlan Yu
- Zhigang Guo
Clinical and Experimental Medicine (2023)
Lantern: an integrative repository of functional annotations for lncRNAs in the human genome
- Swapna Vidhur Daulatabad
- Rajneesh Srivastava
- Sarath Chandra Janga
BMC Bioinformatics (2021)
CORAZON: a web server for data normalization and unsupervised clustering based on expression profiles
- Thaís A. R. Ramos
- Vinicius Maracaja-Coutinho
- Thaís G. do Rêgo
BMC Research Notes (2020)
PLAIDOH: a novel method for functional prediction of long non-coding RNAs identifies cancer-specific LncRNA activities
- Sarah C. Pyfrom
- Hong Luo
- Jacqueline E. Payton
BMC Genomics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.