In Silico Comparative Genomic Analysis of Two Non-small Cell Lung Cancer Subtypes and their Potentials for Cancer Classification

JINDONG LI; DONGFANG LI; XUDONG WEI; YANHE SU

Abstract

Background/Aim: Lung adenocarcinoma (AC) and squamous cell lung carcinoma (SCC) are two main subtypes of non-small cell lung cancer. In order to understand their biological differences, we conducted an in silico comparative genomic analysis of their expression profiles. Materials and Methods: We utilized the published microarray data of 18 SCC samples and 40 AC samples to discriminate genes differentially expressed in SCC and AC. Genes were employed to construct a functional module network and build a support vector machine classifier. Another set of published non-small cell lung cancer microarray data was used to test the predictive accuracy of support vector machine classifier. Results: Our analysis showed that SCC shows an elevated expression of genes related to cell division and DNA replication while AC presents an elevated expression of the genes related to protein transport and cell junction. ROC analysis demonstrates that the support vector machine classifier has a high classification accuracy for AC and SCC. Conclusion: AC and SCC are distinctively different in certain biological network modules. This proposes different pathological mechanisms involved in these two non-small cell lung cancer subtypes.

Lung cancer is one of major contributions of cancer-related death worldwide, bringing unbearable agony to patients and a hefty burden to health system. Since 2008, it has replaced liver cancer as the most common malignant tumor in China (1). About 80% of lung cancers are non-small-cell lung cancers (NSCLC) (2), which are divided in three major subtypes: lung adenocarcinoma (AC), squamous cell lung carcinoma (SCC), and large cell lung carcinoma.

AC (nearly 40% of NSCLCs) and SCC (about 30% of NSCLCs) are more common than large cell lung carcinomas (2, 3). They compose over half of total lung cancer cases. AC is more associated to lung cancer patients without a history of smoking, although smoking is an important risk factor for lung cancer, especially in developed countries (4). SCC is more common in males than in females. It is closely associated with lung cancer patients having a smoking habit (5). These facts suggest that AC and SCC have different pathological mechanisms.

Microarray-based gene expression profiling has been used to describe the expression profiles of AC and SCC (6, 7). The genes differentially expressed between AC and SCC were identified and some of them are reported to be biomarkers for non-small cell lung cancer (8, 9). However, it remains unclear what the potential genes affecting the pathological process of AC or SCC are. So far, many studies only focused on a small portion of signature genes of AC or SCC and their possible roles in NSCLCs (10-13). Most information from AC and SCC microarray data is overlooked. The biological processes and molecular pathways hidden in these data are worth being uncovered.

The long-term survival rate of non-small cell lung cancer patients is abysmal (about 15% of 5-year overall survival and 8-10% of 10-year overall survival) (14, 15). The improvement of the patient’s long-term survival rate requires for correct diagnosis of NSCLC subtype in early stage and proper treatment plan, both of which demand the understanding of the molecular basis underlying different NSCLC subtypes. Using the published AC and SCC microarray data (16), we conducted a comparative genomic analysis of AC and SCC expression profiles. We focused on the differences between AC and SCC in their biological process, cellular component and molecular function networks. The major differences in their biological networks were identified in this study. In addition, we built a support vector machine (SVM) classifier based on genes involved in these networks. It has a possible value in AC and SCC diagnosis. We also used another set of NSCLC microarray data to test for the accuracy of our classifier (8), which proved the high predictive accuracy of our SVM classifier.

Figure 1.

The highly interconnected modules extracted from the biological process network. The circles stand for module and the arrows stands for sub-network flow. The number beside SCC means the number of SCC up-regulated genes in the module and the number beside AC means the number of AC up-regulated genes in the module.

Figure 2.

The highly interconnected modules extracted from the cellular component network. The circles stand for module and the arrows stand for sub-network flow. The number beside SCC indicates the number of SCC up-regulated genes in the module and the number beside AC means the number of AC up-regulated genes in the module.

Materials and Methods

Microarray data. Microarray data including 18 SCC samples and 40 AC samples were downloaded from the GEO database (GSE10245). This data set was used for gene expression profile analysis and building of a support vector machine classifier. The test microarray data for support vector machine classifier were also downloaded from GEO database (GSE19804). It has 60 NSCLC samples (56 adenocarcinoma samples, 3 bronchioloaveolar carcinoma samples, and 1 squamous cell carcinoma sample). Both microarray data sets are normalized counts and were collected using Affymetrix Human Genome U133 Plus 2.0 Array (GPL570) (8, 16).

Statistical methods for identifying differentially expressed genes. One-way ANOVA test was employed to compare the mean normalized counts between AC and SCC samples in the first data set. The R package was used to perform statistical analysis and a p-value smaller than 0.01 was considered statistically significant.

Functional enrichment analysis for differentially expressed genes. We used the DAVID online resource to perform the gene-GO term enrichment analysis for the differentially expressed genes between AC and SCC (17, 18). The differentially expressed genes were classified as AC up-regulated genes and SCC up-regulated genes.

Functional module network construction. The differentially expressed genes mapped by DAVID were used for building a functional module network. BiNGO was used to cluster genes into functional modules and create the biological network based on these modules (19). MCODE was used to find the highly interconnected nodes in the networks (20).

Building a support vector machine classifier. The support vector machine classifier for AC and SCC was built with LIBSVM (21). The LIBSVM parameters were optimized with grid-search and 5-fold cross-validation. Only differentially expressed genes mapped by DAVID database were used as SVM features. We selected linear kernel for our SVM classifier, because the number of samples is greatly lesser than the number of features (22). Both the training and test data sets were normalized to the range [0, 1].

Figure 3.

The highly interconnected modules extracted from the molecular function network. The circles stand for module and the arrows stands for sub-network flow. The number beside SCC means the number of SCC up-regulated genes in the module and the number beside AC means the number of AC up-regulated genes in the module.

Results

Differentially expressed genes between AC and SCC. Using the normalized gene expression data from the first microarray data set, we identified 3,544 differentially expressed genes between AC and SCC through ANOVA comparison (p-value <0.01). According to their expression levels in AC and SCC, we classified them into two categories: AC up-regulated genes and SCC up-regulated genes. Among 3,544 differentially expressed genes, 1,485 genes are up-regulated in AC while 2,059 genes are up-regulated in SCC.

DAVID analysis of differentially expressed genes. In order to understand the possible functions of these differentially expressed genes, we performed the GO term enrichment analysis for them using DAVID functional annotation software (17, 18). 2,857 out of 3,544 differentially expressed genes can be mapped in the DAVID database. 1,216 of them are AC up-regulated genes and 1,641 of them are SCC up-regulated genes. The functional enrichment result is shown in Table I. According to the result, the possible functions of differentially expressed genes between AC and SCC are mainly involved in cell cycle, condensed chromosome, purine nucleotide binding and DNA replication.

Figure 4.

ROC analysis of the support vector machine classifier built on 2,857 mapped differentially expressed genes between AC and SCC.

Functional module network analysis of differentially expressed genes. Although the DAVID analysis of these differentially expressed genes yielded the information about their possible functions, it is still unclear how these functions contribute to the unique biological profile of AC or SCC. In order to investigate the relationship between these functions and the molecular basis of AC or SCC, we constructed biological interaction networks represented by these functions using a Cytoscape plugin, BiNGO (19). The networks of biological process, cellular component and molecular function were constructed. Nevertheless, these networks are too huge and too complex for us to analyze (data not shown). Thus, we used another Cytoscape plugin, MCODE, to refine these networks (20). The highly interconnected nodes were extracted from these networks with MCODE. These nodes are actually the important functional sub-networks in the whole network. We identified two functional sub-networks in biological process network, one sub-network in cellular component network and one sub-network in molecular function network (Figures 1, 2 and 3). The sub-networks in biological process network are responsible for DNA replication and mitosis (Figure 1A and B). They are mainly composed of SCC up-regulated genes. The highly interconnected modules in cellular component network are made of intracellular organelles which ultimately contribute to spindle apparatus, and the highly interconnected modules in molecular function network carry out the function of purine and adenyl nucleotide binding (Figure 2 and 3). About two-third of the genes involved in these modules are up-regulated in SCC. In all important sub-networks, there are more genes up-regulated in SCC than in AC. We also searched the modules with more AC up-regulated genes than SCC up-regulated ones and the result is shown in Table II. Although they are not highly interconnected with other modules, these modules show that AC is more active in expression of the genes related to protein transport, endoplasmic reticulum, Golgi apparatus, and cell junction.

View this table:

Table I.

DAVID functional annotation analysis of differentially expressed genes between AC and SCC.

The performance of support vector machine classifier. The functional module network analysis shows that SCC up-regulated genes play a main part in DNA replication and cell cycle while AC up-regulated genes are mainly involved in protein transport and cell junction. They are useful features for separating these two major subtypes of NSCLC. We used 2,857 DAVID-mapped genes to build the support vector machine classifier for SCC and AC. 18 SCC samples and 40 AC samples from GSE10245 were used as training data set. We selected linear kernel function for building support vector machine classifier, because the number of genes is much larger than the number samples. First, we used the training data set to test the predictive actuary of our classifier. The result of training data set showed a classification accuracy of 100%. Secondly, we used another set of microarray data to test the predictive actuary of the classifier. This microarray data set includes 56 adenocarcinoma samples, 3 bronchioloaveolar carcinoma samples, and 1 squamous cell carcinoma sample. When tested with these samples, the classifier showed an AUC score of 0.9831 in the ROC analysis (Figure 4). The classifier successfully identified AC and SCC samples, and predicted 2 bronchioloaveolar carcinoma samples as AC and 1 bronchioloaveolar carcinoma sample as SCC. Bronchioloaveolar carcinoma is usually considered as a subtype of lung adenocarcinoma (23, 24), so the predictive accuracy of this classifier on AC is 98.3% (58/59). This result proves its possible value in AC and SCC diagnosis.

Discussion

The identification of differentially expressed genes between AC and SCC could help elucidate their different oncogenic mechanisms. Our study shows that there are more SCC up-regulated genes than AC up-regulated ones in the differentially expressed genes. It suggests that the pathological process underlying SCC is more complex than that of AC. The pathogenesis of SCC might need more steps of somatic mutation which, in turn, recruited more abnormally-expressed genes. The clinical statistics show that SCC is less common than AC among NSCLCs (2, 3), which circumstantially support the statement above. The males with a history of tobacco use are more susceptible to SCC while AC is the most common type of lung cancer in non-smokers (4, 5). Carcinogens like benzopyrene in tobacco smoke could cause and facilitate the mutation of normal cells into SCC cancer cells. AC is more common among lung cancer patients without smoking habit. Although the significance of genetic factors in AC development is unknown, they definitely play a part in the carciongenesis of AC (25, 26).

View this table:

Table II.

The functional modules in biological networks with more AC up-regulated genes than SCC up-regulated ones.

In the present study, the majority of the identified differentially expressed genes are annotated by DAVID database. The DAVID analysis shows that their biological process, cellular component, molecular function, and KEGG pathway are mainly involved in cell cycle, condensed chromosome, purine nucleotide binding, and DNA replication, respectively. Since one major feature of cancer is uncontrolled proliferation, it is expected that their functions are somewhat related to cell division. The functional module network analysis of these genes revealed more detailed information about the difference between SCC and AC. In all highly interconnected modules in biological process, cellular component, and molecular function network, there are more SCC up-regulated genes than AC up-regulated ones. Their elevated expression in SCC indicates that SCC has a faster rate of DNA replication and cell division than AC, which is consistent with the former study (27). In cellular component sub-network, the organelle genes form a sub-network which finally contributes to spindle apparatus. As an indispensable cellular structure for cell division, the high expression level of spindle genes in SCC confirmed its hyperactivity in cell division. The modules in molecular function sub-network are responsible for nucleotide and ribonucleotide binding. The fact that SCC has more up-regulated genes in these modules suggests its fast rate of DNA synthesis. On the other hand, AC up-regulated genes are more concentrated in the modules related to protein transport, endoplasmic reticulum, Golgi apparatus, and cell junction. While these modules do not form an interconnected sub-network, they still suggest that AC might be a result of abnormal expression of cell-cell interaction genes and cell junction genes. The functional module network analysis indicates the different molecular basis for AC or SCC carcinogenesis. Although the crucial genes influencing the process of AC or SCC carcinogenesis remain unknown, our study proposes that different drugs and different treatment strategies should be considered for different NSCLC subtypes in lung cancer therapy.

The SVM classifier built on the DAVID-mapped genes shows a high accuracy for identifying AC samples. Due to the limited number of SCC samples in test data set, its predictive accuracy for SCC still needs to be evaluated. We have confidence in the predictive power of our classifier. At least, when tested with the training data set, its predictive accuracy for both AC and SCC is 100%. Its performance on bronchioloaveolar carcinoma is intriguing. It classified 2 bronchioloaveolar carcinoma samples as AC and 1 bronchioloaveolar carcinoma sample as SCC. Bronchioloaveolar carcinoma is commonly regarded as a subtype of AC (23, 24). However, our result suggests that it might be not the case. If our result were correct, bronchioloaveolar carcinoma should be viewed as a mix of several lung cancer variants rather than a single variant of lung cancer. Further studies are required to elucidate the pathological classification of this less common type of lung cancer.

In conclusion, our analysis showed that 3,544 genes are differentially expressed between AC and SCC. SCC has greater number of up-regulated, differentially expressed, genes than AC. The functional enrichment analysis shows that these genes are mainly involved in cell cycle and DNA replication, and the functional module network analysis indicates that SCC and AC have different molecular bases and biological profiles. SCC has an elevated expression of the genes related to cell division and DNA replication while AC has an elevated expression of the genes related to protein transport and cell junction. These results indicate the different pathological mechanisms of SCC and AC. Further investigations are required to identify the genes and the molecular pathways controlling these mechanisms, if we want to exploit them as bases for NSCLC treatment. We also used the differentially expressed genes to build a support vector machine classifier for SCC and AC. It demonstrates the high predictive accuracy for AC and has a potential value in NSCLC diagnosis.

Received August 15, 2014.
Revision received September 29, 2014.
Accepted October 1, 2014.

References

↵
1. She J,
2. Yang P,
3. Hong Q,
4. Bai C
: Lung cancer in China: challenges and interventions. Chest 143: 1117–1126, 2013.
OpenUrl PubMed
↵
1. Nugent WC,
2. Edney MT,
3. Hammerness PG,
4. Dain BJ,
5. Maurer LH,
6. Rigas JR
: Non-small cell lung cancer at the extremes of age: impact on diagnosis and treatment. The Annals of thoracic surgery 63: 193–197, 1997.
OpenUrl CrossRef PubMed
↵
1. Kenfield SA,
2. Wei EK,
3. Stampfer MJ,
4. Rosner BA,
5. Colditz GA
: Comparison of aspects of smoking among the four histological types of lung cancer. Tobacco control 17: 198–204, 2008.
OpenUrl Abstract/FREE Full Text
↵
1. Subramanian J,
2. Govindan R
: Lung cancer in never smokers: a review. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 25: 561–570, 2007.
OpenUrl PubMed
↵
1. Wood ME,
2. Kelly K,
3. Mullineaux LG,
4. Bunn PA Jr..
: The inherited nature of lung cancer: a pilot study. Lung Cancer 30: 135–144, 2000.
OpenUrl CrossRef PubMed
↵
1. Kikuchi T,
2. Daigo Y,
3. Katagiri T,
4. Tsunoda T,
5. Okada K,
6. Kakiuchi S,
7. Zembutsu H,
8. Furukawa Y,
9. Kawamura M,
10. Kobayashi K,
11. Imai K,
12. Nakamura Y
: Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. Oncogene 22: 2192–2205, 2003.
OpenUrl CrossRef PubMed
↵
1. Tanney A,
2. Oliver GR,
3. Farztdinov V,
4. Kennedy RD,
5. Mulligan JM,
6. Fulton CE,
7. Farragher SM,
8. Field JK,
9. Johnston PG,
10. Harkin DP,
11. Proutski V,
12. Mulligan KA
: Generation of a non-small cell lung cancer transcriptome microarray. BMC medical genomics 1: 20, 2008.
OpenUrl PubMed
↵
1. Lu TP,
2. Tsai MH,
3. Lee JM,
4. Hsu CP,
5. Chen PC,
6. Lin CW,
7. Shih JY,
8. Yang PC,
9. Hsiao CK,
10. Lai LC,
11. Chuang EY
: Identification of a novel biomarker, SEMA5A, for non-small cell lung carcinoma in nonsmoking women. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 19: 2590–2597, 2010.
OpenUrl
↵
1. Botling J,
2. Edlund K,
3. Lohr M,
4. Hellwig B,
5. Holmberg L,
6. Lambe M,
7. Berglund A,
8. Ekman S,
9. Bergqvist M,
10. Ponten F,
11. Konig A,
12. Fernandes O,
13. Karlsson M,
14. Helenius G,
15. Karlsson C,
16. Rahnenfuhrer J,
17. Hengstler JG,
18. Micke P
: Biomarker discovery in non-small cell lung cancer: integrating gene expression profiling, meta-analysis, and tissue microarray validation. Clinical cancer research: an official journal of the American Association for Cancer Research 19: 194–204, 2013.
OpenUrl PubMed
↵
1. Au NH,
2. Gown AM,
3. Cheang M,
4. Huntsman D,
5. Yorida E,
6. Elliott WM,
7. Flint J,
8. English J,
9. Gilks CB,
10. Grimes HL
: P63 expression in lung carcinoma: a tissue microarray study of 408 cases. Applied immunohistochemistry & molecular morphology: AIMM/official publication of the Society for Applied Immunohistochemistry 12: 240–247, 2004.
OpenUrl
1. Shi I,
2. Hashemi Sadraei N,
3. Duan ZH,
4. Shi T
: Aberrant signaling pathways in squamous cell lung carcinoma. Cancer informatics 10: 273–285, 2011.
OpenUrl PubMed
1. Ibrahim R,
2. Matsubara D,
3. Osman W,
4. Morikawa T,
5. Goto A,
6. Morita S,
7. Ishikawa S,
8. Aburatani H,
9. Takai D,
10. Nakajima J,
11. Fukayama M,
12. Niki T,
13. Murakami Y
: Expression of PRMT5 in lung adenocarcinoma and its significance in epithelial-mesenchymal transition. Human pathology 45: 1397–1405, 2014.
OpenUrl PubMed
↵
1. Seike M,
2. Yanaihara N,
3. Bowman ED,
4. Zanetti KA,
5. Budhu A,
6. Kumamoto K,
7. Mechanic LE,
8. Matsumoto S,
9. Yokota J,
10. Shibata T,
11. Sugimura H,
12. Gemma A,
13. Kudoh S,
14. Wang XW,
15. Harris CC
: Use of a cytokine gene expression signature in lung adenocarcinoma and the surrounding tissue as a prognostic classifier. Journal of the National Cancer Institute 99: 1257–1269, 2007.
OpenUrl Abstract/FREE Full Text
↵
1. Mountain CF,
2. Lukeman JM,
3. Hammar SP,
4. Chamberlain DW,
5. Coulson WF,
6. Page DL,
7. Victor TA,
8. Weiland LH
: Lung cancer classification: the relationship of disease extent and cell type to survival in a clinical trials population. Journal of surgical oncology 35: 147–156, 1987.
OpenUrl PubMed
↵
1. Naruke T,
2. Goya T,
3. Tsuchiya R,
4. Suemasu K
: Prognosis and survival in resected lung carcinoma based on the new international staging system. The Journal of thoracic and cardiovascular surgery 96: 440–447, 1988.
OpenUrl PubMed
↵
1. Kuner R,
2. Muley T,
3. Meister M,
4. Ruschhaupt M,
5. Buness A,
6. Xu EC,
7. Schnabel P,
8. Warth A,
9. Poustka A,
10. Sultmann H,
11. Hoffmann H
: Global gene expression analysis reveals specific patterns of cell junctions in non-small cell lung cancer subtypes. Lung Cancer 63: 32–38, 2009.
OpenUrl CrossRef PubMed
↵
1. Huang da W,
2. Sherman BT,
3. Lempicki RA
: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic acids research 37: 1–13, 2009.
OpenUrl Abstract/FREE Full Text
↵
1. Huang da W,
2. Sherman BT,
3. Lempicki RA
: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols 4: 44–57, 2009.
OpenUrl CrossRef PubMed
↵
1. Maere S,
2. Heymans K,
3. Kuiper M
: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21: 3448–3449, 2005.
OpenUrl Abstract/FREE Full Text
↵
1. Rivera CG,
2. Vakil R,
3. Bader JS
: NeMo: Network Module identification in Cytoscape. BMC bioinformatics 11(Suppl 1): S61, 2010.
OpenUrl CrossRef PubMed
↵
1. Chih-Chung Chang C-JL
: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2: 1–27, 2011.
OpenUrl CrossRef
↵
1. Rong-En Fan K-WC,
2. Cho-Jui Hsieh,
3. Xiang-Rui Wang,
4. Chih-Jen Lin
: LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research: 1871–1874, 2008.
↵
1. Dermer GB
: Origin of bronchioloalveolar carcinoma and peripheral bronchial adenocarcinoma. Cancer 49: 881–887, 1982.
OpenUrl CrossRef PubMed
↵
1. Raz DJ,
2. Kim JY,
3. Jablons DM
: Diagnosis and treatment of bronchioloalveolar carcinoma. Current opinion in pulmonary medicine 13: 290–296, 2007.
OpenUrl PubMed
↵
1. Hsiung CA,
2. Lan Q,
3. Hong YC,
4. Chen CJ,
5. Hosgood HD,
6. Chang IS,
7. Chatterjee N,
8. Brennan P,
9. Wu C,
10. Zheng W,
11. Chang GC,
12. Wu T,
13. Park JY,
14. Hsiao CF,
15. Kim YH,
16. Shen H,
17. Seow A,
18. Yeager M,
19. Tsai YH,
20. Kim YT,
21. Chow WH,
22. Guo H,
23. Wang WC,
24. Sung SW,
25. Hu Z,
26. Chen KY,
27. Kim JH,
28. Chen Y,
29. Huang L,
30. Lee KM,
31. Lo YL,
32. Gao YT,
33. Liu L,
34. Huang MS,
35. Jung TH,
36. Jin G,
37. Caporaso N,
38. Yu D,
39. Kim CH,
40. Su WC,
41. Shu XO,
42. Xu P,
43. Kim IS,
44. Chen YM,
45. Ma H,
46. Shen M,
47. Cha SI,
48. Tan W,
49. Chang CH,
50. Sung JS,
51. Zhang M,
52. Yang TY,
53. Park KH,
54. Yuenger J,
55. Wang CL,
56. Ryu JS,
57. Xiang Y,
58. Deng Q,
59. Hutchinson A,
60. Kim JS,
61. Cai Q,
62. Landi MT,
63. Yu CJ,
64. Tucker M,
65. Hung JY,
66. Lin CC,
67. Perng RP,
68. Boffetta P,
69. Chen CY,
70. Chen KC,
71. Yang SY,
72. Hu CY,
73. Chang CK,
74. Fraumeni JF Jr..,
75. Chanock S,
76. Yang PC,
77. Rothman N,
78. Lin D
: The 5p15.33 locus is associated with risk of lung adenocarcinoma in never-smoking females in Asia. PLoS genetics 62010.
↵
1. Hosgood HD 3rd.,
2. Wang WC,
3. Hong YC,
4. Wang JC,
5. Chen K,
6. Chang IS,
7. Chen CJ,
8. Lu D,
9. Yin Z,
10. Wu C,
11. Zheng W,
12. Qian B,
13. Park JY,
14. Kim YH,
15. Chatterjee N,
16. Chen Y,
17. Chang GC,
18. Hsiao CF,
19. Yeager M,
20. Tsai YH,
21. Wei H,
22. Kim YT,
23. Wu W,
24. Zhao Z,
25. Chow WH,
26. Zhu X,
27. Lo YL,
28. Sung SW,
29. Chen KY,
30. Yuenger J,
31. Kim JH,
32. Huang L,
33. Chen YH,
34. Gao YT,
35. Huang MS,
36. Jung TH,
37. Caporaso N,
38. Zhao X,
39. Huan Z,
40. Yu D,
41. Kim CH,
42. Su WC,
43. Shu XO,
44. Kim IS,
45. Bassig B,
46. Chen YM,
47. Cha SI,
48. Tan W,
49. Chen H,
50. Yang TY,
51. Sung JS,
52. Wang CL,
53. Li X,
54. Park KH,
55. Yu CJ,
56. Ryu JS,
57. Xiang Y,
58. Hutchinson A,
59. Kim JS,
60. Cai Q,
61. Landi MT,
62. Lee KM,
63. Hung JY,
64. Tucker M,
65. Lin CC,
66. Ren Y,
67. Perng RP,
68. Chen CY,
69. Jin L,
70. Chen KC,
71. Li YJ,
72. Chiu YF,
73. Tsai FY,
74. Yang PC,
75. Fraumeni JF Jr..,
76. Seow A,
77. Lin D,
78. Zhou B,
79. Chanock S,
80. Hsiung CA,
81. Rothman N,
82. Lan Q
: Genetic variant in TP63 on locus 3q28 is associated with risk of lung adenocarcinoma among never-smoking females in Asia. Human genetics 131: 1197–1203, 2012.
OpenUrl PubMed
↵
1. Inamura K,
2. Fujiwara T,
3. Hoshida Y,
4. Isagawa T,
5. Jones MH,
6. Virtanen C,
7. Shimane M,
8. Satoh Y,
9. Okumura S,
10. Nakagawa K,
11. Tsuchiya E,
12. Ishikawa S,
13. Aburatani H,
14. Nomura H,
15. Ishikawa Y
: Two subclasses of lung squamous cell carcinoma with different gene expression profiles and prognosis identified by hierarchical clustering and non-negative matrix factorization. Oncogene 24: 7105–7113, 2005.
OpenUrl CrossRef PubMed

In this issue

Download PDF

Article Alerts

Email Article

Citation Tools

Reprints and Permissions

Cited By...

Google Scholar

Keywords

[1] ↵
She J,
Yang P,
Hong Q,
Bai C
: Lung cancer in China: challenges and interventions. Chest 143: 1117–1126, 2013.
OpenUrl PubMed

[2] She J,

[3] Yang P,

[4] Hong Q,

[5] Bai C

[6] ↵
Nugent WC,
Edney MT,
Hammerness PG,
Dain BJ,
Maurer LH,
Rigas JR
: Non-small cell lung cancer at the extremes of age: impact on diagnosis and treatment. The Annals of thoracic surgery 63: 193–197, 1997.
OpenUrl CrossRef PubMed

[7] Nugent WC,

[8] Edney MT,

[9] Hammerness PG,

[10] Dain BJ,

[11] Maurer LH,

[12] Rigas JR

[13] ↵
Kenfield SA,
Wei EK,
Stampfer MJ,
Rosner BA,
Colditz GA
: Comparison of aspects of smoking among the four histological types of lung cancer. Tobacco control 17: 198–204, 2008.
OpenUrl Abstract/FREE Full Text

[14] Kenfield SA,

[15] Wei EK,

[16] Stampfer MJ,

[17] Rosner BA,

[18] Colditz GA

[19] ↵
Subramanian J,
Govindan R
: Lung cancer in never smokers: a review. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 25: 561–570, 2007.
OpenUrl PubMed

[20] Subramanian J,

[21] Govindan R

[22] ↵
Wood ME,
Kelly K,
Mullineaux LG,
Bunn PA Jr..
: The inherited nature of lung cancer: a pilot study. Lung Cancer 30: 135–144, 2000.
OpenUrl CrossRef PubMed

[23] Wood ME,

[24] Kelly K,

[25] Mullineaux LG,

[26] Bunn PA Jr..

[27] ↵
Kikuchi T,
Daigo Y,
Katagiri T,
Tsunoda T,
Okada K,
Kakiuchi S,
Zembutsu H,
Furukawa Y,
Kawamura M,
Kobayashi K,
Imai K,
Nakamura Y
: Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. Oncogene 22: 2192–2205, 2003.
OpenUrl CrossRef PubMed

[28] Kikuchi T,

[29] Daigo Y,

[30] Katagiri T,

[31] Tsunoda T,

[32] Okada K,

[33] Kakiuchi S,

[34] Zembutsu H,

[35] Furukawa Y,

[36] Kawamura M,

[37] Kobayashi K,

[38] Imai K,

[39] Nakamura Y

[40] ↵
Tanney A,
Oliver GR,
Farztdinov V,
Kennedy RD,
Mulligan JM,
Fulton CE,
Farragher SM,
Field JK,
Johnston PG,
Harkin DP,
Proutski V,
Mulligan KA
: Generation of a non-small cell lung cancer transcriptome microarray. BMC medical genomics 1: 20, 2008.
OpenUrl PubMed

[41] Tanney A,

[42] Oliver GR,

[43] Farztdinov V,

[44] Kennedy RD,

[45] Mulligan JM,

[46] Fulton CE,

[47] Farragher SM,

[48] Field JK,

[49] Johnston PG,

[50] Harkin DP,

[51] Proutski V,

[52] Mulligan KA

[53] ↵
Lu TP,
Tsai MH,
Lee JM,
Hsu CP,
Chen PC,
Lin CW,
Shih JY,
Yang PC,
Hsiao CK,
Lai LC,
Chuang EY
: Identification of a novel biomarker, SEMA5A, for non-small cell lung carcinoma in nonsmoking women. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 19: 2590–2597, 2010.
OpenUrl

[54] Lu TP,

[55] Tsai MH,

[56] Lee JM,

[57] Hsu CP,

[58] Chen PC,

[59] Lin CW,

[60] Shih JY,

[61] Yang PC,

[62] Hsiao CK,

[63] Lai LC,

[64] Chuang EY

[65] ↵
Botling J,
Edlund K,
Lohr M,
Hellwig B,
Holmberg L,
Lambe M,
Berglund A,
Ekman S,
Bergqvist M,
Ponten F,
Konig A,
Fernandes O,
Karlsson M,
Helenius G,
Karlsson C,
Rahnenfuhrer J,
Hengstler JG,
Micke P
: Biomarker discovery in non-small cell lung cancer: integrating gene expression profiling, meta-analysis, and tissue microarray validation. Clinical cancer research: an official journal of the American Association for Cancer Research 19: 194–204, 2013.
OpenUrl PubMed

[66] Botling J,

[67] Edlund K,

[68] Lohr M,

[69] Hellwig B,

[70] Holmberg L,

[71] Lambe M,

[72] Berglund A,

[73] Ekman S,

[74] Bergqvist M,

[75] Ponten F,

[76] Konig A,

[77] Fernandes O,

[78] Karlsson M,

[79] Helenius G,

[80] Karlsson C,

[81] Rahnenfuhrer J,

[82] Hengstler JG,

[83] Micke P

[84] ↵
Au NH,
Gown AM,
Cheang M,
Huntsman D,
Yorida E,
Elliott WM,
Flint J,
English J,
Gilks CB,
Grimes HL
: P63 expression in lung carcinoma: a tissue microarray study of 408 cases. Applied immunohistochemistry & molecular morphology: AIMM/official publication of the Society for Applied Immunohistochemistry 12: 240–247, 2004.
OpenUrl

[85] Au NH,

[86] Gown AM,

[87] Cheang M,

[88] Huntsman D,

[89] Yorida E,

[90] Elliott WM,

[91] Flint J,

[92] English J,

[93] Gilks CB,

[94] Grimes HL

[95] Shi I,
Hashemi Sadraei N,
Duan ZH,
Shi T
: Aberrant signaling pathways in squamous cell lung carcinoma. Cancer informatics 10: 273–285, 2011.
OpenUrl PubMed

[96] Shi I,

[97] Hashemi Sadraei N,

[98] Duan ZH,

[99] Shi T

[100] Ibrahim R,
Matsubara D,
Osman W,
Morikawa T,
Goto A,
Morita S,
Ishikawa S,
Aburatani H,
Takai D,
Nakajima J,
Fukayama M,
Niki T,
Murakami Y
: Expression of PRMT5 in lung adenocarcinoma and its significance in epithelial-mesenchymal transition. Human pathology 45: 1397–1405, 2014.
OpenUrl PubMed

[101] Ibrahim R,

[102] Matsubara D,

[103] Osman W,

[104] Morikawa T,

[105] Goto A,

[106] Morita S,

[107] Ishikawa S,

[108] Aburatani H,

[109] Takai D,

[110] Nakajima J,

[111] Fukayama M,

[112] Niki T,

[113] Murakami Y

[114] ↵
Seike M,
Yanaihara N,
Bowman ED,
Zanetti KA,
Budhu A,
Kumamoto K,
Mechanic LE,
Matsumoto S,
Yokota J,
Shibata T,
Sugimura H,
Gemma A,
Kudoh S,
Wang XW,
Harris CC
: Use of a cytokine gene expression signature in lung adenocarcinoma and the surrounding tissue as a prognostic classifier. Journal of the National Cancer Institute 99: 1257–1269, 2007.
OpenUrl Abstract/FREE Full Text

[115] Seike M,

[116] Yanaihara N,

[117] Bowman ED,

[118] Zanetti KA,

[119] Budhu A,

[120] Kumamoto K,

[121] Mechanic LE,

[122] Matsumoto S,

[123] Yokota J,

[124] Shibata T,

[125] Sugimura H,

[126] Gemma A,

[127] Kudoh S,

[128] Wang XW,

[129] Harris CC

[130] ↵
Mountain CF,
Lukeman JM,
Hammar SP,
Chamberlain DW,
Coulson WF,
Page DL,
Victor TA,
Weiland LH
: Lung cancer classification: the relationship of disease extent and cell type to survival in a clinical trials population. Journal of surgical oncology 35: 147–156, 1987.
OpenUrl PubMed

[131] Mountain CF,

[132] Lukeman JM,

[133] Hammar SP,

[134] Chamberlain DW,

[135] Coulson WF,

[136] Page DL,

[137] Victor TA,

[138] Weiland LH

[139] ↵
Naruke T,
Goya T,
Tsuchiya R,
Suemasu K
: Prognosis and survival in resected lung carcinoma based on the new international staging system. The Journal of thoracic and cardiovascular surgery 96: 440–447, 1988.
OpenUrl PubMed

[140] Naruke T,

[141] Goya T,

[142] Tsuchiya R,

[143] Suemasu K

[144] ↵
Kuner R,
Muley T,
Meister M,
Ruschhaupt M,
Buness A,
Xu EC,
Schnabel P,
Warth A,
Poustka A,
Sultmann H,
Hoffmann H
: Global gene expression analysis reveals specific patterns of cell junctions in non-small cell lung cancer subtypes. Lung Cancer 63: 32–38, 2009.
OpenUrl CrossRef PubMed

[145] Kuner R,

[146] Muley T,

[147] Meister M,

[148] Ruschhaupt M,

[149] Buness A,

[150] Xu EC,

[151] Schnabel P,

[152] Warth A,

[153] Poustka A,

[154] Sultmann H,

[155] Hoffmann H

[156] ↵
Huang da W,
Sherman BT,
Lempicki RA
: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic acids research 37: 1–13, 2009.
OpenUrl Abstract/FREE Full Text

[157] Huang da W,

[158] Sherman BT,

[159] Lempicki RA

[160] ↵
Huang da W,
Sherman BT,
Lempicki RA
: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols 4: 44–57, 2009.
OpenUrl CrossRef PubMed

[161] Huang da W,

[162] Sherman BT,

[163] Lempicki RA

[164] ↵
Maere S,
Heymans K,
Kuiper M
: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21: 3448–3449, 2005.
OpenUrl Abstract/FREE Full Text

[165] Maere S,

[166] Heymans K,

[167] Kuiper M

[168] ↵
Rivera CG,
Vakil R,
Bader JS
: NeMo: Network Module identification in Cytoscape. BMC bioinformatics 11(Suppl 1): S61, 2010.
OpenUrl CrossRef PubMed

[169] Rivera CG,

[170] Vakil R,

[171] Bader JS

[172] ↵
Chih-Chung Chang C-JL
: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2: 1–27, 2011.
OpenUrl CrossRef

[173] Chih-Chung Chang C-JL

[174] ↵
Rong-En Fan K-WC,
Cho-Jui Hsieh,
Xiang-Rui Wang,
Chih-Jen Lin
: LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research: 1871–1874, 2008.

[175] Rong-En Fan K-WC,

[176] Cho-Jui Hsieh,

[177] Xiang-Rui Wang,

[178] Chih-Jen Lin

[179] ↵
Dermer GB
: Origin of bronchioloalveolar carcinoma and peripheral bronchial adenocarcinoma. Cancer 49: 881–887, 1982.
OpenUrl CrossRef PubMed

[180] Dermer GB

[181] ↵
Raz DJ,
Kim JY,
Jablons DM
: Diagnosis and treatment of bronchioloalveolar carcinoma. Current opinion in pulmonary medicine 13: 290–296, 2007.
OpenUrl PubMed

[182] Raz DJ,

[183] Kim JY,

[184] Jablons DM

[185] ↵
Hsiung CA,
Lan Q,
Hong YC,
Chen CJ,
Hosgood HD,
Chang IS,
Chatterjee N,
Brennan P,
Wu C,
Zheng W,
Chang GC,
Wu T,
Park JY,
Hsiao CF,
Kim YH,
Shen H,
Seow A,
Yeager M,
Tsai YH,
Kim YT,
Chow WH,
Guo H,
Wang WC,
Sung SW,
Hu Z,
Chen KY,
Kim JH,
Chen Y,
Huang L,
Lee KM,
Lo YL,
Gao YT,
Liu L,
Huang MS,
Jung TH,
Jin G,
Caporaso N,
Yu D,
Kim CH,
Su WC,
Shu XO,
Xu P,
Kim IS,
Chen YM,
Ma H,
Shen M,
Cha SI,
Tan W,
Chang CH,
Sung JS,
Zhang M,
Yang TY,
Park KH,
Yuenger J,
Wang CL,
Ryu JS,
Xiang Y,
Deng Q,
Hutchinson A,
Kim JS,
Cai Q,
Landi MT,
Yu CJ,
Tucker M,
Hung JY,
Lin CC,
Perng RP,
Boffetta P,
Chen CY,
Chen KC,
Yang SY,
Hu CY,
Chang CK,
Fraumeni JF Jr..,
Chanock S,
Yang PC,
Rothman N,
Lin D
: The 5p15.33 locus is associated with risk of lung adenocarcinoma in never-smoking females in Asia. PLoS genetics 62010.

[186] Hsiung CA,

[187] Lan Q,

[188] Hong YC,

[189] Chen CJ,

[190] Hosgood HD,

[191] Chang IS,

[192] Chatterjee N,

[193] Brennan P,

[194] Wu C,

[195] Zheng W,

[196] Chang GC,

[197] Wu T,

[198] Park JY,

[199] Hsiao CF,

[200] Kim YH,

[201] Shen H,

[202] Seow A,

[203] Yeager M,

[204] Tsai YH,

[205] Kim YT,

[206] Chow WH,

[207] Guo H,

[208] Wang WC,

[209] Sung SW,

[210] Hu Z,

[211] Chen KY,

[212] Kim JH,

[213] Chen Y,

[214] Huang L,

[215] Lee KM,

[216] Lo YL,

[217] Gao YT,

[218] Liu L,

[219] Huang MS,

[220] Jung TH,

[221] Jin G,

[222] Caporaso N,

[223] Yu D,

[224] Kim CH,

[225] Su WC,

[226] Shu XO,

[227] Xu P,

[228] Kim IS,

[229] Chen YM,

[230] Ma H,

[231] Shen M,

[232] Cha SI,

[233] Tan W,

[234] Chang CH,

[235] Sung JS,

[236] Zhang M,

[237] Yang TY,

[238] Park KH,

[239] Yuenger J,

[240] Wang CL,

[241] Ryu JS,

[242] Xiang Y,

[243] Deng Q,

[244] Hutchinson A,

[245] Kim JS,

[246] Cai Q,

[247] Landi MT,

[248] Yu CJ,

[249] Tucker M,

[250] Hung JY,

[251] Lin CC,

[252] Perng RP,

[253] Boffetta P,

[254] Chen CY,

[255] Chen KC,

[256] Yang SY,

[257] Hu CY,

[258] Chang CK,

[259] Fraumeni JF Jr..,

[260] Chanock S,

[261] Yang PC,

[262] Rothman N,

[263] Lin D

[264] ↵
Hosgood HD 3rd.,
Wang WC,
Hong YC,
Wang JC,
Chen K,
Chang IS,
Chen CJ,
Lu D,
Yin Z,
Wu C,
Zheng W,
Qian B,
Park JY,
Kim YH,
Chatterjee N,
Chen Y,
Chang GC,
Hsiao CF,
Yeager M,
Tsai YH,
Wei H,
Kim YT,
Wu W,
Zhao Z,
Chow WH,
Zhu X,
Lo YL,
Sung SW,
Chen KY,
Yuenger J,
Kim JH,
Huang L,
Chen YH,
Gao YT,
Huang MS,
Jung TH,
Caporaso N,
Zhao X,
Huan Z,
Yu D,
Kim CH,
Su WC,
Shu XO,
Kim IS,
Bassig B,
Chen YM,
Cha SI,
Tan W,
Chen H,
Yang TY,
Sung JS,
Wang CL,
Li X,
Park KH,
Yu CJ,
Ryu JS,
Xiang Y,
Hutchinson A,
Kim JS,
Cai Q,
Landi MT,
Lee KM,
Hung JY,
Tucker M,
Lin CC,
Ren Y,
Perng RP,
Chen CY,
Jin L,
Chen KC,
Li YJ,
Chiu YF,
Tsai FY,
Yang PC,
Fraumeni JF Jr..,
Seow A,
Lin D,
Zhou B,
Chanock S,
Hsiung CA,
Rothman N,
Lan Q
: Genetic variant in TP63 on locus 3q28 is associated with risk of lung adenocarcinoma among never-smoking females in Asia. Human genetics 131: 1197–1203, 2012.
OpenUrl PubMed

[265] Hosgood HD 3rd.,

[266] Wang WC,

[267] Hong YC,

[268] Wang JC,

[269] Chen K,

[270] Chang IS,

[271] Chen CJ,

[272] Lu D,

[273] Yin Z,

[274] Wu C,

[275] Zheng W,

[276] Qian B,

[277] Park JY,

[278] Kim YH,

[279] Chatterjee N,

[280] Chen Y,

[281] Chang GC,

[282] Hsiao CF,

[283] Yeager M,

[284] Tsai YH,

[285] Wei H,

[286] Kim YT,

[287] Wu W,

[288] Zhao Z,

[289] Chow WH,

[290] Zhu X,

[291] Lo YL,

[292] Sung SW,

[293] Chen KY,

[294] Yuenger J,

[295] Kim JH,

[296] Huang L,

[297] Chen YH,

[298] Gao YT,

[299] Huang MS,

[300] Jung TH,

[301] Caporaso N,

[302] Zhao X,

[303] Huan Z,

[304] Yu D,

[305] Kim CH,

[306] Su WC,

[307] Shu XO,

[308] Kim IS,

[309] Bassig B,

[310] Chen YM,

[311] Cha SI,

[312] Tan W,

[313] Chen H,

[314] Yang TY,

[315] Sung JS,

[316] Wang CL,

[317] Li X,

[318] Park KH,

[319] Yu CJ,

[320] Ryu JS,

[321] Xiang Y,

[322] Hutchinson A,

[323] Kim JS,

[324] Cai Q,

[325] Landi MT,

[326] Lee KM,

[327] Hung JY,

[328] Tucker M,

[329] Lin CC,

[330] Ren Y,

[331] Perng RP,

[332] Chen CY,

[333] Jin L,

[334] Chen KC,

[335] Li YJ,

[336] Chiu YF,

[337] Tsai FY,

[338] Yang PC,

[339] Fraumeni JF Jr..,

[340] Seow A,

[341] Lin D,

[342] Zhou B,

[343] Chanock S,

[344] Hsiung CA,

[345] Rothman N,

[346] Lan Q

[347] ↵
Inamura K,
Fujiwara T,
Hoshida Y,
Isagawa T,
Jones MH,
Virtanen C,
Shimane M,
Satoh Y,
Okumura S,
Nakagawa K,
Tsuchiya E,
Ishikawa S,
Aburatani H,
Nomura H,
Ishikawa Y
: Two subclasses of lung squamous cell carcinoma with different gene expression profiles and prognosis identified by hierarchical clustering and non-negative matrix factorization. Oncogene 24: 7105–7113, 2005.
OpenUrl CrossRef PubMed

[348] Inamura K,

[349] Fujiwara T,

[350] Hoshida Y,

[351] Isagawa T,

[352] Jones MH,

[353] Virtanen C,

[354] Shimane M,

[355] Satoh Y,

[356] Okumura S,

[357] Nakagawa K,

[358] Tsuchiya E,

[359] Ishikawa S,

[360] Aburatani H,

[361] Nomura H,

[362] Ishikawa Y

Main menu

User menu

Search

In Silico Comparative Genomic Analysis of Two Non-small Cell Lung Cancer Subtypes and their Potentials for Cancer Classification

Abstract

Materials and Methods

Results

Discussion

References

In this issue

Citation Manager Formats

Related Articles

Cited By...

Similar Articles

Keywords

Main menu

User menu

Search

In Silico Comparative Genomic Analysis of Two Non-small Cell Lung Cancer Subtypes and their Potentials for Cancer Classification

Abstract

Materials and Methods

Results

Discussion

References

In this issue

Citation Manager Formats

Jump to section

Related Articles

Cited By...

Similar Articles

Keywords