Identification of early-stage lung adenocarcinoma prognostic signatures based on statistical modeling

Cancer Biomark. 2017;18(2):117-123. doi: 10.3233/CBM-151368.

Abstract

Background: Current staging methods are lack of precision in predicting prognosis of early-stage lung adenocarcinomas.

Objective: We aimed to develop a gene expression signature to identify high- and low-risk groups of patients.

Methods: We used the Bayesian Model Averaging algorithm to analyze the DNA microarray data from 442 lung adenocarcinoma patients from three independent cohorts, one of which was used for training.

Results: The patients were assigned to either high- or low-risk groups based on the calculated risk scores based on the identified 25-gene signature. The prognostic power was evaluated using Kaplan-Meier analysis and the log-rank test. The testing sets were divided into two distinct groups with log-rank test p-values of 0.00601 and 0.0274 respectively.

Conclusions: Our results show that the prognostic models could successfully predict patients' outcome and serve as biomarkers for early-stage lung adenocarcinoma overall survival analysis.

Keywords: Bayesian Model Averaging; Lung adenocarcinoma; gene expression; prognosis.

MeSH terms

  • Adenocarcinoma / genetics*
  • Adenocarcinoma / mortality*
  • Adenocarcinoma / pathology
  • Adenocarcinoma of Lung
  • Aged
  • Algorithms
  • Bayes Theorem
  • Cohort Studies
  • Female
  • Humans
  • Kaplan-Meier Estimate
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / mortality*
  • Lung Neoplasms / pathology
  • Male
  • Middle Aged
  • Models, Statistical*
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*