Somatic copy number profiling from hepatocellular carcinoma circulating tumor cells

Court, Colin M.; Hou, Shuang; Liu, Lian; Winograd, Paul; DiPardo, Benjamin J.; Liu, Sean X.; Chen, Pin-Jung; Zhu, Yazhen; Smalley, Matthew; Zhang, Ryan; Sadeghi, Saeed; Finn, Richard S.; Kaldas, Fady M.; Busuttil, Ronald W.; Zhou, Xianghong J.; Tseng, Hsian-Rong; Tomlinson, James S.; Graeber, Thomas G.; Agopian, Vatche G.

doi:10.1038/s41698-020-0123-0

Download PDF

Article
Open access
Published: 02 July 2020

Somatic copy number profiling from hepatocellular carcinoma circulating tumor cells

Colin M. Court ORCID: orcid.org/0000-0002-9357-8957^1,2,3,
Shuang Hou¹,
Lian Liu⁴,
Paul Winograd^1,2,
Benjamin J. DiPardo^1,2,
Sean X. Liu⁵,
Pin-Jung Chen⁵,
Yazhen Zhu ORCID: orcid.org/0000-0002-2130-8085⁵,
Matthew Smalley ORCID: orcid.org/0000-0002-0509-6502⁵,
Ryan Zhang⁵,
Saeed Sadeghi⁶,
Richard S. Finn⁶,
Fady M. Kaldas¹,
Ronald W. Busuttil^1,7,
Xianghong J. Zhou⁸,
Hsian-Rong Tseng ORCID: orcid.org/0000-0003-0942-5905^5,7,9,
James S. Tomlinson^1,2,9,
Thomas G. Graeber ORCID: orcid.org/0000-0001-8574-9181^3,5,7,9 &
…
Vatche G. Agopian^1,9

npj Precision Oncology volume 4, Article number: 16 (2020) Cite this article

3097 Accesses
11 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Somatic copy number alterations (SCNAs) are important genetic drivers of many cancers. We investigated the feasibility of obtaining SCNA profiles from circulating tumor cells (CTCs) as a molecular liquid biopsy for hepatocellular carcinoma (HCC). CTCs from ten HCC patients underwent SCNA profiling. The Cancer Genome Atlas (TCGA) SCNA data were used to develop a cancer origin classification model, which was then evaluated for classifying 44 CTCs from multiple cancer types. Sequencing of 18 CTC samples (median: 4 CTCs/sample) from 10 HCC patients using a low-resolution whole-genome sequencing strategy (median: 0.88 million reads/sample) revealed frequent SCNAs in previously reported HCC regions such as 8q amplifications and 17p deletions. SCNA profiling revealed that CTCs share a median of 80% concordance with the primary tumor. CTCs had SCNAs not seen in the primary tumor, some with prognostic implications. Using a SCNA profiling model, the tissue of origin was correctly identified for 32/44 (73%) CTCs from 12/16 (75%) patients with different cancer types.

Genomic characterization of rare molecular subclasses of hepatocellular carcinoma

Article Open access 04 October 2021

Jeffrey S. Damrauer, Markia A. Smith, … Katherine A. Hoadley

Hydro-Seq enables contamination-free high-throughput single-cell RNA-sequencing for circulating tumor cells

Article Open access 15 May 2019

Yu-Heng Cheng, Yu-Chih Chen, … Euisik Yoon

Landscape of somatic alterations in large-scale solid tumors from an Asian population

Article Open access 23 July 2022

Liqun Wu, Herui Yao, … Minghui Wang

Introduction

Somatic copy number alterations (SCNAs) are found in 90% of solid tumors and are increasingly recognized as playing a vital role in activating oncogenes and inactivating tumor suppressors through changes in gene dosage and structure¹. Newer sequencing methods and large-scale genetic studies indicate that SCNAs affect a larger fraction of the genome in cancer than any other somatic alteration¹. In fact, SCNAs have recently been shown to provide the largest contribution to a pan-cancer tumor classification model, greater than that provided by transcriptome and methylome alterations². Research has also found that larger SCNAs, such as whole chromosome, arm, and cytoband length events, are likely more important than focal SCNAs in the development of cancer^1,3. These larger SCNAs are easily detectable using next-generation sequencing (NGS) techniques, and result in a very robust and reproducible signal⁴. These favorable characteristics make NGS-based SCNA profiling an ideal molecular study for limited template samples like fine needle aspirates or circulating tumor cells (CTCs).

CTCs, cells of tumor origin that circulate in the blood, are a promising new biomarker for many solid tumors^5,6. As potential metastatic precursors⁷, CTCs are thought to represent the subclones of the primary tumor that are more invasive^8,9. Their presence is associated with a higher risk of recurrence and mortality in many cancer types and across all stages of disease^5,6. Recently, advances in CTC isolation and sequencing has evolved to the point that CTCs may soon serve as a form of “liquid biopsy” for cancer patients, tests done on the blood to look for cancer cells from a tumor that are circulating in the blood⁸.

The majority of studies on the molecular characterization of CTCs in solid tumors have focused on the detection of actionable somatic point mutations through targeted or exome sequencing. However, due to the combination of cost, analysis time, and risk of false positives at typical exome sequencing depths, exome sequencing of CTCs has not seen significant clinical use. In contrast, SCNA profiling of CTCs via low-resolution NGS-based whole-genome sequencing (WGS) has recently been described and has the advantage of offering a robust signal at a significantly lower cost than exome sequencing⁴. Given the recent advances in our understanding of the importance of SCNAs for cancer prognosis and treatment, SCNA analysis of CTCs has significant potential as a biomarker^10,11,12,13.

To better understand the feasibility and potential of SCNA profiling of CTCs as a liquid biopsy, we attempted to address several important methodological questions. These included validating the tumor origin of the isolated CTCs, as well as demonstrating the reliability and reproducibility of the assays that are utilized for CTC characterization¹⁴. We recently developed a HCC-specific CTC isolation method and demonstrated its efficacy for isolating and enumerating CTCs in hepatocellular carcinoma (HCC)¹⁵. We have now developed a modification to that assay that allows for low-resolution WGS of the isolated CTCs using whole-genome amplification (WGA) and NGS. Using this methodology, our current study investigates NGS-based SCNA profiling of CTCs as a potential molecular biomarker for HCC patients.

To that end, we performed a pilot study to investigate SCNA profiling of HCC CTCs isolated immediately prior to surgical resection, and compared them to the SCNA profiles from the surgically resected HCC tumor tissue, peritumoral non-cancerous liver, and genomic DNA from pooled white blood cells. With a robust analysis of multiple DNA sources from individual patients, we aimed to extensively validate the tumor origin of the isolated CTCs, to establish that CTC-derived SCNA profiles can serve as surrogates of the underlying molecular alterations in the primary HCC tumor, and to demonstrate the utility of a CTC-derived SCNA assay as a clinically useful liquid biopsy in HCC.

Results

Patient characteristics and sample collection

Ten consecutive patients undergoing surgical resection of HCC from our ongoing prospective HCC biomarker protocol between March, 2016 and January, 2017 were included for analysis. The clinical, laboratory, radiologic, and treatment characteristics of the ten patients are summarized in Supplementary Table 1. Nine of the patients underwent resection of a primary liver tumor and one patient underwent resection of a retroperitoneal metastasis that developed 10 years after liver transplantation for HCC (patient H167). Nine of the patients had cirrhosis at the time of blood draw, with the majority of these patient’s cirrhosis resulting from HCV infection. All patients had DNA from whole blood, peritumoral normal liver, HCC tumor tissue, and CTCs available for molecular analysis (Fig. 1b).

**Fig. 1: Experimental design of the study.**

CTC enumeration and sequencing

CTC enumeration revealed a median of four (IQR: 2–11, range: 1–16) CTCs per 4-mL VB. Multiple displacement amplification-based WGA was successfully performed on CTC samples, and low-resolution WGS performed on a NextSeq 500. Sequencing reads for the CTC samples were processed using a variable bin algorithm specifically designed for low-resolution single cell copy number determination^4,16. We obtained an average of 0.88 million uniquely mapped reads (range: 0.51–1.9 million uniquely mapped reads) from the CTC samples, with an average depth of coverage of 0.022× and a minimum read count of 0.5 million reads for CTC samples. Blood, peritumoral normal liver, and HCC tumor tissue DNA were extracted as detailed in the methods before being subjected to the same sequencing and analysis pipeline as the CTC samples, resulting in a median of 0.99 million uniquely mapped reads (IQR: 0.86–1.2 million reads, range: 0.38–2.1 million reads).

CTC versus primary tumor SCNA profiles

We performed STR typing of the blood, primary tumor, and CTCs to confirm that contamination had not occurred during WGA processing. While STR typing confirmed that CTCs had originated from the same individual for 9/10 patients, it revealed that the CTCs for patient H169 were likely from a different source and this patient was excluded from further analyzes (Supplementary Fig. 1). Next, whole-genome SCNA profiles were obtained by visualizing the copy number state at each 250-kbps bin along the entire genome for the DNA from whole blood, normal liver, the primary tumor and CTC samples (Fig. 2). For the nine patients with confirmatory STR typing, the whole-genome SCNA profiles were compared with determine if CTCs exhibit the somatic SCNAs found in the primary tumor. Inspection of the copy number profiles clearly demonstrates the recapitulation of the somatic changes of the primary tumor by the CTCs, helping to establish the tumor origin of the CTCs. Most of the SCNAs seen in Fig. 2 are in regions of the genome previously identified as somatic SCNAs associated with HCC (Supplementary Table 2). Overall, the median sensitivity of CTC SCNA profiling for identifying individual gene-level SCNAs in the primary tumor for all patients was 91% (IQR: 0.87–0.98), while the specificity was 97% (IQR: 0.94–0.99). We confirmed the SCNA profiles found by NGS using array CGH for a subset of six patients with sufficient DNA available, and again found that the CTCs recapitulated the somatic changes found in the primary tumor (Supplementary Fig. 2).

Fig. 2: Global copy number profiles for blood, peritumoral normal liver, CTCs, and primary tumor (n = 8) or solitary metastasis (n = 1, H167) demonstrates the recapitulation of SCNAs from the tumor in CTC samples, supporting the potential of CTCs to act as a liquid biopsy of the tumor.

CTC clustering analysis

To statistically verify the visualized results of Fig. 2, we developed a 59 loci panel of previously identified regions or genes amplified or deleted in HCC (Supplementary Table 2)^{17,18,19,20,21,22,23,24}. We compared the segmented, normalized copy number data of CTCs and primary tumors at the 59 loci in the panel, and found frequent somatic changes for all patients, consistent with previous studies of HCC tumors (Fig. 3). Unsupervised hierarchical clustering of the CTC and tumor copy number values demonstrated that CTC samples clustered with their respective primary tumor for all nine patients. Analysis of the global copy number profiles by a Spearman correlation matrix (Supplementary Fig. 3) demonstrated that the CTC SCNA profiles shared an average of 95% (IQR: 86–97%) of the gains and losses found in the primary tumor.

Fig. 3: Heatmap showing copy number profiles at 59 loci frequently amplified or lost in HCC. Gain (red) or loss (blue) at each of the 59 cytobands (identified on the right of the heatmap) in both CTC and tumor samples from all nine patients that passed STR analysis are shown.

CTC SCNA profile reproducibility

To assess the reproducibility of our assay, we isolated and sequenced seven independent CTC samples of 3–4 cells from a single patient (patient H195). Principle component analysis of the whole blood, normal liver, primary tumor, and all seven CTC samples for this patient revealed that five of the CTC SCNA profiles clustered closely with the primary tumor (Supplementary Fig. 4A). One CTC sample (CTC 7) clustered with the peritumoral normal and blood samples and on examination of CTC 7’s copy number profile, it demonstrates a substantially different profile than that of the other CTCs, and is closer to that of the germline DNA, likely representing either a false positive CTC call or contamination with normal human DNA (Supplementary Fig. 4B). One additional CTC sample (CTC.4) clustered on its own, and examination of its copy number profile revealed significant discordance with all other samples (Supplementary Fig. 4B).

CTC SCNA sequencing as a molecular biomarker

To investigate the potential clinical relevance of our CTC SCNA profiling to act as a surrogate for the primary tumor of interest, we explored our ability to detect prognostic or actionable copy number changes identified by prior studies of HCC genetics. One such example is illustrated in Fig. 4. Patient H199 is a 64-year-old male with cryptogenic cirrhosis and a 9.0 cm segment 2–3 AFP-nonproducing lesion who underwent a left hepatectomy. On pathologic examination, his tumor was found to be moderately differentiated with evidence of microvascular invasion but no macrovascular invasion.

Fig. 4: Comparison of low-resolution whole-genome copy number profiles for primary tumor, CTCs 1 and 2, normal liver, and whole blood reveals reproduction of the majority of the SCNAs from the primary tumor in both CTC samples.

Examination of the copy number profile from the primary tumor revealed copy number loss at chromosome 1p, 4p, 8p, 10q and 17p (Fig. 4a). Of particular note is the chromosome 17p loss, as it contains the well-known tumor suppressor TP53 gene, the most frequently mutated or lost gene in HCC (Fig. 4b)¹⁷. When examining the two CTC samples from this patient, all of the losses found in the primary tumor were detected; however, an additional amplification of chromosome 20 was detected from both CTC samples (Fig. 4c). Chromosome 20 amplifications are a recurrent somatic SCNA in HCC associated with 2 important oncogenes^{25,26,27,28,29}. Overexpression of AIB1 is frequently described in HCC, and has previously been associated with invasiveness and sensitivity to the sorafenib therapy^25,29. In addition, recent research has demonstrated that the oncogenic effects of MYC dysregulation, a common occurrence in HCC, require overexpression of AURKA for stabilization²⁸. This discovery led to recent preclinical study which found that in p53-altered HCC patients, the MYC-AURKA complex is an actionable drug target²⁶. While an intriguing finding, these studies are all preclinical at this time. We further investigated if any recurrent SCNAs were found at a higher frequency in the CTCs when compared with the primary tumor samples. No arm or chromosome level SCNAs were identified; however, we did find losses of cytoband 19p12, 2q33.2, 4p14, and 5q13.2 as well as amplifications of 11p15.5 more frequently in the CTCs than in the primary tumor samples.

Cancer type classification using SCNA profiles

As CTCs are universally shed by all tumor types, and CTCs demonstrate similar SCNA patterns to the primary tumor, we investigated the ability of CTC SCNAs to determine the site of origin of the primary tumor. To do so we obtained whole-genome copy number data for 10,478 samples from 31 tumor types available in the TCGA dataset³⁰. The copy number state at 268 cancer-associated cytobands for all samples of the 31 cancer types was evaluated visually through 2-dimensional transformation using t-SNE (Fig. 5; Supplementary Fig. 5)³¹. While some cancers such as glioblastoma multiforme (GBM) or testicular germ cell tumors (TGCT) demonstrated clear clustering, others such as bladder cancer or esophageal cancer (ESCA) had samples scattered across the t-SNE space with no clear clustering identified. We used the copy number state of each sample to calculate three whole-genome metrics to help with tumor origin classification: a chromosomal instability number (CIN) score as well as the two t-SNE dimension variables. We then trained a random forest model to predict the tumor site of origin on the training set (80% of samples) and obtained an overall model accuracy of 0.58 (95% CI: 0.56–0.60) for the test set. Analysis of the misclassification rate revealed that many misclassifications were occurring between expected classes such as low-grade glioma and GBM or between different types of kidney tumors (KICH, KIRC, and KIRP). In addition, several cancer types did not demonstrate a significant site-specific copy number pattern using either t-SNE or random forest classification. Thus, to improve our model we eliminated poorly clustering tumor types (n = 10, Supplementary Fig. 5), and grouped the remaining 21 cancer types into 11 cancer classes based on data from prior studies on cancer subtype classification (Supplementary Table 3)². Repeating the modeling on the 11 cancer class model resulted in an overall accuracy of 0.83 (95% CI: 0.78–0.88) with individual cancer class balanced accuracies ranging from 0.75–0.98.

**Fig. 5: Copy number alterations at all genes for the 32 cancer types were transformed using t-SNE into a 2-dimensional space.**

CTC cancer type classification

Given the circulating nature of CTCs we next sought to determine if CTC SCNA profiles could be used to determine the tissue of origin from which the CTCs originated. A total of 9/15 (66%) HCC CTCs from our patients were correctly classified, and 5/9 (56%) patients had at least 1 CTC sample identify HCC as the tissue of origin (Fig. 6). To further investigate our model, we searched for additional CTC SCNA studies but identified only a single study for which data were available. This study by Ni et al. looked at 29 CTC samples from 7 lung cancer patients¹⁰. For these lung cancer CTCs, the model identified the tissue of origin correctly for 23/29 (79%) CTCs and all 7 patients had at least 1 CTC sample identifying lung adenocarcinoma as the cancer type correctly (Supplementary Fig. 6). Overall, our model correctly identified the cancer type for 32/44 (75%) CTCs for the two cancer types.

**Fig. 6: Cancer class prediction based on SCNA profile for each of the 15 HCC CTC samples using the classification model.**

Discussion

Blood-based “liquid” biopsies hold numerous potential benefits over traditional percutaneous or surgical biopsies such as reduced risk, cost, and patient discomfort. Furthermore, they are increasingly recognized as a necessary component of a precision oncology treatment strategy, given the ongoing need for tumor tissue as the tumor adapts to new therapies³². Despite these benefits, liquid biopsies have yet to enter clinical practice due to issues including reproducibility and applicability for molecular analysis³³. One reason may be that most molecular liquid biopsy studies to date have investigated detection of point mutations, due to the much larger number of actionable mutations compared with actionable SCNAs. However, accurate mutation calling from single cells or limited template samples is difficult due to the relatively high error rate involved^34,35. In contrast, whole-genome NGS SCNA profiling of liquid biopsies results in a robust signal that is highly reproducible from as few as a couple hundred thousand reads^4,36. Recent studies have found that SCNAs represent the largest portion of the somatic genetic changes across all cancer types¹¹, may be more important driver events than somatic mutations for many cancer types³⁷, and their number within a given tumor directly correlate with outcomes such as recurrence and mortality³⁸. However, the specific gene-level drivers within large SCNA requires further investigation before their biological and clinical significance is established. While some cancer types have known SCNAs with therapeutic implications, such as gastric cancer and FGFR2 or breast cancer and HER2, the majority do not currently³⁹. Hopefully that will change soon given the numerous ongoing studies into the clinical importance of SCNAs in many different cancer types. To that end, we sought to investigate the validity of NGS-based SCNA profiling of CTCs and to demonstrate the potential utility of such an assay for different clinical applications. The current work represents the first reported work of SCNA analysis of HCC CTCs, confirming their tumor origin and demonstrating potential prognostic importance.

Previous research into low-resolution copy number profiling has demonstrated that as few as 250–350k reads are sufficient for calling copy number events larger than 500 kbp⁴. Thus, our sequencing method, which resulted in less than a million reads per sample, potentially limits our detection of smaller focal SCNAs. However, current research indicates that whole chromosome, arm, and cytoband length SCNAs are more important in the development of cancer than smaller SCNAs, making our depth sufficient^1,3. When run in bulk, our assay is also surprisingly affordable; a prior study using a similar protocol showed that the total reagent and sequencing costs for such a SCNA profiling assay would be as low as US$30, a significant improvement over older array CGH-based SCNA assays⁴.

A fundamental concern for molecular profiling of CTCs is ensuring that the CTCs are in fact of tumor origin and not just circulating epithelial cells⁴⁰. We employ a strict CTC definition which helps eliminate false positive CTC calls; however, it also results in fewer CTCs captured versus other CTC platforms. Due to the relatively few cells in the resulting sample, single-cell sequencing techniques are required for analysis, which introduces errors common to those techniques⁴¹. Of the 16 CTC samples sequenced, only 3 (19%) of them demonstrated problems. One sample did not pass STR typing analysis indicating potential contamination, one sample demonstrated decreased signal and missing SCNAs indicative of contamination with germline DNA, and one sample had an uninterpretable SCNA profile indicative of failure at some step of the protocol. These failures are all common problems when working with single-cell sequencing and the failure rate could likely be greatly reduced by implementing clean room controls and automation⁴¹. For the samples that were successfully sequenced, we found that CTC SCNA profiles consistently demonstrated that the copy number changes seen in the primary tumor are also found in the CTCs. This finding makes us confident that the CTCs we sequenced likely originated from the primary tumor, and that CTCs can act as molecular surrogates of the primary tumor for SCNA profiling. We further tested the reproducibility of the assay by sequencing multiple CTC samples from a single patient and found similar CTC profiles from almost all of the CTC samples.

In addition to confirming the tumor origin of our samples, we demonstrated two further potential applications of our CTC SCNA profiling assay: identification of prognostic or targetable SCNAs and determining the tumor site of origin for CTCs. Many prognostic and targetable SCNAs were found in our CTC samples. For example, patient H199’s two CTC samples both demonstrated the chromosome 17p loss seen in the primary tumor. However, they both also showed chromosome 20 amplifications, a prognostic and potentially actionable finding^26,29. Prior studies have demonstrated that metastases tend to arise from a single subclone of the primary tumor⁴², and that CTCs have been shown to be oligoclonal precursors of metastases in breast cancer⁷. Thus, it is plausible that CTCs originate from the more aggressive subclones of the primary tumor. While further studies are necessary to investigate this hypothesis, if true, it would lend credence to the idea that CTC-based liquid biopsies selectively sample aggressive subclones at increased risk of metastasizing. This would potentially allow CTCs to overcome the abundant tumor heterogeneity that can limit the clinical utility of traditional percutaneous biopsies for some cancer types such as HCC⁴³.

Prior studies investigating the classification of tumors by site of origin have found that SCNAs are the most important genetic determinant in classification models, contributing more information than mRNA expression, miRNA expression, or DNA methylation data². Our finding that some cancer types were well classified by SCNAs while others were not has previously been reported in other studies investigating SCNA-based classification^37,44. Overall, we found that SCNA data alone could correctly identify the tumor site of origin for most cancer types, and that most of the errors were due to known similarities between some cancers such as between head and neck squamous cell and esophageal squamous cell cancers. Applying our model to CTC SCNA data we could determine the site of origin of the CTCs for the majority of samples, both from our own CTC data of HCC patients in addition to SCNA data from a prior study of lung cancer CTCs¹⁰. While we do see a direct clinical use for this finding, further investigation and refinement of the model could help with difficult scenarios such as identifying the site of origin for patients with tumors of unknown origin, or in cases of recurrence with multiple prior primary tumors.

We investigated the feasibility of low-resolution NGS SCNA profiling of HCC CTCs as a molecular liquid biopsy and present the potential applications of such an assay. Analysis of CTCs and primary tumor tissue demonstrated concordant alterations that were not present in the peritumoral normal liver or blood genomic DNA. This supports the potential use of CTC-derived SCNA profiling as a clinically relevant surrogate of the primary tumor. To our knowledge, this is the first paper looking at SCNA analysis of HCC CTCs and believe that future studies involving larger sample sizes will be needed to better address the clinical utility of the assay. The current study demonstrates proof-of-principle for CTC SCNA analysis, and provides a new potential method to utilize CTCs for precision oncology.

Methods

Patient recruitment, sample processing, and CTC isolation

We prospectively enrolled patients undergoing surgical resection of HCC under our Institutional Review Board (IRB) approved protocol at the University of California, Los Angeles (IRB #14-001932)⁴⁵. All participants in the study provided written informed consent. Following discard of 5 mL of peripheral venous blood to prevent epithelial contamination, 10 mL of venous blood was collected in the operating room immediately prior to surgical resection into ACD solution A tubes (BD Pharmigen, Franklin Lakes, NJ), and stored at 4 °C until processed. All samples were processed within 24 h of collection. Following initial density gradient centrifugation, the buffy coat is incubated with a cocktail of biotinylated CTC capture antibodies against the HCC cell surface markers asialoglycoprotein receptor (Abcam, Cambridge, UK), glypican-3 (Santa Cruz Biotechnology, Santa Cruz, CA), and epithelial cell-adhesion-molecule (EpCAM; Cell Signaling, Danvers, MA). Following capture antibody incubation, cells are washed and re-suspended in PBS (Gibco, Carlsbad, CA) and processed on the NanoVelcro platform (Fig. 1a)⁴⁵.

In addition to peripheral blood samples, all patients had a single radiographically apparent lesion, and had both a section of the primary tumor and a section of the peritumoral normal liver isolated and flash frozen for subsequent molecular analysis. All patients underwent a blood draw prior to surgery, with a portion of the venous blood sample being processed to obtain germline genomic DNA and the remainder used for CTC isolation. Thus, all patients had DNA from whole blood, peritumoral normal liver, primary HCC tumor tissue, and CTCs available for molecular analysis (Fig. 1b).

P-NanoVelcro CTC chip processing, immunocytochemistry, and chip scanning

The assembly, operation, and staining of P-NanoVelcro CTC chips^45,46 uses an electro-spin method to assemble the Poly(lactic-co-glycolic acid) (PLGA) nano-spun chips onto a laser microdissection slide (Leica, Wetzlar, Germany) with an overlaid custom polydimethylsiloxane microfluidic component and attached to a syringe-based microfluidic pump (KD Scientific, Holliston, MA). Chips are stained and CTCs identified via scanning fluorescent microscopy on a Nikon Eclipse 90i using immunocytochemistry (ICC) and NIS Elements 4.1 software. Chips are first scanned at 40× power followed by higher magnification manual imaging of candidate cells at 400× power for verification (Supplementary Fig. 7A). For the resulting multi-channel ICC image, CTCs are defined as round/ovoid cells, DAPI⁺/CD45⁻/CK⁺, with size ≥ 6 µm. WBCs are defined as round/ovoid cells, DAPI⁺/CD45⁺/CK⁻, with size ≤ 6 µm. Any cell displaying CD45 positivity greater than 2× background were excluded as CTCs. CTC enumerations are reported as total counts per 4-mL venous blood, and were performed by the same blinded researcher (S.H.).

Laser micro-dissection

After CTC identification as outlined above, CTC chips were transferred to an ArcturusXT laser capture microdissection system (Thermo Fisher Scientific, Waltham, MA) attached to a Nikon Eclipse Ti microscope, and the CTCs were isolated into CapSure HS Caps (Thermo). Cell transfer to the cap was confirmed by light microscopy, and cells re-suspended in 4-µL PBS using a sterile pipette tip (Supplementary Fig. 7C). All candidate CTC cells from a single PLGA slide (equivalent to 2-mL of whole blood) were re-suspended on to a single cap with the exception of patients who had >5 CTCs per slide (n = 2).

Whole-genome amplification and genomic analysis

Re-suspended cells were lysing and whole-genome DNA amplified using the REPLI-g Single Cell Kit (Qiagen) using the manufacturer’s recommended protocol. Amplified DNA was purified by AMPure XP beads (Beckman Coulter, Brea, CA) using the manufacturer’s recommended protocol resulting in 25 µL of purified WGA product.

Purified WGA products were sheared to generate DNA fragments averaging 350 bps by sonication (Covaris, Woburn, MA). Sonicated DNA was cleaned, end-repaired, ligated, and amplified using the KAPA DNA Library Preparation Kit (KAPA Biosystems, Wilmington, MA) according to the manufacturer’s protocol. Sequencing was performed on an Illumina NextSeq 500 (Illumina, San Diego, CA) using 75 bp paired end reads (2 × 75 bp).

Short tandem repeat (STR) analysis

To eliminate contamination as a confounding factor, we performed STR analysis of all CTC samples and compared it to that of the primary tumor and whole blood with the GenePrint 10 v1.1 system (Promega, Madison, WI) using the manufacturer’s recommended protocol: a 10 ng aliquot of template DNA was added to the amplification master mix and amplified for 30 cycles on a GeneAmp PCR System 9700 thermal cycler (Thermo). Fragments were analyzed on a AB 3500 Genetic Analyzer with POP-4 Polymer (Applied Biosystems, Foster City, CA) and visualized using GeneMapper 5 software (Applied Biosystems).

Array comparative genomic hybridization

Sample DNA (CTC, whole blood, peritumoral normal liver, and tumor tissue) and reference DNA (Agilent, Santa Clara, CA) were differentially labeled with cyanine-3 (CY3) and cyanine-5 (Cy5) dyes using the GenetiSure Amplification and Labeling Kit (Agilent) according to the manufacturer’s protocol. Purified labeled DNA samples were prepared for hybridization, which took place on Agilent 8×60 K CGH microarray slides at 67 °C for 6 h. Following the hybridization, the slides were scanned using the Agilent SureScan Microarray Scanner (Agilent). Microarray images were analyzed using the Agilent CytoGenomics software (Agilent) and the Microarray text files were analyzed using R version 3.3.2 and the packages rCGH, limma, agilp, and snapCGH.

SCNA analysis from WGS data

The low pass WGS data was processed for SCNA analysis using Ginkgo with a variable bin size of 250 kbps and simulating bins of 76 bp reads mapped with bowtie¹⁶. CTC samples demonstrated amplification bias due to GC content and mappability that was corrected using loess smoothing^4,16. To evaluate the ability of CTC SCNA profiles to identify the tumor type of origin, additional lung cancer CTC data from the study by Ni et al. was obtained and processed using the same methodology as for the CTC samples from our study¹⁰. The resultant matrix of binned SCNA values was similarly transformed to a SCNA per gene matrix based on gene–bin overlap using biomaRt⁴⁷.

TCGA copy number analysis and cancer site of origin classification

TCGA copy number profiles for all cancer types listed in Supplementary Table 4 were obtained from the Broad Institute’s Firebrowse TCGA data version 2016_01_28 using FirebrowseR³⁰. A total of 10,478 samples from 32 tumor types were used (Supplementary Table 4). The high dimensional patient × gene dataframe was reduced transformation from a patient × gene dataframe to a patient × cytoband dataframe using biomaRt⁴⁷. The resulting 556 cytobands were then reduced to just 274 cytobands that were previously associated with global and cancer type specific SCNAs based on prior pan-cancer SCNA analysis¹.

Two pan-genomic variables were created to assist with classification. First, a chromosomal instability number (CIN) score, defined as the absolute value of all SCNAs for each sample. Next, two t-SNE variables were created by dimensional transformation using the Rtsne package implementation of the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm^31,48. Both the CIN score and the two t-SNE values for all samples were added to the final dataset used in the classification model.

Tumor site of origin classification was performed on the TCGA data using a random forest classifier. We eliminated cancer types with poor classification accuracy and grouped similar cancer types into cancer classes (e.g., grouping HCC and choleangiocarcinoma) resulting in a final model with 21 cancer types and 11 cancer classes (Supplementary Table 3)². The resulting 11 cancer classes were then used to train a random forest model (n = 500 trees) based on the TCGA SCNA data. Parameter tuning was performed by a repeated cross-validation approach and the final model verified for overall accuracy using the test set. This final model was then used to classify the CTC samples from both this study as well as the previously published lung cancer CTC study¹⁰.

Statistical analysis

Statistical analysis and visualization were performed in R (version 3.3.2). Categorical variables were summarized as frequencies and percentages while continuous variables were summarized as medians and interquartile ranges (IQR). All CTC numbers are reported as whole numbers in 4-mL of venous blood.

The “gplots” package was used for unsupervised hierarchical clustering of SCNA profiles from CTC and primary tumor samples using complete linkage and Euclidean distance metric⁴⁹. The correlation of the SCNA pattern of the CTCs to that of the matched primary tumor samples was calculated using the Pearson’s correlation.

Data availability

CTC next-generation sequencing data are publicly available in the NCBI Sequence Read Archive (SRA) under the accession number PRJNA630090. The other datasets generated during and/or analyzed during this study are available from the corresponding author on reasonable request.

Code availability

Code for analyzes is available at https://github.com/naranoth/HCC-CTC-SCNA. The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request. Statistical analysis and visualization were performed in R (version 3.3.2).

References

Zack, T. I. et al. Pan-cancer patterns of somatic copy number alteration. Nat. Genet. 45, 1134–1140 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hoadley, K. A. et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer. Cell 173, 291–304 e6 (2018).
Article CAS PubMed PubMed Central Google Scholar
Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).
Article CAS PubMed PubMed Central Google Scholar
Baslan, T. et al. Optimizing sparse sequencing of single cells for highly multiplex copy number profiling. Genome Res. 25, 714–724 (2015).
Article CAS PubMed PubMed Central Google Scholar
Court, C. M., Ankeny, J. S., Sho, S. & Tomlinson, J. S. Circulating tumor cells in gastrointestinal cancer: current practices and future directions. In Gastrointestinal Malignancies 345–376 (Springer, 2016).
Alix-Panabieres, C. & Pantel, K. Challenges in circulating tumour cell research. Nat. Rev. Cancer 14, 623–631 (2014).
Article CAS PubMed Google Scholar
Aceto, N. et al. Circulating tumor cell clusters are oligoclonal precursors of breast cancer metastasis. Cell 158, 1110–1122 (2014).
Article CAS PubMed PubMed Central Google Scholar
Miyamoto, D. T., Ting, D. T., Toner, M., Maheswaran, S. & Haber, D. A. Single-cell analysis of circulating tumor cells as a window into tumor heterogeneity. Cold Spring Harb. Symp. Quant. Biol. 81, 269–274 (2016).
Article PubMed Google Scholar
Hata, A. N. et al. Tumor cells can follow distinct evolutionary paths to become resistant to epidermal growth factor receptor inhibition. Nat. Med. 22, 262–269 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ni, X. et al. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients. Proc. Natl Acad. Sci. USA 110, 21083–21088 (2013).
Article CAS PubMed PubMed Central Google Scholar
Heitzer, E., Ulz, P., Geigl, J. B. & Speicher, M. R. Non-invasive detection of genome-wide somatic copy number alterations by liquid biopsies. Mol. Oncol. 10, 494–502 (2016).
Article CAS PubMed Google Scholar
Heitzer, E. et al. Complex tumor genomes inferred from single circulating tumor cells by array-CGH and next-generation sequencing. Cancer Res. 73, 2965–2975 (2013).
Article CAS PubMed Google Scholar
Carter, L. et al. Molecular analysis of circulating tumor cells identifies distinct copy-number profiles in patients with chemosensitive and chemorefractory small-cell lung cancer. Nat. Med. 23, 114–119 (2017).
Article CAS PubMed Google Scholar
Ankeny, J. S. et al. Circulating tumour cells as a biomarker for diagnosis and staging in pancreatic cancer. Br. J. Cancer 114, 1367–1375 (2016).
Article CAS PubMed PubMed Central Google Scholar
Court, C. M. et al. A novel multimarker assay for the phenotypic profiling of circulating tumor cells in hepatocellular carcinoma. Liver Transpl. 24, 946–960 (2018).
Article PubMed PubMed Central Google Scholar
Garvin, T. et al. Interactive analysis and assessment of single-cell copy-number variations. Nat. Methods 12, 1058–1060 (2015).
Article CAS PubMed PubMed Central Google Scholar
Guichard, C. et al. Integrated analysis of somatic mutations and focal copy-number changes identifies key genes and pathways in hepatocellular carcinoma. Nat. Genet. 44, 694–698 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kan, Z. et al. Whole-genome sequencing identifies recurrent mutations in hepatocellular carcinoma. Genome Res. 23, 1422–1433 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chiang, D. Y. et al. Focal gains of VEGFA and molecular classification of hepatocellular carcinoma. Cancer Res. 68, 6779–6788 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. [Association of chromosome 17q copy number variation with overall survival of patients with hepatocellular carcinoma and screening of potential target genes]. Zhonghua Yi Xue Yi Chuan Xue Za Zhi 32, 615–619 (2015).
CAS PubMed Google Scholar
Kwon, S. M. et al. Genomic copy number alterations with transcriptional deregulation at 6p identify an aggressive HCC phenotype. Carcinogenesis 34, 1543–1550 (2013).
Article CAS PubMed Google Scholar
Roessler, S. et al. Integrative genomic identification of genes on 8p associated with hepatocellular carcinoma progression and patient survival. Gastroenterology 142, 957–966. e12 (2012).
Article CAS PubMed Google Scholar
Woo, H. G. et al. Identification of potential driver genes in human liver carcinoma by genomewide screening. Cancer Res. 69, 4059–4066 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schulze, K., Nault, J. C. & Villanueva, A. Genetic profiling of hepatocellular carcinoma using next-generation sequencing. J. Hepatol. 65, 1031–1042 (2016).
Article CAS PubMed Google Scholar
Xu, Y. et al. Overexpression of transcriptional coactivator AIB1 promotes hepatocellular carcinoma progression by enhancing cell proliferation and invasiveness. Oncogene 29, 3386–3397 (2010).
Article CAS PubMed Google Scholar
Dauch, D. et al. A MYC-aurora kinase A protein complex represents an actionable drug target in p53-altered liver cancer. Nat. Med. 22, 744–753 (2016).
Article CAS PubMed Google Scholar
Tong, Z. et al. Steroid receptor coactivator 1 promotes human hepatocellular carcinoma progression by enhancing Wnt/beta-catenin signaling. J. Biol. Chem. 290, 18596–18608 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lu, L. et al. Aurora kinase A mediates c-Myc’s oncogenic effects in hepatocellular carcinoma. Mol. Carcinog. 54, 1467–1479 (2015).
Article CAS PubMed Google Scholar
Li, M. et al. Downregulation of amplified in breast cancer 1 contributes to the anti-tumor effects of sorafenib on human hepatocellular carcinoma. Oncotarget 7, 29605–29619 (2016).
Article PubMed PubMed Central Google Scholar
Deng, M., Bragelmann, J., Kryukov, I., Saraiva-Agostinho, N. & Perner, S. FirebrowseR: an R client to the Broad Instituteas Firehose Pipeline. Database 2017, baw160 (2017).
Article PubMed Central Google Scholar
van der Maaten, L. Accelerating t-SNE using tree-based algorithms. J. Mach. Learn. Res. 15, 3221–3245 (2014).
Google Scholar
Kumar-Sinha, C. & Chinnaiyan, A. M. Precision oncology in the age of integrative genomics. Nat. Biotechnol. 36, 46–60 (2018).
Article CAS PubMed PubMed Central Google Scholar
Webb, S. The cancer bloodhounds. Nat. Biotechnol. 34, 1090–1094 (2016).
Article CAS PubMed Google Scholar
Sundaresan, T. K. et al. Detection of T790M, the acquired resistance EGFR mutation, by tumor biopsy versus noninvasive blood-based analyses. Clin Cancer Res. 22(5), 1103–1110 (2016).
Article CAS PubMed Google Scholar
Park, S. M. et al. Molecular profiling of single circulating tumor cells from lung cancer patients. Proc. Natl Acad. Sci. USA 113, E8379–E8386 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gao, Y. et al. Single-cell sequencing deciphers a convergent evolution of copy number alterations from primary to circulating tumor cells. Genome Res. 27, 1312–1322 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ciriello, G. et al. Emerging landscape of oncogenic signatures across human cancers. Nat. Genet. 45, 1127–1133 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hieronymus, H. et al. Tumor copy number alteration burden is a pan-cancer prognostic factor associated with recurrence and death. Elife 7, e37294 (2018).
Article PubMed PubMed Central Google Scholar
Xie, L. et al. FGFR2 gene amplification in gastric cancer predicts sensitivity to the selective FGFR inhibitor AZD4547. Clin. Cancer Res. 19, 2572–2583 (2013).
Article CAS PubMed Google Scholar
Bhan, I. et al. Detection and analysis of circulating epithelial cells in liquid biopsies from patients with liver disease. Gastroenterology 155, 2016–2018. e11 (2018).
Article PubMed Google Scholar
de Bourcy, C. F. et al. A quantitative comparison of single-cell whole genome amplification methods. PLoS ONE 9, e105585 (2014).
Article PubMed PubMed Central CAS Google Scholar
Gerlinger, M. et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 366, 883–892 (2012).
Article CAS PubMed PubMed Central Google Scholar
Court, C. M. et al. Determination of hepatocellular carcinoma grade by needle biopsy is unreliable for liver transplant candidate selection. Liver Transplant 23(9), 1123–1132 (2017).
Article Google Scholar
Molparia, B., Nichani, E. & Torkamani, A. Assessment of circulating copy number variant detection for cancer screening. PLoS ONE 12, e0180647 (2017).
Article PubMed PubMed Central CAS Google Scholar
Court, C. M. et al. Reality of single circulating tumor cell sequencing for molecular diagnostics in pancreatic cancer. J. Mol. Diagn. 18, 688–696 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lin, M. et al. Nanostructure embedded microchips for detection, isolation, and characterization of circulating tumor cells. Acc. Chem. Res. 47, 2941–2950 (2014).
Article CAS PubMed PubMed Central Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
Article CAS PubMed PubMed Central Google Scholar
Krijthe, J. H. Rtsne: T-distributed stochastic neighbor embedding using a Barnes-Hut implementation. https://github.com/jkrijthe/Rtsne (2015).
Warnes, G. R. gplots: Various R Programming Tools for Plotting Data. http://cran.r-project.org/web/packages/gplots/index.html (2011).

Download references

Acknowledgements

The authors thank the patients and their families, without whom this study would not have been possible. The authors thank the CTC analysis team of Dr. Tseng’s lab for their tireless work. We also thank Kelly Court who conceptualized and produced some of the figures in this manuscript. T.G.G. is supported by the NCI/NIH (P01CA168585, R01CA222877, R01CA227089), UCLA SPORE in Prostate Cancer (NIH P50 CA092131), and the W.M. Keck Foundation. V.G.A. is supported by an American Surgical Association Foundation Fellowship award and by the NIH (R21CA216807-01A1 and R21CA2353450-01).

Author information

Authors and Affiliations

Department of Surgery, University of California, Los Angeles, Los Angeles, CA, USA
Colin M. Court, Shuang Hou, Paul Winograd, Benjamin J. DiPardo, Fady M. Kaldas, Ronald W. Busuttil, James S. Tomlinson & Vatche G. Agopian
Department of Surgery, Veteran’s Health Administration, Greater Los Angeles, Los Angeles, CA, USA
Colin M. Court, Paul Winograd, Benjamin J. DiPardo & James S. Tomlinson
Department of Molecular, Cellular, and Integrative Physiology, University of California, Los Angeles, Los Angeles, CA, USA
Colin M. Court & Thomas G. Graeber
PacGenomics, llc, Los Angeles, CA, USA
Lian Liu
Department of Molecular and Medical Pharmacology, University of California, Los Angeles, Los Angeles, CA, USA
Sean X. Liu, Pin-Jung Chen, Yazhen Zhu, Matthew Smalley, Ryan Zhang, Hsian-Rong Tseng & Thomas G. Graeber
Department of Medicine, Division of Hematology/Oncology, University of California, Los Angeles, Los Angeles, CA, USA
Saeed Sadeghi & Richard S. Finn
California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA, USA
Ronald W. Busuttil, Hsian-Rong Tseng & Thomas G. Graeber
Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Xianghong J. Zhou
Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA, USA
Hsian-Rong Tseng, James S. Tomlinson, Thomas G. Graeber & Vatche G. Agopian

Authors

Colin M. Court
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Hou
View author publications
You can also search for this author in PubMed Google Scholar
Lian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Paul Winograd
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin J. DiPardo
View author publications
You can also search for this author in PubMed Google Scholar
Sean X. Liu
View author publications
You can also search for this author in PubMed Google Scholar
Pin-Jung Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yazhen Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Smalley
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Sadeghi
View author publications
You can also search for this author in PubMed Google Scholar
Richard S. Finn
View author publications
You can also search for this author in PubMed Google Scholar
Fady M. Kaldas
View author publications
You can also search for this author in PubMed Google Scholar
Ronald W. Busuttil
View author publications
You can also search for this author in PubMed Google Scholar
Xianghong J. Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hsian-Rong Tseng
View author publications
You can also search for this author in PubMed Google Scholar
James S. Tomlinson
View author publications
You can also search for this author in PubMed Google Scholar
Thomas G. Graeber
View author publications
You can also search for this author in PubMed Google Scholar
Vatche G. Agopian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.M.C., S.H., H.R.T., T.G.G., V.G.A., J.S.T., S.S., R.W.B. designed and planned the study and developed and optimized experimental protocols. C.M.C., S.H., L.L., P.W. performed experiments. C.M.C., P.W., B.J.D. organized patient enrollment, sample collection, and clinical data curation. C.M.C., P.J.C., T.G.G., V.A.G. analyzed and interpreted data. C.M.C., V.G.A. wrote the manuscript and incorporated feedback from all authors. R.S.F., F.M.K., X.J.Z., J.S.T. provided critical revisions.

Corresponding author

Correspondence to Vatche G. Agopian.

Ethics declarations

Competing interests

H.R.T has ownership in the intellectual property used to isolate circulating tumor cells in this study (NanoVelcro CTC Assay), which has been licensed to CytoLumina Technologies Corp. H.R.T. and S.X.L. have financial interests in CytoLumina Technologies Corp. given their role in the company. All other authors report no conflicts of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Methods, Tables, and Figures

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Court, C.M., Hou, S., Liu, L. et al. Somatic copy number profiling from hepatocellular carcinoma circulating tumor cells. npj Precis. Onc. 4, 16 (2020). https://doi.org/10.1038/s41698-020-0123-0

Download citation

Received: 20 August 2019
Accepted: 07 May 2020
Published: 02 July 2020
DOI: https://doi.org/10.1038/s41698-020-0123-0

This article is cited by

Single-cell low-pass whole genome sequencing accurately detects circulating tumor cells for liquid biopsy-based multi-cancer diagnosis
- Xiaohan Shen
- Jiao Dai
- Qihui Shi
npj Precision Oncology (2024)