Main

Lung cancer is the leading cause of cancer-associated deaths worldwide and has one of the poorest prognoses among all cancer types. Non-small cell lung carcinoma (NSCLC) comprises approximately 80% of lung cancer, and its overall 5-year survival rate is 15%. Early stage tumours are treated primarily by complete surgical resection, yet 30–55% of patients will develop recurrence and die of the disease. Despite significant advances achieved in the chemotherapy and radiation therapy for advanced disease patients, most patients will eventually develop resistance. Thus, there is a need for novel and effective targeted therapies, the development of which requires our greater understanding of the genetic abnormalities in lung cancers.

As the complexity and unique nature of individual human cancers become apparent, the analysis of individual patients' genetic makeup has become important in guiding the development of novel treatments. A striking example of this has recently emerged in therapies using the epidermal growth factor receptor (EGFR) small molecule inhibitors (Shepherd et al, 2005; Tsao et al, 2005; Blackhall et al, 2006). Somatic mutations in the tyrosine kinase domain and copy number changes of the EGFR gene play critical roles in determining the sensitivity and clinical benefit of NSCLC patients treated by EGFR inhibitor drugs.

The discoidin domain receptors (DDRs) are receptor tyrosine kinases (RTKs) belonging to the same enzyme family as EGFR. The DDR1 gene is divided into 19 exons and has a coding sequence of 2742 bp. Discoidin domain receptor 2 is also divided into 19 exons and encodes a 2568 bp mRNA. Discoidin domain receptor 1 and DDR2 share similar structures, consisting of a characteristic discoidin homology domain, stalk region, transmembrane region, juxtamembrane region, and kinase domain. Compared to other RTKs, DDRs are unique because they have native collagens as their ligands. Discoidin domain receptors have been shown to exhibit altered expression patterns in multiple human cancers, including breast, oesophageal, ovarian, brain and lung tumours (Vogel et al, 2006). Of note, DDR1 appears to be preferentially expressed in tumour cells, whereas DDR2 is expressed in tumour stroma (Alves et al, 1995). The mechanism by which these receptors may contribute to oncogenesis is as yet unknown; however, given their important role in transmitting signals from the extracellular matrix (ECM), it is possible that they act as regulators of cell proliferation, adhesion, migration, and subsequent tumour metastasis. Similar to EGFR it is conceivable that altered expression and/or mutation of DDRs trigger abnormal activity, ultimately leading to enhanced proliferation and oncogenic transformation. The mutations in EGFR are all found within the catalytic kinase domain, and mostly consist of single-point mutations or small deletions (Sakurada et al, 2006; Shigematsu and Gazdar, 2006). In a related study, Davies et al (2005) screened for mutations in lung cancer by comprehensively sequencing all 518 kinases in the human genome. Among mutations in several other RTKs, the authors also described novel somatic mutations in DDRs, specifically the mutations A496S and R824W for DDR1 and the mutation R105S for DDR2.

Here, we sought to extensively explore the association of DDRs with human lung cancer by determining their expression levels and the prevalence of the newly reported mutations in a larger cohort of NSCLC primary tumours and tumour cell lines. We demonstrated a significant upregulation of DDR1 and a downregulation of DDR2 in lung tumour tissue compared with matched normal tissues from the same patients; importantly we demonstrated that DDR1 expression was a good prognostic marker for early-stage NSCLC patients. However, we did not identify any of the three previously identified mutations in our cohort of lung cancers or in any of the cell lines tested.

Materials and methods

Patients

A total of 146 primary lung tumours and an independent set of 23 matched tumour and normal lung tissue samples were harvested from NSCLC patients treated by surgical resection without adjuvant chemotherapy at the University Health Network and Mount Sinai Hospital, Toronto, Canada (Table 1A). Tissues were harvested within 30 min after complete resection, and the quality and pathology of tumour tissue was confirmed by the study pathologist (M-ST). Samples with a tumour-cell content of at least 50% were used in the study. The University Health Network Research Ethics Board has approved the use of these samples and their associated clinical information in this study.

Table 1A Comparative clinical and pathological features of the full cohort of NSCLC patients. Patients were dichotomised using the median level of DDR1 expression, whereas patients with undetectable DDR2 were considered low expressors

Cell lines and xenograft models

Human cell lines (ATCC) were cultured and grown in media according to the manufacturer's recommended conditions (Table 2). A lung xenograft model had been generated previously and no additional animal experiments were necessary for the current work (Liu and Tsao, 1993). Briefly, 2 × 106 cells in 70 μl medium (RPMI-1640, 10% fetal bovine serum) containing 10% Matrigel (BD Bioscience, San Jose, CA, USA) were subcutaneously injected into the ventral abdomen of 5-week-old male severe combined immunodeficiency mice (Table 2). Formation of tumours was observed by palpation, and growth measured twice a week until the tumour reached a volume of 1 cm3. After mice were killed, tumour tissue was harvested and snap-frozen in liquid nitrogen. All animals received humane care in compliance with the Guide to the Care and Use of Experimental Animals issued by the Canadian Council of Animal Care. All animal procedures were performed in accordance with a protocol approved by the animal care committee of the University of Toronto.

Table 2 List of cell lines and xenografts used for sequence analysis

RNA extraction

Ribonucleic acid was extracted from patient samples and cell lines using the method of Chomczynski and Sacchi (1987); extracts were purified using the RNeasy kit (Qiagen, Hilden, Netherlands), DNase treated (Ambion, Austin, TX, USA), and quantified via spectrophotometry. Ribonucleic acid quality was assessed by agarose gel electrophoresis.

Quantification and analysis of DDR expression in lung tumour samples

Quantitative real-time PCR (qRT-PCR) was conducted using the SYBR Green assay in the ABI PRISM 7900-HT (Applied Biosystems, Foster city, CA, USA). Each 10 μl qRT-PCR aliquet contained a 2 ng equivalent of cDNA in a 384-well plate. The reactions were activated at 95°C for 3 min followed by 40 cycles of 95°C (15 s), 65°C (15 s) and 72°C (20 s). The amount of transcript per ng cDNA was calculated using standard curves generated using a pool of genomic DNA from 10 normal lung tissue samples as described (Yun et al, 2006). Primer sequences were designed with Primer Express v 2.0 (Applied Biosystems). Primer sequences for DDR1 were forward primer ATGGAGCAACCACAGCTTCTC, reverse primer CTCAGCCGGTCAAACTCAAACT, and for DDR2, forward primer GGAGGTCATGGCATCGAGTT, reverse primer GAGTGCCATCCCGACTGTAATT. Technical replicates displayed high correlation (Ravg=0.95±0.03) and were collapsed by averaging. Expression values were then log2-transformed. Standardisation and normalisation were conducted using the geometric mean of the expression levels of four normaliser genes (ACTB, forward primer TCCTAAAAGCCACCCCACTTCT, reverse primer GGGAGAGGACTGGGCCATT, TBP, forward primer GGGCATTATTTGTGCACTGAGA, reverse primer TAGCAGCACGGTATGAGCAACT, BAT1, forward primer CGGTATCAGCAGTTTAAAGATTTTCA, reverse primer TGCCTCGGCCAAATAGGTT, and B2M, forward primer GAGTGCTGTCTCCATGTTTGATGT, reverse primer AAGTTGCCAGCCCTCCTAGAG). An indepth description of this method was published recently (Barsyte-Lovejoy et al, 2006).

PCR and sequencing analysis

Total RNA was transcribed into cDNA using Superscript II Reverse Transcriptase and oligo-dT (Invitrogen, Carlsbad, CA, USA). Primers were designed to amplify the regions containing previously identified DDR mutations in the juxtamembrane (forward primer GAGCTGACGGTTCACCTCTC, reverse primer AATGTCAGCCTCGGCATAAT), and kinase domains (forward primer GGTGCTGATGCTCTGTAGGG, reverse primer CGTGTTGAGTGCATCCTCTG) of DDR1 and the discoidin domain of DDR2 (forward primer GACTTGCACACCCTCCATTT, reverse primer GAGTGGTCGGTGACTGGAAT). The housekeeping gene, glyceraldehyde-3-phosphate dehydrogenase (forward primer CAATGACCCCTTCATTGACC, reverse primer TGCTGTAGCCAAATTCGTTG), was used as a control. cDNA was subjected to PCR amplification consisting of an initial 2 min denaturation at 94°C, followed by 35 cycles of amplification (94°C, 30 s; 57°C, 30 s; 72°C, 45 s), and a final extension at 72°C for 3 min. Polymerase chain reaction products were subjected to agarose gel electrophoresis with appropriate size markers, and samples producing bands of the correct size were purified using the MiniElute PCR Purification kit (Qiagen). Purified samples were sequenced at The Centre for Applied Genomics (TCAG) facility in Toronto, Canada, using an ABI3730XL DNA sequencer. Samples were sequenced in both directions using the same primers as for amplification, and sequences aligned and analysed using ChromasPro software (Technelysium, Tewantin, Australia). Samples that gave ambiguous sequences were repeated up to three times. Two samples were excluded from the study as clean sequences could not be generated.

Genomic sequencing

Additionally, DNA was extracted from all 12 lung cancer cell lines using standard techniques. Polymerase chain reaction was performed using exon-specific primers spanning putative mutation sites in DDR1 and DDR2, and PCR product sequenced as outlined above.

Statistical analysis

Matched t-tests were used to determine differential expression of DDR1 and DDR2. The Spearman's correlation, Kruskal–Wallis, and Wilcoxon test were used to assess association within and between molecular indices and the pathological or clinical factors. The end points for analyses were overall survival (from date of surgery to date of death) and disease-free survival (from date of surgery to date of disease recurrence). The Cox proportional hazards model was used to test the association of survival and DDR1 expression, where DDR1 expression was treated as a continuous variable. For Kaplan–Meier and multivariate analysis, subjects were dichotomised according to the median expression levels of DDR1 (median=32.6); low DDR1 expressors are defined as patients with expression levels below the median, whereas high DDR1 expressors are those with expression values above the median. Discoidin domain receptor 2 cases were dichotomised based on expressors vs nonexpressors. The Cox proportional hazards model was employed for the multivariate analysis. All statistical analysis was conducted using SAS software (v9.1).

Results

Quantitative real-time PCR analysis of DDR1 and DDR2 expression performed on the 146 NSCLC tumour samples (full data set available as Supplementary Table S1) showed neither DDR1 nor DDR2 were associated with factors such as tumour stage (P=0.38 and P=0.51, respectively), differentiation (P=0.93 and P=0.60, respectively), and age (Spearman's correlation P=0.82 and P=0.48, respectively). Normalised DDR1 expression ranged from 0 to 1184 transcript copies per 2 ng of cDNA, whereas DDR2 expression ranged from 0 to 186 copies. The expression of DDR1 was significantly higher in squamous cell carcinoma than in adenocarcinoma (P=0.007; median expression 40.6 vs 23.1, respectively). In contrast to DDR2 whose expression level was not prognostic (Supplementary Table S2), high expression of DDR1 was associated with significantly favourable overall survival in this cohort of patients (HR=0.43, 95% CI=0.22–0.83, P=0.014) (Figure 1A). Discoidin domain receptor 1 expression was also significantly associated with greater disease-free survival (HR=0.56, 95% CI=0.33–0.94, P=0.029) (Figure 1B). In multivariate analysis, DDR1 expression remained significant in predicting overall survival (HR=0.5, 95% CI=0.26–0.97, P=0.04), but not disease-free survival, after adjustments for stage, differentiation, histology, and age.

Figure 1
figure 1

Kaplan–Meier analysis of (A) overall survival and (B) disease-free survival, according to DDR1 expression levels. Patients were dichotomised based on the median level of DDR1 expression.

Next, we analysed a cohort of 23 samples from lung cancer patients, which included matched normal tissue for each tumour sample. Discoidin domain receptor 1 was shown to be upregulated by 2.15-fold (P=0.0005), and DDR2 downregulated by an equivalent amount (1.94-fold, P=0.0001) in tumour tissue compared with matched normal samples (Table 1B, Figure 2). We then analysed the genetic sequences of DDR1 and DDR2 in this cohort. To our surprise, we failed to detect any of the previously identified mutations in DDR1 and DDR2, nor any new mutations (Davies et al, 2005). We then subjected 12 lung cell lines, six lung xenografts, and 15 other cell lines to sequence analysis. The 12 lung cell lines were also subjected to genomic sequencing. Again, none of the previously reported mutations was found. This group included the lung cell line NCI-H1770 that had previously been reported to harbour the R824W mutation in the DDR1 kinase domain (Davies et al, 2005) (Figure 3A). We did not detect this mutation in either DNA or RNA sequences from this cell line. Sequences were generated for almost 20% of the coding sequence of DDR1, and 10% of DDR2.

Table 1B Cohort of matched tumour and normal lung tissues with relative DDR1 and DDR2 expression levels. Expression values were calculated relative to that of four normalizer genes
Figure 2
figure 2

Accumulated plot of DDR qRT-PCR data demonstrating relative expression of DDR1 and DDR2 in normal and cancerous tissues from lung cancer patients. Discoidin domain receptor 1 was found to be significantly upregulated by 2.15-fold (P=0.0005) and DDR2 significantly downregulated by an equivalent amount (P=0.0001) in tumour tissue compared with normal tissue from lung cancer patients. Expression values were calculated relative to that of four normaliser genes.

Figure 3
figure 3

Results of sequence analyses. (A) Location of S495S synonymous change adjacent to A496S somatic mutation identified previously. (B) Overall prevalence of DDR1 mutations in clinical lung cohort and cell lines. Six patients had the S485S polymorphism in both their normal and tumour tissue.

Previously identified polymorphisms located in these regions were successfully identified, confirming the robust nature and specificity of the sequencing method employed. One previously identified polymorphism in DDR1, located at amino acid 495, was identified in 46% of the clinical samples screened. This polymorphism is directly adjacent to the DDR1 A496S mutation identified by Davies et al (2005) that was not identified in any sample of our cohort (Figure 3B). This polymorphism is a synonymous change, and is therefore unlikely to be of functional importance; however, it is of interest to note that whereas six of 23 samples harboured this change in both their normal and tumour tissues suggestive of a germline polymorphism, another nine samples recorded the change only in their tumour tissue, indicating an acquired or somatic aetiology.

Discussion

We have demonstrated for the first time that the expression of DDRs is significantly deregulated in NSCLC. Furthermore we have shown that DDR1 is an independent favourable prognostic marker for early-stage NSCLC patients, and that mutations in DDR1 and DDR2 appear less frequently than previously reported. The collagen-binding RTKs, DDR1 and DDR2, have previously been linked to various human diseases including fibrosis (Alves et al, 1995; Mao et al, 2002; Lee et al, 2004; Avivi-Green et al, 2006), atherosclerosis (Hou et al, 2001; Hou et al, 2002; Ferri et al, 2004), and cancer (Johnson et al, 1993; Alves et al, 1995; Barker et al, 1995; Nemoto et al, 1997; Weiner et al, 2000; Dejmek et al, 2003; Ongusaha et al, 2003; Heinzelmann-Schwarz et al, 2004; Ram et al, 2006; Vogel et al, 2006). The mechanism by which DDRs may contribute to oncogenesis is as yet unknown.

Although DDR1 has been previously discussed in the context of idiopathic pulmonary fibrosis and inflammation of the lung (Avivi-Green et al, 2006; Matsuyama et al, 2006a, 2006b), a direct link with lung tumorigenesis has not yet been established. In addition, DDR2 has not yet been investigated in the context of lung cancer. This therefore represents the first study to extensively explore and quantify the expression levels of these unique RTKs in a large clinical cohort of lung cancer patients. Discoidin domain receptor 1 was shown to be upregulated in tumour vs normal tissue, whereas DDR2 was downregulated. Intriguingly, our data suggest that higher DDR1 expression can also be a strong independent prognostic indicator for early-stage NSCLC patients. Although DDR2 also appeared to be protective in our cohort, there were ultimately too few cases with quality expression data to make a firm conclusion. These results detailing differences in expression at the mRNA level are also of interest in relation to previous reports that DDR1 exhibits preferential protein expression in tumour epithelial cells, whereas DDR2 is more commonly expressed in the surrounding stroma (Alves et al, 1995). At this stage, it remains unknown as to the mechanisms by which DDRs may contribute to oncogenesis.

We were interested to note the recent study reporting for the first time somatic mutations in DDR1 and DDR2 (Davies et al, 2005). The authors report mutations in two patient samples (one squamous cell carcinoma and one large cell carcinoma), and one lung cancer cell line (NCI-H1770). However, we could not confirm the existence of the DDR1 mutation in the NCI-H1770 neuroendocrine cell line, which we obtained from ATCC and passaged for three times only. Despite repeated amplification, sequencing and analysis of both forward and reverse sequences, the wild-type DDR1 sequence was observed consistently. Davies et al (2005) comment on the unusually large number of mutations in the NCI-H1770 cell line cultured in their laboratory and suggest that this may be indicative of a defect in DNA repair that mimics UV exposure. The apparent discrepancy between the two sets of results may also be owing to differences in clonal strains of the NCI-H1770 cell lines, and passage number at the time of sequence analysis. Furthermore, we amplified and sequenced 33 cell lines (including 12 lung cell lines and six xenografts) and 46 clinical samples (RNA extracted from 23 matched lung tumour and normal tissue) from patients treated at our institution and did not identify any of the three mutations reported by Davies et al (2005).

Several explanations exist for this disparity. Lung cancer encompasses a broad range of clinical subtypes, and the makeup of the two cohorts differed. We cannot conclude that differences were owing to our methodology as our PCR and sequencing methods were robust and results repeatable. Further to this point, we managed to detect both germline and acquired polymorphisms in our samples, thereby verifying our ability to detect mutations if they existed in our population. One synonymous nucleotide change in DDR1 was present in almost half of our clinical cohort. This polymorphism is located at amino acid 495, directly adjacent to amino acid 496, previously identified as the site of a somatic mutation in DDR1. The nucleotide change is a cytosine to a thymine change located at the third nucleotide of the codon, resulting in no alteration in the amino acid serine. The previously identified A496S mutation results from a change from a guanine to a thymine (Davies et al, 2005). The two altered thymine nucleotides are sequential, suggesting that this region of the DDR1 genome may be a hotspot for genetic variation (Glover et al, 2005). We must therefore conclude that if these recently reported DDR1 and DDR2 mutations are valid, then their prevalence must be much lower in the general lung cancer population than expected. This does not however rule out the possibility that they are important mutations contributing clinically to a subset of lung cancers in certain populations. This is analogous to the emerging understanding of the role of EGFR mutations in various populations. As we understand more about the role of RTKs in cancer, it has become important to investigate the presence and prevalence of mutations in these genes, in varied populations. We suggest further research into the presence of mutations in diverse cohorts from varied populations.

This is the first study to show a strong association of the collagen binding RTK, DDR1 with human lung cancer. We have conclusively shown that DDR1 is significantly more (2.15-fold) expressed in tumour as opposed to normal tissues from lung cancer patients, and have demonstrated that it can also be a strong prognostic indicator. Further research is required into the mechanisms by which both DDR1 and DDR2 function in lung carcinogenesis.