Abstract
The role of p53, as a prognostic factor for survival in lung cancer, is controversial and the purpose of the present systematic review of the literature is to determine this effect.
Published studies were identified with the objective to aggregate the available survival results after a methodological assessment using a scale specifically designed by the European Lung Cancer Working Party (ELCWP). To be eligible, a study had to deal with p53 assessment in lung cancer (primary site) only, and to provide a survival comparison according to the p53 status.
Among the 74 eligible papers, 30 identified p53 abnormalities as a univariate statistically significant poor prognostic factor and 56 provided sufficient data to allow survival results aggregation. There was no significant difference between the trials that either showed or did not show a prognostic effect of p53 according to the methodological score or to the laboratory technique used. The studies were categorized by histology, disease stage, treatment and laboratory technique. Combined hazard ratios suggested that an abnormal p53 status had an unfavourable impact on survival: in any stage nonsmall cell lung cancer (NSCLC) the mean (95% confidence interval) was 1.44 (1.20–1.72) (number of studies included in the subgroup was 11), 1.50 (1.32–1.70) in stages I–II NSCLC (n=19), 1.68 (1.23–2.29) in stages I–IIIB NSCLC (n=5), 1.68 (1.30–2.18) in stages III–IV NSCLC (n=9), 1.48 (1.29–1.70) in surgically resected NSCLC (n=20), 1.37 (1.02–1.85) in squamous cell carcinoma (n=9), 2.24 (1.70–2.95) in adenocarcinoma (n=9), 1.57 (1.28–1.91) for a positive immunohistochemistry with antibody 1801 (n=8), 1.25 (1.09–1.43) for a positive immunohistochemistry with antibody DO-;7 (n=16), and 1.65 (1.35–2.00) for an abnormal molecular biology test (n=13). Data were insufficient to determine the prognostic value of p53 in small cell lung cancer.
In each subgroup of nonsmall cell lung cancer, p53 abnormal status was shown to be associated with a poorer survival prognosis.
Lung cancer is the most common cause of cancer death in industrialized countries and its incidence is steadily increasing in females and in many European countries. Despite improvements in diagnosis and therapy, the overall 5-;yr survival is still 15%.
Some independent prognostic factors have been identified for predicting survival and helping in the management of patients with lung cancer 1. They include: for small cell lung cancer (SCLC), extent of disease and performance status (PS) 2; for resectable nonsmall cell lung cancer (NSCLC), PS, tumour, node, metastasis (TNM) stage, and age 3; for advanced NSCLC, PS, TNM stage, age, sex and weight loss 4, 5. Among biological factors, white blood cell count, serum lactate dehydrogenase level, angiogenesis and factors reflecting proliferative state have been shown to significantly predict outcome 6, 7.
Recent progresses in molecular biology have allowed for the extension of the research on prognostic factors to the analysis of proteins and genes involved in cancer development. For example, factors related to growth (e.g. epithelial growth factor, erb-;B2), cell cycle (e.g. retinoblastoma gene), or apoptosis (e.g. p53 and bcl-;2) have been studied in recent publications, in order to correlate them with survival 8–11.
p53 has been the topic of numerous publications in lung cancer patients. It is a 53-;kD nuclear phosphoprotein, produced by the p53 tumour suppressor gene, which is localized on human chromosome 17p13. It binds to double-stranded deoxyribonucleic acid (DNA) and has three main physiological functions: cell cycle regulation, induction of apoptosis, and stabilization of the genome 12. Mutations of the p53 gene are implicated in the carcinogenesis of many cancers and are commonly found in lung cancer 13, 14.
Despite the large number of studies, the prognostic value of p53 for survival in patients with lung cancer is controversial, numerous studies published in this field deal with a small series of patients. Therefore, a systematic review of the literature has been performed to assess the prognostic value of p53 abnormalities for the survival of patients presenting with lung cancer.
Materials and methods
Publications selection
Criteria for eligibility of a study to the present systematic review were: to deal with lung cancer only; to evaluate the correlation between p53 status and patient survival; to analyse p53 (protein, DNA or ribonucleic acid (RNA) in the primary tumour (not in metastatic tissue, or in tumour adjacent tissue) and/or antibodies against p53 in the serum; to be published as a full paper in the English or French language literature.
Articles were identified by an electronic search on MEDLINE using the keywords “lung neoplasm” and “p53”, completed by the personal bibliography of two of the authors. The bibliographies reported in any of the studies identified were also used for the trials of identification. The search ended in July 1999.
When the same author reported results obtained on the same patient population, in several publications, only the most recent or the most complete report was included in the analysis (to avoid overlap between cohorts).
The review was limited to the published studies and no contact was made with the authors to obtain unpublished data.
Methodological assessment
To assess the trial methodology (the Wording trial, to refer to a clinical research study, is used in the present paper), 10 investigators, including nine physicians and one biostatistician, read each publication independently and scored them according to a quality scale as described in the Appendix. Each item was assessed using an ordinal scale (possible values: 2, 1, 0).
The scores were compared and a consensus value for each item was reached in meetings at which at least two-thirds of the investigators needed to be present. The participation of many readers was a guarantee for the correct interpretation of the articles. As the scores were objective, a consensus was always obtainable.
The overall score evaluated several dimensions of the methodology, grouped into four main categories: the scientific design; the description of the methods used to identify an abnormality of the p53 status (abnormality meaning presence of a mutated gene, a detectable protein or anti-;p53 antibodies); the generalizability of the results; the analysis of the study data.
Each category had a maximum score of 10 points, hence the overall maximum score was 40 points. When an item was not applicable in a study, the theoretically attributable points were not taken into account in the total of the concerned category. The final scores were expressed as percentages, ranging 0–100%, higher values indicated a better methodological quality.
The studies eligible for the systematic review were called “eligible” and those providing data for the meta-analysis “evaluable”. Only studies reporting univariate analysis results for survival were considered for the aggregation of the survival data.
Statistical methods
The results of a study were considered significant if the p-;value for the statistical test comparing the survival distribution between the groups with and without abnormalities in the status of p53, was <0.05 in favour of the group without abnormalities. The study was then called “positive” (which meant that abnormalities were a poor prognosis for survival). If authors reported a significant result in a subgroup analysis only, the study was also considered to be positive. In the other situations it was called “negative”, including the situation where a significant survival difference was found in favour of the group of patients with p53 abnormalities.
The correlation between the quality scores, or two continuous variables, was measured by the Spearman rank correlation coefficient. Its significance was assessed by testing the null hypothesis of equality to zero of this coefficient. Nonparametric tests were used to compare the distribution of the quality scores according to the value of a discrete variable (Mann-Whitney U-;tests for binary variables and Kruskal-Wallis tests for multiple class variables).
For the quantitative aggregation of the survival results, the p53 effect was measured by the hazard ratio (HR) between the two survival distributions. For each trial, this HR was estimated by a method depending on the results provided in a previous publication 15. The most accurate method comprised of retrieving the HR estimate and its variance from the reported results, or calculating them directly using parameters given by the authors for the univariate analysis: the O-;E statistic (difference between numbers of observed and expected events), the confidence interval (CI) for the HR, the logrank statistic or its p-;value. If these were not available, the total number of events, the number of patients at risk in each group and the logrank statistic or its p-;value were used to allow for an approximate calculation of the HR estimate. Finally, if the only exploitable data were in the form of graphical representations of the survival distributions, survival rates at some specified times were extracted from them in order to reconstruct the HR estimate and its variance, with the assumption that the rate of patients censored was constant during the study follow-up. If authors reported survival of three or more groups (for example, using several cut-off values for percentage of protein present in the nucleus, or regarding the exons of DNA separately) the results were pooled, making a comparison between two groups feasible.
Survival of the entire patient population, when available, was analysed globally, according to the p53 status. If results in particular subgroups were reported separately, they were treated in the meta-analysis of the corresponding subgroups. Of course, the same patients were never considered more than once in the same analysis. The individual HR estimates were combined into an overall HR using the method published by Yusuf et al. 16, which consists of using a fixed-effects model with the assumption of the homogeneity of the individual HRs. This assumption was tested by performing Chi-;squared tests for heterogeneity, as previously described 16. The possible heterogeneity among the HRs was first taken into account by categorizing the studies according to some of the covariates (histology, stage, treatment, or laboratory method) known to be prognostic factors in lung cancer, rather than considering a random-effects model. However, if the assumption of homogeneity had to be rejected for some defined subgroups, a random-effects model was used in a second stage for these particular subgroups. By convention, an observed HR >1 implied a worse survival for the group with p53 abnormalities. This pejorative impact of p53 on survival was considered as statistically significant if the 95% CI for the overall HR did not overlap 1.
Results
Study selection and characteristics
Seventy-nine trials, published between 1992–1999, were selected [9–11, 17–92]. They all reported the prognostic value for survival of p53 status in lung cancer patients, assessing p53 gene abnormality or protein expression in the primary tumour, or antibody anti-;p53 presence in the serum. Five of these studies were excluded because an identical patient cohort had been used in other selected publications (studies that were excluded and included were 39, 40, 43, 44, 71, 72, 84–87.
The main characteristics of the 74 studies eligible for the systematic review are shown in table 1⇓. Sixty-seven of them dealt with NSCLC, while SCLC, neuroendocrine tumours, and any histological type were the subject of 2, 2 and 3 trials, respectively. The NSCLC studies considered either all subtypes (n=57), adenocarcinoma (n=6), or squamous cell carcinoma (n=4). Fifty of the 67 NSCLC studies assessed locoregional disease, treated by surgery alone in 76% of cases.
Some studies evaluated the prognostic role of p53 gene mutations, identified by molecular biology (n=18), with polymerase chain reaction (PCR), most frequently followed by single-strand conformation polymorphism (SSCP) method, or by denaturing gradient gel electrophoresis (DGGE), or by DNA sequencing. One study assessed the RNA abnormalities by a reverse-transcription PCR-SSCP method. The others evaluated the accumulation of p53 protein either by immunohistochemistry (IHC) detection (n=49), by optic microscopy or flow cytometry, or by an enzyme-linked immunosorbent assay technique (ELISA) on cellular extracts (n=2). Five studies investigated the role of p53 antibodies in sera by ELISA, Western blot or immunoblot test.
Three trials evaluated both p53 DNA and protein; they were considered in the present analysis for DNA results only, the protein accumulation not being strictly correlated with gene mutation. One study assessed both p53 protein accumulation in the primary tumour and presence of antibodies in the sera, and was considered for the protein analysis only.
Amongst the 74 trials eligible for systematic review, it was not possible to evaluate 18 for the HR estimation due to insufficient reported data, and they, therefore, could not be taken into account for the aggregation of the survival results. The reasons for noninclusion of a study into the meta-analysis were: absence of univariate analysis (n=3) 37, 63, 80; no p-;value, HR or CI reported (n=8) 11, 23, 24, 49, 61, 62, 67, 91; no survival curve shown (n=5) 17, 31, 35, 53, 81; no p53 status distribution (n=2) 28, 73.
Study results report
As shown in table 1, 30 of the 74 trials (41%) identified p53 abnormalities as a poor prognostic factor for survival (with 28 evaluated for the meta-analysis), 42 (57%) concluded that p53 is not a prognostic factor for survival (26 evaluated), and two (2%) reported a better prognosis for p53 positivity, both being evaluated 18, 58. In order to compare positive studies (p53 was statistically a poor prognostic factor) and negative studies (not statistically significant), the latter two studies were pooled with the negative ones.
Among the 67 trials on NSCLC, 28 (42%) were positive (26 of which were evaluated). None of the two studies dealing with SCLC reported significant results. One of the two trials assessing p53 in neuroendocrine tumours had significant results. Three studies evaluated lung cancer of any histology and of any stage, of which one was significant. Twenty-two of the 51 trials (43%) evaluating the role of p53 protein, 39% of those evaluating p53 DNA and 20% of those assessing anti-p53 antibodies, were significant.
Concerning the meta-analysis, 18 of the 74 studies were not evaluated. The evaluation status for the meta-analysis was associated with trial positivity; the rate of positive results was 50% for the trials that could be evaluated (28 of 56) compared to 11% (2 of 18) for those that could not (p=0.005).
Quality assessment
Overall, the global quality score ranged 23.9–85.2%, with a median of 54.8% (table 2⇓). The design subscore had the lowest value, with a median value of 4 out of 10. The most poorly described items (<30% of the maximum theoretical score) were the a priori estimate of sample size required to conduct the study (2%), the blinding in evaluation of the results (24%), and the initial workup description (11%).
There was a significant correlation between the global score and the number of patients included in the study (Spearman correlation coefficient rs=0.43, p=0.0015).
Concerning the global score, there was no statistically significant difference between the 30 positive and the 44 negative trials (median of 55.1% versus 54.3%, p=0.24 by the Mann-Whitney U-;test). However, the positive trials had a better score concerning the report of the analysis results: 6.2 of 10 in comparison to 3.7 of 10 for the negative trials (p=0.02).
There was no statistically significant difference between the 56 studies that were evaluated and the 18 studies that were not, either for the global score (54.8% in comparison to 54.6%, p=0.72 by the Mann-Whitney U-;test) or the four subscores.
Concerning the 56 trials evaluated for the meta-analysis, the global score ranged 24.0–85.2%, with a median of 54.8%. The four subscores were superimposable to those of the 74 trials, with the design subscore being the worst described. The same items were poorly described (<30% of maximal theoretical score). There was no significant difference between the positive and the negative trials according to the global score (median of 54.8% in both groups, p=0.61).
The global score was not statistically different between studies assessing p53 status by IHC (n=35) or by molecular biology (n=14), with scores of 56.5% and 52.4%, respectively (p=0.07), but there was a significant difference for the description of the laboratory methodology, IHC being better described than molecular biology (6.4 of 10 compared to 4 of 10, p=0.0002).
Meta-analysis
The absence of a significant difference in the overall methodology score between the positive and negative trials led to the performance of the quantitative aggregation of the individual survival results. Because of the important heterogeneity of the cohorts of the 56 selected trials, only subgroup analyses were performed. Indeed, authors report on different patient cohorts (selected on the basis of histology, stage and/or treatment). Furthermore, the techniques used to identify p53 abnormalities could bring some additional heterogeneity.
The subgroups that were analysed were defined according to the covariates described earlier.
Of the 56 evaluable studies, eight were not used in the meta-analysis for the following reasons. Results aggregation was not performed in SCLC because the approaches used by the two available studies were too different, one assessed the p53 gene directly into the tumour cell 60 and the other assessed circulating anti-;p53 antibodies 82. For neuroendocrine tumours there was only one trial that could be evaluated 77. Of the three studies including any type of lung cancer, one reported results for the subgroups of adenocarcinomas and squamous cell carcinomas 64, which were used in their respective classes. The other two analysed p53 using very different approaches that prevented them from being aggregated together; one assessed the protein into the tumour cell 19 and the other circulating antibodies 54.
The five studies assessing circulating anti-;p53 antibodies that could be evaluated were not aggregated because they all dealt with different patient populations: any stage of NSCLC 18, locoregional squamous cell lung cancer 50, SCLC 82, and any type of lung cancer 54.
The individual HRs of the 48 evaluated studies were calculated by one of the three methods reported in the Materials and methods section. Only five studies reported the data needed to directly calculate the estimated HR. In 14 trials, HR was approximated by the total number of events and the logrank statistic or its p-;value. For the 29 remaining studies, HR had to be extrapolated from the graphical representations of the survival distributions.
Stages I and II NSCLC subgroup included 19 trials (nine positive) analysing 2,580 patients. The aggregation produced a statistically significant HR (1.52; 95% CI 1.35–1.72) with a worse survival prognosis in cases of p53 abnormalities (table 3⇓). The test for heterogeneity gave a significant result (p=0.01), but it was not possible to go further in the categorization of the trials, and namely, to treat papers reporting on Stage I patients separately. The use of a random-effects model did not change the conclusion with a combined HR of 1.78 (95% CI 1.45–2.19).
The surgically resected NSCLC subgroup, defined as a tumour completely surgically resected without any other treatment modality, with 10 positive trials among 20, showed very similar results (HR=1.56; 95% CI 1.37–1.79, Chi-;squared test for heterogeneity p=0.02) (table 4⇓). Once again, the introduction of a random effect did not change the interpretation of the HR (HR=1.66; 95% CI 1.37–2.01).
Studies performed in NSCLC Stages I–IIIB, or Stages III and IV, or all stages (trials 5, 9 and 11 respectively) also showed HR to be significantly >1 by the aggregation: 1.68 (95% CI 1.23–2.29), 1.68 (95% CI 1.30–2.18) and 1.44 (95% CI 1.20–1.72) respectively. No heterogeneity was detectable in any of these subgroups (p>0.05) (tables 5–7⇓⇓⇓).
When considering the NSCLC histological subtypes, nine trials were assessable for adenocarcinoma (table 8⇓) and nine for squamous cell carcinoma (table 9⇓). Results were significant, with an HR of 2.24 (95% CI 1.70–2.95) and 1.37 (95% CI 1.02–1.85) respectively, without any significant heterogeneity.
The methodology used to detect p53 abnormalities was analysed according to the following three aggregations (figs. 1–3⇓⇓⇓, individual data having been already reported in previous tables): IHC with antibody 1801 (n=8), IHC with antibody DO-;7 (n=16) and molecular biology (n=13). HRs were 1.57 (95% CI 1.28–1.91), 1.25 (95% CI 1.09–1.43) and 1.65 (95% CI 1.35–2.00) respectively. The three corresponding Chi-;squared tests for heterogeneity were not statistically significant (p=0.38, p=0.77 and p=0.32, respectively).
Discussion
The systematic review of the literature shows that alteration of the p53 gene in the lung cancer cells is a poor prognostic factor for survival in patients with NSCLC. The analysis revealed similar features for each NSCLC subgroup and helped to clarify the somewhat inconsistent message of individual studies.
This observation is potentially important for prognostic reasons. Identification of independent prognostic factors allowed for the definition of high-risk groups of patients for whom specific therapy could be designed or a stratification in randomized trials performed. So far, for lung cancer, useful prognostic factors are clinical variables, such as disease extent or performance status, or routine histological tests, such as white blood cell count or serum lactate dehydrogenase level. The results of the meta-analysis, which suggested a relationship between p53 and survival but only in a univariate setting, should encourage the development of adequately designed prospective studies, with an appropriate statistical methodology including multivariate analysis, in order to demonstrate the routine utility of a molecular biology marker like p53.
Moreover, the presence of abnormal p53 expression might be a potential therapeutic target. Tumour suppressor gene therapy is currently the topic of preliminary clinical investigations 93, 94.
The aggregation of the survival data was performed by using the results reported in the univariate analysis only. The type of studies available were very heterogeneous, often oriented to particular groups of patients, such as those with NSCLC treated by surgery alone or with a specific histological subtype or a particular tumour stage. For this reason, a global meta-analysis was not performed and the analysis concentrated on more homogeneous subgroups of patients by aggregating the data of studies conducted in similar patient populations or tumours.
Even by doing this, however, the issue of heterogeneity existing among the individual trials could not be addressed completely. Indeed, in two of the 10 subgroups considered, a statistically significant test for heterogeneity was found (Stages I–II and surgical stages). In the subgroup of Stages I–II, the heterogeneity was mostly due to a particular study 76, which was dedicated to pathological Stage I patients only. By dropping this trial, the test for heterogeneity became unsignificant (p=0.51). The selection of patients with the best possible prognosis, such as Stage I disease, might be responsible for the lack of effect of p53 abnormality on survival. However, no other trial provided results specific to Stage I patients, and it was thus impossible to analyse them separately.
In the surgically resected subgroup, the detected heterogeneity was less surprising. Patient candidates with the most favourable prognosis for surgery might belong to Stage I, with a seriously decreased probability of survival for Stage IIIA. The definition of this subgroup, using the treatment covariate, was not optimal, but categorization according to covariates having a better prognostic value was not possible because of the lack of available data in the publications.
When using random-effects models, the authors came to the same conclusion as with fixed-effects models, but such models do not identify the source of heterogeneity, which is the important clinical point. However, the quantitative aggregated HRs were consistent and similar in all the categories tested, with significantly decreased survival in cases of abnormally expressed p53 protein or gene mutation: Stages I–II NSCLC, NSCLC treated by surgery, Stages I–IIIB NSCLC, Stage III–IV NSCLC, any stage NSCLC, adenocarcinoma, squamous cell carcinoma (tables 3–9⇓⇓⇓⇓⇓⇓⇓). Subgroups of studies dealing with neuroendocrine tumours, SCLC, any histology and antibodies anti-;p53 were not meta-analysed because of the lack of similar approaches.
It should be noted that the two eligible studies reporting on SCLC were both negative. The single trial assessing the gene directly 60 included a very small number of patients (n=28) and the information provided therefore has to be considered as preliminary. In fact, further studies are necessary to determine the value of p53 as a prognostic factor for survival in SCLC.
The decision to perform the meta-analysis was based on a prior methodological quality assessment of the publications. A methodology similar to that of previous systematic reviews reported by the present authors on the treatment of lung cancer was used 95–. A specific scale for biological prognostic factors was designed (Appendix) based on the opinion of experts and on prior experience in the field. By comparing the scores of the studies where p53 was a significant prognostic factor and of those where it was not, differences could be identified that suggested biases induced by the trials methodology. In fact, the comparison of the scores of the two groups showed no statistically significant difference, allowing for a meaningful data aggregation. It should be stressed, however, that results analysis was significantly better reported in the positive studies than in the negative studies.
However, the methodology used did not prevent all of the potential biases. The most important one is probably the publication bias. Indeed, studies with nonstatistically significant results are published less often or, if they are, with more concise data (as shown in the methodological analysis). Moreover, there is a language problem. The present review was restricted to articles published in English and French because other languages such as Japanese were not accessible for the readers. This bias could favour the positive studies that are more often published in English, while the negative ones tend to be reported in native languages 99. Another possible source of confusion is the use of the same cohort of patients for different publications 39, 43, 72, 84, 86 and it might be difficult to avoid the fact that the same patients have been included twice or three times in the meta-analysis. Finally, for practical purposes because of their small number, the two trials that showed a significant positive survival effect of the presence of abnormal p53 were included in the negative group.
The diversity in the techniques used to identify alteration of the p53 status can also be a potential source of bias. Some authors search for mutations inside the gene by molecular biology techniques, usually by targeting exons 5–8 where the majority of mutations in lung cancer are situated 100. However, not all mutated proteins lose their physiological function and in the case of p53, the disturbance of the regulation of the cell cycle depends on the site of the mutation 101, 102. Moreover, not all mutations are found by a partial analysis of the gene. Other authors have studied the accumulation of an abnormal form of the protein in the cell nucleus, the normal p53 protein having a half-life that was too short to be detected 103. For that purpose they used an immunohistochemistry technique or, less often, ELISA. Immunohistochemistry is not performed with the same antibody to reveal the protein. Moreover, the cut-off in the number of positive cells defining a tumour with p53 accumulation is often arbitrary and varies according to the investigators, from a few per cent to 50%. An optimal threshold is yet to be defined. Sometimes, the manipulations might be inadequate: for example, the use of the antibody 1801 on a paraffin-embedded tissue, because of its specific immunoreactivity 104, or the absence of prior reaction of epitope unmasking on fixed tissue 105. Finally, immunohistochemistry can produce false-negative results because some mutations are associated with the absence of detectable protein 106, or false-positive results because p53 can be quantitatively increased or stabilized in circumstances other than gene mutations 107–. Thus, there is no absolute correlation between immunohistochemistry and molecular biology. To try to exclude the technique bias, a subgroups analyses was performed according to the most frequently used methods: immunohistochemistry with antibody 1801, immunohistochemistry with antibody DO-;7 and molecular biology (figs. 1–3⇓⇓⇓). In any situation, the results were consistent, with poorer survival in cases where there was an abnormal p53 status, making the techniques improbable as a source of bias.
Some biases can be due to the methodology used to perform the meta-analysis. As already highlighted, the negative studies reported less detailed results, making them unlikely to be evaluated. Of 18 studies excluded for the meta-analysis due to a lack of reported data, 16 were negative, which represents a significantly higher proportion than in the trials evaluable for the meta-analysis. There is thus, a potential bias in favour of positive trials. Another potential source of biases is related to the method of extrapolation of the HR. If they were not reported by the authors, they were calculated from the data available in the article and, if this was not possible, they were extrapolated from the survival curves, hence making assumptions on the censoring process. Moreover, the choice of time intervals for reading mortality rates on the curves is not the object of a consensus. Finally, it should again be emphasized that a global meta-analysis did not appear meaningful because of the heterogeneity of the patient populations. This potential population bias makes it hazardous to generalize the results of the meta-analysis of subgroups of patients that were not included in the data aggregation, such as those with SCLC.
The present results are based on an aggregation of data obtained by univariate survival analysis in a majority of retrospective trials. It should be stressed that an HR of 1.5–2 does not mean a large effect of p53 on survival and, therefore, p53 does not constitute a useful factor for prediction of shorter survival at the level of individual patients. The results need to be confirmed by an adequately designed prospective study and the exact value of an abnormal p53 status needs to be determined by an appropriate multivariate analysis, taking into account the classical well-defined prognostic factors for lung cancer. A meta-analysis based on the individual data of the patients included into the studies 110 would help to define, by multivariate methods, the prognostic role of p53, but it will require the collection of a huge amount of retrospective data, with the potential problem of dealing with a lot of missing data. Moreover, it will not have the definitive value of a well-designed prospective study 111.
To conclude, a systematic review of the lung cancer literature suggested a poor prognostic value for survival of an abnormal p53 status in patients with nonsmall cell lung cancer, whatever the biological test used. Data are insufficient to determine its role in small cell lung cancer. This observation requires further investigation with an adequate statistical methodology to determine its exact place among the other known survival prognostic factors for lung cancer.
Results of the meta-analysis of the studies evaluating p53 status by immunohistochemistry with antibody 1801. Hazard ratio >1 implies a worse survival for the group with p53 abnormalities. ▪: hazard ratio value of each study, designed proportionally to the amount of patients included; -; -; -; : combined hazard ratio of the meta-analysis of the studied subgroup; ♦: the centre of the diamond gives the combined hazard ratio and its extremities the 95% confidence interval.
Results of the meta-analysis of the studies evaluating p53 status by immunohistochemistry with antibody DO-;7. ▪: hazard ratio value of each study, designed proportionally to the amount of patients included; -; -; -; : combined hazard ratio of the meta-analysis of the studied subgroup; ♦: the centre of the diamond gives the combined hazard ratio and its extremities the 95% confidence interval.
Results of the meta-analysis of the studies evaluating p53 status by molecular biology. ▪: hazard ratio value of each study, designed proportionally to the amount of patients included; -; -; -; : combined hazard ratio of the meta-analysis of the studied subgroup; ♦: the centre of the diamond gives the combined hazard ratio and its extremities the 95% confidence interval.
Main characteristics and results of the eligible studies
Results of the methodological assessment by the European Lung Cancer Working Party score
Meta-analysis of the studies including stages I and II nonsmall cell lung cancer (NSCLC), with their characteristics
Meta-analysis of the studies including surgical stage nonsmall cell lung cancer (NSCLC), with their characteristics
Meta-analysis of the studies including Stages I–IIIA nonsmall cell lung cancer (NSCLC), with their characteristics
Meta-analysis of the studies including stages III and IV nonsmall cell lung cancer (NSCLC), with their characteristics
Meta-analysis of studies including nonsmall cell lung cancer (NSCLC) of any stage, with their characteristics
Meta-analysis of the studies assessing adenocarcinoma, with their characteristics
Meta-analysis of the studies assessing squamous cell carcinoma, with their characteristics
Appendix: European Lung Cancer Working Party quality scale for biological prognostic factors for lung cancer
Except when specified, the attributed value per item is 2 points if it is clearly defined in the article, 1 point if its description is incomplete or unclear and 0 point if it is not defined or is inadequate.
Scientific design
Study objective definition.
Study design: prospective (2 points); retrospective or retrolective (1 point); not defined (0 point).
Outcome definition.
Statistical considerations: fully reported with a preliminary assessment of the patient/sample number to be included and/or analysed (2 points); patient/sample number to be included and/or analysed justified by the number of studied variables (minimum 10 patients per variable) (1 point); not defined (0 point).
Statistical methods and tests description.
Laboratory methodology
Blinding in the biological assays performance: double-blind (2 points); simple-blind (1 point); unblinded or not defined (0 point).
Tested factor description: DNA (types of exons analysed), messenger RNA (complete or partial with description of the primers used), protein (nuclear, cytoplasmic or extracted from cellular components), antibodies (type of tissue or liquid sampled).
Tissue sample conservation: either fresh tissue or conservation requiring freezing at ≤−80°C in presence of an anti-RNAase for RNA or freezing at ≤−20°C for DNA, protein and serum, or fixation in formol, alcohol or paraffin.
Description of the revelation test procedure of the biological factor: PCR with mention of primers, polymerase type, general reaction conditions (concentration of the various reagents, cycle number, duration and temperature of the various steps); DNA-;sequencing with the method used, the electrophoresis characteristics (gel composition, duration, temperature and generator voltage) and revelation procedure; reverse transcription (RT) with the transcriptase-reverse type and general conditions of incubation (reagent concentration, temperature and duration); SSCP with the gel composition (acrylamide percentage, glycerol content), other electrophoresis characteristics (duration, temperature and generator voltage), coloration method (the SSCP must be followed by the abnormally migrated fragments sequencing if there is no negative internal control); DGGE with gel composition and gel composition gradient, other electrophoresis characteristics (duration, temperature and generator voltage) and coloration method (the DGGE must be followed by the abnormally migrated fragments sequencing if there is no negative internal control); restricted fragment length polymorphism (RFLP) with the restriction enzyme type, temperature and incubation time, electrophoresis characteristics (gel composition, duration, temperature and generator voltage), coloration method (the RFLP must be followed by the abnormally migrated fragments sequencing if there is no negative internal control); IHC with the first antibody type and clone identification, second antibody type, reaction characteristics (antibodies concentration, duration and temperature of incubation), colouration method (peroxydase, alkaline phosphatase or chromagenic method), epitope unmasking method in case of fixed tissue, endogenous peroxydase activity inhibition method if the colouration method requires peroxydase; ELISA with the type of antibodies/antigens used, general reaction conditions (reagents concentration, temperature, duration), unspecific sites blockage (type of serum used), colouration and reading methods (fluorometry, spectrophotometry, etc.); Western blot with types of antibodies/antigens used, gel composition, other electrophoresis characteristics (duration, temperature, generator voltage), colouration method; immunoblot with the type of antibodies/antigens used, the type of membrane, other electrophoresis characteristics (duration, temperature, generator voltage), colouration method.
Description of the negative and positive control procedures.
Test reproducibility control: between investigator's centres if the study is multicentric, or inside the centre if it is unicentric.
Definition of the level of positivity of the test (only evaluated in case of IHC): validated (2 points); arbitrary (1 point); not described (0 point).
Generalizability
Patient selection criteria, including histological type, disease stage and treatment.
Patients' characteristics, including histology type, disease stage and treatment.
Initial workup.
Treatment description.
Source of samples.
Number of unassessable samples with exclusion causes.
Results analysis
Follow-up description, including the number of events.
Survival analysis according to the biological marker.
Univariate analysis of the prognostic factors for survival: report of the relative risk with the confidence interval (2 points); results without evaluation of the relative risk and its confidence interval (1 point); not reported or inadequate (0 point).
Multivariate analysis of the prognostic factors for survival: report of the relative risk with the confidence interval (2 points); results without evaluation of the relative risk and its confidence interval (1 point); not reported or inadequate (0 point).
Acknowledgments
The authors would like to thank D. Larsimont, P. Martiat, P. Vermylen, B. Martin, D. Gancberg and L. Lespagnard for their assistance in the design of the methodological scale for biological prognostic factors.
- Received July 7, 2000.
- Accepted April 12, 2001.
- © ERS Journals Ltd