Abstract
Contemporary prognostic equations in pulmonary arterial hypertension (PAH) derived from US and French cohorts may not perform as well in the UK as a locally derived scoring scheme. The aim of the study was to develop and validate a UK risk score to predict prognosis in PAH.
Baseline mortality predictors identified by multivariate Cox analysis in 182 incident PAH patients were used to derive the Scottish composite score (SCS). Its prognostic performance in an independent UK cohort was compared with the French registry and Pulmonary Hypertension Connection (PHC) registry equations using Brier scores (BS).
The SCS based on age, sex, aetiology, right atrial pressure, cardiac output and 6-min walk distance predicted survival in the validation cohort (hazard ratio (HR) 1.7 per point increase; p<0.001) and provided further prognostic stratification in World Health Organization (WHO) functional class III patients (HR 1.8 per point increase; p<0.001). It was more accurate than the French registry equation in predicting 1-yr survival (BS: 0.092 versus 0.146; p=0.001) and 2-yr survival (0.131 versus 0.255; p<0.001). There was no significant difference in BS between the SCS and PHC registry equation.
The SCS predicts survival and can be used to supplement WHO functional class in prognostication.
Pulmonary arterial hypertension (PAH) remains an incurable disease associated with high morbidity and mortality despite expansion of effective pharmacological therapy in the last decade [1]. As accurate prognostication is an integral part of disease management, considerable efforts have been made to identify prognostic factors and develop algorithms to predict patient outcome for clinical use. The first prognostic equation based on pulmonary haemodynamics (right atrial pressure (Pra)), mean pulmonary artery pressure (mean Ppa) and cardiac index at diagnosis) was derived from the National Institutes of Health (NIH) registry study of 194 patients with primary pulmonary hypertension (now classified as idiopathic, heritable and anorexigen-associated PAH) from 32 centres across the USA two decades ago before the advent of PAH-specific drug therapy [2]. It has become obsolete but continues to be used as a benchmark for patient outcome without targeted therapeutic intervention. Survival analysis of a contemporary PAH cohort from the Pulmonary Hypertension Connection (PHC) registry treated in a single centre in Chicago (IL, USA), including demographics and functional measures, identified the same haemodynamic variables as the NIH equation and these were used to derive a new equation based on a similar methodological approach [3]. Another prognostic equation based on sex, 6-min walk distance (6MWD) and cardiac output (CO) has been developed from the French registry including incident and prevalent patients diagnosed up to 3 yrs prior to study entry [4, 5]. These two contemporary equations only apply to patients with idiopathic, familial and anorexigen-associated PAH whereas the equation developed from the Registry to Evaluate Early and Long-term Pulmonary Arterial Hypertension Disease Management (REVEAL) included all patients in World Health Organization (WHO) Group I PAH [6]. Due to its large sample size, a greater number of variables were incorporated in the prediction model compared with the French registry and PHC registry equations (13 versus three variables).
These contemporary prognostic algorithms utilise baseline variables to calculate a predicted probability of survival at a certain time point during follow-up. Validation data on these equations published so far support their predictive value. The REVEAL equation and its simplified risk score (a 22-point scoring system) have been shown to have good discriminatory ability in a prospective PAH cohort of newly diagnosed patients [7]. When the PHC registry and French registry equations were applied to a prospective cohort of PAH patients followed up in four randomised controlled clinical trials and their extension studies, there was good agreement between predicted and observed survival [8]. As these equations were developed from patient populations in the USA and France with differing demographics, healthcare systems and treatment approaches to the UK, they may not perform as well in UK PAH populations as a locally derived scoring scheme. The aims of this study were to develop a UK risk score to predict prognosis in PAH patients from a well-defined cohort in Scotland, and to validate its prognostic performance against other published prognostic equations in an independent UK PAH cohort.
METHODS
Study design
A retrospective cohort of incident and treatment naïve patients diagnosed with WHO Group I PAH in the Scottish Pulmonary Vascular Unit (Glasgow, UK) between November 2000 and September 2009 were used to derive the Scottish composite score (SCS). Exclusion criteria were PAH associated with congenital heart disease (CHD-PAH), long-term calcium channel blocker (CCB) responders, pulmonary capillary wedge pressure (Ppcw) >15 mmHg and significant lung disease (defined by forced expiratory volume in 1 s or forced vital capacity or total lung capacity <60% predicted based on the European Coal and Steel Community reference values) [9]. The diagnosis of PAH was based on right heart catheterisation (RHC) in accordance with contemporary guidelines [1, 10]. Patients were subsequently treated with conventional therapy (long-term warfarin, diuretics or supplemental oxygen) and PAH-specific monotherapy (prostacyclin analogues, phosphodiesterase-5 inhibitors or endothelin receptor antagonists). Sequential combination therapy was initiated as clinically indicated. Baseline data on demographics, RHC, lung function, WHO functional class (FC), 6MWD, N-terminal-pro-brain natriuretic peptide (NT-pro-BNP) and Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR) score [11] recorded within 3 months of diagnostic RHC and prior to starting PAH-specific therapy were collected. The variables found to be independent mortality predictors in multivariate Cox analysis were included in the SCS. Weighted points were assigned to the threshold values of each variable identified by exploratory analysis. The SCS was obtained by summation of the points scored for each variable and finalised prior to validation in a cohort of incident and treatment naive idiopathic and heritable PAH patients treated in the Pulmonary Vascular Disease Unit at Papworth Hospital (Cambridge, UK) between January 2001 and December 2009. Patients with significant lung disease (defined by lung function), raised Ppcw and long-term CCB responders were excluded. Baseline data were collected to apply the SCS, French registry and PHC registry equations. The ability of the SCS to predict survival over time and stratify patients into prognostic groups in the whole cohort and in WHO FC III patients was tested. The French registry and PHC registry equations were used to compute survival estimates at 1 and 2 yrs post-diagnosis. Their predictive accuracy was compared with that of the SCS using the Brier score (BS). Patient consent was considered unnecessary by the local research ethics committee in the respective institutions.
Statistical analysis
Statistical analysis was performed using Statview version 5.0.1 (SAS Institute, Cary, NC, USA) and Graphpad Prism version 5.00 (Graphpad Software, La Jolla, CA, USA). Continuous variables were checked for normality using a D'Agostino and Pearson omnibus normality test. Normally distributed variables were described using the mean and standard deviation, and compared using an unpaired t-test. Non-normally distributed variables were described using the median and interquartile range and compared using Mann–Whitney U test. Categorical variables were described using number (percentage) and compared by the Chi-squared test.
Univariate Cox proportional hazards analysis was used to identify baseline mortality predictors in the derivation cohort. Survival time was calculated from the date of RHC to the date of data cut-off (May 1, 2010). Patients who received lung transplantation or were lost to follow-up were censored on the date of procedure or last clinical contact. Significant continuous variables were transformed into categorised variables using threshold values identified in exploratory Cox analysis (see online supplementary data) and were assessed with their corresponding indicator variable coding for missing data (“missing” and “not missing”) in a multivariate Cox model using a backward selection procedure. The use of indicator variables was to allow inclusion of patients with missing data in multivariate analyses. None of the indicator variables for missing data was predictive of survival. Sex was not significant in univariate Cox analysis but was entered in the multivariate model as its prognostic significance has been confirmed in previous studies. Aetiology was divided into two subgroups: connective tissue disease associated pulmonary arterial hypertension due to systemic sclerosis (CTD-PAH-SSc) and WHO Group I PAH others versus idiopathic, heritable, anorexigen-associated and CTD-PAH non-SSc (reference subgroup). Anorexigen-associated PAH was included in the reference subgroup as these patients are regarded as having similar outcomes to those with idiopathic and heritable PAH. “WHO Group I PAH others” included patients with portopulmonary hypertension, HIV and pulmonary veno-occlusive disease. Variables with p<0.1 were retained in the final multivariate Cox model. Weighted points were assigned to each variable subgroup according to their adjusted hazard ratio (HR) in the final model, for example, 0 point was assigned to the reference subgroup with an adjusted HR of 1. For the other subgroups, 1 point was assigned if the adjusted HR was 2, 2 points if the adjusted HR was 3 and 3 points if the adjusted HR was 4.
Kaplan–Meier analysis was used to estimate survival rates and comparison between groups was made by the log-rank test for trend. Cox analysis was used to evaluate the ability of the SCS to predict survival in the validation cohort. The BS [12, 13] was used to assess the accuracy of the SCS, French registry and PHC registry equations in predicting 1- and 2-yr survival in the validation cohort allowing for censoring. It measures the mean squared deviation of predicted probability from the actual outcome. A BS of 0 indicates perfect prediction. A BS of 0.25 indicates that the prediction is equivalent to the outcome occurring by chance alone. Higher prediction accuracy is indicated by a lower BS. To compare the performance of the SCS with other equations, a point estimate of difference in BS between SCS and each equation and its 95% confidence interval was obtained from 200,000 bootstrap re-samples. A difference in BS of >0.02 was considered clinically relevant. A p-value <0.05 was defined as statistically significant.
RESULTS
Derivation of the SCS
Characteristics of the derivation cohort and the aetiology reference subgroup are summarised in table 1. 10% of patients received prostacyclin analogues (7% intravenous epoprostenol, 2% inhaled iloprost, 1% subcutaneous treprostinil), 47% phosphodiesterase-5 inhibitors (45% sildenafil 45% and 2% tadalafil) and 44% endothelin receptor antagonists (29% bosentan, 7% sitaxentan and 8% ambrisentan). After a median follow-up period of 25 months (range 0.1–113 months), 81 patients died from all causes, two patients received lung transplantation, one patient was lost to follow-up and two patients transferred care to another centre. The survival estimates for the whole cohort were 79% at 1 yr, 68% at 2 yrs and 57% at 3 yrs. Out of the 17 baseline variables considered in the univariate Cox model, age, aetiology, carbon monoxide diffusing capacity % pred, Pra, CO, WHO FC, 6MWD, NT-pro-BNP and CAMPHOR score were significant mortality predictors (table 2). When these variables were categorised into two to four subgroups and assessed in a multivariate Cox model, age at diagnosis, sex, aetiology, Pra, CO and 6MWD were retained as independent mortality predictors and were used to construct the SCS (0 to 8) (table 3).
Validation of the SCS
Out of 119 patients in the validation cohort, 99 patients had complete data to calculate the SCS (table 4). 38 patients died and four received lung transplantation during follow-up (median 29 months, range 0.3–102 months). The 1-, 2- and 3-yr survival estimates for patients in whom SCS was available were 87, 74 and 68%, respectively which were not significantly different from those in whom SCS was not available (log-rank p=0.196). The risk of death increased with the SCS (HR 1.7 per point increase, 95% CI 1.4–2.1, p<0.001). When patients were stratified into three SCS risk groups (“high risk”: 4–8 (n=23); “intermediate risk”: 2–3 (n=44); and “low risk”: 0–1 (n=32)), there was a significant difference in survival between groups (p<0.001 by log-rank test for trend) (fig. 1). The SCS further stratified WHO FC III patients (HR 1.8 per point increase, 95% CI 1.4–2.4, p<0.001) (n=66, 30 deaths). When these patients were stratified into three SCS risk groups (“high risk” = 4–8, “intermediate risk” = 2–3, “low risk” = 0–1), there was a significant difference in survival between groups (p<0.001 by log-rank test for trend) (fig. 2).
Kaplan–Meier survival estimates of the validation cohort stratified by three Scottish composite score risk groups.
Kaplan–Meier survival estimates of World Health Organization functional class III patients in the validation cohort stratified by three Scottish composite score risk groups.
Comparison of prognostic equations
When the SCS, French registry and PHC registry equations were applied to the validation cohort to predict 1- and 2-yr survival, the SCS had a lower BS than both equations but only the comparison with the French registry equation reached statistical significance (table 5).
DISCUSSION
The present study is the first to describe the derivation and validation of a risk score to predict prognosis in incident patients with PAH based on UK data. The data analysis showed that the SCS, an 8-point simple scoring system based on age, sex, aetiology, Pra, CO and 6MWD at diagnosis, can be used to categorise patients into prognostic groups and provide further risk stratification in WHO FC III patients. When tested in an independent UK PAH cohort, the predictive accuracy of the SCS was comparable to that of the PHC registry equation and may be greater than the French registry equation. The prognostic significance of sex and aetiology demonstrated by previous studies is also confirmed. These findings add to the growing body of literature on prognostication in PAH and provide further insights into the use of contemporary prognostic equations derived from other registry studies.
The predictive value of the French registry, PHC registry and REVEAL equations has been demonstrated in respective validation studies. However their applicability in other patient populations would be influenced by factors such as study population characteristics, treatment pathways and statistical methodology (table 6). The SCS was designed as a simple point-based risk score to stratify patients into prognostic groups whereas the other equations were developed to compute predicted survival probabilities at certain time points post-diagnosis. Both SCS derivation and validation cohorts were treated in single tertiary centres and this would ensure complete data capture and uniformity of care. In contrast, the French registry and REVEAL included national cohorts from selected centres. This may lead to selection bias, variations in clinical practice and patient outcome. The SCS was derived from a strictly incident and treatment naive patient cohort, whereas French registry, PHC registry and REVEAL equations were derived from mixed incident and prevalent cohorts. The inclusion of prevalent patients would introduce survivor bias as these patients have survived long enough to be recruited in the study and would have a better prognosis than those who die early from severe disease or lack of response to PAH therapy. This was clearly demonstrated by comparing survival between incident and prevalent cohorts in the French registry study [5]. In order to remove this survivor bias, authors of the French registry equation estimated survival from time of diagnosis as opposed to time of enrolment and only considered patients to be at risk from their time of study entry. This adjustment for the time delay between diagnosis and study entry in prevalent patients was not adopted in the development of the PHC registry or REVEAL equations. The study period of the PHC registry spanned both pre- and post-modern treatment eras. Changes in patient prognosis over this period due to treatment advances are probable and this may have affected the survival analysis. These factors, in addition to variations in healthcare systems and treatment approaches, would impact on the performance of these prognostic algorithms in different clinical settings.
It is well recognised that patient outcome differs according to PAH aetiology. Patients with CHD-PAH were excluded from this study as they have a distinct haemodynamic profile and natural history from other PAH patients [14]. Pulmonary arterial hypertension associated with systemic sclerosis is known to confer a worse prognosis than idiopathic PAH despite a similar degree of haemodynamic derangement for reasons that are not fully understood [15, 16]. When PAH associated with different types of connective tissue disease including mixed connective tissue disease, overlap syndromes and rheumatoid arthritis were grouped together and compared with CTD-PAH-SSc in the derivation cohort, CTD-PAH-SSc conferred a worse prognosis. This would be consistent with the findings from a UK-wide epidemiology study [17]. Patients with idiopathic, heritable, anorexigen-associated PAH are regarded as having similar outcomes as they share clinical and pathophysiological features [18]. Hence, the grouping of aetiologies in the SCS would be consistent with findings from previous studies. There is also increasing evidence to support a sex difference in patient outcome and this is confirmed by the present study. More females are affected by PAH than males but they have a better prognosis [4, 19]. Recent analysis of patients enrolled in REVEAL showed that men had a higher mean Ppa and mean Pra at diagnosis and those aged ≥ 60 yrs had lower survival rates compared with females aged ≥60 yrs [20]. Sex hormones are thought to play a role but the precise mechanisms remain to be elucidated. The individual prognostic value of other component variables in the SCS has also been demonstrated in other studies (age [3, 19], 6MWD [4, 6, 21–25], Pra [6, 23–27], CO [4]).
Algorithms advising on the initial choice of drugs and the use of combination therapy advocated by current guidelines on PAH management are primarily based on WHO FC as its prognostic value has been firmly established [1, 28, 29]. However, WHO FC may not be a sufficiently reliable measure of functional status as wide variation in clinicians' assessment has been reported [30]. The differences between WHO FC I or II and IV are clear-cut, but WHO FC III encompasses patients with a wide range of functional capacity. The results of this study showed that the SCS could provide further risk stratification in WHO FC III patients and hence supplement WHO FC in clinical assessment. Variables measured during follow-up may be more predictive of long-term outcome than those measured at baseline evaluation as they would capture the impact of treatment response and disease progression on survival. A recent study has confirmed the prognostic impact of changes in outcome variables during the course of disease and demonstrated the importance of incorporating them into risk assessment [31], but there are currently no published prediction tools specifically developed to address this issue. We propose that the SCS could be used to assess the initial risk of mortality in the one to two year time horizons with a view to develop a risk score incorporating changes in mortality predictors over time for repeated use during follow-up.
This study has several limitations. Missing data were unavoidable due to the retrospective nature of the study. As measurements of NT-pro-BNP and CAMPHOR score were not introduced into clinical practice until 2004, there were fewer data compared with other baseline variables which may have introduced bias against them in multivariate survival analyses despite statistical adjustment. The derivation cohort consisted of patients with heterogeneous aetiologies of PAH with relatively small numbers in some subgroups. However, this simply reflects the relative incidence of different PAH aetiologies in a real-life clinical cohort. The data on vasoreactivity status at diagnosis were incomplete, so the effect of a positive vasodilatory response on survival could not be assessed. Only one patient demonstrated sustained response from CCB and was excluded from the study. This is based on the finding that long-term CCB responders have a better prognosis than non-CCB responders [32]. As the number is small, the bias associated with this exclusion would be insignificant. There were no patients with rarer causes of WHO Group I PAH such as schistosomiasis or chronic haemolytic anaemia in the derivation or validation cohort, and only patients with idiopathic and heritable PAH were included in the validation cohort. Hence, the performance of the SCS in other PAH subgroups is still to be validated. 17% of patients in the validation cohort did not have all the required variables to calculate the SCS. It was not possible to determine how the SCS would perform in these patients compared with other equations, but there was no difference in survival between patients with and without the SCS. Heart rate (HR) and systolic blood pressure (SBP) were measured in each patient at diagnosis but not recorded in the validation database and hence unavailable for this analysis. The REVEAL equation could not be included in the comparison as the systematic omission of HR and SBP data would lead to bias in its performance. This is a limitation of database analysis that does not reflect the potential value of an assessment tool in clinical practice.
In conclusion, the SCS is a simple multidimensional risk score combining the impact of demographics, pulmonary haemodynamics and functional status on survival in incident PAH patients. When validated in an independent UK idiopathic and heritable PAH cohort, it correlated with mortality and provided further risk stratification in WHO FC III patients. It may perform better in UK populations than prognostic equations derived from other registry studies but further validation in wider PAH populations is required before firm conclusions on its clinical utility can be made.
Acknowledgments
The authors would like to thank J. Alsop (Numerus Ltd, Northampton, UK) for his statistical advice and Actelion UK for funding this project.
Footnotes
For editorial comments see page 530.
This article has supplementary material available from www.erj.ersjournals.com
Statement of Interest
Statements of interest for W-T.N. Lee, K. Sheares, J. Pepke-Zaba, A.J. Peacock, M.K. Johnson and for the study itself can be found at www.erj.ersjournals.com/site/misc/statements.xhtml
- Received November 11, 2011.
- Accepted March 24, 2012.
- ©ERS 2012