Article Text

Download PDFPDF

Diagnostic accuracy of B type natriuretic peptide and amino terminal proBNP in the emergency diagnosis of heart failure
  1. T Mueller1,
  2. A Gegenhuber2,
  3. W Poelz3,
  4. M Haltmayer1
  1. 1Department of Laboratory Medicine, Konventhospital Barmherzige Brueder, Linz, Austria
  2. 2Department of Internal Medicine, Konventhospital Barmherzige Brueder
  3. 3Institute for Applied System Sciences and Statistics, University of Linz
  1. Correspondence to:
    Dr Meinhard Haltmayer
    Department of Laboratory Medicine, Konventhospital Barmherzige Brueder, Seilerstaette 2, A-4021 Linz, Austria; meinhard.haltmayerbblinz.at

Abstract

Objective: To compare head to head the diagnostic accuracy of B type natriuretic peptide (BNP) and the amino terminal fragment of its precursor hormone (NT-proBNP) for congestive heart failure (CHF) in an emergency setting.

Methods: 251 consecutive patients presenting to the emergency department with dyspnoea as a chief complaint were prospectively studied. Patients with acute coronary syndromes were excluded. The diagnosis of CHF was based on the Framingham score for CHF plus echocardiographic evidence of systolic or diastolic dysfunction. Blood concentrations of BNP and NT-proBNP were measured by two commercially available assays (Abbott and Roche methods). The diagnostic accuracies of BNP and NT-proBNP were assessed by receiver operating characteristic curve analysis.

Results: Areas under the curve for BNP and NT-proBNP in patients with dyspnoea caused by CHF (n  =  137) and in patients with dyspnoea attributable to other reasons (n  =  114) did not differ significantly (area under the curve 0.916 v 0.903, p  =  0.277, statistical power 94%). Cut off concentrations with the highest diagnostic accuracy were 295 ng/l for BNP (sensitivity 80%, specificity 86%, diagnostic accuracy 83%) and 825 ng/l for NT-proBNP (sensitivity 87%, specificity 81%, diagnostic accuracy 84%). Evaluation of discordant false classifications at these cut off concentrations showed no advantage for either BNP nor NT-proBNP in the biochemical diagnosis of CHF (17 misclassifications by BNP and 14 by NT-proBNP, p  =  0.720). In the population studied, age, sex, and renal function had no impact on the diagnostic utility of both tests when compared by logistic regression models.

Conclusions: BNP and NT-proBNP may be equally useful as an aid in the diagnosis of CHF in short of breath patients presenting to the emergency department.

  • AUC, area under the curve
  • BNP, B type natriuretic peptide
  • BNP study, breathing not properly study
  • CHF, congestive heart failure
  • CV, coefficient of variance
  • eGFR, estimated glomerular filtration rate
  • LVEF, left ventricular ejection fraction
  • NCCLS, National Committee for Clinical Laboratory Standards
  • NT-proBNP, amino terminal fragment of proBNP
  • ROC, receiver operating characteristic
  • STARD, standards for reporting of diagnostic accuracy
  • diagnosis
  • echocardiography
  • heart failure
  • natriuretic peptides

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

B type natriuretic peptide (BNP) is a cardiac neurohormone that derives from the precursor pre-proBNP, containing 134 amino acids and including a signal peptide of 26 amino acids. proBNP, produced by cleavage of the signal peptide, is further split into BNP, which is considered to be the biologically active hormone, and an inactive amino terminal fragment, NT-proBNP.1 Blood measurements of BNP have been shown to be useful in differentiating dyspnoea caused by congestive heart failure (CHF) from dyspnoea related to other causes.2,3 More recently, the BNP (breathing not properly) study showed the usefulness of determining BNP in the emergency department in establishing or excluding CHF in patients with dyspnoea.4,5 The authors of the BNP study showed that the evaluation of dyspnoea was improved by the addition of BNP testing to clinical judgement in the emergency department.5 Another recent publication indicated that measurement of NT-proBNP may also be a diagnostic aid in the verification of CHF in patients with shortness of breath.6 However, potential advantages of either BNP or NT-proBNP for the diagnosis of CHF in an emergency setting are not established.

Thus, the present study was designed to compare head to head the diagnostic accuracy of BNP and NT-proBNP with respect to CHF in patients consulting our emergency department with shortness of breath as a chief complaint. Furthermore, we aimed at assessing appropriate cut off concentrations for this clinical setting by means of two currently developed commercially available assays for BNP and NT-proBNP.

METHODS

Patient recruitment

From 1 October 2003 to 12 February 2004, all patients presenting with dyspnoea to the emergency department of St John of God Hospital Linz, Austria, were eligible for the present prospective clinical evaluation. The inclusion criterion was shortness of breath as a chief complaint at the initial patient examination in our emergency department. Patients with ST elevation myocardial infarction, non-ST elevation myocardial infarction, or acute coronary syndrome troponin positive and trauma patients were excluded. During the initial patient examination in our emergency department a blood sample for the determination of BNP and NT-proBNP had to be collected and analysed within the next four hours. Serum creatinine concentrations were routinely determined by Jaffé’s kinetic method on a COBAS Integra 700 analyser (Roche Diagnostics, Germany) on the day of blood collection. Estimated glomerular filtration rate (eGFR) was calculated by the Cockgroft and Gault formula.7 The complete clinical evaluation including echocardiography and the classification of patients had to be completed within three days after the initial patient examination. For the attending physicians and cardiologists BNP and NT-proBNP results were available in the medical records only after the patients had been completely clinically evaluated and classified. The study protocol was approved by the local ethics committee in accordance with the Declaration of Helsinki. All study participants gave informed consent. The present evaluation was performed and described according to the criteria suggested by the standards for reporting of diagnostic accuracy (STARD) initiative.8

Clinical evaluation of patients

Each patient enrolled in the study was thoroughly surveyed by taking a history, physical examination, 12 lead ECG, and chest X ray at the time of the initial patient examination, as well as sonography of the liver (determination of congestive hepatomegaly or hepatic venous congestion). All study participants underwent subsequent echocardiography within three days after the patients’ first examination. Left ventricular ejection fraction (LVEF) was measured by the quantitative two dimensional (biplane Simpson) method. Patients were defined as having systolic dysfunction if their LVEF was ⩽ 50%. Patients with an LVEF > 50% were further examined for diastolic dysfunction. Each of these patients underwent pulsed wave Doppler examination of mitral inflow before and during Valsalva manoeuvre. Diastolic dysfunction was categorised according to the progression of diastolic dysfunction as normal; mild (grade 1), defined as impaired relaxation without evidence of increased filling pressures; moderate (grade 2), defined as impaired relaxation associated with moderately increased filling pressures or pseudonormal filling; and severe (grades 3 and 4), defined as reversible restrictive or fixed restrictive patterns. In the case of LVEF > 50% and concomitant atrial fibrillation during echocardiography classification of diastolic dysfunction was not possible due to methodological limitations.9

Classification of patients (reference standard)

One of the investigators (GA), also blinded to the BNP and NT-proBNP results, reviewed all medical records pertaining to the patient after completion of the above described clinical evaluation. Thereafter, the final classification was based on the Framingham criteria for the clinical diagnosis of CHF.10 For the diagnosis of CHF both of the following had to be present: Framingham CHF score with two major or one major and two minor criteria; and evidence of systolic or diastolic dysfunction as determined by echocardiography. This procedure is in accordance with the guidelines for the diagnosis of CHF in clinical practice.11 For patients with a diagnosis other than CHF, the diagnosis was verified on the basis of other observations (for example, roentgenographic signs of chronic obstructive lung disease, pneumonia, abnormal pulmonary function tests, or follow up results such as response to treatment with nebulisers, corticosteroids, and antibiotics).

BNP and NT-proBNP determination (index tests)

Blood for measurement of natriuretic peptide concentrations was collected by venepuncture in Vacuette polyethylene terephthalate glycol EDTA tubes (Greiner Bio-One, Kremsmünster, Austria) at the initial patient examination in the emergency department. Blood samples were centrifuged at 3500 g for 10 minutes at 4°C immediately after collection. Both BNP and NT-proBNP were analysed within four hours after blood withdrawal. BNP was assayed on an AxSYM analyser (Abbott Laboratories, Abbott Park, Illinois, USA). The AxSYM BNP assay is a fully automated microparticle enzyme immunoassay with two monoclonal mouse antibodies in a two step sandwich format.12 NT-proBNP was measured on an Elecsys 2010 instrument (Roche Diagnostics). The Elecsys NT-proBNP test is a fully automated two site (sandwich) enzyme immunoassay with electrochemiluminescent technology and is designed for the determination of NT-proBNP in human serum and plasma.13

The precision of the two methods was evaluated according to the National Committee for Clinical Laboratory Standards (NCCLS) guideline EP5-A.14 Three pooled patient plasma samples were aliquoted into 40 tubes of 1.5 ml for each concentration and frozen at −70°C. We analysed these samples in duplicate in two runs every day for 20 days on the two analysers. Total imprecision was calculated by the NCCLS double run precision evaluation test.14 Precision data of the two methods were as follows: the AxSYM BNP assay had a total coefficient of variance (CV) of 8.1% at a mean concentration of 108 ng/l (pool 1), a total CV of 7.5% at a mean concentration of 524 ng/l (pool 2), and a total CV of 10% at a mean concentration of 2117 ng/l (pool 3). The Elecsys NT-proBNP assay had a total CV of 3.8% at a mean concentration of 246 ng/l (pool 1), a total CV of 4.7% at a mean concentration of 891 ng/l (pool 2), and a total CV of 2.2% at a mean concentration of 10 666 ng/l (pool 3).

Statistical analysis

Univariate data on demographic and clinical features were compared between patients with dyspnoea caused by CHF and patients with dyspnoea attributable to other reasons by the non-parametric Mann-Whitney U test or Fisher’s exact test as appropriate (p values were not adjusted for multiple comparisons and are therefore only descriptive). To determine the diagnostic accuracy of BNP and NT-proBNP for CHF, receiver operating characteristic (ROC) plots were analysed and areas under the curve (AUCs) were calculated for both analytes. AUCs were compared according to the method by Hanley and McNeil.15 Cut off concentrations for BNP and NT-proBNP were determined at the 90% and 95% sensitivity criterion derived directly from the ROC curves. Furthermore, cut off concentrations with the highest diagnostic accuracy (that is, optimal cut off concentrations, defined as points of the ROC curves, in which the sum of false negative and false positive results was lowest) and the approved cut points were evaluated. Positive and negative predictive values at these cut off concentrations were calculated by using the ratio of cases in the positive and negative groups (reflecting the prevalence of the disease in our study population). To determine odds ratios (unadjusted and controlling for confounding covariates) for the detection of CHF with respect to BNP and NT-proBNP thresholds of highest diagnostic accuracy, logistic regression analysis without variable selection technique was performed. Dichotomous variables were coded with an indicator variable of 1 for having the disease and 0 for its absence. The diagnostic accuracies of both assays at optimal cut off concentrations as determined by ROC analysis were compared by the McNemar test (comparison of discordant pairs of false biochemical classifications). Spearman’s coefficient of rank correlation was used to asses the relation of BNP and NT-proBNP concentrations in the study population. Data were statistically analysed with SPSS 10.0 software (SPSS Inc, Chicago, Illinois, USA), the MedCalc 7.2.1.0 package (MedCalc Software, Mariakerke, Belgium), and the software N (IDV, Gauting, Germany). All probabilities were two tailed and p < 0.05 was regarded as significant. Sample size calculation for the present study was based on the following assumptions: AUC for BNP approximately 0.900 with an estimated SE of 0.030 and a correlation of BNP and NT-proBNP concentrations of 0.900. Thus, to detect a difference of 5% in AUCs for the two analytes, our calculation showed that a sample size of 250 patients had to be enrolled into this study (α  =  0.050 two sided, β  =  0.100).

RESULTS

Of 293 patients referred with dyspnoea, 276 met the primary inclusion criteria and were enrolled. For 11 patients the clinical evaluation (Framingham CHF classification or echocardiography) could not be completed within three days. Thus, 265 patients with complete clinical evaluation were allocated to CHF classification. In nine patients with LVEF > 50% concomitant atrial fibrillation during echocardiography prohibited an exact classification of diastolic dysfunction. Of the remaining 256 patients, five were excluded because both index tests (BNP and NT-proBNP) were not completed within four hours after blood withdrawal. Finally, 137 patients with dyspnoea due to CHF were compared with 114 patients with dyspnoea attributable to other reasons in the present study. Figure 1 provides comprehensive information. Patients with end stage renal disease on dialysis were not enrolled into the present study, although this condition was not an exclusion criterion.

Figure 1

 Trial profile. ACS, acute coronary syndromes; AF, atrial fibrillation; BNP, B-type natriuretic peptide; CHF, congestive heart failure; NSTEMI, non-ST elevation myocardial infarction; NT-proBNP, amino terminal fragment of proBNP; STEMI, ST elevation myocardial infarction.

Table 1 lists the demographic and clinical characteristics of the study participants (n  =  251). Figure 2 presents box and whisker plots for BNP and NT-proBNP in both study groups. Of the overall study group (n  =  251), 100% had dyspnoea on exertion, 16% had orthopnoea, 30% had paroxysmal nocturnal dyspnoea, 29% had nocturnal cough, 17% had jugular venous distension, 21% had pulmonary rales, 10% had a third heart sound, and 21% had oedema of the legs or feet. As mentioned, 137 met the criteria for dyspnoea caused by CHF, and 114 patients attended our emergency department because of dyspnoea attributable to other causes, including 10 patients with a history of CHF who were asymptomatic for this disorder (NYHA class I). Reasons for dyspnoea in these 114 patients were chronic obstructive pulmonary disease (n  =  41), pneumonia (n  =  22), bronchitis/asthma (n  =  12), malignancy of the lung (n  =  6), musculoskeletal chest pain (n  =  6), hypertension (n  =  6), acute coronary syndrome troponin negative (n  =  6), tachycardia/arrhythmia (n  =  5), and other causes (n  =  11) such as pulmonary embolism, interstitial lung disease, anaemia, or pericardial disease.

Table 1

 Patient characteristics according to CHF classification (n = 251)

Figure 2

 Box and whisker plots of BNP and NT-proBNP for patients with dyspnoea caused by CHF (n  =  137) and for patients with dyspnoea attributable to non-cardiac causes (n  =  114). In the box and whisker plots, the central box spans from the lower to the upper quartile, the middle line is the median, the whiskers extend from the minimum to the maximum concentration, excluding outside and far out concentrations, which are displayed as separate points. Grey boxes, box and whisker plots for BNP; white boxes, box and whisker plots for NT-proBNP.

In distinguishing between patients with dyspnoea caused by CHF (n  =  137) and patients with dyspnoea attributable to other causes (n  =  114), the AUCs were 0.916 (SE  =  0.018, 95% confidence interval (CI) 0.874 to 0.947) for BNP, and 0.903 (SE  =  0.019, 95% CI 0.859 to 0.939) for NT-proBNP (fig 3). Comparison of the ROC curves showed no significant difference between the AUCs for BNP and NT-proBNP (difference of AUCs 0.013, SE  =  0.012, 95% CI −0.011 to 0.037, p  =  0.277); a power calculation showed that the power of this analysis was 94%.

Figure 3

 Receiver operating characteristic (ROC) plot for BNP and NT-proBNP comparing patients with dyspnoea caused by CHF (n  =  137) versus patients with dyspnoea attributable to other reasons (n  =  114). The area under the curve for BNP is 0.916 and for NT-proBNP is 0.903 (difference of 0.013 not significant, p  =  0.277). Solid line, ROC curve for BNP; dashed line, ROC curve for NT-proBNP.

Table 2 gives detailed diagnostic information including the appropriate decision statistics for the biochemical diagnosis of clinical CHF. Using the cut off concentrations with the highest diagnostic accuracy, classification by both BNP and NT-proBNP was correct in 194 patients and incorrect in 26 patients. Comparing the 17 misclassifications by the BNP assay to the 14 by the NT-proBNP assay with the McNemar test, the difference was not significant (p  =  0.720) (table 3).

Table 2

 Diagnostic information for the biochemical diagnosis of CHF

Table 3

 Results of biochemical classification of patients with and without CHF at cut off concentrations with the highest diagnostic accuracy for BNP and NT-proBNP*

Seven patients with normal systolic and diastolic dysfunction were classified as false positive by means of the Framingham criteria for the clinical diagnosis of CHF as table 1 indicates. In six of these patients BNP and NT-proBNP concentrations were low, so these two analytes would have contributed to a correct diagnosis of non-CHF. Only one of these seven patients had increased BNP and NT-proBNP plasma concentrations, which was attributable to a pulmonary embolism.

Univariate odds ratios for the detection of CHF were calculated with BNP or NT-proBNP as independent variables dichotomised according to the above determined thresholds. In addition, odds ratios for CHF with respect to BNP and NT-proBNP adjusted for age, sex, and eGFR were determined (adjusted regression model 1) to estimate the influence of these potential confounders. We further included the history of CHF as well as clinical variables (signs and symptoms of CHF) in another analysis (adjusted regression model 2) to determine the impact of these covariates on the predictive value of the two analytes for CHF. Table 4 specifies results of these analyses. When calculating a logistic regression with CHF as the dependent variable and age, sex, eGFR, history of CHF, and the presence of orthopnoea, paroxysmal nocturnal dyspnoea, nocturnal cough, jugular venous distension, pulmonary rales, third heart sound, and peripheral oedema as independent variables (excluding BNP and NT-proBNP), we found a diagnostic accuracy of 79% for this statistical model.

Table 4

 Results of logistic regression analysis

Figure 4 shows a scatter plot for BNP versus NT-proBNP concentrations in the study population. The range for this analyte comparison was limited to the reportable ranges of the two assays (that is, 4000 ng/l for AxSYM BNP and 35000 ng/l for Elecsys NT-proBNP) to avoid any dilution. Thus, from the 251 patients investigated in the clinical part of the study, only 239 samples were eligible for the correlation analysis because 12 samples had BNP or NT-proBNP concentrations above the measurement ranges of one of the assays. Spearman’s coefficient of rank correlation was 0.914 (95% CI 0.890 to 0.932, p < 0.001).

Figure 4

 Scatter plot for BNP versus NT-proBNP. Spearman’s coefficient of rank correlation (n  =  239): 0.914 (95% CI 0.890 to 0.932, p < 0.001). Since the cumulative sum test for linearity showed a significant deviation from linearity (p < 0.010), results of regression analysis (Passing and Bablok) are not given.

DISCUSSION

BNP measurements have been shown to add important information to clinical judgement in establishing a final diagnosis of CHF.5 Since our evaluation produced similar AUCs and comparable sensitivity, specificity, and diagnostic accuracy at selected cut off concentrations for BNP and NT-proBNP in the biochemical diagnosis of CHF, NT-proBNP measurements should also improve the emergency evaluation of dyspnoeic patients. In addition, comparison of discordant false classifications on the basis of optimal cut off concentrations for BNP and NT-proBNP underlined the similar diagnostic utility of the two analytes. Thus, our findings indicate that BNP and NT-proBNP may be equally useful as an aid for the diagnosis of CHF in patients consulting an emergency department with shortness of breath as a chief complaint. Of course, different cut off concentrations have to be considered for the analytes BNP and NT-proBNP as table 2 shows. This is a consequence of a slower plasma clearance of NT-proBNP than of the biologically active peptide BNP resulting in higher circulating concentrations of NT-proBNP, although both peptides are released by cardiomyocytes on an equimolar basis.1 Furthermore, the different mechanisms of plasma clearance (neutral endopeptidase clearance receptors for BNP versus renal clearance for NT-proBNP)1 result in an only moderate correlation of plasma concentrations of the two analytes (fig 4).

As shown by logistic regression analysis, we observed no relevant influences of age, sex, and renal function (eGFR) on the diagnostic value of BNP and NT-proBNP in the present study. The odds ratios of BNP and NT-proBNP for the prediction of CHF did not change when calculating the model unadjusted for potential confounders and the model controlling for age, sex, and eGFR. In addition, the overall accuracy of the two logistic regression models was equal (that is, 83% and 84% for the models with BNP and NT-proBNP, respectively), indicating that age, sex, and eGFR did not add any relevant diagnostic information (despite significant univariate differences between the study groups in terms of age and eGFR). Although previous studies described an association between natriuretic peptide concentrations and age, sex, and renal function, in particular in patients with cardiac impairment,16–18 these confounders obviously had no impact on the diagnostic utility of BNP and NT-proBNP in the present study. BNP and NT-proBNP concentrations are known to increase with age, female sex, and renal dysfunction, which may not be sufficient to modify the diagnostic performance of BNP and NT-proBNP in an emergency setting. However, regarding renal function, our statement is restricted to eGFR and we must mention that the median of eGFRs was 84 ml/min (25th–75th centile 56–119 ml/min) in the entire study population, with 74 patients having an eGFR < 60 ml/min.

Adding all clinical information available in the emergency department to the above logistic regression model (that is, history of CHF and the presence of orthopnoea, paroxysmal nocturnal dyspnoea, nocturnal cough, jugular venous distension, pulmonary rales, third heart sound, and peripheral oedema) increased the diagnostic accuracy to 87%. Conversely, the odds ratios for BNP and NT-proBNP decreased as expected, indicating the predictive value of the clinical signs and symptoms of CHF. Removing BNP and NT-proBNP, respectively, from this model reduced the diagnostic accuracy of the statistical model (including age, sex, eGFR, and the clinical information described above) to 79%. Thus, BNP and NT-proBNP added about 8% accuracy in the prediction of CHF to the clinical information in an emergency setting. On the other hand, BNP and NT-proBNP alone appeared to have a greater predictive value for symptomatic CHF in an emergency department (accuracies of 83% and 84%, respectively) than taking together all the clinical information (accuracy of 79%).

In the present study the reported decision statistics were mainly related to cut off concentrations with the highest diagnostic accuracy, defined as points of the ROC curves in which the sum of false negative and false positive results is lowest indicating the points of the curves nearest the top left hand corner. Therefore, these points give an idea of the shape of the ROC curve having mainly a mathematical relevance. However, for clinical practice we recommend referring to cut off concentrations with 90% or 95% sensitivity for both BNP and NT-proBNP. These cut off concentrations can be used to rule out a patient with a high probability according to the previously reported high negative predictive values of BNP and NT-proBNP in CHF.11,19 In this context, the currently approved dual cut off concentration for NT-proBNP is not advantageous compared with a single cut point, especially in terms of the overall diagnostic accuracy as table 2 shows.

Compared with other studies in this field the strengths of our trial are the stringent study design reported according to the suggestions of the STARD initiative,8 the performance of echocardiography in each study participant within three days after their enrolment (assessing both systolic and diastolic dysfunction), and the determination of both index tests (BNP and NT-proBNP) directly after blood collection at the initial examination in our emergency department (the stability of these analytes in frozen plasma samples is controversial). However, one limitation of the present study is the preponderance of men. In this context, we must point out that our hospital is traditionally mainly responsible for medical care of male patients in the extended area of Linz, Austria. Another limitation of the present trial may be related to the determination of diastolic dysfunction. We did not use Doppler tissue imaging of mitral annular motion in this study as one of the examinations applied for the assessment of diastolic dysfunction9. Nevertheless, this is the first time that diastolic dysfunction has been determined in a study evaluating the diagnostic accuracy of BNP and NT-proBNP in an emergency setting and, thus, the chosen approach, which is clinically useful, is valid in our opinion.

In conclusion, this study showed that BNP and NT-proBNP may be equally useful as an aid in the diagnosis of CHF in patients presenting to the emergency department with shortness of breath as a chief complaint. We observed no relevant influences of age, sex, and renal function (eGFR) on the diagnostic value of the two analytes. BNP and NT-proBNP alone had a greater predictive value for symptomatic CHF than the clinical information available in the emergency department.

Acknowledgments

This work was supported in part by a grant for scientific research from the Upper Austrian Government. We thank Abbott Diagnostics (Vienna, Austria) and Roche Diagnostics (Vienna, Austria) for providing reagents free of charge. Neither of these companies played a part in (1) the design of the study, (2) data collection, analysis, and interpretation, and (3) preparation of the manuscript.

REFERENCES