Copyright ©ERS Journals Ltd 2008 Impact of a spirometry expert system on general practitioners' decision makingDepts of 1 General Practice, 2 Medical Informatics, and 4 Postgraduate Training, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands. 3 Dept of General Practice, Richmond Hill, ON, Canada. CORRESPONDENCE: P. J. P. Poels, Dept of General Practice (117), Radboud University Nijmegen Medical Centre, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands. Fax: 31 243541862. E-mail: p.j.p.poels{at}hag.umcn.nl Keywords: Computer-assisted diagnosis, expert systems, family practice, spirometry
Received: February 1, 2007
The present study assessed the impact of computerised spirometry interpretation expert support on the diagnostic achievements of general practitioners (GPs), and on GPs' decision making in diagnosing chronic respiratory disease. A cluster-randomised controlled trial was performed in 78 GPs who each completed 10 standardised paper case descriptions. Intervention consisted of support for GPs' spirometry interpretation either by an expert system (expert support group) or by sham information (control group). Agreement of GPs' diagnoses was compared with an expert panel judgement, which served as the primary outcome. Secondary outcomes were: additional diagnostic test rates; width of differential diagnosis; certainty of diagnosis; estimated severity of disease; referral rate; and medication or nonmedication changes. Effects were expressed as odds ratios (ORs) with 95% confidence intervals (CIs). There were no differences between the expert support and control groups in the agreement between GPs and expert panel diagnosis of chronic obstructive pulmonary disease (OR (95% CI) 1.08 (0.70–1.66)), asthma (1.13 (0.70–1.80)), and absence of respiratory disease (1.32 (0.61–2.86)). A higher rate of additional diagnostic tests was observed in the expert support group (2.5 (1.17–5.35)). Computerised spirometry expert support had no detectable benefit on general practitioners' diagnostic achievements and the decision-making process when diagnosing chronic respiratory disease. Although all major chronic obstructive pulmonary disease (COPD) guidelines stress the central role of spirometry in diagnosing and managing chronic respiratory disease 1, 2, this does not guarantee that general practitioners (GPs) will consequently use spirometry in the care of their patients with respiratory symptoms 3, 4. Most common barriers that impede utilisation of spirometry in general practice are: the absence of properly trained staff 5; the lack of time and practice support to fit spirometry into the daily practice routine 6; the absence of a spirometer in the practice 7; and GPs lack of confidence in the ability to interpret the test results 8, 9. A recent survey 4 showed that one third of Australian GPs interpreted less than one spirometry test per week. Due to this low prevalence of test interpretations, it seems difficult for GPs to become experts in this area. The present authors have previously demonstrated the influence of spirometry on GPs' diagnostic achievements and management decisions in a nonrandomised simulation study 10. Other recent nonrandomised studies 11, 12 confirm that spirometry increases diagnostic rates of chronic respiratory disease and may lead to management changes in a general practice population. However, an absolute prerequisite for the use of spirometry is the validity (or reliability) of spirometric tests. In a previous study with patients with COPD, Schermer et al. 13 observed that the most relevant indices, as measured by trained general practice staff, were comparable with those measured in pulmonary function laboratories. Therefore, once GPs have had initial spirometry training and spirometry equipment and test validity are adequate, the next step to improve implementation of spirometry in general practice is to arrange for the possibility to receive continuous advice and support for test interpretation 14. This could be carried out by means of a diagnostic computerised clinical decision support system 15, 16. While there is already such an expert support system available on the market 17 and GPs welcome such type of support 18, empirical studies on the effects of ongoing expert support on the interpretative capacity and self-confidence of GPs are warranted. The objective of the present study was to assess the impact of expert support for the interpretation of spirometry tests on GPs' diagnostic achievements and decision-making processes when diagnosing chronic respiratory disease.
Study design The study was a simulated cluster-randomised trial of GPs' diagnostic acuity of chronic respiratory disease in a process of diagnostic assessment of 10 standardised cases, with an expert system support. A diagnosis of the cases by the expert panel served as the gold standard. Differences in GPs' diagnostic achievements and decision-making processes were compared both between the study groups and within groups.
Ethical approval
Participants
Intervention
The spirometry expert system (SpidaXpert®; Micro Medical Ltd, Rochester, UK) 17 contains a diagnostic algorithm based on pre- and post-bronchodilator forced expiratory volume in one second (FEV1)/forced vital capacity (FVC) and FEV1 values and the accompanying age, sex and ethnicity-specific predicted values. The expert interpretation module in SpidaXpert® had been developed with funding of the Netherlands Asthma Foundation by a group of independent experts 17. The spirometry interpretation is presented as coloured bars that indicate levels of FEV1/FVC and FEV1, and compares the values before and after bronchodilatation. The graphical representation is further elucidated by a textual interpretation, which provides information on and suggestions for additional diagnostic testing and treatment options. GPs in the control group received the volume–time curve as sham information. Sham information was introduced in the control group to be able to compare GPs reassessment of a diagnosis in the control group in the same way as in the expert support group. Sham information has, in fact, a placebo effect, as no new data was being presented to these GPs; earlier data (i.e. the flow–volume curve) was presented in each case again but in another way, i.e. the volume–time curve. Although it is clearly important to evaluate the quality of forced expiratory manoeuvres, i.e. end-of-test criteria 20, the volume–time curve does not add relevant new information from a diagnostic point of view to the information provided by the flow–volume curve and the numerical test results. Prior to the study, participants were informed that they would receive additional information on spirometry and were asked to reconsider their diagnosis. No further specification was given of the nature or the background of that information.
Standardised case descriptions and gold standard
At inclusion, a research assistant visited the participating GPs in their practice. During a 90-min audiotaped session, an example case and 10 standardised cases were presented on a laptop computer using PowerPoint slides. GPs worked through the cases in a random order. GPs first practised on one separate example case to become familiar with case structure. For each case, a concise medical history, the results of physical examination and the medication were presented to the GP first. Subsequently, absolute predicted pre- and post-bronchodilator spirometry test results (including FEV1, FVC, FEV1/FVC and flow–volume curves) were provided. GPs were asked to consider their diagnosis and management before the upcoming intervention. Next, GPs received additional information next to the spirometry test results: either the graphical representation of FEV1, FEV1/FVC together with interpretative notes (expert support group) or the volume–time curve (control group). Again GPs were asked to reconsider their diagnosis and management after the intervention. An example of the case structure is depicted in figure 2
Before their use in the study, the cases were judged by an expert panel consisting of two chest physicians, a GP (P.J.P. Poels) with specific expertise in spirometry and a health scientist (T.R.J. Schermer). The panel consensus diagnoses served as the gold standard in the subsequent evaluation of GPs' diagnostic achievements. The whole approach was piloted in four GPs before the start of the study. Shortly after the first six study visits, the case set was adjusted by switching the example case with a case out of the actual set. As a result, no data of the new introduced case were available for those first six GPs (equally divided over the two groups).
Primary and secondary outcome measures Six predefined secondary outcome measures were assessed using indicators that show the impact of the expert system intervention on the GPs decision-making processes, as follows: 1) probability of ordering additional diagnostic tests (yes/no); 2) width of the differential diagnoses (i.e. the working diagnosis plus the number of alternative diagnoses considered by the GP); 3) a GP's certainty of the working diagnosis (self-scored 0–10, with 0 = uncertain and 10 = certain); 4) a GP's perception of severity of the working diagnosis (self-scored 0–10, with 0 = no severe disease and 10 = severe disease); 5) probability of referral to secondary care (yes/no); and 6) probability of medication and nonmedication changes. Medication change included: stopping or lowering treatment with inhaled corticosteroids or bronchodilators; the commencement of bronchodilator, inhaled or oral corticosteroid treatment; or combination drug treatment. Nonmedication included giving smoking cessation advice.
Sample size
Randomisation
Statistical analysis To detect possible effect modifications before intervention, subgroup analyses were performed for a GP's prior experience with spirometry, a GP's prior experience with expert support and a GP's number of interpreted spirometry tests per week.
Baseline characteristics of GPs Between January and October 2006, 78 GPs were enrolled in the present study; 36 were allocated to the expert support group and 42 to the control group (fig. 1
Primary outcome: diagnostic achievements by GPs GPs assessed a total of 774 cases, 357 cases from the expert support group and 417 cases from the control group. There was no difference between the expert support and control group in agreement on judgement between GPs and the expert panel for presence of COPD, asthma, absence of respiratory disease and incorrect test manoeuvre after intervention (table 2
Although the DORs in the expert support group were consistently higher than in the control group, no significant differences were found between the groups (table 3
Secondary outcomes: indicators of GPs' decision-making process GPs in the expert support group ordered slightly more additional diagnostic tests compared with the control group (OR (95% CI) 2.5 (1.2–5.4); table 4
Subgroup analyses Neither a GP's experience with spirometry (OR (95% CI) 1.02 (0.97–1.06)), nor a GP's prior experience with expert support (0.97 (0.72–1.31)) or a GP's number of interpreted spirometry tests per week (1.02 (0.84–1.23)) was associated with the effectiveness of expert support, as their agreement with the expert panel was not different before intervention. If GPs interpreted more spirometry tests per week and had prior experience of expert support, the probability of agreement with the expert panel before intervention increased; however, this probability decreased if GPs had no prior experience with expert support (interaction effect p = 0.02).
Statement of principal findings Computerised spirometry expert support for the interpretation of spirometry tests by GPs had no detectable benefit over sham information on GPs' diagnostic achievements of chronic respiratory disease. Overall, expert support did not influence GPs' decision-making processes.
Strengths of the study The standardised complex and original method that was used to assess the impact of expert support in the present study has been used before in a nonrandomised design 10. Based on previous information 10, the present authors were able to create a balanced mixture of cases relevant for GPs. The confirmative role of spirometry was more strongly focussed on than the exclusive role of spirometry in primary care. To avoid bias, cases were presented in a (computer-generated) random order and analyses were performed blinded for both the investigators and the statistician. Subgroup analyses showed no difference in baseline diagnostic achievements of GPs with prior experience of spirometry or of the expert support system used, and on the number of spirometry tests a GP interpreted per week. Therefore, the external validity seems quite good, given the fact that the participants were not specifically interested in spirometry.
Possible limitations A large within-group difference was found for ordering additional diagnostic tests, which may be an effect of the study design: GPs barely reassessed their diagnostics after intervention, because they expected the results of their diagnostics to have been already discounted before intervention. However, the objective was to reassess the opinion of GPs when new information, i.e. expert support, was available, regardless of their earlier assessment in the same case. From a methodological point of view, the use of the volume–time curve as sham information could be questioned. Theoretically, such curves do not show new information to GPs after presentation of the flow–volume curves. Additionally, this is not really "usual care", as most GPs in the Netherlands are trained to look at flow–volume curves rather than volume–time curves. Conversely, the volume–time curve is much more intuitive and may have improved unconscious performance of spirometry interpretation in the control group. Furthermore, providing the expert panel and GPs in the present study with a fixed cut-off value of <0.7, instead of the lower limit of normal for the FEV1/FVC ratio in the standardised cases may have led to an overestimation of diagnosed airflow obstruction 22. Further discussion about the pros and cons of using a fixed cut-off value versus the lower limit of normal for FEV1/FVC 23 is beyond the scope of the present paper. Finally, a possible reason why no differences could be demonstrated in diagnostic achievements should be sought in the expert support system used. Although the expert support system used in the present study met the criteria of a good system 15, i.e. involvement of the present authors by development, integration through the computer, and the displaying of specific recommendations at the right place and time, it was not actually tested in the target group, i.e. GPs, before the study. Therefore, it may not optimally comply with the decision-making process of GPs. The information presented by the system to the GP possibly lacked explanation of exactly what the output means. These are known barriers to the adoption of expert support in primary care 24.
Relation to other studies The acuity of GPs' interpretation of test results has been evaluated by others. In 1999, Eaton et al. 5 had already found that 53% of GPs' interpretation of spirometry test results was judged to be correct according to an expert panel. Recently, Raghunath et al. 9 found that the agreement in interpretation of spirometry and peak flow results between nurses, GPs and an expert panel was only 20%. The lower agreement in the latter study could probably be explained by the fact that GPs, as well as nurses, i.e. less-trained professionals, assessed a common diagnosis. Furthermore, contrary to GPs and nurses, the expert panel did not have detailed clinical history information to assess their final diagnosis on and, due to a design artefact, interpretation of their study results was difficult. Results of the present study concur with the results of Eaton et al. 5 and show that, generally, GPs have made progress in the interpretation of test results relevant for respiratory diseases in primary care. The current acuity of GPs interpretation of test results should weaken earlier reported lack of confidence in the ability to interpret the test results 8, 9.
Unanswered questions and future research
The authors wish to thank all participating GPs: Y. Heijdra and J. Molema, chest physicians (Dept of Pulmonology, Radboud University Nijmegen Medical Centre, Nijmegen, the Netherlands), for their role in the expert panel; M. van den Bogart-Jansen and A. Marks (Dept of General Practice, Radboud University Nijmegen Medical Centre, Nijmegen, the Netherlands) who helped with the interviews of the GPs; and J. Grootens-Stekelenburg for data management. The authors also wish to acknowledge the Executive Steering Committee: H. van den Hoogen (Dept of General Practice), A Jacobs (Centre for Quality Care Research), B. Thoonen (Dept of Postgraduate Training; all Radboud University Nijmegen Medical Centre, Nijmegen, the Netherlands) and P. Quanjer (Leiden University, Leiden, the Netherlands).
This article has supplementary material accessible from www.erj.ersjournals.com
For editorial comments see page 8.
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||