Abstract
No current patient-centred instrument captures all dimensions of physical activity in chronic obstructive pulmonary disease (COPD). Our objective was item reduction and initial validation of two instruments to measure physical activity in COPD.
Physical activity was assessed in a 6-week, randomised, two-way cross-over, multicentre study using PROactive draft questionnaires (daily and clinical visit versions) and two activity monitors. Item reduction followed an iterative process including classical and Rasch model analyses, and input from patients and clinical experts.
236 COPD patients from five European centres were included. Results indicated the concept of physical activity in COPD had two domains, labelled “amount” and “difficulty”. After item reduction, the daily PROactive instrument comprised nine items and the clinical visit contained 14. Both demonstrated good model fit (person separation index >0.7). Confirmatory factor analysis supported the bidimensional structure. Both instruments had good internal consistency (Cronbach's α>0.8), test–retest reliability (intraclass correlation coefficient ≥0.9) and exhibited moderate-to-high correlations (r>0.6) with related constructs and very low correlations (r<0.3) with unrelated constructs, providing evidence for construct validity.
Daily and clinical visit “PROactive physical activity in COPD” instruments are hybrid tools combining a short patient-reported outcome questionnaire and two activity monitor variables which provide simple, valid and reliable measures of physical activity in COPD patients.
Abstract
Both PROactive hybrid tools are simple, valid, and reliable measures of physical activity in COPD patients http://ow.ly/LJqP8
Introduction
Physical activity is a key factor in determining health status, and its preventative role in chronic diseases is well accepted [1]. During the past decade, a substantial amount of research on physical activity in patients with chronic obstructive pulmonary disease (COPD) has been published, prompted initially by studies showing that low levels of physical activity are associated with poor prognosis [2, 3]. Research has consistently shown that patients with COPD have lower physical activity levels than their healthy peers [4–7] and many patients limit their physical activity to avoid symptoms [8]. A recent systematic review of the determinants and outcomes of physical activity in patients with COPD showed a consistent effect of regular physical activity in reducing COPD exacerbations and mortality [9]. Hence, physical activity constitutes an important outcome to clinicians and to patients with COPD.
Current trends in the assessment of physical activity focus on the measures of amount and frequency of activities using activity monitors [10]. However, it is unlikely that an activity monitor can fully capture patients' experience of physical activity. Indeed, qualitative research has shown that patients experience symptoms while being active, or have to adapt their lifestyle [11], features that will never be captured using activity monitors alone. Such a patient-centred concept can only be measured through a patient-reported outcome (PRO) questionnaire, which needs to be developed using data directly from patients with COPD [12]. Conversely, a PRO questionnaire alone will also not allow assessment of physical activity patterns or detection of small changes in physical activity on a day-to-day basis. Of note, the currently available PROs that aim to measure physical activity in COPD patients are not based on a conceptual framework [13] and their development does not follow best practice as set out in the guidelines published by the US Food and Drug Administration (FDA) [14]. In developing the PROactive tools we hypothesised that items generated from patient experiences in combination with an activity monitor would capture all relevant dimensions of physical activity in patients with COPD [11].
Within the framework of the European Union Innovative Medicines Initiative PROactive project (www.proactivecopd.com) our group first conducted qualitative research and drafted a conceptual framework to set out the concept of physical activity from patients' experience and to provide the necessary basis to generate an item pool for two PROs, one with a daily recall period and one with a 7-day recall period (daily and clinical visit versions, respectively) to measure the experience of physical activity in patients with COPD [11]. Our group has also evaluated the validity of six commercial physical activity monitors in patients with COPD [15, 16]. The aim of the current study was 1) to identify a comprehensive set of nonredundant items and monitor outputs that can be used to measure physical activity in COPD patients and 2) to perform an initial validation of the two instruments.
Methods
Study design and sample
A 6-week, randomised, two-way cross-over, multicentre study was conducted in five European medical centres (fig. 1, further details in the online supplementary material). We recruited both stable and exacerbated patients with COPD defined by spirometry, to cover the whole range of physical activity. The study was advised and approved by the PROactive ethics and patient advisory boards, approved by the ethics committee at each centre and registered at www.clinicaltrials.gov (NCT01388218). Written consent was obtained from all patients.
Assessments and measures
Figure 1 shows the assessments and timing. In the first 2-week study period patients were randomised to complete either the daily PROactive item pool, consisting of 30 questions asking patients to report their physical activity experience on a daily basis, or the clinical visit PROactive item pool, of 35 questions using a 7-day recall (see online supplementary material). Following a 2-week wash-out patients completed the other questionnaire during the second study period. Both item pools were developed previously using appropriate qualitative research methods and linguistic adaptation for each site [11] in line with current European Medicines Agency [17] and US FDA published standards [14].
During the study periods (fig. 1), patients wore two accelerometers previously identified during a systematic review [10] and further validated for COPD patients against the doubly labelled water method [15] and indirect calirometry [16] (Actigraph G3Tx; Actigraph, Pensacola, FL, USA and Dynaport MoveMonitor; McRoberts BV, The Hague, The Netherlands).
Statistical analysis
The analysis sets and sample size calculations, as well as the statistical analysis plan were defined a priori according to objectives (see online supplementary material).
The item reduction process was conducted separately for each PROactive instrument and followed a systematic iterative process (fig. 2 and online supplementary material). Items were flagged and considered for removal if they exceeded prespecified thresholds of acceptability [18–25]. We hypothesised a priori that PROs alone might not capture information on the amount of physical activity well, so both item pools were analysed alone and in combination with activity data from each monitor (analysed separately).
Confirmatory factor analysis was used to test the fit of the final PROactive Physical Activity in COPD (PPAC) instruments with the dimensionality identified during the item reduction process [20, 26]. Initial validation of the item-reduced PPAC instruments comprised reliability (internal consistency and test–retest reliability) and construct validity (convergent, known-groups and discriminant validity) [23, 27]. All analyses were performed separately for each instrument and dimension, in case the instruments exhibited more than one dimension.
Results
236 patients with COPD were included in the study (fig. 1). Patients were mostly male (68%), with mean±sd age of 67±8 years, forced expiratory volume in 1 s of 57±21% and body mass index of 27±5 kg·m−2. Baseline characteristics are shown in table 1, and in online supplementary table E1 stratified by site. A total of 228 patients (97%) had valid (≥3 days with ≥10 h wearing time) data from activity monitors, showing good compliance and moderate levels of physical activity (online supplementary table E2).
Item reduction
The factor analysis, item-to-total and inter-item correlations, and infit and outfit statistics from the Rasch models suggested that both the daily and clinical visit versions of PPAC (D-PPAC and C-PPAC, respectively) had a bidimensional structure, with a clear distribution of items in two factors (online supplementary tables E3 and E4). Consequently, the process of item reduction was performed separately according to factors 1 “amount of physical activity” and 2 “difficulty with physical activity”.
Tables 2 and 3 detail the reasons for item flagging for exclusion: due to floor or ceiling effects, not fitting unidimensionality, redundancy, not proper ordering, differential item functioning or less relevant for patients and/or experts. For the D-PPAC, a total of five iterations in factor 1 (“amount”) allowed a reduction from nine to four items, including two variables from activity monitors (steps and vector magnitude unit (VMU)); and in factor 2 (“difficulty”), a total of 12 iterations allowed a reduction from 26 to five items. Full information on item removal is detailed in online supplementary table E5. The final version of the D-PPAC (see online supplement), described in figure 3, exhibited good performance and distribution of items, according to the person separation index (PSI) (0.83 in factor 1 with Dynaport, and 0.80 in factor 1 with Actigraph; 0.88 in factor 2) and the person-item map (fig. 4). Confirmatory factor analysis supported the two-factor structure and exhibited appropriate goodness of fit (online supplementary table E6).
For the C-PPAC, a total of four iterations allowed reduction from nine to four items, including two variables from activity monitors for factor 1. For factor 2, a total of 16 iterations allowed reduction from 31 to 10 items. The final version of the C-PPAC (see the online supplement), described in figure 3, also showed good performance and items distribution, as per the PSI (0.75 in factor 1 with Dynaport, 0.74 in factor 1 with Actigraph, and 0.93 in factor 2) and person-item map (fig. 4). Confirmatory factor analysis again supported the two factor structure and exhibited appropriate goodness of fit (table E6).
Several analyses were performed to select cut-off points for activity monitor variables to achieve optimal performance of the PPAC instruments. Because cut-offs were not identical between Actigraph and Dynaport, we determined agreement between the variables from the two monitors (complete methods are described in the online supplement) and observed 91% agreement for steps and 88% for VMU in the daily instrument, and 82% and 96% for steps and VMU, respectively, in the clinical visit version.
Complementary analyses using PRO data alone for factor 1 (i.e. excluding activity monitor data) showed that neither the D-PPAC nor the C-PPAC achieved optimal performance or discrimination. Specifically, all items exhibited low item-to-total correlation, some of them were flagged due to floor effect, redundancy, ordering problems and differential item functioning, and the PSI values decreased from 0.83 to 0.59 and from 0.75 to 0.62 in the daily and clinical visit versions, respectively, when the item reduction process did not include variables from the activity monitors (online supplementary table E7 and fig. E1). Taken together, the data show that the combination of PRO and monitor data resulted in better measurement properties than either PRO or monitor data alone.
Both PPAC instruments can be scored as a simple sum of the scores of the items (mostly from 0 to 4), with lower scores indicating poorer physical activity. The final PROactive scores exhibited no differences by sex, although they demonstrated lower values in the most severe COPD patients (online supplementary table E8).
Reliability and construct validity
Both PPAC instruments showed strong internal consistency and test–retest reliability (table 4 and online supplementary figs E2 and E3). In both PPAC instruments, factor 1 (“amount of physical activity”) exhibited weak correlations with health-related quality of life and moderate correlations with dyspnoea and exercise capacity, while factor 2 (“difficulty with physical activity”) scores showed moderate-to-strong correlations with health-related quality of life, dyspnoea and exercise capacity (table 5). Known-groups validity was good in both instruments, with PPAC scores differentiating across grades of dyspnoea, stable from exacerbated patients at baseline and tertiles of physical activity levels (using variables not included in the PPAC scoring, such as intensity) (online supplementary table E9). All correlations with unrelated constructs such as height, socioeconomic status and baseline heart rate were weak; supporting the discriminant validity of both PPAC instruments (online supplementary table E10).
Discussion
Our analyses resulted in two innovative reliable and valid instruments to fully evaluate the concept of physical activity in patients with COPD: the daily (D-PPAC) and the clinical visit (C-PPAC) versions of the PROactive instruments. The PPAC instruments are hybrid tools that combine PRO and activity monitors to assess physical activity. Interestingly, the C-PPAC turned out to have nearly twice as many items as the D-PPAC. We are not surprised by this fact, because some activities are not performed every day although they are covered in a 1-week period, therefore data about difficulties with such activities may not be captured with a 1-day-recall item but are captured with a 1-week-recall item.
The first important finding of the study confirms that the concept “physical activity” in patients with COPD has two domains, which we labelled “amount of physical activity” and “difficulty with physical activity”. The “amount” domain is covered by two PRO items (amount of walking outside and chores outside) and by two variables from the activity monitor (steps and VMU), in both PPAC instruments. The inclusion of other items or activity monitor outputs did not enrich the final score. The combination of PRO and activity monitor data yields a much more robust measure of “amount” than either of the components, a unique observation from our study. The “difficulty” domain contains only PRO items and includes items about avoiding or pacing activities and breathlessness during activities. Demonstrating that physical activity is structured in two domains concurs with our qualitative research of experience of physical activity where COPD patients universally described three themes: “amount of”, “symptoms during” and “adaptations to” [11]. Based on the analysis of this study, the themes “symptoms” and “adaptations” merged into a single domain (“difficulty”).
The second relevant finding is that the final PPAC instruments are hybrid tools combining PRO and activity monitors for the assessment of amount of physical activity in COPD patients. Our comprehensive analyses clearly showed that neither PRO nor activity monitors alone could discriminate well (PSI<0.4) within the latent patient-centred construct “amount of physical activity”, while the combination of both tools achieved good discrimination at all ranges of the scale. Restricting assessment to either a PRO or an activity monitor does not provide a complete measure of the amount of physical activity. It is possible that activity monitors do not totally capture activities of low intensity, or that relevant activity is not appropriately recalled by patients.
It could be argued that the combination of the PRO with the accelerometer may impair the use of the PPAC instruments. However, the use of activity monitors is becoming more frequent both in clinical practice [10] and in clinical trials [28–30]. In addition, research shows limitations in using only questionnaires for assessing physical activity [13]. The patient-centred development of the PPAC instruments provides comprehensive assessment of physical activity incorporating the patient viewpoint. Importantly, the PPAC instruments will not be limited to the two activity monitors reported in this paper: other activity monitors can be combined with the PRO items after appropriate validation is complete.
It could be argued that some of the excluded items should have been kept in the PPAC instruments due to their clinical relevance. For example, in the “amount” domain, no items about activities inside the house were kept. The item “chores inside the house” did not fit well with the rest of the items according to exploratory factor analysis and showed differential item functioning between males and females (online supplementary table E5). In addition, some male patients did not understand the item during interviews or cognitive debriefings because they considered this item to be more relevant to females. Two items about outside activities were included (“walking outside” and “chores outside”). Although superficially similar, these two items increased discrimination and covered different parts of the range of physical activity, as shown in PSI values and person-item maps. Interestingly, they completely overlapped with the range covered by “chores inside”. Finally, VMU, a variable from the activity monitors, showed the best discriminant ability at low amounts of activity (online supplementary fig. E1), suggesting that it conceptually covers what the item “chores inside” was not able to capture.
The item “difficulty climbing stairs” could seem to be important because climbing stairs represents a substantial effort, and was also identified by patients as a priority during interviews and focus groups [11]. However, Rasch analysis showed improper ordering of the item, suggested a poor fit with the other items, presented a floor effect and showed high correlation with other remaining items in the difficulty domain (online supplementary table E5). Both “chores inside the house” and “difficulty climbing stairs” are examples of items that are intuitively relevant which were removed based on corroborating arguments from different sources (statistics and qualitative studies).
Both PPAC instruments demonstrated construct validity, showing ability to measure the construct that they intended to measure. As expected, “amount” scores exhibited weak correlations with health status, and moderate correlations with dyspnoea and exercise capacity, while “difficulty” scores showed moderate-to-strong correlations with health status, dyspnoea and exercise capacity. Additionally, all correlations were <0.9, demonstrating that the PPAC instruments provide different information than existing scales, i.e. that PPAC instruments indeed measure physical activity and not exercise capacity or health status [23]. Similarly, the instruments showed good known-groups validity when comparing across previously defined groups, detecting differences in physical activity (using variables not included in the PPAC scoring) between them. Finally, the discriminant validity showed that the difficulty and amount domains from the PPAC instruments were unrelated (predefined as correlation <0.3) to the constructs that were expected to be unrelated such as patient height. In comparison with previous studies that aimed to validate questionnaires to measure physical activity in patients with COPD, our instruments exhibited better correlations for convergent validity. For example, correlations of ∼0.37 were found between the several outputs from the Yale Physical Activity Survey and the 6-min walking distance [31], compared to correlations of ∼0.55 in our instruments.
Measuring the experience of physical activity is important for COPD management because physical activity is important from the patient's perspective [11] and thus for their daily life, and no valid instruments are currently available to capture it. In particular, because patients report that treatments (which need not be limited to pharmacological approaches) that improve physical activity are of value to them [11], the availability of PPAC tools which have been developed in line with current guidance will help future clinical trials aiming to include physical activity as an regulatory-accepted outcome [14].
A strength of this study is that the instruments were developed according to current best practice [14, 17], and their psychometric properties were rigorously tested using appropriate methodology and predefined and rigorous statistical procedures [32]. Both the D-PPAC and the C-PPAC item pools were developed through a systematic process of literature reviews [12, 13], input from experts and, as recommended [33–35], input from patients through qualitative research using one-to-one interviews, focus groups and cognitive debriefing interviews. Patients recruited were representative of a wide spectrum of COPD disease severity, comorbidities and different geographical and cultural backgrounds [11]. The information from this qualitative research was also used during the item reduction process when deciding to remove or keep an item. An advisory board provided advice to guide the whole process for the item reduction and validation of the tools. Moreover, the item reduction and initial validation reported here were also based on a heterogeneous sample of patients with COPD that covered a whole range of physical activity and other clinical determinants.
Another strength of the PPAC instruments is that they are based on a conceptual framework. A systematic review identified the absence of a conceptual framework of physical activity in all existing instruments to assess it in patients with COPD [13], which raises doubts about the validity of these instruments. We drafted an initial conceptual framework from extensive patient input and literature review [11], which was refined after item reduction and confirmed using appropriate statistical methods.
Our study has some limitations. Firstly, although the sample size was sufficient and the population was quite heterogeneous to achieve the objective of the study, further validation is needed in specific subgroups of patients with COPD. For example, the person item maps of the “difficulty” domain (factor 2) suggest that our instruments perhaps cover patients with greater difficulty in physical activity than the difficulty shown by patients in our sample, since the person distribution is skewed to the lower end of the scale. Secondly, the PPAC instruments have been validated only in a European population and using only three different languages. Further studies are ongoing to validate the PPAC instruments in patients with both milder and more severe COPD, from other countries and using validated translations of the instruments. In the same way, additional studies are required to confirm the best scoring system for the PPAC instruments, to assess responsiveness to change due to interventions and to define levels of response.
In conclusion, both the D-PPAC and the C-PPAC, combining a short list of PRO items and two activity monitor variables, provide a simple, valid and reliable measure of physical activity in COPD patients. These tools will be further evaluated in different clinical situations and for responsiveness to drug and nondrug interventions.
Acknowledgements
The PROactive Consortium members are as follows. Nathalie Ivanoff: Almirall, Barcelona, Spain; Niklas Karlsson and Solange Corriol-Rohou: AstraZeneca AB, Mölndal, Sweden; Ian Jarrod: British Lung Foundation, London, UK; Damijen Erzen: Boehringer Ingelheim, Nieder-Ingelheim, Germany; Caterina Brindicci, Tim Higenbottam and Mario Scuri: Chiesi Farmaceutici S.A. Parma, Italy; Paul McBride: Choice Healthcare Solutions, Hitchin, UK; Nadia Kamel: European Respiratory Society, Lausanne, Switzerland; Margaret Tabberer: GlaxoSmithKline, Uxbridge, UK; Thierry Troosters and Fabienne Dobbels: Katholieke Universiteit Leuven, Leuven, Belgium; Judith Garcia-Aymerich: Municipal Institute of Medical Research, Barcelona, Spain; Pim de Boer: Netherlands Asthma Foundation, Amersfoort, The Netherlands; Karoly Kulich and Alastair Glendenning: Novartis, Basel, Switzerland; Katja Rudell and Frederick J. Wilson: Pfizer Walton Oaks, UK; Michael I. Polkey and Nick S. Hopkinson: Royal Brompton and Harefield NHS Foundation Trust, London, UK; Ioannis Vogiatzis: Thorax Research Foundation, Athens, Greece; Enkeleida Nikai: UCB, Brussels, Belgium; Thys van der Molen and Corina De Jong: University Medical Center, Groningen, The Netherlands; Roberto A. Rabinovich and Bill MacNee: University of Edinburgh, Edinburgh, UK; Milo A. Puhan and Anja Frei: University of Zurich, Zurich, Switzerland.
We are grateful for the input from the PROactive project ethics board, advisory board and patient input platform (www.proactivecopd.com/about/advisory-boards).
Footnotes
Clinical trial: This study is registered at www.clinicaltrials.gov with identifier number NCT01388218.
This article has supplementary material available from www.erj.ersjournals.com
Support statement: This study was supported by the European Commission Innovative Medicines Initiative Joint Undertaking (IMI JU #115011). There was no involvement of funding sources in study design; the collection, analysis, and interpretation of data; the writing of the report; nor the decision to submit the article for publication. This project was supported by the National Institute for Health Research Respiratory Biomedical Research Unit at the Royal Brompton and Harefield NHS Foundation Trust and Imperial College (London, UK) who part-fund Michael I. Polkey's salary. Funding information for this article has been deposited with FundRef.
Conflict of interest: Disclosures can be found alongside the online version of this article at erj.ersjournals.com
- Received October 3, 2014.
- Accepted March 22, 2015.
- Copyright ©ERS 2015
ERJ Open articles are open access and distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0.