Abstract
There is ongoing debate as to how asthma should be defined in order to forward understanding of the underlying mechanisms. The aim of the present study was to build quantitative scores of asthma and asthma severity and to assess whether refinement of disease phenotypes can facilitate the identification of chromosomal regions harbouring susceptibility genes.
A genome-wide linkage scan was conducted in 110 families with at least two asthmatic siblings (n = 508) from the French Epidemiological study on the Genetics and Environment of Asthma, bronchial hyperresponsiveness and atopy (EGEA). The phenotypes studied were an asthma severity score (assessed among asthmatics by combining clinical data and treatment), forced expiratory volume in one second (FEV1) and an asthma score (including both asthmatics and nonasthmatics and representing the whole disease spectrum).
This analysis showed genome-wide suggestive evidence of linkage of the asthma score to 18p11, a novel region undetected in a previous screen of dichotomous asthma. There was potential linkage of 2p23 to asthma severity score and of three regions (1p36, 2q36 and 6q14) to FEV1. Moreover, FEV1 appeared to have no genetic determinant in common with asthma severity and asthma scores.
Asthma and asthma severity quantitative scores revealed new regions of linkage and thus provide support for considering these phenotypes in future genetic studies.
The general consensus now emerging is that asthma is unlikely to be a single disease, but rather a collection of different phenotypes, as recently pointed out in the Lancet 1. These phenotypes may be defined from various criteria, including clinical and physiological characteristics (e.g. severity, resistance to treatment, chronic airflow restriction, etc.) 2. One strategy for determining whether these phenotypes represent different manifestations of a common underlying pathological process or are distinct disease entities is to refine phenotypic characterisation and search for determinants of these phenotypes. Indeed, improving phenotype definition can increase the power to detect the underlying genetic mechanisms and, conversely, genetic analysis can provide support for new phenotype definitions and new phenotypic entities.
Regarding the asthma phenotype, it has recently been shown that use of quantitative scores based on symptoms of asthma 3, rather than simple dichotomous definitions, can improve the identification of risk factors by reducing misclassification of disease status. Definition of asthma severity depends upon its primary purpose. The 2002 and 2004 Global Initiative for Asthma (GINA) guidelines, established to improve asthma management, combine current symptoms, respiratory function and treatment 4. However, it is now recognised that asthma severity is heterogeneous, being represented by different phenotypes that may result from different risk factors 2. Low lung function is a marker of asthma severity. It antedates the occurrence of persistent wheeze, tracks over the life span, and predicts the severity of asthma in adults in the general population 5 and exacerbations in clinical settings 6. Asthma occurrence, asthma severity and forced expiratory volume in one second (FEV1) may result from common determinants, but also from specific ones. Analysing these phenotypes separately rather than combining them in a single index makes it possible to discriminate between these hypotheses.
Evidence for familial resemblance of asthma severity and the absence of a significant relationship between asthma severity in probands and the occurrence of disease in relatives have suggested that specific familial factors, possibly genetic, are involved in asthma severity 7. Many genome-wide scans, including the Epidemiological study on the Genetics and Environment of Asthma, bronchial hyperresponsiveness and atopy (EGEA), have reported chromosomal regions linked to asthma, based on classical dichotomous definitions 8. Although candidate genes for severe asthma have been investigated 9, limited information is available regarding the localisation of potential new genes. To the present authors’ knowledge, no genome scan has yet been conducted for categorical phenotypes of asthma and asthma severity.
The EGEA study, with detailed data permitting the construction of scores for asthma and asthma severity in family members, and the availability of linkage analysis methods for ordered categorical traits make it possible to conduct a genome-wide screen for the three following phenotypes: 1) a categorical asthma score, in order to reduce potential asthma misclassification and to investigate the whole spectrum of disease expression; 2) an asthma severity score, in order to investigate variation in disease severity; and 3) a measure of ventilatory function, FEV1, an objective marker of asthma severity. This genome screen permits evaluation of whether the examination of more-refined disease phenotypes can help in the identification of novel regions of linkage. It also permits assessment of whether the three phenotypes under study are more likely to share genes or depend upon different genetic factors.
METHODS AND MATERIALS
The protocol for the EGEA data collection has been described elsewhere 7, 8, 10. Subjects answered a detailed questionnaire regarding respiratory symptoms and treatment based on international standardised questionnaires. FEV1 was measured according to American Thoracic Society recommendations.
Among the whole EGEA family sample ascertained through one asthmatic proband, 119 nuclear families including at least two asthmatic siblings were selected for the present study. Asthma was defined using the same criteria as in previous EGEA linkage scans 8, associating report of asthma attacks or attacks of breathlessness with wheezing ever and either airway hyperresponsiveness or reversible obstruction or hospitalisation for asthma or asthma treatment.
Phenotypes analysed
Asthma severity
Based on extensively documented phenotypes in EGEA, two measures of asthma severity were considered in the present study: 1) asthma severity score; and 2) FEV1.
Asthma severity score
The asthma severity score, ranging 1–4, was built in asthmatic siblings in two steps, following the concept of combining treatment and symptoms according to the 2002 and 2004 GINA guidelines 4. In step 1, the sum was computed, in respect of the last 12 months, of the following clinical items: frequency of asthma attacks (<1 attack per month (0), <1 attack per week (1), <1 attack per day (2) and ≥1 attack per day (3)); symptoms between asthma attacks (none (0), wheezing (1), wheezing and dyspnoea (2), and activities limited by dyspnoea (3)); and hospitalisation for asthma (no(0)/yes(1)). This sum of clinical items was grouped into four classes: 0, 1, 2 and ≥3. In step 2, the four-class variable built in step 1 was combined with the type of anti-asthmatic treatment during the past 12 months, as reported in the EGEA questionnaire (no treatment, treatment without inhaled steroids and treatment with inhaled steroids), resulting in an asthma severity score of 1, 2, 3 or 4, as shown in table 1⇓.
Asthma severity score#
Asthma score
An asthma score was built to represent the whole disease spectrum, encompassing nonasthmatics and asthmatics with varying asthma expression. This score ranged 0–4; a score of 0 was assigned to nonasthmatics and the scores of 1–4 represented the four classes of asthma severity defined above.
Genotypes
Genotyping of 396 microsatellites, with a mean spacing of 10 centimorgans (cM), was performed in EGEA families with at least two siblings with available DNA, following a protocol described elsewhere 8. After rigorous genotype quality control, the final sample for the present analysis included 110 families with at least two asthmatic siblings (508 subjects), comprising 218 genotyped parents (99% of all parents) and 288 genotyped siblings.
Linkage analysis
Linkage analyses of the two ordered categorical phenotypes (asthma severity score and asthma score) were performed using the maximum-likelihood-binomial method 11, 12 implemented in MLB-GENEHUNTER 11. This is a unique linkage analysis method for categorical traits. The principle of this approach is to introduce a latent (unobserved) binary variable (Y = {0;1}) that captures the linkage information between the observed categorical phenotype and the marker. For each phenotypic category, the latent variable Y can take a value of 1 (affected), with probability pi, or a value of 0 (unaffected), with probability 1–pi. Linkage is then investigated for all possible sets of Y values within sibships weighted by their probabilities (2s sets for a sibship of size s). Choice of these probabilities is guided by the hypothesis under test. The idea is that the probability of Y = 1 increases as the severity of asthma increases. Different sets of probabilities were used to model the correspondence between the observed category and the unobserved latent variable. When analysing the asthma severity score, the four probabilities assigned to the four classes (1, 2, 3 and 4) were chosen in order to provide maximal power for detecting genes controlling the mildest (classes 1 and 2, having low probabilities of 0.05 and 0.25 for the latent variable Y to be 1) or the most severe forms of asthma (classes 3 and 4, having high probabilities of 0.75 and 0.95 for Y to be 1). When analysing the asthma score, the probabilities modelling the correspondence between this score and the latent variable were chosen to investigate a continuum of disease expression and thus ranged from 0 in nonasthmatics to 1.0 in asthmatics belonging to severity class 4, intermediate probabilities of 0.25, 0.45 and 0.75 being assigned to classes 1, 2 and 3, respectively. The likelihood of the observations is expressed using a binomial distribution of parental marker alleles among offspring according to the value of the binary latent variable and depends upon only one parameter, α, the probability that siblings with Y = 1 receive one of the two marker alleles with the disease allele from his/her heterozygous parent, 1−α being the corresponding probability among siblings with Y = 0. The null hypothesis of no linkage (α = 0.5) against the alternative hypothesis of linkage (α>0.5) was tested using a likelihood-ratio test. It is of note that contribution to linkage information comes from sets of siblings having the same value of the latent variable (concordant siblings for Y = 1 or Y = 0) and from sets of siblings having opposite values of the latent variable (discordant siblings).
Multipoint linkage analysis of the continuous phenotype (FEV1 % pred) was conducted using the variance-components method 13 implemented in MERLIN 14. The variance-components method separates the total variation of a trait into genetic and environmental components and evaluates linkage by comparing a model incorporating both a genetic additive variance at a putative quantitative trait locus and a polygenic component with a purely polygenic model (quantitative trait locus variance set to zero) using a likelihood-ratio test. Since the variance-components method is known to be sensitive to departure from normality assumptions of the trait distributions, a probit transformation was applied prior to linkage analysis to FEV1 (% pred) in order to normalise its distribution.
For either method, minus twice the natural logarithm of this likelihood ratio is assumed to follow a one-sided Chi-squared distribution with one degree of freedom. This Chi-squared distribution divided by 2ln10 can be considered the logarithm of odds (LOD) score.
Correlations among genome-wide LOD scores
In order to assess whether or not the asthma score, asthma severity score and FEV1 (% pred) might share common genetic determinants, the correlation matrix was computed among the standardised genome-wide LOD scores obtained at 378 marker positions for these three phenotypes.
RESULTS
Sample characteristics
The relationship between the asthma severity score and the sum of clinical items is presented in table 1⇑. Taking treatment into account changed the scoring based on clinical items for 91 (42.7%) siblings, three-quarters of these siblings being reclassified as more severe and a quarter as less severe compared with step 1 when treatment was ignored.
The 288 genotyped siblings of the 110 nuclear families considered in the present analysis had a mean age of 15 yrs (range 3–43 yrs) and comprised 82% asthmatics. The principal characteristics of the asthmatic and nonasthmatic siblings are presented in table 2⇓.
Phenotypic characteristics of genotyped siblings in the set of 110 families with at least two asthmatic siblings
Regarding the asthma score, which included asthmatic and nonasthmatic siblings, 18% of these siblings belonged to class 0, 38% to class 1, 19% to class 2, and 12% each to classes 3 and 4.
Genome-wide screen
The genome-wide linkage test results for asthma score, asthma severity score and FEV1 (% pred) are shown in figure 1⇓ and table 3⇓.
Multipoint results of the genome-wide linkage scan for asthma score (------), asthma severity score (––––) and forced expiratory volume in one second percentage of the predicted value (– – – –) conducted in 110 Epidemiological study on the Genetics and Environment of Asthma, bronchial hyperresponsiveness and atopy (EGEA) families. The microsatellite markers were as follows: #: D1S468 (p = 0.004); ¶: D2S165 (p = 0.002); +: D2S126 (p = 0.003); §: D6S460 (p = 0.003); ƒ: D18S53 (p = 0.0004). Numbers above the x-axis represent chromosome number. cM: centimorgan.
Chromosomal regions showing significant(p≤0.01) multipoint logarithm of odds (LOD) scores for at least one phenotype#
The most significant evidence for linkage was found on chromosome 18p11, for asthma score, where the maximum LOD score reached 2.40 at D18S53, with a p-value of 0.0004, lower than the critical threshold of 7×10−4 for genome-wide suggestive evidence of linkage 15. There was modest evidence of linkage to asthma score (p<0.01) in two other regions: 5q15 (LOD 1.22 at D5S428; p = 0.009) and 14q13 (LOD 1.20 at D14S70; p = 0.009).
In respect of the asthma severity score, a maximum LOD score of 1.80 (p = 0.002) was obtained on chromosome 2p23, at marker D2S165. Additional potential linkage was observed in the 7q36 region (LOD 1.00 at D7S2465; p = 0.01).
Three regions were found to be linked to FEV1 (% pred) at p≤0.005: 1p36 (LOD 1.52 at D1S468; p = 0.004), 2q36 (LOD 1.59 at D2S126; p = 0.003) and 6q14 (LOD 1.64 at D6S460; p = 0.003). There were two additional linkage signals at the 1% significance level: 4p14 (LOD 1.21 at D4S405; p = 0.009) and 7p22 (LOD 1.19 at D7S531; p = 0.01).
Correlations among genome-wide LOD scores
Estimation of the correlations among genome-wide LOD scores for the three phenotypes showed nonsignificant correlations that were close to zero between LODs for FEV1 (% pred) and LODs for asthma score (r = -0.03) and asthma severity score (r = -0.07), respectively. Thus FEV1 (% pred) appears to have no genetic factor in common with the asthma score and asthma severity score. There was a significant positive correlation, although not very high, between LODs for the asthma score and asthma severity score (r = 0.13; p = 0.01), suggesting that these phenotypes may share a few genetic factors.
DISCUSSION
There is ongoing debate as to how asthma should be defined in order to progress understanding of the underlying mechanisms involved 1. To date, genetic studies have used dichotomous definitions, often established from questionnaires. The present genome-wide scan is the first to consider varying levels of asthma expression (from unaffected to severely affected). The asthma spectrum phenotype, encompassing nonasthmatics and various classes of asthma, showed genome-wide suggestive evidence for linkage to the 18p11 region. The asthma severity score, combining clinical items and treatment, showed linkage to the 2p23 region, whereas other linkage signals were detected for FEV1 on chromosomes 1p36, 2q36 and 6q14. None of the regions detected for any of these three phenotypes was revealed for the other two. Moreover, estimation of the correlations among genome-wide LOD scores showed no evidence for any common genetic determinant between FEV1 (% pred) and either the asthma score or asthma severity score, whereas there was suggestion of the sharing of a few genetic factors by these two scores. These potential common genetic determinants are likely to have weak effects since no region common to the latter two phenotypes was detected at the 1% significance level.
Phenotype definition and linkage analysis outcomes
The strongest evidence for linkage was found for the asthma score, on 18p11. Previous analysis of asthma in the same EGEA sample, using a dichotomous definition, had revealed a single linkage signal in the 1p31 region (p = 0.005) 8. Further analysis led to increased evidence for linkage to this region when affection status was defined by the presence of both asthma and allergic rhinitis, and showed that 1p31 is likely to contain a gene specific to this comorbid condition 16. Conversely, the 18p11 region, revealed by the present analysis of categorical asthma score, was not detected in a previous analysis of dichotomous asthma and seven other asthma-related phenotypes 8. This shows that considering a more refined phenotype, including various classes of asthma expression, can lead to the detection of new genes influencing variation in disease expression. Finding new regions by considering a categorical definition for asthma raises several hypotheses. First, these results may be due to a reduction in asthma misclassification. Whereas asthmatic probands were defined using a very strict procedure and recruited in chest clinics 10, the definition of asthma in siblings was less specific, although attention was paid to items such as bronchial hyperresponsiveness, treatment or hospitalisation in the confirmation of asthma. In this context, a categorical variable may decrease such potential misclassification bias. Indeed, a second possibility is that asthma is not a dichotomous phenotype, but, as with airflow limitation, may occur on a somewhat continuous scale. A similar approach of building continuous scores for asthma was proposed by Pekkanen et al. 3, and was shown to increase the power of detecting risk factors for disease. The present score for asthma was constructed upon the asthma severity score, itself defined from clinical symptoms and treatment, but other types of continuous score may be built. Interestingly, Pekkanen et al. 3 found that a simple sum of positive answers to eight questions regarding clinical symptoms and asthma medication, i.e. based on the same principles as the present grading of asthma severity, was as effective at detecting risk factors as a score computed from a more sophisticated principal component analysis of 12 questions regarding asthma symptoms. Thus, the present results and those of Pekkanen et al. 3 provide strong support for the usefulness of examining quantitative scores of asthma in aetiological research.
It was decided not to include FEV1 in the severity score, despite the fact that this physiological measure may be helpful in the assessment of asthma severity in clinical settings, since the present aim was to investigate whether ventilatory function and the asthma severity score result from common or distinct genetic determinants. The present results show that FEV1 (% pred) and the asthma severity score do not share any genetic determinant. This is in agreement with other phenotypic studies, which have shown that FEV1 does not systematically strongly correlate with disease symptoms 2, 17. Moreover, none of the published genome-wide screens conducted for lung function in asthmatic families reported linkage to the 2p23 region, detected here for the asthma severity score 18–20. Taken together, these results indicate that FEV1 (% pred) is more likely to be controlled by specific genetic determinants. Thus, in the search for genetic determinants of asthma severity, it may be worthwhile to consider lung function separately from clinical symptoms and treatment. It should also be noted that the 2p23 region found to be linked to asthma severity score was not revealed by a previous scan of dichotomous asthma and seven asthma-associated phenotypes 8, and thus appears as a new and specific region of asthma severity in the present sample. Among the three linkage signals revealed for FEV1, only the 6q14 region was detected by the previous scan of the whole EGEA sample with at least one asthmatic proband, and even more significantly (p = 0.001), suggesting that genes in this region may rather control physiological variation of this phenotype.
Replication of linkage results
It has often been advocated that replication of linkage results across studies provides support for the genuine involvement of linkage regions. An exhaustive compilation of the linkage results of the published genome scans of asthma and asthma-associated phenotypes performed to date, in 16 different populations (not including the EGEA study), was carried out. All previously reported linkage peaks (p≤0.01) spanning a 20-cM region on either side of each of the present five main linkage peaks were considered. Among the 16 asthma genome scans, the 18p11 region, detected for asthma score, was reported once in Australian twins for asthma analysed as a binary trait 18. The linkage of the asthma severity score to 2p23 was detected for eosinophils in Australian twins 18, 21, 22 and for severe asthma in German data 23. It should be noted that the latter study examined dichotomous subtypes of asthma, which were defined, using broader categorisation than the present study, from the three following items: asthma attack frequency, receiving asthma treatment or not, and experience or otherwise of at least one overnight hospital stay. Since FEV1 (% pred) was studied in only four asthma genome screens, linkage scans conducted for lung function phenotypes in families with early-onset chronic obstructive pulmonary disease (COPD) and from the general population were considered 24. Among the three linkage signals detected for FEV1, 1p36 was reported to be linked to asthma in two scans 22, 25 and 6q14 to eosinophils in one scan 26, whereas linkage to 2q36 was observed for atopy phenotypes in two asthma scans 18, 21, 22 and for lung function phenotypes in asthmatic families 19 and early-onset COPD families 27, 28. Regarding this latter result, longitudinal studies have shown that asthma is associated with accelerated lung function decline and is a risk factor for COPD 29. Thus genetic factors on chromosome 2q might contribute to the common determinants of asthma and COPD.
Limitations and strengths
A major strength of the present study was the availability of extensively documented phenotypes for all subjects in the EGEA sample, ascertained through asthmatic cases followed in chest clinics. This permitted the study of a wide range of asthma severity, but it is acknowledged that one limitation of the present study is the small number of severe asthmatics, such as those patients included in specific studies of this asthma sub-phenotype 2. Examination of the families that contributed to linkage to 2p23 came mainly from siblings affected with mild asthma, as well as from discordant sets of siblings with mild and severe asthma, suggesting that this region may harbour genes underlying mild asthma and possibly protecting from severe asthma.
The assessment of asthma severity remains a difficult issue because of the paucity of knowledge and heterogeneity of this condition, as well as rapidly evolving concepts regarding its measurement. Here, it was decided to combine clinical items and treatment in order to follow the same principle as recommended in the 2002 and 2004 GINA guidelines. Despite the fact that the dose of treatment was not available in the present data, it was possible to assess the use of inhaled corticosteroids during the past 12 months. Inhaled corticosteroids are the main controller medication and differentiate intermittent from persistent asthma in the GINA guidelines. Moreover, the asthma severity classification presently used has already been used to examine the relationship between occupational exposure and asthma severity in the EGEA data 30.
The present authors are aware that the latest GINA recommendations (2006) have evolved and shifted from severity to control. However, since this approach is adapted to clinical practices for the management of individual asthmatic patients, it may not be the best approach for epidemiological studies, aimed at determining risk factors for asthma severity in populations. In cross-sectional surveys, the level of asthma symptoms may not be interpreted in the same way in asthmatic populations with and without controller medication. Taking medication into account may provide useful additional information in populations, as underlined by Liard et al. 31, who showed that a classification of asthma severity combining treatment with symptoms and FEV1 showed better correlation with emergency visits or hospitalisation for asthma than the classification without treatment. The degree of airway hyperresponsiveness could be also considered an objective marker of asthma severity, but specific criteria for the test in the EGEA led to a large number of missing data in asthmatics, and thus to too few siblings being available for linkage analysis.
Another strength of the present study was the use of a unique method for categorical phenotypes that is very flexible, since the probabilities modelling the correspondence between the observed scores and the underlying latent variable can be chosen to increase the power for testing a given hypothesis. Moreover, this method takes into account, in the same analytical framework, sets of siblings with similar and opposite forms of the disease spectrum (possibly including those that are unaffected), which both contribute to linkage (concordant siblings sharing an excess and discordant siblings sharing a lower proportion of marker alleles that are identical by descent than the distribution expected under the null hypothesis of no linkage) and can thus increase the power for detecting linkage signals.
Conclusion
In conclusion, the present study shows that the use of a categorical phenotype to represent the whole spectrum of disease expression instead of a simple dichotomous phenotype can increase the power for detecting new genetic regions. It also provides evidence of a specific genetic component involved in asthma severity and that different genetic components underlie the various dimensions of asthma severity represented, on the one hand, by combination of clinical symptoms and treatment and, on the other hand, by forced expiratory volume in one second. These findings have important implication for the consideration of more refined disease phenotypes in future genetic studies of asthma.
Acknowledgments
The members of the Epidemiological study on the Genetics and Environment of Asthma, bronchial hyperresponsiveness and atopy (EGEA) cooperative group are as follows (all France).
Coordination: F. Kauffmann; F. Demenais (genetics); and I. Pin (clinical aspects).
Respiratory epidemiology: V. Siroux (INSERM U578, Grenoble); M. Korobaeff and F. Neukirch (Egea1 (first survey)/INSERM U700, Paris); I. Annesi-Maesano (INSERM U707, Paris); F. Kauffmann, N. Le Moual, R. Nadif and M.P. Oryszczyn (INSERM U780, Villejuif).
Genetics: J. Feingold (INSERM U393, Paris); M.H. Dizier (INSERM U535, Villejuif); E. Bouzigon and F. Demenais (INSERM U794, Evry); I. Gut and M. Lathrop (National Genotyping Centre, Evry).
Clinical centres: I. Pin and C. Pison (Grenoble); D. Ecochard (Egea1); F. Gormand and Y. Pacheco (Lyon); D. Charpin (Egea1) and D. Vervloet (Marseille); J. Bousquet (Montpellier); A. Lockhart (Egea1) and R. Matran (Paris Cochin); E. Paty and P. Scheinmann (Paris Necker); A. Grimfeld and J. Just (Paris-Trousseau).
Data and quality management: J. Hochez (Egea1/INSERM ex-U155); N. Le Moual and C. Ravault (INSERM Unit 780, Villejuif); N. Chateigner (INSERM U794); J. Ferran (Grenoble).
- Received December 12, 2006.
- Accepted April 18, 2007.
- © ERS Journals Ltd