Copyright ©ERS Journals Ltd 2001 Genetics of COPD: present and futureCORRESPONDENCE: J.R. Hoidal, Pulmonary Division, University of Utah, Health Sciences Center, 26 North 1900 East, Salt Lake City, UT, 84132-4701, USA. Fax: 1 8015853355
Chronic obstructive pulmonary disease (COPD) is a devastating clinical problem. In the USA, it affects >16 million people, accounts for 13% of hospitalizations, and is the fourth leading cause of death 1. Similarly, COPD is a major cause of morbidity and mortality worldwide. Over the next half-century, one and a half million deaths·yr1 due to COPD are predicted in China alone 2. Although cigarette smoking is the major risk factor, only 1020% of smokers develop symptomatic COPD. The extent of cigarette smoking (pack-yrs and duration of smoking) only account for
To date, For genetically complex diseases, recent emphasis has been placed on the common disease-common variant hypothesis, which states that common disease susceptibility or resistance variants are expected to be few at each locus, relatively common in the human population and enriched in the coding and regulatory sequence of genes 9. Despite the small effects of such genes individually, the magnitude of their attributable risk (the proportion of people affected by them) may be large because they are quite frequent in the population, making them of public health significance. Thus, in COPD, it is likely that multiple genes are operating and the genetic susceptibility may depend on the coincidence of several gene polymorphisms acting together. Moreover, expression of different gene combinations probably affects the heterogeneous histopathological and clinical profile of COPD seen among individuals.
To date, the genes implicated in the pathogenesis of COPD have been found to be involved in antiproteolysis, metabolism of toxic substances in cigarette smoke, the inflammatory response to cigarette smoke, and the efficiency of mucociliary clearance in the lung. A variety of studies have examined candidate gene loci with association studies, comparing the distribution of variants in genes hypothesized to be involved in the development of COPD in patients and control subjects. Polymorphisms in the genes for One key to establishing that a gene modifies the risk for a disease is replication of the association in different populations. This can be a difficult task. Several factors could contribute to the inconsistent results of case-control genetic association studies in COPD. Genetic heterogeneity or different genetic mechanisms in different populations could play a part in making it difficult to replicate associations between studies. In addition, false-positive or false-negative results could contribute. To be useful, polymorphisms need to be real, not errors or artefacts. A further well-recognized potential problem is ascertainment bias. Case-control association studies are susceptible to supporting associations based purely on population stratification that may result from incomplete matching between cases and controls, including differences in ethnicity and geographical origin. To the authors' knowledge, there are as yet no reported association studies in COPD that use family-based controls, a study design that is immune to population stratification effects. In this issue of the European Respiratory Journal, Hirano et al. 20 suggest for the first time that variants in the tissue inhibitor of metalloproteinases (TIMP)-;2 gene predispose to COPD. The phenotype used is pulmonary function, as measured by forced expiratory volume in one second, which is a strong correlate of COPD and, in population studies, is predictive of all-cause mortality 2123. The association of TIMP-;2 polymorphisms with COPD has good biological plausibility. As referenced by Hirano et al. 20, there are substantial in vitro and in vivo data to suggest that metalloproteinases may be important in the pathobiochemistry of COPD. The authors also indicate that the results are not due to the confounder of population stratification, although this possibility cannot be completely excluded by the information provided. In future studies, it will be important to determine whether the observation can be confirmed in different populations and to ascertain how the observed polymorphisms affect the expression or activity of TIMP-;2. What lies ahead in efforts to uncover the genetic basis of COPD? Risch and Merikangas 24 suggested that the future of genetic studies for complex polygenic diseases such as COPD will require large-scale testing by association analysis, rather than the linkage analysis approach that has been used successfully to find major genes. They indicate that many genetic effects, too weak to identify by linkage, can be effectively detected by genomic association studies. The limitation of association studies is that the actual polymorphisms within the gene (or at least a polymorphism in strong disequilibrium) must be available. A large number of genes and polymorphisms must first be identified, and an extremely large number of such polymorphisms will need to be tested. Testing such a large number of polymorphisms on several hundred individuals has, until recently, seemed unrealistic in scope. However, with the publication of drafts of human genome sequences 25, 26, the human genome projects have provided the first complete sequence of the reference human genome. This landmark feat will accelerate the pace at which the spectrum of genetic variation in the human gene pool and the genetic basis of many complex human phenotypes can be elucidated. There are now numerous ongoing efforts to catalogue sequence variants. Emphasis is being placed on detecting single nucleotide polymorphisms (SNPs), the most common human sequence variation. Two approaches include the targeted resequencing of genomic deoxyribonucleic acid (DNA) from diverse individuals, and random, whole genome approaches that utilize either pre-existing datasets (e.g. expressed sequence tag databases) or collected random sequence samples of the genome. An example of the latter approach is the work spearheaded by the SNP Consortium Ltd, a group of 10 pharmaceutical companies, and the Wellcome Trust. The SNP Consortium is set to produce 300,000 publicly available SNPs within the next year. Improved technologies for scoring SNPs are reducing hands-on time and cost, although truly high-throughput methods are still lacking for genome-wide, population-based studies. For sequence variants to be fully utilized, there must be up-to-date, complete, coherent and organized views of this complex dataset so that investigators are empowered and not overwhelmed by its use. Investigators using the genome sequence must have access to electronic resources that allow them to gather, manipulate and display sequence-based information so that they can derive the intended benefits. Integrated, up-to-date sequence-based views of complementary DNA and genomic sequences are essential for the analysis of common variation in candidate genes, and the prioritization of candidate genes and regions for further analysis. Bioinformatics tools will be at the core of this approach. Powerful new types of bioinformatics will be required to assimilate and interpret the data that will result from various types of genomics research.
The rate of detection and reporting of polymorphisms is increasing logarithmically. In the last decade, A word of caution. The choice of phenotype is critical to the success of gene discovery programmes. If the cases include a heterogeneous collection of aetiologies for a complex disease, the power to detect a significant association will be reduced. Similarly, if the control group includes subjects who are affected but undiagnosed, power will be reduced. This is particularly relevant to gene discovery in COPD because of its variable pathological features and late onset of disease. In summary, establishing molecular variants that may constitute inherited predispositions to chronic obstructive pulmonary disease can be a daunting task. However, it will be far more challenging to establish causation for a chronic obstructive pulmonary disease phenotype; that is, to understand the mechanism by which a genetic factor may predispose to the disorder. Ultimately, the studies of disease gene identification will have to be turned back over to biologists and physiologists because the identification of the functional disease mutation depends upon the demonstration of a biological basis for one or more of the putative genetic variants in the development of disease. As with the tissue inhibitor of metalloproteinases-2, the product of the gene under examination may be highly pleiotropic, due to its involvement with multiple physiological processes in multiple tissues, and determination of the role of a specific variant may require extensive manipulation in intact organisms. References
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||