Abstract
Background Primary ciliary dyskinesia (PCD) is a heterogeneous inherited disorder caused by mutations in approximately 50 cilia-related genes. PCD genotype–phenotype relationships have mostly arisen from small case series because existing statistical approaches to investigating relationships have been unsuitable for rare diseases.
Methods We applied a topological data analysis (TDA) approach to investigate genotype–phenotype relationships in PCD. Data from separate training and validation cohorts included 396 genetically defined individuals carrying pathogenic variants in PCD genes. To develop the TDA models, 12 clinical and diagnostic variables were included. TDA-driven hypotheses were subsequently tested using traditional statistics.
Results Disease severity at diagnosis, measured by forced expiratory volume in 1 s (FEV1) z-score, was significantly worse in individuals with CCDC39 mutations (compared to other gene mutations) and better in those with DNAH11 mutations; the latter also reported less neonatal respiratory distress. Patients without neonatal respiratory distress had better preserved FEV1 at diagnosis. Individuals with DNAH5 mutations were phenotypically diverse. Cilia ultrastructure and beat pattern defects correlated closely to specific causative gene groups, confirming these tests can be used to support a genetic diagnosis.
Conclusions This large scale, multi-national study presents PCD as a syndrome with overlapping symptoms and variations in phenotype according to genotype. TDA modelling confirmed genotype–phenotype relationships reported by smaller studies (e.g. FEV1 worse with CCDC39 mutation) and identified new relationships, including FEV1 preservation with DNAH11 mutations and diversity of severity with DNAH5 mutations.
Abstract
Topological data analysis of 396 primary ciliary dyskinesia patients shows genetic mutations of worse (CCDC39), variable (DNAH5) and milder (DNAH11) effects on lung function, offering the potential for more accurately targeted disease management https://bit.ly/3oL5r64
Introduction
Primary ciliary dyskinesia (PCD) is clinically and genetically heterogeneous. Symptoms relate to dysfunction of multiple motile cilia and can include neonatal respiratory distress syndrome (NRDS), wet cough, recurring upper and lower respiratory tract infections, otitis media, bronchiectasis, infertility, situs inversus and congenital heart disease (CHD) [1]. Mutations in 50 ciliary genes have been described so far [2, 3].
Understanding of genotype–phenotype relationships informs diagnostic decisions and treatment; however, due to the rarity (≈1:10 000) and diversity of PCD, and the constraints of traditional statistical methods, a large patient cohort has never been studied for these relationships. Evidence for clinically relevant genotype–phenotype associations is mostly limited to small case series for a specific gene or clinical characteristic. For example, individuals with variants in HYDIN (a central pair projection gene), or in multiciliogenesis genes like MCIDAS and CCNO, are unlikely to have situs inversus as nodal cilia are not affected [4–7]. Using traditional statistical approaches, cohort studies have been underpowered to investigate by single gene and have instead combined functionally similar genes for analysis. A North American study of 137 children reported worse lung disease in those patients with central apparatus or microtubular disorganisation (MTD) (with inner dynein arm (IDA) ultrastructural defects), most of whom had CCDC39 and CCDC40 variants, than in those patients with outer dynein arm (ODA) defects caused by DNAH5 variants [8, 9].
Topological data analysis (TDA) allows for the visual exploration of data without establishing a priori hypotheses [10]. It can be used to explore the underlying patterns in complex datasets by generating clusters of individuals with similar features in multiple dimensions, in an unsupervised manner, as extensively validated in several clinical studies [11–13]. TDA can be used to highlight small groups of interest in large or complex datasets, which could otherwise be overlooked when applying traditional clustering methods that are typically more constrained by a requirement for pre-selection of parameters (e.g. definition of the number of clusters) to drive data analyses [10, 14]. In doing so, TDA can uncover patient subgroups more likely to benefit from a particular therapeutic intervention [12, 15–17]. It thereby provides a promising approach to investigate genotype–phenotype associations in heterogeneous patients with rare diseases.
Our aim was to investigate relationships between clinical, diagnostic and genetic data, hypothesising that different subgroups of PCD patients, with particular clinical and diagnostic phenotypes, could be identified according to their underlying genotypes.
Methods
Ethics
Local and national research approvals and ethical approvals were obtained and adhered to (NRES Committee South Central Hampshire Ethics 06/Q1702/109, London Bloomsbury Research Ethics Committee 08/H0713/82 and Ile-de-France Ethics Committee CPP07729).
Study Design
Clinical and diagnostic data were retrospectively collected from patients with a confirmed genetic diagnosis of PCD (i.e. carrying autosomal bi-allelic variants or an X-linked variant classified as pathogenic according to international guidelines) [18, 19]. The data coding for the clinical characteristics included in the study is shown in supplementary table E1.
The study design was based on previous TDA studies and is outlined in figure 1 [15]. TDA was performed in order to generate hypotheses, which could then be tested using more traditional statistical methods. TDA was applied to a discovery cohort of 199 patients (cohort details and genetics can be found in supplementary tables E2–E4) and validated using a second cohort of 197 patients (cohort details and genetics can be found in supplementary figure E1 and supplementary tables E5 and E6). An overview of the PCD genes affected by mutations in the full study population is shown in supplementary figure E2.
Topological data analysis
Topological models were developed using a licensed version of TDA software through the Symphony AyasdiAI cloud-based platform, version 2.0 (Ayasdi Inc., Menlo Park, CA, USA; www.ayasdi.com). More details of TDA are available in the supplementary material.
The phenotypic data used for clustering were body mass index (BMI), forced expiratory volume in 1 s (FEV1) z-score, forced vital capacity (FVC) z-score, neonatal respiratory distress (NRDS), wet cough, rhinitis, glue ear, cardiac situs, congenital heart disease (CHD), nasal nitric oxide (nNO), ciliary beat pattern (CBP) and transmission electron microscopy (TEM). Genetic data were not used to generate the topological models, as these were the study's main variable of interest; however, genes of interest were later mapped onto the models to develop hypotheses regarding genotype–phenotype associations.
Models were generated using an automated analysis option. Locally-linear embedding (LLE) is a non-linear dimensionality reduction technique, in which highly complex data are summarised and compressed into smaller representations of their variability. The topological model with the best-defined clusters upon visual inspection used two LLE lenses and the correlation distance as a metric (i.e. a distance function). These identical parameters were applied to develop the discovery and validation models.
The Mapper algorithm was used to identify coherent groups of samples [20]. Each node of the topology model constitutes patients who have combinations of features that are similar between them, with connecting lines (edges) representing data points that are shared between nodes. The size of the node represents the number of subjects with that specific combination of features. Genotypes were mapped onto the model to visualise hypothesised associations between genotype and phenotypic clusters. Validation of hypotheses suggested by the TDA was then performed using standard statistical analysis techniques. Generating hypotheses using TDA prevented the requirement for multiple comparisons and loss of statistical power.
TDA is an effective method to apply in clinical studies as it can allow for missing data [21]. A more detailed explanation of TDA can be found in the supplementary material.
Statistical analysis
Selection of variables for hypothesis testing was guided by the topological models to limit the number of comparisons. Further methodological details are provided in the supplementary material.
The derived hypotheses were tested through statistical analyses of the whole dataset and of the validation dataset alone. Where the same outcome was tested twice, p-values were adjusted using the Bonferroni correction (p≤0.049 was found to be significant). Continuous data were compared using t-tests, ANOVA and Kruskal–Wallis tests, while categorical data were compared using Chi-squared or Fisher's exact tests. Tukey's test was used for pairwise comparisons following ANOVA and Dunn's test with Holm–Sidak adjustment was used following Kruskal–Wallis tests. Multiple regression models were used to model FEV1 z-scores, adjusting for age at diagnosis, history of NRDS and presence of CHD. Normality of residuals was investigated using kernel density estimations, and visual inspection of histograms and residuals versus fits graph plots. Number of observations (n), regression coefficients (r) with 95% confidence intervals (CIs) and a model's goodness-of-fitness (adjusted R2) were reported for each model. Data were analysed in STATA version 14.0 (StataCorp, College Station, TX, USA).
Results
Data-driven genotype–phenotype associations using TDA in a discovery group of 199 PCD patients
Genotype and diagnostic test phenotype associations
TEM defect and CBP visually mapped very closely to the corresponding gene group (figure 2).
Genotype and FEV1 associations
Systematic exploration of each of the features collected for this study showed that patients with defects in the radial spoke/central complex (CC) and nexin–dynein regulatory complex (N-DRC)/molecular ruler gene functional groups had worse FEV1 z-scores at diagnosis (as indicated by dark blue coloured nodes in figure 3b) than those with dynein structural gene mutations (which had higher FEV1 z-scores, as indicated by white coloured nodes in figure 3b). Interestingly, in the cluster with predominantly poor FEV1 (dark blue in figure 3b), which corresponds to N-DRC or molecular ruler genes (CCDC39, CCDC40, CCDC65, DRC1) (figure 3a), there was a defined group showing absence of history of rhinitis (supplementary figure E3b).
The group with predominantly preserved lung function at diagnosis (white in figure 3b) corresponds to a cluster of individuals with absence of NRDS (white in figure 3c) and an area associated with gene defects of dynein structure (blue in figure 3a). Further exploration of the topological model showed that, within this dynein structural defects group, it was predominantly DNAH11 patients that had preserved lung function at diagnosis and absence of NRDS (green in figure 3e).
In contrast, individuals with variants in DNAH5 (the most common genetic cause of PCD and the predominant patient group in the cohort) were a phenotypically diverse group regarding lung function, with no clear cluster observed (figure 3f).
Genotype and other clinical phenotype associations
The model showed a group of patients with CC and N-DRC/molecular ruler gene mutations without situs inversus but with increased likelihood of glue ear (yellow and orange in supplementary figure E3a, as well as red in supplementary figure E3c) [7, 22]. In addition, there was a lack of laterality defects associated with MCIDAS and CCNO in the other function gene group (red in supplementary figure E3d) [6, 23]. Conversely, TDA revealed a cluster of patients with absence of glue ear. This was a genetically diverse group of individuals with dynein structural and assembly defects (blue and green in supplementary figure E3a and white in supplementary figure E3c).
Validation using TDA in a replication group of 197 PCD patients
A topological validation model was generated by analysis of a replication cohort of 197 additional patients (61 from the UK, 28 from The Netherlands and 108 from France) (supplementary tables E5 and E6). This confirmed the discovery group findings, with CCDC39 mutation patients clustering in an area of the structure with lower FEV1 z-scores at diagnosis (dark blue in figure 4b and green in figure 4d) and a higher proportion of reported NRDS (red in figure 4c). Meanwhile, DNAH11 mutation patients clustered in an area with higher FEV1 z-scores (green in figure 4e and light blue and white in figure 4b) and less reported NRDS (red and white in figure 4c). The model also confirmed the absence of a clear cluster of patients with DNAH5 mutations (green in figure 4f). Additional features of the validation cohort are shown in supplementary figure E4.
When analysing gene groups, those with mutations in the dynein regulatory/molecular ruler gene category had worse FEV1 z-scores (orange in figure 4a and dark blue in figure 4b) and less rhinitis (data not shown) at diagnosis, as seen in the discovery model. The cluster with preserved lung function was mostly formed by patients with dynein structure gene variants (light blue and white in figure 4b and blue in figure 4a), particularly DNAH11 (green in figure 4e). However, we could not confirm the inverse association between upper airway disease (rhinitis and glue ear) and lower airway disease (FEV1 and NRDS) observed in the discovery model (supplementary figure E4).
The distribution of gene variants in the sum total of 396 patients from both cohorts, in 31 PCD genes, is shown in figure 5 and the clinical and diagnostic characteristics are given in supplementary tables E7 and E8.
Validation of hypothesis suggested by TDA using standard statistical analysis
Two genes, CCDC39 and DNAH11, fulfilled the criteria for further hypothesis-driven statistical analysis. This required the identification of clearly defined clusters of patients, with mutations in each gene, showing distinct features in both the hypothesis-driving discovery (figure 3) and validation (figure 4) topological models. In addition, sufficient patients were required in each phenotype to allow standard statistical approaches (n=35 and n=48, respectively) (figure 5). These two genes clustered in areas with extreme values of FEV1 z-score in both topological models, leading to the hypothesis that CCDC39 and DNAH11 patients had a distinct respiratory phenotype compared to the rest of the study population.
Testing these hypotheses using traditional statistical analysis techniques, CCDC39 mutation patients had significantly lower FEV1 z-scores at diagnosis compared to all other patient genotypes grouped together when adjusted for age at diagnosis, NRDS and CHD (r=−1.2, 95% CI −1.88 to −0.55; adjusted R2=8.0%; p<0.001; n=205). Conversely, those with DNAH11 mutation had significantly higher FEV1 z-scores at diagnosis (r=0.09, 95% CI 0.27–1.53; adjusted R2=5.8%; p=0.003; n=205) and reported less NRDS compared to patients with mutations in any of the other genes (41.03% versus 63.91%; p=0.008).
In contrast, there were no statistically significant differences in NRDS for patients with CCDC39 mutations (67.86% versus 60.29% for any of the other genes). Nor were there statistically significant differences in upper airway symptoms (i.e. rhinitis and glue ear) for patients with CCDC39 mutations (96.77%) compared to muations in other genes (93.44%) or in DNAH11 mutations (97.67%) compared to other genes (93.18%).
Discussion
This is the first large-scale study to systematically investigate associations between genotype and phenotype in the genetically heterogeneous disorder PCD. It demonstrates the use of a new methodology for the visualisation of data and the generation of hypotheses that complements more traditional statistical approaches, which, where used alone, would not be sufficiently powered even in multinational cohorts. TDA cluster modelling, in nearly 400 individuals from three European countries, identified several previously unknown genotype–phenotype relationships, in addition to confirming previously reported genetic associations [7, 22, 24]. PCD, a disease with many well-defined features and 50 causal genes, leant itself to TDA and machine learning for the identification of distinct phenotypic clusters that might share an underlying genetic mutation. TDA was able to identify clinical patterns amongst relatively small numbers of patients (less than 40) with mutations in a particular gene. We suggest the approach might be beneficial for similar rare diseases, where traditional statistical methods are not suitable.
The TDA model confirmed well-established associations between diagnostic tests (e.g. TEM and CBP) and genetics, as seen by the similar colour patterns in the topological models (figure 2), where TEM defect and CBP mapped very closely visually to corresponding gene group. This confirms a strong association that is in agreement with the published PCD literature [2, 21]. Distinct genetic findings were also associated with disease severity. We found CCDC39 patients had significantly worse lung function at diagnosis (by FEV1 z-score) when compared to all other groups, as has previously been observed in individuals with microtubular defects [8, 9, 25, 26]. Furthermore, modelling identified other findings not previously reported, including that individuals with DNAH11 mutations were significantly less likely to have NRDS and, in turn, that the absence of NRDS is associated with better lung function at diagnosis. These findings were consistent between both the discovery and validation groups, and when using traditional statistical approaches.
The underlying pattern of the discovery group topological model data suggests that patients with compromised lower airways at diagnosis (i.e. decreased lung function and a history of NRDS) reported fewer upper airway symptoms (i.e. a history of glue ear and rhinitis). However, these findings could not be verified in the validation model as they may result from its over-fitting and this requires independent validation in an adequately powered independent dataset.
Comparison to previous literature
Our findings confirm and add to the evidence from other PCD genotype–phenotype studies. The largest of these have been two cross-sectional, longitudinal studies from the USA and Canada (the Genetic Disorders of Mucociliary Clearance Consortium) which also showed, based on ultrastructural phenotype and limited genotype information, that patients with microtubular defects had worse lung function [8, 9]. We also confirmed associations previously described in smaller studies, such as the absence of situs inversus in individuals with radial spoke, CC and N-DRC/molecular ruler gene mutations [4, 5, 22, 27, 28].
A previous study, using lung clearance index as a more sensitive measure of lung function, showed preserved lung function in a small group of patients from our cohort with normal ultrastructure (of which the majority had DHAH11 defects) [26]. We have further confirmed that this genotype is associated with milder lung disease by showing that these patients clustered in an area with higher values of FEV1 z-score. Traditional statistics also showed better preserved lung function in patients with DNAH11 variants compared to those with mutations in any of the other genes.
Notably, patients carrying mutations in DNAH5 were phenotypically diverse. The reasons for this are unclear, but may likely be connected to the variety of different mutations within this large gene. DNAH5 was the gene found to have the widest spectrum of gene variants in our overall cohort. This diversity and high number of different mutations is in line with DNAH5 being the most common overall genetic cause of PCD and most frequently mutated gene in affected individuals (with at least 100 different pathogenic mutations recorded worldwide) [29]. In PCD, it is likely that there will be patient phenotypic differences associated not just with the specific gene, but also the nature and location of the mutations within that gene. These genotype-related differences are already emerging on a smaller scale. For example, diagnostic results in DNAH5 are known to vary somewhat depending on the mutation type (e.g. premature stop codon (nonsense) versus missense) [30]. Differences are also associated with missense versus truncation mutations in CCDC103, where a milder diagnostic and clinical phenotype has been described in individuals with p.His154Pro missense mutations [18].
Strengths and weaknesses
This is the largest study to date investigating genotype–phenotype associations in PCD. Using a new methodology of hypothesis-free TDA to examine underlying patterns in the dataset, genotype–phenotype patterns were identified from relatively few patients, something that would be difficult with usual clustering methods. The use of temporally and geographically distinct training and validation groups is highly recommended for such topological clustering approaches [31]. Initial UK discovery findings were validated in the mixed internal and external dataset, including by replication of several important associations published previously, suggesting that these results are generalisable to other PCD populations.
The major weakness of our study remains the statistical power required to tease out relationships in a heterogeneous rare condition. To avoid problems with multiple comparisons and loss of statistical power, TDA-led hypothesis testing was performed for only two genes (CCDC39 and DNAH11). This required combining the discovery and validation datasets, and a multinational dataset larger than any existing cohort will be required to ascertain further differences, especially to analyse whether variant types (e.g. stop-gain, frameshift, splicing, missense and copy number variants) explain some of the differences seen in the phenotypic data.
Another limitation of our study was potential recall bias for neonatal and early life events, due to a reliance on parental memory to report symptoms at the time of diagnosis. Not all medical records were complete and therefore “missing data” was recorded for some of these variables; however, TDA is particularly robust to missing data (see supplementary material for more information) [14]. Finally, we acknowledge that TDA is not completely hypothesis free, as we chose variables to enter into the models. Furthermore, there may be confounding variables affecting our models that have not yet been identified.
Potential impact for clinical management and research
A better understanding of genotype–phenotype associations from studies such as these should inform education and counselling for PCD patients and their families, and will alter disease management in the future. Identifying patients that may require more aggressive or personalised treatment due to underlying genetics will allow for better and more targeted care (e.g. high-risk groups, such as patients with CCDC39 mutations, might benefit from more intense and targeted therapies).
The identification of mutations in known PCD-causative genes confirms a diagnosis of PCD. The topological models highlighted previously describe links between affected genes, TEM defects and CBPs from high-speed video analysis (HSVA), indicating that diagnostic TEM and HSVA tests can play an important supportive role in the classification of novel gene variants (likely of causal nature) and variants of uncertain clinical significance [2, 19]. These tests can also direct genetic testing to target a specific subset of genes.
By including longitudinal parameters such as lung function in the model, our approach for exploring genotype–phenotype associations might be useful for future longitudinal trials in PCD. It is a model-generating approach that could also usefully be applied to other rare diseases and to more common conditions. More accurate mapping of clinical characteristics, including severity, will allow a more targeted approach to treatment with an associated improvement in patient outcomes.
Overall, these clinically important findings can be useful in counselling parents and when considering prognosis and ongoing therapeutic interventions.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Supplementary material ERJ-02359-2020.SUPPLEMENT
Shareable PDF
Supplementary Material
This one-page PDF can be shared freely online.
Shareable PDF ERJ-02359-2020.Shareable
Acknowledgements
We thank the patients and their families for participating in the study and acknowledge the PCD Family Support Group. Borislav Dimitrov (University of Southampton) led initial discussions regarding statistical and TDA approaches for exploring genotype–phenotype relationships. He sadly died before the analyses began. We thank the following for their clinical and laboratory contributions to data used in this manuscript: Lucy Jenkins, Thomas Cullup, Alexandros Onoufriadis (University College London and Great Ormond Street, UK), Patricia Goggin, Claire L. Jackson, Janice Coles, James Thompson, Amanda Harris, Amanda Friend (University of Southampton and University Hospital Southampton, UK), Mellisa Dixon, Sarah Ollosson, Andrew V. Rogers, Emily Frost, Charlotte Richardson, Farheen Daudvohra, Paul Griffin and Thomas Burgoyne (Royal Brompton Hospital, UK). The researchers are supported by the Better Evidence to Advance Therapeutic options for PCD (BEAT-PCD) network (COST Action 1407 and European Respiratory Society (ERS) Clinical Research Collaboration). Several authors of this publication are members of the European Reference Network for Rare Respiratory Diseases (ERN-LUNG) (project ID number 739546).
Footnotes
This article has supplementary material available from erj.ersjournals.com
This article has an editorial commentary: https://doi.org/10.1183/13993003.00392-2021
Author contributions: Concept and design of the study: J.S. Lucas, C. Hogg, H.M. Mitchison, A. Shoemark and B. Rubbo. Genotyping: H.M. Mitchison, M.R. Fassad, M.P. Patel, N.S. Thomas, D. Hunt, M. Edwards, D. Morris-Rosendahl and M. Legendre. Clinical characterisation: J.S. Lucas, C. Hogg, W.T. Walker, M. Carroll, S.B. Carr, M.R. Loebinger, R. Wilson, E.G. Haarman, J-F. Papon, B. Maitre, G. Thouvenin, P-R. Burgel and I. Honoré. TDA models: B. Rubbo, J. Brandsma and G. Carlsson. Data collection: B. Rubbo, A. Shoemark, S. Best, W.T. Walker, C. Hogg, J-F. Papon, B. Maitre, G. Thouvenin, P-R. Burgel, I. Honoré, E.G. Haarman, I.C.M. Bon, E. Escudier, G. Jouvion and M. Legendre. Planned and performed the statistical analyses: B. Rubbo, A. Shoemark, C.M. Parsons and J.S. Lucas. Laboratory analyses and data collection: E. Escudier, G. Jouvion, A. Shoemark, M. Legendre, S. Best, H.M. Mitchison and M.R. Fassad. Interpretation of data analyses: B. Rubbo, A. Shoemark, J.S. Lucas, H.M. Mitchison and C. Hogg. Drafted the manuscript: A. Shoemark, B. Rubbo, J.S. Lucas, H.M. Mitchison and C. Hogg. Revised the manuscript: J.S. Lucas, C. Hogg, H.M. Mitchison, A. Shoemark, B. Rubbo, N.S. Thomas, M. Legendre, M.R. Fassad, W.T. Walker, S.B. Carr, I. Honoré and E. Escudier. All authors read and approved the final manuscript. H.M. Michison (h.mitchison@ucl.ac.uk) was principal investigator for the genetics aspect of the work, and J.S. Lucas (jlucas1@soton.ac.uk) was clinical and epidemiological principal investigator. H.M. Mitchison and J.S. Lucas had full access to all data and take final responsibility for the decision to submit for publication.
Conflict of interest: B. Rubbo has nothing to disclose.
Conflict of interest: M. Legendre has nothing to disclose.
Conflict of interest: M.R. Fassad has nothing to disclose.
Conflict of interest: E.G. Haarman has nothing to disclose.
Conflict of interest: S. Best has nothing to disclose.
Conflict of interest: I.C.M. Bon has nothing to disclose.
Conflict of interest: J. Brandsma has nothing to disclose.
Conflict of interest: P-R. Burgel reports personal fees for lectures and advisory board work from AstraZeneca, Boehringer Ingelheim, Chiesi, Novartis, Teva and Vertex, as well as personal fees for lectures from GSK, Pfizer and Zambon, outside the submitted work.
Conflict of interest: G. Carlsson has nothing to disclose.
Conflict of interest: S.B. Carr reports non-financial support and other (advisory board, lecture fee, travel, steering committee) from Vertex Pharmaceuticals, other (advisory board) from Profile Pharmaceuticals and other (lectures) from Teva Pharmaceuticals, as well as non-financial support and other (advisory board, travel, accommodation) from Chiesi Pharmaceuticals, outside the submitted work.
Conflict of interest: M. Carroll has nothing to disclose.
Conflict of interest: M. Edwards has nothing to disclose.
Conflict of interest: E. Escudier has nothing to disclose.
Conflict of interest: I. Honoré has nothing to disclose.
Conflict of interest: D. Hunt has nothing to disclose.
Conflict of interest: G. Jouvion has nothing to disclose.
Conflict of interest: M.R. Loebinger reports personal fees for advisory board work and consultancy from AstraZeneca, Insmed, Polyphor, Bayer and Griffols, outside the submitted work.
Conflict of interest: B. Maitre has nothing to disclose.
Conflict of interest: D. Morris-Rosendahl has nothing to disclose.
Conflict of interest: J-F. Papon has nothing to disclose.
Conflict of interest: C.M. Parsons has nothing to disclose.
Conflict of interest: M.P. Patel has nothing to disclose.
Conflict of interest: N.S. Thomas has nothing to disclose.
Conflict of interest: G. Thouvenin has nothing to disclose.
Conflict of interest: W.T. Walker has nothing to disclose.
Conflict of interest: R. Wilson has nothing to disclose.
Conflict of interest: C. Hogg has nothing to disclose.
Conflict of interest: H.M. Mitchison has nothing to disclose.
Conflict of interest: J.S. Lucas has nothing to disclose.
Conflict of interest: A. Shoemark has nothing to disclose.
Support statement: The PCD Centres in Southampton and London, the Wessex Regional Genetics Laboratory and Wessex Clinical Genetics Service are funded by the National Health Service for England (NHSE). Clinical research in Southampton was supported by the National Institute for Health Research (NIHR) Southampton Respiratory Biomedical Research Centre (BRC) and NIHR Southampton Wellcome Trust Clinical Research Facility. H.M. Mitchison acknowledges support from Action Medical Research, the Great Ormond Street Children's Charity and the NIHR Great Ormond Street Hospital (GOSH) BRC. M.R. Fassad was also supported by the NIHR GOSH BRC and a PhD studentship from the British Council Newton–Mosharafa Fund and the Ministry of Higher Education in Egypt. In France, this work was supported by the Institut National de la Santé et de la Recherche Médicale (INSERM), the RaDiCo funded by the French National Research Agency under the specific programme “Investments for the Future” (cohort grant agreement ANR-10-COHO-0003) and the legs Poix grant from La Chancellerie des Universités of the Sorbonne Universités de Paris. The funders had no role in the writing of the manuscript or the decision to submit it for publication. No payment was received to write this article.
- Received June 16, 2020.
- Accepted December 24, 2020.
- Copyright ©The authors 2021. For reproduction rights and permissions contact permissions{at}ersnet.org