TY - JOUR T1 - Late Breaking Abstract - External validation of exhaled-breath analysis to detect non-small cell lung cancer: a step-wise design to simultaneously develop and validate a prediction model JF - European Respiratory Journal JO - Eur Respir J DO - 10.1183/13993003.congress-2020.4166 VL - 56 IS - suppl 64 SP - 4166 AU - Sharina Kort AU - Marjolein Brusse-Keizer AU - Jan-Willem Gerritsen AU - Hugo Schouwink AU - Emanuel Citgez AU - Frans De Jongh AU - Daiana Stolz AU - Michel Van Den Heuvel AU - Wouter Van Geffen AU - John Van Putten AU - Lisanne Kastelijn AU - Ben Van Den Borne AU - Job Van Der Palen Y1 - 2020/09/07 UR - http://erj.ersjournals.com/content/56/suppl_64/4166.abstract N2 - Introduction: Exhaled-breath analysis of volatile organic compounds (VOCs) has shown the potential to detect lung cancer. Reproducibility of prediction models, especially based on artificial intelligence (AI) algorithms is essential. However, external validation is often lacking as this is time-consuming and meanwhile improved AI models often outperform the “older” model, based on a training set. We aim to simultaneously validate and improve a training model to distinguish non-small cell lung cancer (NSCLC) patients from healthy controls based on AI algorithms.Methods: We obtained exhaled-breath data of > 800 subjects. This new cohort will be used to externally validate our original prediction model to distinguish between NSCLC patients and healthy controls (N=290, AUC-ROC 0.76). In a step-wise design, a set of 50 subjects will be first predicted by the original model, whereupon these data are added to the unblinded data, and a new prediction model will be created based on an increased sample size. This will be repeated 6 times. The remaining 500 subjects will be used to validate the final extended model. Performance will be assessed by Area under the Curve.Results: Despite finishing the inclusions, due to the COVID pandemic, we have not yet been able to validate data of all included subjects in all 7 centres. This will be done before September 2020.Conclusion: We propose a design to simultaneously externally validate an original prediction model based on exhaled-breath data to distinguish NSCLC patients from healthy controls and develop new prediction models based on improved AI techniques.FootnotesCite this article as: European Respiratory Journal 2020; 56: Suppl. 64, 4166.This abstract was presented at the 2020 ERS International Congress, in session “Respiratory viruses in the "pre COVID-19" era”.This is an ERS International Congress abstract. No full-text version is available. Further material to accompany this abstract may be available at www.ers-education.org (ERS member access only). ER -