Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening

Ziegelmayer, Sebastian; Graf, Markus; Makowski, Marcus; Gawlitza, Joshua; Gassert, Felix

doi:10.3390/cancers14071729

Open AccessArticle

Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening

by

Sebastian Ziegelmayer

^*,†,

Markus Graf

^†,

Marcus Makowski

,

Joshua Gawlitza

^†

and

Felix Gassert

^†

Institute of Diagnostic and Interventional Radiology, School of Medicine, Klinikum Rechts der Isar, Technical University Munich, Ismaninger Straße 22, 81675 Munich, Germany

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Cancers 2022, 14(7), 1729; https://doi.org/10.3390/cancers14071729

Submission received: 1 March 2022 / Revised: 23 March 2022 / Accepted: 23 March 2022 / Published: 29 March 2022

(This article belongs to the Topic Artificial Intelligence in Cancer Diagnosis and Therapy)

Download

Browse Figures

Versions Notes

Simple Summary

Lung cancer screening with low-dose CT (LDCT) has been shown to significantly reduce cancer-related mortality and is recommended by the United States Preventive Services Task Force (USPSTF). With pending recommendation in Europe and millions of patients enrolling in the program, deep learning algorithms could reduce the number of false positive and negative findings. Therefore, we evaluated the cost-effectiveness of using an AI algorithm for the initial screening scan using a Markov simulation. We found that AI support at initial screening is a cost-effective strategy up to a cost of USD 1240 per patient screening, given a willingness-to-pay of USD 100,000 per quality-adjusted life years (QALYs).

Abstract

Background: Lung cancer screening is already implemented in the USA and strongly recommended by European Radiological and Thoracic societies as well. Upon implementation, the total number of thoracic computed tomographies (CT) is likely to rise significantly. As shown in previous studies, modern artificial intelligence-based algorithms are on-par or even exceed radiologist’s performance in lung nodule detection and classification. Therefore, the aim of this study was to evaluate the cost-effectiveness of an AI-based system in the context of baseline lung cancer screening. Methods: In this retrospective study, a decision model based on Markov simulation was developed to estimate the quality-adjusted life-years (QALYs) and lifetime costs of the diagnostic modalities. Literature research was performed to determine model input parameters. Model uncertainty and possible costs of the AI-system were assessed using deterministic and probabilistic sensitivity analysis. Results: In the base case scenario CT + AI resulted in a negative incremental cost-effectiveness ratio (ICER) as compared to CT only, showing lower costs and higher effectiveness. Threshold analysis showed that the ICER remained negative up to a threshold of USD 68 for the AI support. The willingness-to-pay of USD 100,000 was crossed at a value of USD 1240. Deterministic and probabilistic sensitivity analysis showed model robustness for varying input parameters. Conclusion: Based on our results, the use of an AI-based system in the initial low-dose CT scan of lung cancer screening is a feasible diagnostic strategy from a cost-effectiveness perspective.

Keywords:

lung cancer screening; deep learning; cost-effectiveness analysis; AI-support system

1. Introduction

Based on the findings of the national lung screening trial (NLST), in 2014 the United States Preventive Service task force recommended the annual lung cancer screening of patients between 55 and 80 years with 20 pack years of smoking history [1,2]. In contrast to the high and further increasing incidence of lung cancer globally, the incidence of lung cancer was relatively low in the NLST. Nonetheless, the NLST was able to show a significant reduction in lung cancer related mortality due to the annual screening with low-dose computed tomography (CT). Consequently, a European Position Statement followed in 2017, strongly recommending the CT-based lung cancer screening as well [3]. This recommendation is further supported by the Dutch-Belgian lung-cancer screening trial (Nederlands-Leuvens Longkanker Screenings Onderzoek (NELSON)), which also showed a significant reduction in lung cancer mortality for high-risk patients who participated in the screening [4]. With several ongoing pilot projects in Europe, the widespread introduction of lung cancer screening seems to be only a matter of time.

Nevertheless, the benefits of lung cancer screening are limited by false negative and false positive findings, which not only result in high costs but also affect clinical outcome and quality of life [2,5,6]. Currently, low dose CT-scans in the screening setting are evaluated based on standardized systems like Lung-RADS (Lung imaging reporting and data system), which improve the diagnostic accuracy for radiologists and reduces costs by decreasing the need for further diagnostic tests [7,8]. Even after a recent revision of the reporting system, observer variability will remain a relevant limitation [9,10].

The rapid development of artificial intelligence (AI) in the medical field has shown promising results for cancer screening and recent AI-models may achieve or exceed the diagnostic performance of sub-specialized experts, for example in breast cancer screening [11]. While long-standing CAD (computer aided diagnosis/detection) systems show mixed results for lung cancer detection [12,13,14], novel neural networks, convolutional neural networks (CNN) in particular, seem to have a positive effect on the diagnostic performance of radiologists [15]. Ardila et al. showed that a 3D-CNN outperformed radiologists in low-dose CT screening scans when no prior scans were available, indicating a favorable benefit for screening initiation.

Among other constraints, the health economic impact of AI systems is an important factor in the decision to implement models in routine clinical practice. Despite the imminent deployment of lung cancer screening and the promising results of AI-systems, no study has been performed to evaluate the utilization of neural networks in lung cancer screening compared to the stand-alone low dose CT-scan from an economic point of view. Therefore, the aim of our study was to evaluate the cost effectiveness of an AI-system for the initial scan of annual lung cancer screening and present the first results on identifying a cost margin for a clinical integration.

2. Materials and Methods

2.1. Model Structure

A decision model including the diagnostic strategies of conventional CT and CT augmented by AI was created and used as a decision tree, as shown in Figure 1.

For calculation of costs and benefits in the different iterations a Markov transition state model was created. The model included the stages:

No BC (patients without BC = true negative);
No BC, Suspicious nodule (patients without BC but suspicious nodule = false positive);
BC undetected (patients with undetected BC = false negative);
BC after resection (patients with BC after resection);
BC palliative (patients with BC which is unresectable/palliative);
Dead.

Additionally, for better simulation and understanding of the model, the states “BC delayed detection” and “BC early detection” were created, which only served for transition. The Markov model reflects the different states a patient can be assigned to. Taking into account transition probabilities between the states as well as costs and effectiveness (displayed in Quality of Life) in those states during several iterations, cumulative costs and cumulative effectiveness within a defined time horizon can be calculated by adding those up throughout the iterations.

Analysis of the model was performed using a dedicated decision analysis software (TreeAge Pro Version 19.1.1, Williamstown, MA, USA).

2.2. Input Parameters

There was no requirement for an ethical approval for this analysis based on commonly available data. Model input parameters were based on current literature. Age-specific risk of death was derived from the US life tables [16]. Age at the diagnostic procedure was set to 60 years and willingness-to-pay was set to USD 100,000 per quality adjusted life year (QALY) at a discount rate of 3%, as reported previously [17,18]. The discount rate reflects the loss in economic value or effectiveness when there is a delay in realizing a benefit or incurring costs. The pre-test probability of BC was set to 2.635% for the risk group consisting of female and male smokers risk for an interval of 30 years, according to published data from Jacob et al. [19]. All input parameters and corresponding references are listed in Table 1.

2.3. Diagnostic Test Performances

Sensitivity and specificity values for CT detection of BC with and without AI were derived from the literature (Table 1).

2.4. Costs

From a United States (US) healthcare perspective, costs were estimated based on Medicare data and available literature (Table 1). The long-term costs of the follow up in case of false positive was estimated at USD 2256 including the costs for a follow up CT examination and a possible bronchoscopy and biopsy [21]. The resection costs of BC were set to USD 36,305, according to Cowper et al. [22]. annual costs of palliative BC patients were estimated at USD 60,000 [21].

2.5. Utilities

Utility is measured in the additional quality-adjusted life years (QALY) which are gained through each diagnostic procedure. According to previous studies, quality of life (QOL) for curative BC patients was set to 0.79 for the first year after resection and 0.933 for the following years [24,25]. In accordance with the literature, QOL for palliative BC patients was set to 0.63 [26]. These values were then used for calculations in a Markov model specifically designed as mentioned above.

2.6. Transition Probabilities

Transition probabilities were derived from a systematic review of the recent literature and are shown in Table 1. Probability of successful resection of (early) detected BC was estimated at 75%, according to the national lung screening trial research team [2]. Risk of secondary occurrence of cancer/metastases after resection of the primary tumor was assumed to be 9.80% [29]. Annual mortality rate of curative patients was set to 4.7% and to 36.0% for palliative patients [28,32,33].

2.7. Cost-Effectiveness Analysis

The cost-effectiveness analysis was performed based on Markov simulations with a run time of 20 years (20 iterations) after initial diagnostic procedure. The discount rate was set to 3.0% and willingness-to-pay was set to USD 100,000 per QALY according to current recommendations [18].

In the base-case scenario, cost-effectiveness was determined with costs of CT + AI identical to costs of CT only, meaning costs of USD 0 for additional use of AI. Based on these results, maximum costs for AI were calculated for several willingness-to-pay thresholds. For evaluation of model uncertainty and influence of alteration of each variable on the model, a deterministic sensitivity analysis was performed. Results were visualized in a tornado diagram.

Based on the Markov model, Monte-Carlo simulations were used to perform a probabilistic sensitivity analysis with a total of 30,000 iterations. This method is used to account for the variation of input-parameters among different individuals.

3. Results

3.1. Cost-Effectiveness Analysis

Simulations of a time horizon of 20 years resulted in average cumulative costs of USD 4310.82 for CT + AI and USD 4378.44 for CT if additional diagnostic costs for the use of AI were set to USD 0 in the base case scenario. In this scenario, average cumulative effectiveness was at 13.76 QALYs for CT + AI and at 13.75 QALYs for CT. To better understand the impact of input parameters on the model, costs and effectiveness as well as distribution of the different outcomes are shown in Figure 2. Different overall costs and effectiveness derive from different distribution of the outcomes “true positive”, “false negative”, “true negative”, and “false positive” based on different sensitivity and specificity of the two methods. The incremental cost-effectiveness ratio in the base case scenario was negative, meaning both, lower cost and higher effectiveness for CT + AI.

3.2. Sensitivity Analysis

Probabilistic sensitivity analysis and Monte Carlo simulation was performed to determine the distribution of the resulting ICER-values and is visualized in Figure 3. Monte Carlo simulation reflects the difference between costs (=incremental costs) and effectiveness (=incremental effectiveness) for a certain amount of notional scenarios/iterations. All iterations with an ICER-value below the willingness-to-pay of USD 100,000 per QALY were considered cost-effective.

Deterministic sensitivity analysis was performed to account for variability of input parameters in the base case scenario. Results are displayed as a tornado diagram in Figure 4A.

Applying wide ranges of variation for the different input parameters, ICER stayed below USD 0/QALY for the sensitivities of the diagnostic modalities and the probabilities of resectability in early and delayed diagnosis. Although ICER turned positive when varying the specificity of CT and CT + AI, the willingness-to-pay threshold of USD 100,000/QALY was not crossed in any of the cases.

3.3. Threshold Analysis

To determine the maximum possible costs for the use of AI at a willingness-to-pay of USD 100,000/QALY, a threshold analysis was performed. As shown in Figure 5, ICER remained negative until costs of AI were raised to USD 68.

Raising costs of AI further, the assumed willingness-to-pay threshold of USD 100,000/QALY is only crossed at a value USD 1240. Influence in different input parameters in this second base case scenario setting costs of AI to USD 1240 are shown in Figure 4B. To account for possible variation of the willingness-to-pay, Table 2 displays possible costs for AI depending on different willingness-to-pay thresholds. Due to the cost’s dependency on the ICER, the cost for AI directly is further influenced by the systems performance, resulting in a higher price for a better system due to the increased ICER.

4. Discussion

The widespread integration of lung cancer screening is proving to be a complex and challenging undertaking. Nevertheless, lung cancer screening is a cost-effective method to reduce lung cancer mortality. AI-models for cancer detection and classification have proved to be of benefit in lung cancer screening in several studies [15,34].

In the present study, we show that a state-of-the-art AI-model (3D-convolutional neural network according to Ardila et al.) is a cost-effective method for the baseline screening scan [15]. Despite promising results of AI in the health care sector, studies evaluating the economic impact and cost effectiveness remain sparse [35]. To our knowledge, no study has been conducted to investigate the cost-effectiveness of an AI-system in lung cancer screening. Based on the superior performance of the AI-model without prior imaging, we simulated an implementation for the initial screening scan using input parameters derived from published screening cohorts [2,15,36,37], to ensure comparability to the standard screening setting.

Our base case estimate for screening with an AI system compared to current low-dose CT screening yielded a negative ICER up to costs of USD 68 for the AI system, indicating that using an AI system in the screening setting results in lower cost and higher effectiveness up to these costs per patient scan. Furthermore, the ICER remained below the applied willingness-to-pay up to costs of USD 1240. To account for variations in input parameters, we performed a deterministic sensitivity analysis for the base case scenario and the maximum cost-effective costs (USD 1240). The specificity of the diagnostic strategy had the greatest influence for both scenarios, due to the low lung cancer rate in screening cohorts. For the base case scenario all input variations resulted in an ICER below the willingness-to-pay by a large margin, indicating robust cost-effectiveness. Adding AI support showed a reduced number of false-positives and an increased number of true negatives in our simulation. In particular, the reduction of false-positives highly impacts the value of a screening method, as not only costs in the form of unnecessary follow-up examination and possibly further, partly invasive examinations are reduced, but also patients do not have to experience the psychological distress of a possible cancer diagnosis [38]. Additionally, the false positive rates and the frequency of invasive diagnostic procedures were more frequent at the baseline CT, ranging from 7.9% to 49.3% for the false positive rate and 3.7% for additional invasive procedures [2,39], further emphasizing the benefit of AI support for the initial screening. As shown by Audelan et al., the sensitivity and specificity of AI in lung cancer screening can further be improved, consequently allowing for an additional reduction of costs and increased effectiveness [40].

Despite promising results, our study underlies several limitations. First, the cost-effectiveness was only evaluated for the initial scan in the lung cancer screening. This is due to published literature, focusing on the superiority of AI lung nodule detection and classification in initial CT of the thorax without prior imaging for comparison. According to Ardila et al., deep-learning algorithms are superior to radiologists in lung cancer screening detection, when no prior imaging is available for comparison, but is on-par as soon as previous examinations are available for the reader. Consequently, further research has to be conducted to evaluate the cost-effectiveness of AI-based computer-aided diagnosis systems in longitudinal screening, beyond the initial scan [15]. Further, our evaluation is focused on the sole AI system performance in comparison to the human reader—the radiologist. However, several studies have shown promising results for the collaboration of both, often referred to as the “Centaur model” [33]. Such systems were shown not only to be beneficial in patient care but cost-effective as well [41]. Despite dealing with different challenges compared to lung cancer, for thyroid nodule detection, AI systems outperform thyroid cancer specialized radiologists in nodule classification, but the combination of specialized radiologists with AI-support showed an even higher specificity and positive predictive value when compared to the AI system alone [42]. Therefore, further research is needed to evaluate the combination of AI models and specialized thorax radiologists in lung cancer detection and its cost-effectiveness. Lastly, cost-effectiveness analysis with decision-based models is highly dependent on the input parameters, while deterministic sensitivity analysis may incorporate parameter variation to a certain degree, and recommendations for each individual case cannot be derived from the model.

5. Conclusions

To conclude, in our study we show that screening with an AI-model in the initial screening scan is a cost-effective strategy in low-dose CT lung cancer screening with robustness to variation of input parameters. Defining thresholds for cost of AI results might help faster translate AI systems into clinical use.

Author Contributions

Conceptualization, S.Z. and F.G.; methodology, F.G. and J.G.; validation, M.G., S.Z.; formal analysis, F.G.; investigation, S.Z., M.G. and J.G.; resources, M.M.; data curation M.G.; writing—original draft preparation, S.Z. and J.G.; writing—review and editing, M.G., F.G. and M.M.; visualization, F.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to this analysis is based on commonly available data.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are listed in Table 1.

Conflicts of Interest

The authors declare no conflict of interest.

References

Moyer Virginia, A. On behalf of the us preventive services task force screening for lung cancer: Us preventive services task force recommendation statement. Ann. Intern. Med. 2014, 160, 330–338. [Google Scholar]
National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. N. Engl. J. Med. 2011, 365, 395–409. [Google Scholar] [CrossRef] [PubMed]
Oudkerk, M.; Devaraj, A.; Vliegenthart, R.; Henzler, T.; Prosch, H.; Heussel, C.P.; Bastarrika, G.; Sverzellati, N.; Mascalchi, M.; Delorme, S. European position statement on lung cancer screening. Lancet Oncol. 2017, 18, e754–e766. [Google Scholar] [PubMed]
De Koning, H.J.; van der Aalst, C.M.; de Jong, P.A.; Scholten, E.T.; Nackaerts, K.; Heuvelmans, M.A.; Lammers, J.-W.J.; Weenink, C.; Yousaf-Khan, U.; Horeweg, N. Reduced lung-cancer mortality with volume ct screening in a randomized trial. N. Engl. J. Med. 2020, 382, 503–513. [Google Scholar] [CrossRef] [PubMed]
Rasmussen, J.F.; Siersma, V.; Pedersen, J.H.; Heleno, B.; Saghir, Z.; Brodersen, J. Healthcare costs in the danish randomised controlled lung cancer ct-screening trial: A registry study. Lung Cancer 2014, 83, 347–355. [Google Scholar] [CrossRef]
Wiener, R.S.; Schwartz, L.M.; Woloshin, S.; Welch, H.G. Population-based risk for complications after transthoracic needle lung biopsy of a pulmonary nodule: An analysis of discharge records. Ann. Intern. Med. 2011, 155, 137–144. [Google Scholar] [CrossRef]
Kastner, J.; Hossain, R.; Jeudy, J.; Dako, F.; Mehta, V.; Dalal, S.; Dharaiya, E.; White, C. Lung-rads version 1.0 versus lung-rads version 1.1: Comparison of categories using nodules from the national lung screening trial. Radiology 2021, 300, 203704. [Google Scholar]
McKee, B.J.; Regis, S.M.; McKee, A.B.; Flacke, S.; Wald, C. Performance of acr lung-rads in a clinical ct lung screening program. J. Am. Coll. Radiol. 2016, 13, R25–R29. [Google Scholar] [CrossRef]
Mehta, H.J.; Mohammed, T.-L.; Jantz, M.A. The american college of radiology lung imaging reporting and data system: Potential drawbacks and need for revision. Chest 2017, 151, 539–543. [Google Scholar] [CrossRef]
Singh, S.; Pinsky, P.; Fineberg, N.S.; Gierada, D.S.; Garg, K.; Sun, Y.; Nath, P.H. Evaluation of reader variability in the interpretation of follow-up ct scans at lung cancer screening. Radiology 2011, 259, 263–270. [Google Scholar] [CrossRef]
McKinney, S.M.; Sieniek, M.; Godbole, V.; Godwin, J.; Antropova, N.; Ashrafian, H.; Back, T.; Chesus, M.; Corrado, G.S.; Darzi, A. International evaluation of an ai system for breast cancer screening. Nature 2020, 577, 89–94. [Google Scholar] [CrossRef] [PubMed]
Brown, M.S.; Goldin, J.G.; Rogers, S.; Kim, H.J.; Suh, R.D.; McNitt-Gray, M.F.; Shah, S.K.; Truong, D.; Brown, K.; Sayre, J.W. Computer-aided lung nodule detection in ct: Results of large-scale observer test1. Acad. Radiol. 2005, 12, 681–686. [Google Scholar] [CrossRef] [PubMed]
De Hoop, B.; de Boo, D.W.; Gietema, H.A.; van Hoorn, F.; Mearadji, B.; Schijf, L.; van Ginneken, B.; Prokop, M.; Schaefer-Prokop, C. Computer-aided detection of lung cancer on chest radiographs: Effect on observer performance. Radiology 2010, 257, 532–540. [Google Scholar] [CrossRef] [PubMed]
Jeon, K.N.; Goo, J.M.; Lee, C.H.; Lee, Y.; Choo, J.Y.; Lee, N.K.; Shim, M.-S.; Lee, I.S.; Kim, K.G.; Gierada, D.S. Computer-aided nodule detection and volumetry to reduce variability between radiologists in the interpretation of lung nodules at low-dose screening ct. Investig. Radiol. 2012, 47, 457. [Google Scholar] [CrossRef]
Ardila, D.; Kiraly, A.P.; Bharadwaj, S.; Choi, B.; Reicher, J.J.; Peng, L.; Tse, D.; Etemadi, M.; Ye, W.; Corrado, G. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 2019, 25, 954–961. [Google Scholar] [CrossRef]
Arias, E.; Xu, J.; Kochanek, K.D. United states life tables, 2016. Natl. Vital Stat. Rep. 2019, 68, 4. [Google Scholar]
Cameron, D.; Ubels, J. Norstr öm f: On what basis are medical cost-effectiveness thresholds set. Clashing Opin. Absence Data A Syst. Rev. Glob. Health Action 2018, 11, 1447828. [Google Scholar] [CrossRef]
Sanders, G.D.; Neumann, P.J.; Basu, A.; Brock, D.W.; Feeny, D.; Krahn, M.; Kuntz, K.M.; Meltzer, D.O.; Owens, D.K.; Prosser, L.A. Recommendations for conduct, methodological practices, and reporting of cost-effectiveness analyses: Second panel on cost-effectiveness in health and medicine. JAMA 2016, 316, 1093–1103. [Google Scholar] [CrossRef]
Jacob, L.; Freyn, M.; Kalder, M.; Dinas, K.; Kostev, K. Impact of tobacco smoking on the risk of developing 25 different cancers in the uk: A retrospective study of 422,010 patients followed for up to 30 years. Oncotarget 2018, 9, 17420. [Google Scholar] [CrossRef]
Procedure Price Lookup for Outpatient Services. Medicare.gov 71275. 2021. Available online: https://www.medicare.gov/procedure-price-lookup/cost/71275/ (accessed on 9 January 2022).
Ten Haaf, K.; Tammemägi, M.C.; Bondy, S.J.; van der Aalst, C.M.; Gu, S.; McGregor, S.E.; Nicholas, G.; de Koning, H.J.; Paszat, L.F. Performance and cost-effectiveness of computed tomography lung cancer screening scenarios in a population-based setting: A microsimulation modeling analysis in Ontario, Canada. PLoS Med. 2017, 14, e1002225. [Google Scholar] [CrossRef]
Cowper, P.A.; Feng, L.; Kosinski, A.S.; Tong, B.C.; Habib, R.H.; Putnam, J.B., Jr.; Onaitis, M.W.; Furnary, A.P.; Wright, C.D.; Jacobs, J.P. Initial and longitudinal cost of surgical resection for lung cancer. Ann. Thorac. Surg. 2021, 111, 1827–1833. [Google Scholar] [CrossRef] [PubMed]
Gareen, I.F.; Duan, F.; Greco, E.M.; Snyder, B.S.; Boiselle, P.M.; Park, E.R.; Fryback, D.; Gatsonis, C. Impact of lung cancer screening results on participant health-related quality of life and state anxiety in the national lung screening trial. Cancer 2014, 120, 3401–3409. [Google Scholar] [CrossRef] [PubMed]
Grutters, J.P.; Joore, M.A.; Wiegman, E.M.; Langendijk, J.A.; de Ruysscher, D.; Hochstenbag, M.; Botterweck, A.; Lambin, P.; Pijls-Johannesma, M. Health-related quality of life in patients surviving non-small cell lung cancer. Thorax 2010, 65, 903–907. [Google Scholar] [CrossRef]
Möller, A.; Sartipy, U. Long-term health-related quality of life following surgery for lung cancer. Eur. J. Cardio-Thorac. Surg. 2012, 41, 362–367. [Google Scholar] [CrossRef] [PubMed]
Doyle, S.; Lloyd, A.; Walker, M. Health state utility scores in advanced non-small cell lung cancer. Lung Cancer 2008, 62, 374–380. [Google Scholar] [CrossRef] [PubMed]
Green, A.; Hauge, J.; Iachina, M.; Jakobsen, E. The mortality after surgery in primary lung cancer: Results from the danish lung cancer registry. Eur. J. Cardio-Thorac. Surg. 2016, 49, 589–594. [Google Scholar] [CrossRef]
Toker, A.; Dilege, S.; Ziyade, S.; Eroglu, O.; Tanju, S.; Yilmazbayhan, D.; Kilicarslan, Z.; Kalayci, G. Causes of death within 1 year of resection for lung cancer. Early mortality after resection. Eur. J. Cardio-Thorac. Surg. 2004, 25, 515–519. [Google Scholar] [CrossRef][Green Version]
Lou, F.; Huang, J.; Sima, C.S.; Dycoco, J.; Rusch, V.; Bach, P.B. Patterns of recurrence and second primary lung cancer in early-stage lung cancer survivors followed with routine computed tomography surveillance. J. Thorac. Cardiovasc. Surg. 2013, 145, 75–82. [Google Scholar] [CrossRef] [PubMed]
Scholten, E.T.; Horeweg, N.; de Koning, H.J.; Vliegenthart, R.; Oudkerk, M.; Willem, P.T.M.; de Jong, P.A. Computed tomographic characteristics of interval and post screen carcinomas in lung cancer screening. Eur. Radiol. 2015, 25, 81–88. [Google Scholar] [CrossRef]
Thorsteinsson, H.; Alexandersson, A.; Oskarsdottir, G.N.; Skuladottir, R.; Isaksson, H.J.; Jonsson, S.; Gudbjartsson, T. Resection rate and outcome of pulmonary resections for non–small-cell lung cancer: A nationwide study from iceland. J. Thorac. Oncol. 2012, 7, 1164–1169. [Google Scholar] [CrossRef]
Cancer Stat Facts: Lung and Bronchus Cancer. 2021. Available online: https://seer.cancer.gov/statfacts/html/lungb.html (accessed on 9 January 2022).
Goldstein, I.M.; Lawrence, J.; Miner, A.S. Human-machine collaboration in cancer and beyond: The centaur care model. JAMA Oncol. 2017, 3, 1303–1304. [Google Scholar] [CrossRef] [PubMed]
Liang, M.; Tang, W.; Xu, D.M.; Jirapatnakul, A.C.; Reeves, A.P.; Henschke, C.I.; Yankelevitz, D. Low-dose ct screening for lung cancer: Computer-aided detection of missed lung cancers. Radiology 2016, 281, 279–288. [Google Scholar] [CrossRef] [PubMed]
Wolff, J.; Pauling, J.; Keck, A.; Baumbach, J. Systematic review of economic impact studies of artificial intelligence in health care. J. Med. Internet Res. 2020, 22, e16866. [Google Scholar] [CrossRef] [PubMed]
Pastorino, U.; Rossi, M.; Rosato, V.; Marchianò, A.; Sverzellati, N.; Morosi, C.; Fabbri, A.; Galeone, C.; Negri, E.; Sozzi, G. Annual or biennial ct screening versus observation in heavy smokers: 5-year results of the mild trial. Eur. J. Cancer Prev. 2012, 21, 308–315. [Google Scholar] [CrossRef]
Van Klaveren, R.J.; Oudkerk, M.; Prokop, M.; Scholten, E.T.; Nackaerts, K.; Vernhout, R.; van Iersel, C.A.; van den Bergh, K.A.; Westeinde, S.V.; van der Aalst, C. Management of lung nodules detected by volume ct scanning. N. Engl. J. Med. 2009, 361, 2221–2229. [Google Scholar] [CrossRef]
Van den Bergh, K.A.; Essink-Bot, M.-L.; Borsboom, G.J.; Scholten, E.T.; Prokop, M.; de Koning, H.J.; van Klaveren, R.J. Short-term health-related quality of life consequences in a lung cancer ct screening trial (nelson). Br. J. Cancer 2010, 102, 27–34. [Google Scholar] [CrossRef]
Jonas, D.E.; Reuland, D.S.; Reddy, S.M.; Nagle, M.; Clark, S.D.; Weber, R.P.; Enyioha, C.; Malo, T.L.; Brenner, A.T.; Armstrong, C. Screening for lung cancer with low-dose computed tomography: Updated evidence report and systematic review for the us preventive services task force. JAMA 2021, 325, 971–987. [Google Scholar] [CrossRef]
Audelan, B.; Lopez, S.; Fillard, P.; Diascorn, Y.; Padovani, B.; Delingette, H. Validation of Lung Nodule Detection a Year before Diagnosis in Nlst Dataset Based on a Deep Learning System; European Respiratory Society: Lausanne, Switzerland, 2021. [Google Scholar]
Hoverman, J.R.; Klein, I.; Harrison, D.W.; Hayes, J.E.; Garey, J.S.; Harrell, R.; Sipala, M.; Houldin, S.; Jameson, M.D.; Abdullahpour, M. Opening the black box: The impact of an oncology management program consisting of level i pathways and an outbound nurse call system. J. Oncol. Pract. 2014, 10, 63–67. [Google Scholar] [CrossRef]
Peng, S.; Liu, Y.; Lv, W.; Liu, L.; Zhou, Q.; Yang, H.; Ren, J.; Liu, G.; Wang, X.; Zhang, X. Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: A multicentre diagnostic study. Lancet Digit. Health 2021, 3, e250–e259. [Google Scholar] [CrossRef]

Figure 1. Markov model with possible states of disease and transition probabilities between states. BC = bronchial cancer; LT = life tables.

Figure 2. Roll-back of the economic model showing costs and effectiveness of the different outcomes. Distributions leading to overall costs and effectiveness are different for CT and CT + AI depending on sensitivity and specificity of the two methods and indicated as probabilities. BC = bronchial cancer; CT = computed tomography; TP = true positive; TN = true negative; FP = false positive; FN = false negative; Prob = probability.

Figure 3. Probabilistic sensitivity analysis utilizing Monte-Carlo simulations (30,000 iterations). Incremental cost-effectiveness scatter plot for CT + AI vs. CT. iterations with an ICER-value below the willingness-to-pay of USD 100,000 per QALY are shown as green crosses. WTP = willingness-to-pay.

Figure 4. (A) Tornado diagram showing the impact of input parameters on incremental cost-effectiveness ratio (ICER) in the base case scenario. Assuming a willingness-to-pay threshold of USD 100,000 per QALY, CT + AI remained cost-effective in all cases. (B) Tornado diagram showing the impact of input parameters on incremental cost-effectiveness ratio (ICER) when costs of AI were set to USD 1240 with an expected value of USD 100,000 per QALY. Blue bars show changes when decreasing the value of an input parameter as compared to the base case scenario and red bars when increasing the respective value. Sens = sensitivity; Spec = specificity; CT = computed tomography; AI = artificial intelligence; P = probability.

Figure 5. One-way sensitivity analysis for costs of AI (USD) and the corresponding incremental cost effectiveness ratio (ICER in USD/QALY). Thresholds indicate values at an ICER of USD 0/QALY and USD 100,000/QALY. ICER = incremental cost-effectiveness ratio; AI = artificial intelligence; QALY = quality adjusted life year.

Table 1. Input parameters.

Pre-test-Probability of BC	2.635	Jacob et al. [19]
Age at diagnostic procedure	60 years	US Preventive Services Task Force [1]
Assumed WTP	USD 100,000,00	Assumption
Discount rate	3.00%	Assumption
Markov model time	20 years	Assumption
Diagnostic Test Performances
Sensitivity for BC CT	77.9%	Ardila et al. [15]
Specificity for BC CT	87.7%	Ardila et al. [15]
Sensitivity for BC CT + AI	97.7%	Ardila et al. [15]
Specificity for BC CT + AI	98.4%	Ardila et al. [15]
Costs (Acute)
CT	USD 161.00	Medicare (71,250) [20]
Costs (Long Term)
No BC	USD 0.00
Follow up if false positive	USD 2256.00	ten Haaf et al. [21]
Curative therapy BC/resection cost	USD 36,305.00	Cowper et al. [22]
BC undetected	USD 0	Assumption
BC after resection	USD 4283.00	ten Haaf et al. [21]
Therapy BC, palliative	USD 60,000.00	ten Haaf et al. [21]
Dead	USD 0	Assumption
Utilities
No BC	1	Assumption
Follow up if false positive	0.98	Gareen et al. [23]
Curative therapy BC/resection	0.79	Grutters et al. [24]
BC undetected	1	Assumption
BC after resection	0.933	Möller et al. [25]
BC palliative	0.63	Doyle et al. [26]
Dead	0	Assumption
Transition Probabilities
Verification of suspicious nodule as no BC	100%	Assumption
Death if no BC but suspicious nodule	0.001 (invasive diagnostics) + life tables	The National Lung Screening Trial Research Team [2]
Resection rate of BC after early detection	75%	The National Lung Screening Trial Research Team [2]
Death after curative resection	4.70%	Green et al./Toker et al. [27,28]
Recurrence after resection	9.80%	Lou et al. [29]
Detection of initially undetected BC	15% 1st, 40% 2nd, 100% 3rd year	Scholten et al. [30]
Death with undetected BC	life tables
Resection rate of BC after delayed detection	26%	Hunbogi et al. [31]
Death with palliative care	36%	Cancer Stat Facts: Lung and Bronchus Cancer, National Cancer Institute [32]
Death without BC	life tables

AI = artificial intelligence; BC = bronchial cancer; CT = computed tomography; QALY = quality adjusted life year; WTP = willingness-to-pay.

Table 2. Cost of AI at different WTP-thresholds.

WTP (USD/QALY)	0	20,000	40,000	60,000	80,000	100,000	120,000	150,000	200,000
Cost of AI (USD)	68	302	537	771	1006	1240	1475	1826	2412

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ziegelmayer, S.; Graf, M.; Makowski, M.; Gawlitza, J.; Gassert, F. Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening. Cancers 2022, 14, 1729. https://doi.org/10.3390/cancers14071729

AMA Style

Ziegelmayer S, Graf M, Makowski M, Gawlitza J, Gassert F. Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening. Cancers. 2022; 14(7):1729. https://doi.org/10.3390/cancers14071729

Chicago/Turabian Style

Ziegelmayer, Sebastian, Markus Graf, Marcus Makowski, Joshua Gawlitza, and Felix Gassert. 2022. "Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening" Cancers 14, no. 7: 1729. https://doi.org/10.3390/cancers14071729

APA Style

Ziegelmayer, S., Graf, M., Makowski, M., Gawlitza, J., & Gassert, F. (2022). Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening. Cancers, 14(7), 1729. https://doi.org/10.3390/cancers14071729

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cost-Effectiveness of Artificial Intelligence Support in Computed Tomography-Based Lung Cancer Screening

Simple Summary

Abstract

1. Introduction

2. Materials and Methods

2.1. Model Structure

2.2. Input Parameters

2.3. Diagnostic Test Performances

2.4. Costs

2.5. Utilities

2.6. Transition Probabilities

2.7. Cost-Effectiveness Analysis

3. Results

3.1. Cost-Effectiveness Analysis

3.2. Sensitivity Analysis

3.3. Threshold Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI