Next Article in Journal
Comparison of Short Uncemented Metaphyseal Stem and Long-Stem Reverse Shoulder Arthroplasty in Proximal Humerus Fractures: Preliminary Study at 2-Year Follow-Up
Previous Article in Journal
The Impact of Navigation in Lumbar Spine Surgery: A Study of Historical Aspects, Current Techniques and Future Directions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence—A Systematic Review and Meta-Analysis

by
Jakub Kufel
1,*,
Łukasz Czogalik
2,3,
Michał Bielówka
2,3,
Mikołaj Magiera
2,3,
Adam Mitręga
2,3,
Piotr Dudek
2,3,
Katarzyna Bargieł-Łączek
4,
Magdalena Stencel
2,3,
Wiktoria Bartnikowska
5,
Sylwia Mielcarska
6,
Sandra Modlińska
1,
Zbigniew Nawrat
7,8,
Maciej Cebula
9 and
Katarzyna Gruszczyńska
1
1
Department of Radiology and Nuclear Medicine, Faculty of Medical Sciences in Katowice, Medical University of Silesia, Medyków 14, 40-752 Katowice, Poland
2
Students’ Scientific Association of Computer Analysis and Artificial Intelligence, Department of Radiology and Nuclear Medicine, Medical University of Silesia in Katowice, 40-752 Katowice, Poland
3
Professor Zbigniew Religa Student Scientific Association, Department of Biophysic, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Jordana 19, 41-808 Zabrze, Poland
4
Department of Diagnostic Imaging, Szpital Specjalistyczny im. Sz. Starkiewicza, 41-300 Dąbrowa Górnicza, Poland
5
Faculty of Medical Sciences in Katowice, Medical University of Silesia, 40-752 Katowice, Poland
6
Department of Medical and Molecular Biology, Faculty of Medical Sciences, Medical University of Silesia, 41-808 Zabrze, Poland
7
Foundation of Cardiac Surgery Development, 41-800 Zabrze, Poland
8
Department of Biophysics, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Jordana 19, 41-808 Zabrze, Poland
9
Individual Medical Practice Maciej Cebula, 40-754 Katowice, Poland
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2024, 13(16), 4659; https://doi.org/10.3390/jcm13164659
Submission received: 26 May 2024 / Revised: 21 July 2024 / Accepted: 6 August 2024 / Published: 8 August 2024
(This article belongs to the Section Nuclear Medicine & Radiology)

Abstract

:
Background: Chest X-rays (CXRs) are pivotal in clinical diagnostics, particularly in assessing cardiomegaly through the cardiothoracic ratio (CTR). This systematic review and meta-analysis evaluate the efficacy of artificial intelligence (AI) in automating CTR determination to enhance patient care and streamline diagnostic processes. They are concentrated on comparing the performance of AI models in determining the CTR against human assessments, identifying the most effective models for potential clinical implementation. This study was registered with PROSPERO (no. CRD42023437459). No funding was received. Methods: A comprehensive search of medical databases was conducted in June 2023. The search strategy adhered to the PICO framework. Inclusion criteria encompassed original articles from the last decade focusing on AI-assisted CTR assessment from standing-position CXRs. Exclusion criteria included systematic reviews, meta-analyses, conference abstracts, paediatric studies, non-original articles, and studies using imaging techniques other than X-rays. After initial screening, 117 articles were reviewed, with 14 studies meeting the final inclusion criteria. Data extraction was performed by three independent investigators, and quality assessment followed PRISMA 2020 guidelines, using tools such as the JBI Checklist, AMSTAR 2, and CASP Diagnostic Study Checklist. Risk of bias was assessed according to the Cochrane Handbook guidelines. Results: Fourteen studies, comprising a total of 70,472 CXR images, met the inclusion criteria. Various AI models were evaluated, with differences in dataset characteristics and AI technology used. Common preprocessing techniques included resizing and normalization. The pooled AUC for cardiomegaly detection was 0.959 (95% CI 0.944–0.975). The pooled standardized mean difference for CTR measurement was 0.0353 (95% CI 0.147–0.0760). Significant heterogeneity was found between studies (I2 89.97%, p < 0.0001), with no publication bias detected. Conclusions: Standardizing methodologies is crucial to avoid interpretational errors and advance AI in medical imaging diagnostics. Uniform reporting standards are essential for the further development of AI in CTR measurement and broader medical imaging applications.

1. Introduction

Chest radiographs (X-rays) are widely used in clinical diagnostics. They are a significant part of imaging procedures worldwide due to their numerous advantages—first and foremost, their low cost, high availability, and widespread use in the imaging diagnosis of many lung and heart diseases [1,2,3]. Chest X-rays account for 40% of the 3.6 billion imaging procedures performed annually [4]. One of the important components of a chest X-ray is the cardiothoracic ratio (CTR), which is the ratio of the width of the heart silhouette (transverse cardiac diameter—TCD) to the widest dimension of the chest (transverse thoracic diameter—TTD) [5]. CTR values of 0.42–0.50 are considered normal, and values above 0.5 indicate enlargement of the cardiac silhouette—cardiomegaly, which is considered a pathology [6]. The cut-off value for a CTR may vary depending on the projection and the specific group of patients studied. According to the literature, a CTR should be determined in a posterior–anterior (PA) projection [7,8], which provides the closest representation of the true dimensions of the heart. Nevertheless, some studies report appropriate CTR values in the anteroposterior (AP) position [6], which proves important when diagnosing patients who are lying down and diagnosing young children. There are many causes of cardiomegaly, which always require additional diagnosis due to the high risk of patient deterioration and even death. The development of heart failure results in a 50% survival rate [9], and conditions involving cardiomegaly, such as myocardial infarction, coronary artery disease, cardiomyopathies, or valvular regurgitation lead precisely to heart failure [10]. Artificial intelligence (AI) has emerged as a promising tool for CTR measurement, offering potential improvements in efficiency and consistency. Despite the technical validation of AI methods, there remains a need for clinical evaluation on large datasets to ensure their reliability and effectiveness. Recent studies, such as the one by Saiviroonporn et al., have explored the utility of AI in assisting radiologists by reducing observer variation and operation time. Their findings indicate that, while AI alone may not yet be suitable for automated CTR measurement due to high variation, AI-assisted methods show promise in improving measurement accuracy and speed [11]. In addition to these advances, evaluating AI performance in medical applications requires rigorous and standardized evaluation tools. For example, the Artificial Intelligence Performance Instrument (AIPI) has been developed and tested for its reliability and validity in evaluating AI performance in clinical settings. A study by Lechien et al. evaluated the AIPI using the medical records of otolaryngology patients, comparing AI-generated and physician-generated scores for differential diagnosis, management, and treatment. The AIPI demonstrated adequate internal consistency and moderate-to-strong test–retest reliability, supporting its potential as a robust tool for assessing AI performance in medicine [12]. In this article, we will present a review and meta-analysis of studies using AI to determine the CTR. This study is concentrated on comparing the performance of AI models in determining the CTR against human assessments, identifying the most effective models for potential clinical implementation. The findings from this analysis will assess whether AI will help improve patient care during the diagnosis of cardiomegaly-related diseases, as well as evaluate the potential benefits for radiology and diagnostic imaging specialists by automating the process and speeding up their work.

2. Protocol Registration

This study was registered in PROSPERO no. CRD42023437459 [13].
Study protocol supplement: study protocol.

3. Materials and Methods

3.1. Search Strategy and Selection Criteria

In June 2023, a search of medical databases such as MEDLINE via. PubMed, Scopus, Web of Science, Embase, and Cochrane Library was conducted.
The search strategy was developed in accordance with PICO [14]. A detailed search strategy and PICO schema are presented in (Supplement S1: PICO).
Initially, a total of 173 articles were found in all databases. After applying filters, 117 articles remained. The articles were then entered into the Rayyan tool [15], and 62 duplicates were automatically detected, of which 40 were removed.
Next, 77 articles were analysed by two independent researchers. The analysis looked at titles, abstracts, and keywords. A total of 63 articles were excluded for the following reasons: different form of study from the article (n = 3), wrong study design (n = 35), paediatric population (n = 3), article had no abstract (n = 1), article was published more than 10 years ago (n = 1), and concerning computed tomography (CT) instead of CXRs (n = 19). During the review of articles, one more article was rejected because it used a database of paediatric patients.
Fourteen articles were included in the review (Supplement S2: PRISMAflow diagram). All decisions made by the investigators were recorded—Researcher 1: 62 excluded, 14 included, perhaps 1; Researcher 2: 61 excluded, 14 included, perhaps 2. Conflicts were resolved by a third independent researcher. Inter-investigator concordance was assessed using Cohen’s Kappa test [16], which was 0.87 (percent concordance: 96.15), which is interpreted as near-perfect concordance.

Inclusion and Exclusion Criteria

Inclusion criteria: original articles, articles having an abstract, and articles from the last 10 years. Concerning assessment of the cardiothoracic ratio (CTR) on CXRs in the standing position in posterior–anterior (PA) projection using AI.
Exclusion criteria: systematic reviews, meta-analyses, congress abstracts, conference abstracts, paediatric population articles, and other forms of scientific publications that are not original articles. Studies using imaging techniques other than X-rays, e.g., CT, ultrasound, MRI. All studies older than 10 years. Studies that did not use AI. Studies not involving the chest. Studies in which the CTR was not measured using AI.

3.2. Data Extraction

All articles meeting the inclusion criteria were subjected to data extraction by three independent investigators. The following information was extracted from each study:
  • General characteristics (authors, country, publication date, journal, IF, number of citations, general purpose of article, keywords);
  • Dataset (number of examinations, no. of unique patients (if available, male/female), number of CXRs allocated to training, validation, and test with percentage division);
  • Characteristics of the AI technology (model’s name, image preprocessing, number of layers, training device);
  • Algorithm performance and validation;
  • Cardiothoracic ratio (CTR) (average values);
  • Results (sensitivity, specificity, AUROC, accuracy, negative predictive value (NPV), positive predictive value (PPV));
  • Evidence generation: results (technological, clinical, economic, side effects), comparators/gold standard;
  • Ethical, legal, and social considerations.

3.3. Quality Assessment

The review article and meta-analysis were conducted in accordance with the PRISMA 2020 Statement (Supplement S3: PRISMA_2020_checklist.docx) and PRISMA 2020 for abstract (Supplement S4: PRISMA_2020_abstract_checklist.docx) [17]. The systematic review was assessed using the JBI Checklist (Supplement S5: JBI_checklist) [18], AMSTAR 2 (Supplement S6: AMSTAR2), an assessment scale for systematic reviews of health interventions [19], and the Critical Appraisal Skills Program, CASP (Supplement S7: CASP checklist), a diagnostic study checklist systematic review [20].

3.4. Risk of Bias Assessment

All articles were assessed for risk of bias according to the Cochrane Handbook guidelines for randomized and non-randomized controlled trials [21] by two independent reviewers. Discrepancies were resolved by consensus. The Cochrane risk of bias assessment tool covers bias in sample selection, study design, detection of results, data loss, and reporting of results for randomized trials. The assessment is based on three options: low risk, high risk, and unclear risk (Supplement S8: Risk of bias assessment).

3.5. Statistical Analysis

The AUC of AI models in cardiomegaly detection was compared between included studies, and the pooled measure was evaluated according to a random effect model due to the high heterogeneity of results (I2 > 50%). Additionally, mean difference in CTR measurements between AI models and reference methods were also compared and pooled using the random effect model as standardized mean difference. Q-test and I2 values were applied to assess heterogeneity between studies. I2 values were interpreted as follows: 25–50%—low heterogeneity, 50–75%—moderate heterogeneity, and above 75%—high heterogeneity. The AUC of each study and pooled AUC was shown in the forest plot. Additionally, publication bias was evaluated using a funnel plot. MedCalc (20.116) was used to perform all statistical analyses. p values ≤ 0.05 were considered significant.

3.6. Systematic Literature Review Process

The systematic literature review process is presented collectively in (Supplement S9: SLR illustration).

4. Results

4.1. Aim of This Study

The aim of this study was to compare the performance of CTR determination and segmentation of the heart and lungs by a deep learning model compared to humans [11,22,23,24,25,26,27,28]. This study was conducted to determine the potential of the model to automatically determine the CTR and detect cardiomegaly [29,30,31]. A study by Saivioonporn et al. compared the performance of four deep learning models to determine the most effective one for clinical implementation [32]. A study by Ajmera et al. focused on evaluating the effectiveness of a radiologist in diagnosing cardiomegaly with and without AI [33]. Que et al.’s study focused on demonstrating that a combination of segmentation and deep learning, using the CardioXNet algorithm proposed by the researchers, allows for an automatic diagnosis of heart disease [34]. The characteristics of the studies included in this article are presented in Table 1.

4.2. Database

In the 14 studies described by the researchers, the total number of CXR images was 70,742. Only seven studies [11,24,26,28,29,30,32] reported the number of unique patients; thus, it is not possible to determine the total number of unique patients for the entire study. The study with the smallest number of CXR images included 103 of them [34], and the largest included 16,903 [32]. When describing the datasets, the authors of four papers [22,23,28,29] provided complete information on the division of these studies into individual groups (training, validation, test). The authors of 8 papers [22,23,24,28,29,31,32,34] used preexisting datasets, of which the JSRT dataset was used most frequently (6/8 using preexisting datasets).
The full characteristics of the datasets used in this study are shown in Table 2.

4.3. Characteristics of the AI Technology

When describing the characteristics of the AI technology, the authors focused on identifying the model name and software, as well as describing the training device and training. Each researcher included information on the model name [11,22,23,24,25,26,27,28,29,30,31,32,33,34]. Only four researchers provided information on the name of the software [11,27,28,29]. Seven researchers did not describe the training device [11,24,25,27,30,33,34]; also, training was not described by seven authors [11,24,25,26,27,28,34].
All characteristics regarding the AI technology used are included in Table 3.

4.4. Preprocessing CXRs

More than half of the studies included in this review have used some form of image data preprocessing or image augmentation. Research conducted by Saiviroonporn [11], Lee [29], Chou [30], and Que [34] did not include or did not provide any information on preprocessing methods. Also, Saiviroonporn [32] and Ajmera [33] reported in their papers that only augmentation was used, without any further explanation. Among the preprocessing methods, the most often used were resizing, normalization (grey scale, brightness, and contrast), format conversion, and ROI selection. The augmentation process mostly included cropping, inverting, horizontal flip, contrast scaling, brightness shifting, noise addition, rotation, and scaling.
The complete list of methods used in the included articles is presented in Table 4.

4.5. Results

One of the authors described the sensitivity and specificity of cardiomegaly classification [11], while five other researchers described the sensitivity and specificity of CTR calculation [23,24,25,27,34]. Four authors included information on the accuracy of cardiomegaly classification [11,26,28,34], while five researchers described the accuracy of CTR measurement [23,25,27,30,33]. AURC of cardiomegaly classification was included by two investigators [11,34], as was the AUROC of CTR calculation [23,24]. Three investigators described the F1 of cardiomegaly classification [11,28,34], with one describing the F1 of CTR calculation [33]. Four investigators reported the mean error I standard deviation of CTR calculation [23,25,29,33].
The exact results obtained by the researchers are included in Table 5.

4.6. Technological, Clinical, and Economic Side Effects

The primary and most significant effect of automating the measurement of CTR mentioned by all authors is the reduction in the duration of this procedure and the increase in accuracy compared to manual measurements. This opens up the possibility to use this test on a routine screening basis. The CTR is a clinically useful indicator for experienced clinicians, so its determination is not intrinsic to every CXR analysis; the gain/loss balance of CTR determination largely depends on the specific clinical situation. The popularization of CTR determination by AI methods will provide a useful diagnostic factor for the CXRs described, without additional financial or time investment.

4.7. Comparators/Gold Standard

The studies described in this systematic review unanimously describe manual measurement as the primary and most widely used method for determining the CTR. The most obvious limitation of this method is its time-consuming nature, especially when determining on large datasets. Furthermore, the use of the CTR is highly dependent on individual expertise. A CTR > 0.55 is recognised as cardiomegaly [11]. The first studies using AI to determine the CTR have been conducted between 2018 and 2020, but nowadays, in the clinical setting, a radiologist’s assessment is required to evaluate the compatibility of the measurement with the manual method [11,32,33]. The advantage of AI methods is their time- and labour-saving capabilities [27]. CTR determination methods using AI can be divided into classification-based and segmentation-based methods. Among these, methods using segmentation are more accurate—these are a solution for providing information about the location or extent of the cardiomegaly region, rather than just a binary decision about its presence or absence. These also provide a numerical estimate of the CTR. The downside of this solution is the need to perform segmentation of the X-ray image before starting the network training process [23,35]. The accuracy of the measurement can be further confirmed by performing other high-resolution imaging studies—CMRI and CCT. One study deserves special attention because, in addition to radiologist determination of the CTR on the X-ray, it was confirmed by an ultrasound performed on the same day. The cardiac dimension component was therefore confirmed by another method to increase the reliability of the measurement. However, the AI algorithm can also reflect the inadequacies of an echocardiographic examination when provided with data from it [24].

4.8. Meta-Analysis

The pooled AUC for cardiomegaly detection was 0.959 (95% CI 0.944–0.975, (Table 5). Heterogeneity tests indicated high heterogeneity between studies; it yielded an I2 value of 83.76%, indicating significant variability among the studies (Table 6). No publication bias was found, which was evaluated by Egger’s test (p = 0.16) and Begg’s test (p = 0.07), (Figure 1 and Figure 2).
Absolute value of the pooled standardized mean difference for CTR measurement was 0.0353 (95% CI 0.147–0.0760, Table 7, Figure 3), which is classified as a small effect according to convention by Cohen (2002). Heterogeneity tests revealed substantial variation among studies, with an I² value of 89.97%, indicating significant differences in their findings (Table 8 and Table 9). No publication bias was found, which was evaluated by Egger’s test (p = 0.16) and Begg’s test (p = 0.07), (Figure 4).

5. Discussion

Assessment of the cardiothoracic ratio (CTR) based on chest radiographs is a pivotal diagnostic tool in clinical practice. Often, it sheds initial light on cardiac conditions in patients, providing valuable insights for further diagnostic directions to physicians worldwide. According to Celik et al., early changes in heart failure can be tracked using artificial intelligence through sequential chest radiography over time, potentially playing a significant role in patient monitoring and providing essential data for further research to ultimately determine the application of this method [34,36]. Despite its widespread use and clinical significance, manual CTR measurement remains time-consuming and susceptible to human errors, especially in repeated studies at regular intervals. The advent of artificial intelligence has initiated promising progress in automating CTR determination, offering an alternative to manual measurements.
This systematic review and meta-analysis aimed to evaluate studies employing artificial intelligence for CTR determination and segmentation. The analysis sought to assess whether implementing artificial intelligence could enhance patient care in diagnosing conditions related to cardiomegaly and provide potential benefits to radiologists and imaging specialists by streamlining the diagnostic process.
The studies included in this work underscore the significance of artificial intelligence in automating CTR determination. The findings suggest that artificial intelligence models hold promise for accurately identifying the CTR and diagnosing cardiomegaly, potentially revolutionizing the diagnostic and therapeutic process. Among the analysed studies, the most favourable outcomes were observed in works conducted by Chou et al.. and Lee et al. (method 2—PX), with AI assistance demonstrating high specificity and sensitivity in diagnosing cardiomegaly [23,29].
Integrating artificial intelligence technology with CTR determination brings numerous benefits, particularly in reducing the time required for study evaluation and increasing diagnostic accuracy compared to manual measurements. This automation could facilitate routine screening for heart diseases, providing clinicians with valuable diagnostic information without imposing additional financial and time burdens. Kim et al. suggest that the Emergency Department could be an excellent example of AI utilization for CTR measurements, where a rapidly acting AI model could prioritize patients for cardiac examination based on initial assessments [37].
Furthermore, our analysis highlighted the importance of artificial intelligence-based segmentation methods in enhancing diagnostic accuracy. By providing detailed information on the location and extent of cardiomegaly, segmentation approaches enable numerical CTR assessment, enhancing diagnostic precision. However, it is worth noting that segmentation methods may require the preprocessing of CXR images and additional computational resources.
Additionally, it is possible to further integrate the work of various artificial intelligence models to improve patient care. The integration of Support Vector Machine (SVM) models with AI systems that automatically assess the CTR on frontal chest radiographs can significantly enhance clinical and radiologic management. Initially, an AI model measures CTR, providing accurate detection of cardiac enlargement. The SVM model then classifies patients based on the severity of cardiomegaly, facilitating personalized treatment plans and supporting clinical decisions. This approach not only improves diagnostic precision but also tailors therapeutic strategies, demonstrating the synergistic potential of combining advanced AI technologies in optimizing patient outcomes and clinical workflows [38].
This meta-analysis provides significant insights into the utility of artificial intelligence in CTR measurement, yet certain limitations must be acknowledged. Our limitations on the selection of articles included in the review and meta-analysis have been described above, but we also want to specify other relevant research issues.
Unfortunately, not all studies could be included due to glaring deficiencies in result reporting in some articles, precluding their fair comparison and inclusion in our study. In some publications, we could observe the omission of important data or the lack of a detailed description of the methodology, which could lead to inaccuracies. Another issue, also stemming from the lack of consistent reporting guidelines, was the differences in measurement units used by individual authors, leading to measurement errors.
Therefore, it is so important that scientists conducting research publish detailed data about the study group and precisely describe the steps in the methodology. Creating generalized reporting criteria for researchers, which would include detailed guidance, would be a huge help in this matter. In our opinion, this is a necessary step to standardize future publications. Transparency in reporting, standardized methods, and awareness of potential bias are essential for obtaining credible meta-analytical results.
Implementing uniform criteria in this field will undoubtedly contribute to future research endeavours similar to ours but encompassing a broader range of studies.
Another challenge, also resulting from the lack of coherent reporting guidelines, was the differences in measurement units used by individual authors; hence, in comparing individual data, we opted to use SMD (standardized mean difference), allowing for a comparison of measures used in studies despite discrepancies in applied units.
The Cochrane Q test yielded a statistically significant result (p < 0.0001), indicating substantial heterogeneity among the studies included in the meta-analysis. The I2 statistic further confirmed a high degree of inconsistency (83.76%) in the results of different studies, suggesting considerable variability beyond what could be expected by chance alone. This high level of heterogeneity may arise from differences in study designs, populations, interventions, or outcome measures among the included studies.
Given the observed significant heterogeneity, it is important to interpret the results of the meta-analysis cautiously. Future studies should aim to identify and address potential sources of heterogeneity to improve the credibility and generalizability of results. Sensitivity analyses or subgroup analyses may be warranted to examine the impact of specific study characteristics on observed heterogeneity and assess the reliability of meta-analytical results. Furthermore, comparing AI-assisted CTR determination with manual measurements highlights the potential for discrepancies and the need for standard validation protocols. Future research efforts should focus on refining algorithms, optimizing preprocessing techniques, and conducting rigorous clinical evaluations to establish the reliability and effectiveness of AI-assisted CTR determination.
In summary, our systematic review and meta-analysis underscore the transformative potential of artificial intelligence in automating CTR determination and enhancing diagnostic accuracy in cardiomegaly. By harnessing artificial intelligence technologies, physicians can streamline the diagnostic process, improve patient care, and mitigate challenges associated with manual measurements. However, further research and clinical validation are essential to ensure the smooth integration of artificial intelligence into clinical practice and maximize its utility in diagnosing cardiovascular diseases and treatment.

6. Conclusions

The recent progress in AI within the field of medicine has been impressive, garnering increasing attention from researchers. This paper highlights the promising role of artificial intelligence in measuring the cardiothoracic ratio (CTR).
Our findings indicate that the application of AI in assessing the CTR on chest X-rays (CXRs) can significantly shorten the diagnosis time for patients and could be integrated routinely into imaging studies. This integration is likely to alleviate the workload of radiologists, improving efficiency and accuracy in clinical settings. Furthermore, AI methods have shown potential in reducing observer variation and operation time, aligning with the goals of improving consistency and reliability in CTR measurement. However, further research is needed, particularly based on standardized and coherent methodologies. Inconsistencies in reporting methods and differences in measurement units between studies may lead to interpretational errors, hindering the advancement of AI in this direction due to the inability to leverage existing results and compare them. Adopting uniform reporting standards and methodologies is crucial and indispensable for future research in this field, facilitating and accelerating the further development of AI methods not only in determining CTR but also in all aspects of medical imaging diagnostics.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/jcm13164659/s1, Supplement: S0–S9, Supplement: Risk of bias assessment [39,40].

Author Contributions

Conceptualization, J.K.; methodology, J.K., K.B.-Ł. and S.M. (Sandra Modlińska); software, M.B. and A.M.; validation, J.K., Ł.C., M.B. and A.M.; formal analysis, J.K, K.B.-Ł., W.B., S.M. (Sylwia Mielcarska), S.M. (Sandra Modlińska), M.C.; investigation, M.B., M.M., A.M., P.D., M.S., W.B.; resources, J.K., Ł.C., M.B. and A.M.; data curation, Ł.C., M.B., M.M., A.M., P.D. and M.S.; writing—original draft preparation, J.K., Ł.C., M.B., M.M., A.M., P.D., M.S., K.B.-Ł., W.B., S.M. (Sylwia Mielcarska) and S.M. (Sandra Modlińska); writing—review and editing, J.K., Ł.C., M.B., M.M., A.M., P.D., M.S., K.B.-Ł., S.M. (Sylwia Mielcarska), S.M. (Sandra Modlińska) and M.C.; visualization, Ł.C. and S.M. (Sylwia Mielcarska); supervision, J.K., S.M. (Sandra Modlińska), Z.N., K.G. and M.C.; project administration, J.K. and Ł.C.; funding acquisition, J.K., K.G. and M.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Speets, A.M.; van der Graaf, Y.; Hoes, A.W.; Kalmijn, S.; Sachs, A.P.; Rutten, M.J.; Gratama, J.W.C.; Montauban van Swijndregt, A.D.; Mali, W.P. Chest Radiography in General Practice: Indications, Diagnostic Yield and Consequences for Patient Management. Br. J. Gen. Pract. 2006, 56, 574–578. [Google Scholar] [PubMed]
  2. Tsakok, M.T.; Gleeson, F.V. The Chest Radiograph in Heart Disease. Medicine 2018, 46, 453–457. [Google Scholar] [CrossRef]
  3. Arzhaeva, Y.; Wang, D.; Devnath, L.; Amirgholipour, S.; McBean, R.; Hillhouse, J.; Luo, S.; Meredith, D.; Newbigin, K.; Yates, D. Development of Automated Diagnostic Tools for Pneumoconiosis Detection from Chest X-ray Radiographs; Coal Services Health and Safety Trust Project No. 20647, CSIRO Report No. EP192938; CSIRO: Canberra, Australia, 2019; Available online: https://www.coalservices.com.au/wp-content/uploads/2023/06/Project-No.-20647-Final-Report.pdf (accessed on 19 July 2024).
  4. Murphy, K. How Data Will Improve Healthcare without Adding Staff or Beds. In The Global Innovation Index 2019; World Intellectual Property Organization: Geneva, Switzerland, 2019; pp. 129–132. Available online: https://www.wipo.int/edocs/pubdocs/en/wipo_pub_gii_2019-chapter8.pdf (accessed on 24 January 2024).
  5. Sharma, R.; Gaillard, F. Cardiothoracic Ratio. Radiopaedia. Available online: https://radiopaedia.org/articles/cardiothoracic-ratio?lang=us (accessed on 24 January 2024).
  6. Van Der Jagt, E.J.; Smits, H.J. Cardiac Size in the Supine Chestfilm. Eur. J. Radiol. 1992, 14, 173–177. [Google Scholar] [CrossRef]
  7. Truszkiewicz, K.; Poręba, R.; Gać, P. Radiological Cardiothoracic Ratio in Evidence-Based Medicine. J. Clin. Med. 2021, 10, 2016. [Google Scholar] [CrossRef]
  8. Simkus, P.; Gutierrez Gimeno, M.; Banisauskaite, A.; Noreikaite, J.; McCreavy, D.; Penha, D.; Arzanauskaite, M. Limitations of Cardiothoracic Ratio Derived from Chest Radiographs to Predict Real Heart Size: Comparison with Magnetic Resonance Imaging. Insights Imaging 2021, 12, 158. [Google Scholar] [CrossRef]
  9. Bui, A.L.; Horwich, T.B.; Fonarow, G.C. Epidemiology and Risk Profile of Heart Failure. Nat. Rev. Cardiol. 2011, 8, 30–41. [Google Scholar] [CrossRef]
  10. Amin, H.; Siddiqui, W.J. Cardiomegaly. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2024. [Google Scholar]
  11. Saiviroonporn, P.; Rodbangyang, K.; Tongdee, T.; Chaisangmongkon, W.; Yodprom, P.; Siriapisith, T.; Wonglaksanapimon, S.; Thiravit, P. Cardiothoracic Ratio Measurement Using Artificial Intelligence: Observer and Method Validation Studies. BMC Med. Imaging 2021, 21, 95. [Google Scholar] [CrossRef]
  12. Lechien, J.R.; Maniaci, A.; Gengler, I.; Hans, S.; Chiesa-Estomba, C.M.; Vaira, L.A. Validity and Reliability of an Instrument Evaluating the Performance of Intelligent Chatbot: The Artificial Intelligence Performance Instrument (AIPI). Eur. Arch. Otorhinolaryngol. 2024, 281, 2063–2079. [Google Scholar] [CrossRef] [PubMed]
  13. Kufel, J.; Bielówka, M.; Mitręga, A. Detection and Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence—A Systematic Review and Meta-Analysis. PROSPERO 2023 CRD42023437459. Available online: https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42023437459 (accessed on 4 February 2024).
  14. Schardt, C.; Adams, M.B.; Owens, T.; Keitz, S.; Fontelo, P. Utilization of the PICO Framework to Improve Searching PubMed for Clinical Questions. BMC Med. Inform. Decis. Mak. 2007, 7, 16. [Google Scholar] [CrossRef]
  15. Ouzzani, M.; Hammady, H.; Fedorowicz, Z.; Elmagarmid, A. Rayyan—A Web and Mobile App for Systematic Reviews. Syst. Rev. 2016, 5, 210. [Google Scholar] [CrossRef]
  16. McHugh, M.L. Interrater Reliability: The Kappa Statistic. Biochem. Med. 2012, 22, 276–282. [Google Scholar] [CrossRef]
  17. Page, M.J.; Moher, D.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. PRISMA 2020 Explanation and Elaboration: Updated Guidance and Exemplars for Reporting Systematic Reviews. BMJ 2021, 372, n160. [Google Scholar] [CrossRef] [PubMed]
  18. Aromataris, E.; Fernandez, R.; Godfrey, C.M.; Holly, C.; Khalil, H.; Tungpunkom, P. Summarizing Systematic Reviews: Methodological Development, Conduct and Reporting of an Umbrella Review Approach. Int. J. Evid. -Based Healthc. 2015, 13, 132–140. [Google Scholar] [CrossRef] [PubMed]
  19. Shea, B.J.; Reeves, B.C.; Wells, G.; Thuku, M.; Hamel, C.; Moran, J.; Moher, D.; Tugwell, P.; Welch, V.; Kristjansson, E.; et al. AMSTAR 2: A Critical Appraisal Tool for Systematic Reviews That Include Randomised or Non-Randomised Studies of Healthcare Interventions, or Both. BMJ 2017, 358, j4008. [Google Scholar] [CrossRef] [PubMed]
  20. Long, H.A.; French, D.P.; Brooks, J.M. Optimising the Value of the Critical Appraisal Skills Programme (CASP) Tool for Quality Appraisal in Qualitative Evidence Synthesis. Res. Methods Med. Health Sci. 2020, 1, 31–42. [Google Scholar] [CrossRef]
  21. Higgins, J.P.T.; Altman, D.G.; Gotzsche, P.C.; Juni, P.; Moher, D.; Oxman, A.D.; Savovic, J.; Schulz, K.F.; Weeks, L.; Sterne, J.A.C.; et al. The Cochrane Collaboration’s Tool for Assessing Risk of Bias in Randomised Trials. BMJ 2011, 343, d5928. [Google Scholar] [CrossRef] [PubMed]
  22. Arsalan, M.; Owais, M.; Mahmood, T.; Choi, J.; Park, K.R. Artificial Intelligence-Based Diagnosis of Cardiac and Related Diseases. J. Clin. Med. 2020, 9, 871. [Google Scholar] [CrossRef] [PubMed]
  23. Chou, H.-H.; Lin, J.-Y.; Shen, G.-T.; Huang, C.-Y. Validation of an Automated Cardiothoracic Ratio Calculation for Hemodialysis Patients. Diagnostics 2023, 13, 1376. [Google Scholar] [CrossRef]
  24. Nam, J.G.; Kim, J.; Noh, K.; Choi, H.; Kim, D.S.; Yoo, S.-J.; Yang, H.-L.; Hwang, E.J.; Goo, J.M.; Park, E.-A.; et al. Automatic Prediction of Left Cardiac Chamber Enlargement from Chest Radiographs Using Convolutional Neural Network. Eur. Radiol. 2021, 31, 8130–8140. [Google Scholar] [CrossRef]
  25. Chaisangmongkon, W.; Chamveha, I.; Promwiset, T.; Saiviroonporn, P.; Tongdee, T. External Validation of Deep Learning Algorithms for Cardiothoracic Ratio Measurement. IEEE Access 2021, 9, 110287–110298. [Google Scholar] [CrossRef]
  26. Zhou, L.; Yin, X.; Zhang, T.; Feng, Y.; Zhao, Y.; Jin, M.; Peng, M.; Xing, C.; Li, F.; Wang, Z.; et al. Detection and Semiquantitative Analysis of Cardiomegaly, Pneumothorax, and Pleural Effusion on Chest Radiographs. Radiol. Artif. Intell. 2021, 3, e200172. [Google Scholar] [CrossRef] [PubMed]
  27. Li, Z.; Hou, Z.; Chen, C.; Hao, Z.; An, Y.; Liang, S.; Lu, B. Automatic Cardiothoracic Ratio Calculation with Deep Learning. IEEE Access 2019, 7, 37749–37756. [Google Scholar] [CrossRef]
  28. Wu, J.-X.; Pai, C.-C.; Kan, C.-D.; Chen, P.-Y.; Chen, W.-L.; Lin, C.-H. Chest X-Ray Image Analysis with Combining 2D and 1D Convolutional Neural Network Based Classifier for Rapid Cardiomegaly Screening. IEEE Access 2022, 10, 47824–47836. [Google Scholar] [CrossRef]
  29. Lee, M.S.; Kim, Y.S.; Kim, M.; Usman, M.; Byon, S.S.; Kim, S.H.; Lee, B.I.; Lee, B.-D. Evaluation of the Feasibility of Explainable Computer-Aided Detection of Cardiomegaly on Chest Radiographs Using Deep Learning. Sci. Rep. 2021, 11, 16885. [Google Scholar] [CrossRef] [PubMed]
  30. Chou, C.-Y.; Wang, C.C.N.; Chiang, H.-Y.; Huang, C.-F.; Hsiao, Y.-L.; Sun, C.-H.; Hu, C.-S.; Wu, M.-Y.; Chen, S.-H.; Chang, C.-M.; et al. Cardiothoracic Ratio Values and Trajectories Are Associated with Risk of Requiring Dialysis and Mortality in Chronic Kidney Disease. Commun. Med. 2023, 3, 19. [Google Scholar] [CrossRef] [PubMed]
  31. Jafar, A.; Hameed, M.T.; Akram, N.; Waqas, U.; Kim, H.S.; Naqvi, R.A. CardioNet: Automatic Semantic Segmentation to Calculate the Cardiothoracic Ratio for Cardiomegaly and Other Chest Diseases. J. Pers. Med. 2022, 12, 988. [Google Scholar] [CrossRef] [PubMed]
  32. Saiviroonporn, P.; Wonglaksanapimon, S.; Chaisangmongkon, W.; Chamveha, I.; Yodprom, P.; Butnian, K.; Siriapisith, T.; Tongdee, T. A Clinical Evaluation Study of Cardiothoracic Ratio Measurement Using Artificial Intelligence. BMC Med. Imaging 2022, 22, 46. [Google Scholar] [CrossRef]
  33. Ajmera, P.; Kharat, A.; Gupte, T.; Pant, R.; Kulkarni, V.; Duddalwar, V.; Lamghare, P. Observer Performance Evaluation of the Feasibility of a Deep Learning Model to Detect Cardiomegaly on Chest Radiographs. Acta Radiol. Open 2022, 11, 205846012211073. [Google Scholar] [CrossRef]
  34. Que, Q.; Tang, Z.; Wang, R.; Zeng, Z.; Wang, J.; Chua, M.; Gee, T.S.; Yang, X.; Veeravalli, B. CardioXNet: Automated Detection for Cardiomegaly Based on Deep Learning. In Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 18–21 July 2018; pp. 612–615. [Google Scholar]
  35. Candemir, S.; Rajaraman, S.; Thoma, G.; Antani, S. Deep Learning for Grading Cardiomegaly Severity in Chest X-Rays: An Investigation. In Proceedings of the 2018 IEEE Life Sciences Conference (LSC), Montreal, QC, Canada, 28–30 October 2018; pp. 109–113. [Google Scholar]
  36. Celik, A.; Surmeli, A.O.; Demir, M.; Esen, K.; Camsari, A. The Diagnostic Value of Chest X-ray Scanning by the Help of Artificial Intelligence in Heart Failure (ART-IN-HF). Clin. Cardiol. 2023, 46, 1562–1568. [Google Scholar] [CrossRef]
  37. Kim, D.; Lee, J.H.; Jang, M.; Park, J.; Hong, W.; Lee, C.S.; Yang, S.Y.; Park, C.M. The Performance of a Deep Learning-Based Automatic Measurement Model for Measuring the Cardiothoracic Ratio on Chest Radiographs. Bioengineering 2023, 10, 1077. [Google Scholar] [CrossRef]
  38. Maniaci, A.; Riela, P.M.; Iannella, G.; Lechien, J.R.; La Mantia, I.; De Vincentiis, M.; Cammaroto, G.; Calvo-Henriquez, C.; Di Luca, M.; Chiesa Estomba, C.; et al. Machine Learning Identification of Obstructive Sleep Apnea Severity through the Patient Clinical Features: A Retrospective Study. Life 2023, 13, 702. [Google Scholar] [CrossRef] [PubMed]
  39. Whiting, P.; Rutjes, A.W.S.; Reitsma, J.B.; Bossuyt, P.M.M.; Kleijnen, J. The development of QUADAS: A tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med. Res. Methodol. 2003, 3, 25. [Google Scholar] [CrossRef] [PubMed]
  40. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Forest plot of AUC meta-analysis [11,23,29].
Figure 1. Forest plot of AUC meta-analysis [11,23,29].
Jcm 13 04659 g001
Figure 2. Funnel plot of AUC meta-analysis. Egger’s test (p = 0.16), Begg’s test (p = 0.07).
Figure 2. Funnel plot of AUC meta-analysis. Egger’s test (p = 0.16), Begg’s test (p = 0.07).
Jcm 13 04659 g002
Figure 3. Forest plot of standardized mean difference meta-analysis [11,27,29].
Figure 3. Forest plot of standardized mean difference meta-analysis [11,27,29].
Jcm 13 04659 g003
Figure 4. Funnel plot of mean difference meta-analysis using Egger’s test (p = 0.43) and Begg’s test (p = 0.58).
Figure 4. Funnel plot of mean difference meta-analysis using Egger’s test (p = 0.43) and Begg’s test (p = 0.58).
Jcm 13 04659 g004
Table 1. Characteristics of type of study.
Table 1. Characteristics of type of study.
Study No.AuthorsType of Study (Classification, Segmentation, Both)
1Saiviroonporn P et al. [11]classification
2Saiviroonporn P et al. [32]classification
3Arsalan M et al. [22]both
4Lee MS et al. [29]both
5Chou CY et al. [30]classification
6Chou HH et al. [23]both
7Nam JG et al. [24]both
8Ajmera P et al. [33]both
9Que Q et al. [34]both
10Jafar A et al. [31]both
11Chaisangmongkon W et al. [25]both
12Zhou L et al. [26]both
13Li Z et al. [27]both
14Wu JX et al. [28]both
Table 2. Characteristics of included studies—databases.
Table 2. Characteristics of included studies—databases.
Study No.AuthorsNo. of ExaminationsTrainingValidationTestPreexisting Dataset Name
No. of Unique PatientsNo. of CXRsNo. of CXRsNo. of CXRsNo. of CXRs
1Saiviroonporn P et al. [11]73527352----
2Saiviroonporn P et al. [32]16,90316,903-75179386JSRT dataset, Montgomery County dataset, ChestX-ray14 dataset, CheXpert dataset
3Arsalan M et al. [22]-4919433120568JSRT dataset, Montgomery County dataset, Shenzhen X-Ray
4Lee MS et al. [29]179317936341591000JSRT dataset
5Chou CY et al. [30]247412,391--537-
6Chou HH et al. [23]-92746054413JSRT dataset
7Nam JG et al. [24]52775384-107--
8Ajmera P et al. [33]-35993416-183-
9Que Q et al. [34]-103- -ChestX-ray8 Database
10Jafar A et al. [31]-247---JSRT dataset
11Chaisangmongkon W et al. [25]-8316---JSRT dataset, The Montgomery County dataset, ChestX-ray14 dataset, CheXpert dataset, Proprietary dataset
12Zhou L et al. [26]263828382554284--
13Li Z et al. [27]5000550040001000500-
14Wu JX et al. [28]-200---NIH CXR dataset
Table 3. Characteristics of AI technology.
Table 3. Characteristics of AI technology.
Study No.AuthorsModels NameSoftwareTraining DeviceTraining
1Saiviroonporn P et al. [11]VGG-16 U-NetMATLAB software (R2019a, MathWorks, Inc., Natick, MA, USA)--
2Saiviroonporn P et al. [32]AlbuNet, SegNet, VGG-11, and VGG-16-Nvidia Tesla V100 GPU 32 GBImageNet
3Arsalan M et al. [22]X-RayNet-1 and X-RayNet-2-Intel Core™ i7-3770K CPU with the clock speed of 3.50 GHz (4 cores). The system RAM was 28 GB with NVIDIA GeForce GTX Titan X GPU (3072 Cuda cores with a graphics memory of 12 GB)MATLAB 2019a
4Lee MS et al. [29]U-Net, XLSorPyTorch framework (2.4)GTX Titan XP; NVIDIA, Santa Clara, CA, USAImageNet pre-trained ResNet-101
5Chou CY et al. [30]U-Net--iCTR Assessment System
6Chou HH et al. [23]AlbuNet-34, ResNet34-NVIDIA RTX 3090ImageNet
7Nam JG et al. [24]DensNet, ResNet152---
8Ajmera P et al. [33]Attention U-Net DL--ImageNet
9Que Q et al. [34]U-Net, Dense-Net, CardioXNet---
10Jafar A et al. [31]CardioNet-B-Intel®® Core™ i7-8700 CPUMATLAB R2021
11Chaisangmongkon W et al. [25]VGG-11 U-Net, VGG-16 U-Net, SegNet i AlbuNet---
12Zhou L et al. [26]U-Net-Intel Xeon Processor E5–2620 v4 2.10 GHz central processing unit and four 12GB memory NVIDIA GeForce GTX 1080 Ti-
13Li Z et al. [27]U-NetITK-SNAP (4.2.0)--
14Wu JX et al. [28]DenseNet, ResNet, FC-ResNets, UNetMATLAB software, LabVIEW (NITM, Austin, TX,
USA)
NVIDIA ® GeForce ® RTX™ 2080 Ti, 1755 MHz, 11 GB GDDR6-
Table 4. Preprocessing and augmentation.
Table 4. Preprocessing and augmentation.
Study No.AuthorsPreprocessingAugmentation
1Saiviroonporn P et al. [11]No dataNo data
2Saiviroonporn P et al. [32]No dataExpanded augmentation repertoire
3Arsalan M et al. [22]Not requiredCropping, resizing, and horizontal flipping with interpolation
4Lee MS et al. [29]No dataNot applied
5Chou CY et al. [30]No dataNo data
6Chou HH et al. [23]resizing to 512 × 512 pxlRandomResizedCrop, ShiftScaleRotate, RandomBrightnessContrast, InverImg, ElasticTransform, GridDistortion, and OpticalDistorsion
7Nam JG et al. [24]resizing to 224 × 224 or 299 × 299 pxlInversion, Fast Autoaugment
8Ajmera P et al. [33]No dataImage augmentation
9Que Q et al. [34]No dataNo data
10Jafar A et al. [31]Resizing to 350 × 350 pxlCropping, horizontal flipping, translation, and horizontal flipping with translation
11Chaisangmongkon W et al. [25]Transformation into 8-bit grayscale PNG images, resizing to 512 × 512 pxl, histogram equalization to normalize brightness and contrastAugmentation process where each image has a 60% chance of becoming transformed: arithmetic change (contrast normalization),
contrast adjustments (gamma and linear contrast scaling), brightness shift, geometric alterations
(elastic, perspective, and affine transformations), noise addition (Gaussian blur and additive Gaussian noise), and horizontal flipping
12Zhou L et al. [26]Pixel intensity converted into [0, 4095], linear intensity range compressed to [0, 255](Conducted on training dataset) Randomized image flipping, contrast and brightness adjusting, rotation with angles between −15° and 15°,
optimalisation performed by stochastic gradient descent optimizer
13Li Z et al. [27]Normalisation by grey-scale transformation, cropping, resizing to 800 × 800 pxlNot considered
14Wu JX et al. [28]Conversion to TIF, resizing to 420 × 420 pxl, ROI selectionNo data
Table 5. Results.
Table 5. Results.
Study No.AuthorsSensitivitySpecificityAccuracyAUROCF1Mean Error of AIStandard Deviation of AIMean Error of Manual MeasurementSD of Manual Measurement
1Saiviroonporn P et al. [11]97.5% 182.8% 187.9% 10.962 10.849 1----
2Saiviroonporn P et al. [32]---------
3Arsalan M et al. [22]---------
4Lee MS et al. [29]--94.97% 2------
5Chou CY et al. [30]--94.97% 2------
6Chou HH et al. [23]96.4% 291% 294.9% 20.992 2-0.83% 21.52% 21.46% 2-
7Nam JG et al. [24]69.6% 265.8% 2-0.706 2-----
8Ajmera P et al. [33]80% 299% 294.96% 2-0.8820.0254 20.006 2--
9Que Q et al. [34]--93.75% 10.934810.9434 1
10Jafar A et al. [31]---------
11Chaisangmongkon W et al. [25]94.21% 298.02% 296.32% 2 0.002 20.0157 2−0.0076 20.0136 2
12Zhou L et al. [26]--98% 1------
13Li Z et al. [27]97.2% 292.7% 295.3% 2--0.0004 30.0133 3−0.0083 30.00187 3
14Wu JX et al. [28]--98.4% 1-0.9838 1----
1 Cardiomegaly classification; 2 CTR measurement. 3 CTR measurement between deep learning and reference standard.
Table 6. Results of AUC meta-analysis.
Table 6. Results of AUC meta-analysis.
StudyROC AreaStandard Error95% CIzPWeight (%)
FixedRandom
Saiviroonporn P et al. [11]0.9620.002770.957 to 0.967--51.1815.26
Lee MS et al.—method 1—PX [29]0.9600.01790.925 to 0.995--1.238.63
Lee MS et al.—method 1—PE [29]0.9200.02810.865 to 0.975--0.55.22
Lee MS et al.—method 1—lesions obscuring the heart [29]0.8600.05610.750 to 0.970--0.121.74
Lee MS et al.—method 1—lesions obscuring the diaphragm [29]0.9300.03830.855 to 1.00--0.273.32
Lee MS et al.—method 1-cardiomegaly only [29]0.9700.007650.955 to 0.985--6.7113.56
Lee MS et al.—method 2—PX [29]0.9800.01020.960 to 1.000--3.7812.33
Lee MS et al.—method 2—PE [29]0.8900.02810.835 to 0.945--0.505.22
Lee MS et al.—method 2—lesions obscuring the heart [29]0.8100.06120.690 to 0.930--0.101.49
Lee MS et al.—method 2—lesions obscuring the diaphragm [29]0.9500.03060.890 to 1.000--0.424.64
Lee MS et al.—method 2—cardiomegaly only [29]0.9700.007650.955 to 0.985--6.7113.56
Chou HH et al. [30]0.9920.003720.985 to 0.999--28.4715.03
Total (fixed effects)0.9710.001980.967 to 0.975489.843<0.001100.00100.00
Total (random effects)0.9590.007870.944 to 0.975121.924<0.001100.00100.00
Table 7. Test of heterogeneity; AUC meta-analysis.
Table 7. Test of heterogeneity; AUC meta-analysis.
QDFSignificance LevelI2 (Inconsistency)95% CI for I2
67.732211p < 0.000183.76%73.09 to 90.20
Table 8. Results of meta-analysis comparing mean difference in CTR measurement between AI models and reference methods. SMD—standardized mean difference, SE—standard error.
Table 8. Results of meta-analysis comparing mean difference in CTR measurement between AI models and reference methods. SMD—standardized mean difference, SE—standard error.
StudyN1N2TotalSMDSE95% CItpWeight %)
FixedRandom
Saiviroonporn P et al.—normal group [11]499349939866−0.1570.0202−0.196 to −0.117 48.8510.29
Saiviroonporn P et al.—cardiomegaly group [11]241924194838−0.02130.0288−0.0776 to 0.0351 24.0310.16
Lee MS et al.—method 1—PX [29]123123246−0.08730.127−0.338 to 0.163 1.236.85
Lee MS et al.—method 1—PE [29]2022024040.09110.0994−0.104 to 0.286 2.017.91
Lee MS et al.—method 1—lesions obscuring the heart [29]8383166−0.2330.155−0.539 to 0.0730 0.835.87
Lee MS et al.—method 1—lesions obscuring the diaphragm [29]63631260.0000.177−0.351 to 0.351 0.635.18
Lee MS et al.—method 1—cardiomegaly only [29]6526521304−0.04000.0554−0.149 to 0.0686 6.489.49
Lee MS et al.—method 2—PX [29]1231232460.6630.1310.406 to 0.921 1.166.72
Lee MS et al.—method 2—PE [29]202202404−0.8110.103−1.014 to −0.608 1.867.75
Lee MS et al.—method 2—lesions obscuring the heart [29]83831660.1040.155−0.201 to 0.410 0.835.88
Lee MS et al.—method 2—lesions obscuring the diaphragm [29]63631260.0000.177−0.351 to 0.351 0.635.18
Lee MS et al.—method 2—cardiomegaly only [29]65265213040.0000.0554−0.109 to 0.109 6.489.49
Li Z et al. [27]50050010000.1230.0633−0.000914 to 0.247 4.989.23
Total (fixed effects)100981009820196−0.08580.0141−0.113 to −0.0582−6.087<0.001100.00100.00
Total (random effects)100981009820196−0.03530.0568−0.147 to 0.0760−0.6220.534100.00100.00
Table 9. Test of heterogeneity; mean difference meta-analysis.
Table 9. Test of heterogeneity; mean difference meta-analysis.
QDFSignificance LevelI2 (Inconsistency)95% CI for I2
119.626312p < 0.000189.97%84.70 to 93.42
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kufel, J.; Czogalik, Ł.; Bielówka, M.; Magiera, M.; Mitręga, A.; Dudek, P.; Bargieł-Łączek, K.; Stencel, M.; Bartnikowska, W.; Mielcarska, S.; et al. Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence—A Systematic Review and Meta-Analysis. J. Clin. Med. 2024, 13, 4659. https://doi.org/10.3390/jcm13164659

AMA Style

Kufel J, Czogalik Ł, Bielówka M, Magiera M, Mitręga A, Dudek P, Bargieł-Łączek K, Stencel M, Bartnikowska W, Mielcarska S, et al. Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence—A Systematic Review and Meta-Analysis. Journal of Clinical Medicine. 2024; 13(16):4659. https://doi.org/10.3390/jcm13164659

Chicago/Turabian Style

Kufel, Jakub, Łukasz Czogalik, Michał Bielówka, Mikołaj Magiera, Adam Mitręga, Piotr Dudek, Katarzyna Bargieł-Łączek, Magdalena Stencel, Wiktoria Bartnikowska, Sylwia Mielcarska, and et al. 2024. "Measurement of Cardiothoracic Ratio on Chest X-rays Using Artificial Intelligence—A Systematic Review and Meta-Analysis" Journal of Clinical Medicine 13, no. 16: 4659. https://doi.org/10.3390/jcm13164659

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop