1. Introduction
Kidney or renal disease can gradually harm the human body by impairing essential renal functions such as filtration, re-absorption, secretion, and excretion. If this condition worsens, the kidney may fail, leading to Chronic Kidney Disease (CKD). Globally, the mortality rate of CKD increased
, resulting in a total of 1.2 million deaths [
1]. According to the Ministry of Health and Welfare (MHW) in Taiwan, nephritis, nephrotic syndrome, and nephrosis were the ninth leading cause of death in 2018 [
2]. Autosomal Dominant Polycystic Kidney Disease (ADPKD) ranks as the fourth leading cause of CKD worldwide [
3]. The occurrence of ADPKD is primarily attributed to genetic abnormalities that are often inherited from parents. Typically, ADPKD begins to develop asymptomatically in both kidneys. For this reason, its progression is often observed in middle to late adulthood. Thus, it is crucial to predict the progressive loss of renal function at an early stage.
Glomerular Filtration Rate (GFR) [
4] is recognized as an important biomarker for predicting the progressive loss of renal function. GFR is measured using blood tests by analyzing changes in serum creatinine levels and estimating GFR (eGFR) values. However, studies have shown that GFR measurements do not reflect changes in serum creatine levels until around the fourth or fifth decade of life [
5]. As a result, Total Kidney Volume (TKV) has been included as a second key biomarker alongside GFR. TKV can be calculated using commonly available medical imaging techniques such as Magnetic Resonance Imaging (MRI) and Computed Tomography (CT). Both techniques provide images in three planes, Axial (Transverse), Coronal, and Sagittal, which are stored in the Picture Archiving and Communication System (PACS) in Digital Imaging and Communication in Medicine (DICOM) format. These images typically consist of multiple slices (e.g., ≈100–200 slices). However, MRI is known for being costly and time-consuming. In contrast, CT is a faster and more cost-effective technique, making it highly preferable. CT imaging is categorized into two types: Contrast-enhanced Computed Tomography (CCT) and Non-enhanced Computed Tomography (NCCT). The term “Contrast” refers to a contrast material injected into the patient’s body, which enhances the visibility of specific organs under investigation. However, CCT is not always feasible for ADPKD patients due to the potential side effects of the injected contrast material. As a result, NCCT, which does not require the use of contrast material, is considered as the most practical and widely available medical imaging technique for ADPKD patients, even though some organs may be more difficult to observe.
TKV calculation on CT involves localization and segmentation tasks, which require experienced radiologists to manually localize and segment the kidneys by outlining them slice by slice in the patient’s CT data. Several conventional methods have been applied to this process, including Polyline tracing [
6], Livewire [
7], Freehand drawing [
8], Stereology [
9], Mid-Slice [
10], and Ellipsoid [
11]. However, these methods are reported to be labor-intensive, time-consuming, and prone to human error [
12]. For instance, Polyline tracing requires ≈30 min, Livewire takes ≈20–26 min, and Freehand drawing demands 8 min for a single kidney. Moreover, methods like Stereology, Mid-Slice, and Ellipsoid rely on specific values, such as the total number of grids, mid-slice, length, width, and depth, which must be derived using specialized software such as ImageJ [
6], OsiriX [
8], and others. Despite these tools, the accuracy and efficiency of TKV calculation through localization and segmentation largely depend on a radiologist’s experience and expertise.
To optimize TKV calculation, an automatic segmentation model has been developed using a Computer-Aided Diagnosis (CAD) approach, which is based on Image Preprocessing (IP). A semi-automated segmentation method utilizing IP has been reported, where a T2-weighted MRI was employed to design an ADPKD segmentation model [
13]. The method relies on active contours and sub-voxel morphology and considers data from 17 patients. Similarly, an automated segmentation approach based on IP was designed, using a Spatial Prior Probability Map (SPPM) and Propagated Shape Constraint (PSC) techniques, by employing T2-weighted MRI data from 60 patients [
14]. However, the IP-based approach has limitations, particularly in terms of low accuracy, which can result in a high error rate. This issue arises from the manual extraction of features, where the low quality of extracted features negatively impacts the performance and accuracy of the segmentation model.
To address these limitations, Artificial Intelligence (AI) techniques have rapidly evolved into two major approaches: Machine Learning (ML) and Deep Learning (DL) [
15,
16]. Initially, ML techniques were employed to enhance the localization and segmentation capabilities of IP-based approaches. For example, a segmentation method using geodesic distance volume and Random Forest (RF) algorithms was developed to segment ADPKD kidneys on 55 CCT image datasets [
17]. Similarly, a preliminary study on 20 NCCT image datasets applied Histogram and K-means algorithms for ADPKD kidney segmentation [
18]. Although various ML techniques addressed some limitations of IP-based approaches, their effectiveness heavily depends on the quantity and quality of the training data. The high variability in the shape, intensity, and size of ADPKD kidneys makes small training datasets insufficient for developing robust segmentation models. Additionally, ML techniques rely on optimal handcrafted feature extraction, which can be challenging to achieve consistently. In recent years, DL has achieved tremendous success in handling complex medical image data. Unlike ML, DL introduces automatic feature extraction through the Convolution Neural Network (CNN) methodologies, eliminating the need for handcrafted features. Several DL-based approaches have been designed for image segmentation tasks in ADPKD, utilizing a variety of medical imaging datasets and architectures. For example, Fully Convolutional Network (FCN) and 244 CCT [
19], AlexNet and 448 CCT [
20], U-Net and 2000 T2-weighted MRI [
21], U-Net and 3D T2-weighted [
22], FCN and 22 NCCT [
23], Region-based CNN (R-CNN) and 32 T2-weighted MRI [
24], CNN and 526 T2-weighted MRI [
25], Volumetric Medical Image Segmentation (V-Net) with 182 NCCT and 32 CCT [
26], and multiple architectures such as FCN, Unet, SegNet, Deeplab, and pspNet and breast 309 Ultrasound image [
27].
Based on the existing segmentation techniques, T2-weighted MRI [
28] and CCT image data are the most commonly utilized medical imaging modalities for designing automated segmentation models of ADPKD, which use IP, ML, and DL approaches. While two existing ADPKD segmentation methods have involved 20 [
24] and 22 [
23] NCCT image datasets, the total number of training samples is insufficient for establishing the robustness of the segmentation models. Similarly, 182 NCCT and 32 CCT datasets have been utilized for volumetric analysis [
26]. However, the imbalance in the number of NCCT and CCT cases makes it challenging to validate the robustness of the developed models for both modalities.
It is also found that NCCT images pose significant challenges for localization and segmentation due to several factors. First, the intensity of NCCT images, as shown in
Figure 1a, is lower as compared to the CCT images, as illustrated in
Figure 1b. Second, the intensity of liver cysts is similar to that of ADPKD kidneys, as shown in
Figure 1c. Third, the intensity of ADPKD kidneys can be similar to adjacent organs such as the liver and spleen, as depicted in
Figure 1c. This similarity makes it difficult to differentiate the boundaries of the ADPKD kidneys from neighboring organs during delineation. These limitations of NCCT in capturing detailed structural and functional characteristics of the kidneys can make tasks such as localization, segmentation, and TKV estimations difficult to perform accurately. As ADPKD is a progressive disease, cysts are often smaller and fewer in number during its early stage, which makes diagnosis through NCCT challenging. This difficulty arises because NCCT has limitations in accurately detecting and calculating the total number of cysts in the kidneys [
29]. This is crucial in clinical practice as early diagnosis of ADPKD can lead to better patient prognosis. Therefore, incorporating NCCT and CCT for kidney localization, segmentation, and TKV estimation, using AI-based methods, can enhance diagnostic accuracy, improve early risk predictions, and ultimately optimize treatment strategies for better outcomes.
Furthermore, a review of existing studies reveals a lack of investigations into developing an end-to-end model that integrates localization, segmentation, and TKV estimation while addressing the imbalance between NCCT and CCT cases. These challenges have motivated us to design an integrated, robust end-to-end model for ADPKD kidney localization, segmentation, and TKV estimation, which performs effectively on both NCCT and CCT.
In this paper, we propose an automatic approach for the localization, segmentation, and TKV estimation of ADPKD, using a balanced dataset of 100 NCCT and 100 CCT images. Our methodology integrates IP techniques and state-of-the-art DL architectures. Specifically, we adopted the Single Shot Detector (SSD) Inception V2 for the localization model, DeepLab V3+ Xception65 for the segmentation model, and a Decision Tree Regression (DTR) ML model for the TKV estimation model. The main contributions of this paper are summarized as follows:
Design an automatic localization and segmentation model for ADPKD kidneys, which can effectively work with both NCCT and CCT image data.
Develop a TKV estimation model, utilizing the outputs of the derived segmentation model.
Facilitate radiologists’ work by providing automated ADPKD localization, segmentation, and TKV estimation models, thereby reducing the labor involved in analyzing the progressive loss of renal function.
This paper is organized as follows.
Section 1.1 reviews existing works related to ADPKD localization and segmentation.
Section 2 presents the description of the proposed methods.
Section 3 describes the experimental setup and results.
Section 4 provides the discussion, while
Section 5 concludes the work.
1.1. Related Work
This section discusses the evolution of state-of-the-art methodologies for ADPKD kidney localization and segmentation. These methodologies can be categorized into two main groups: without AI (
Section 1.1.1), which includes traditional methods and IP-based approaches, and with AI (
Section 1.1.2), which encompasses ML and DL approaches.
1.1.1. Without Artificial Intelligence
In clinical practice, traditional methods such as Polyline tracing [
6], Livewire [
7], Freehand drawing [
8], Stereology [
9], Mid-Slice [
10], and Ellipsoid [
11], have been used to delineate and segment the ADPKD kidneys. However, the accuracy of these methods heavily relies on the radiologist’s expertise in manually delineating the kidneys. As a result, high error rates in ADPKD kidney localization and segmentation are inevitable, which can lead to inaccurate TKV calculation. To address the limitations of these traditional methods, IP techniques have been utilized [
13,
14]. Due to the challenges in delineating ADPKD kidneys on T2-weighted MRI, active contours and sub-voxel morphology were used to automate the delineation process [
13]. Specifically, Geodesic Active Contours, Region Competition techniques, and Bridge Burner algorithms were used for the sub-voxel morphology. In [
14], instead of using shape model, which are not well-suited for the variable shapes of ADPKD kidneys, a Spatial Prior Probability Map (SPPM) was applied. The process included three steps: SPPM construction, region mapping, and boundary refinement, with the results compared to manual segmentation [
14]. While IP-based approaches mitigate some of the challenges of traditional methods, their performance still depends on the quantity and quality of training data. The non-uniform morphology and intensity of ADPKD kidneys make it difficult to build robust localization and segmentation models with insufficient training data. Additionally, IP-based approaches often require handcrafted features, which can be influenced by the radiologist’s expertise in extracting relevant features from the highly variable ADPKD kidney image.
1.1.2. With Artificial Intelligence
To fully automate ADPKD kidney localization and segmentation, the used of AI has rapidly increased. In the realm of ML, the authors [
17] applied RF and geodesic distance volume on mid-slice CCT images. Before generating the forest training, feature selection was performed by selecting box features, represented as a single vector with 11 elements. In a preliminary study using a small number of NCCT cases, the authors proposed the first segmentation model for ADPKD kidneys using Histogram analysis, and K-means clustering [
18]. By applying ML algorithms, the performance of ADPKD kidney localization and segmentation models can be more accurate compared to IP-based approaches. This is because the model is designed through a learning process using extracted features. However, the performance of the derived ML model depends on the feature extraction and selection methods. If inappropriate features are selected, the localization and segmentation accuracy will be poor. Moreover, during feature extraction and selection, important features might be overlooked. To overcome the limitations of handcrafted feature extraction in IP and ML approaches, DL is increasingly utilized. DL automates the feature extraction process, which is otherwise manually performed in traditional methods. For example, in [
19], the authors designed an ADPKD segmentation model using FCN with a Visual Geometry Group (VGG-16) backbone on CCT images. Due to the presence of liver cysts, the localization and segmentation model could potentially overestimate TKV. To address this, the authors [
20] proposed a method using AlexNet architecture and Marginal Space Learning (MSL) to improve ADPKD kidney localization and segmentation. Their method classified patches into two classes: abdomen and kidney localization. However, the division of predefined abdomen classes was not analyzed, which could affect the optimal range values derived from the model. The kidney localization model in their approach was created by manually cropping the kidney area and then dividing the cropped images into positive and negative patches. In contrast, the authors [
21] designed an ADPKD segmentation model using the U-Net architecture on T2-weighted MRI. They introduced a multi-observer concept, where several iterations were required to find the optimal networks. The final segmentation result was determined by applying a voting scheme among the optimal networks. While this method is robust, it requires a time-consuming training phase.
The U-Net architecture has gained widespread popularity and has been applied in various studies using 2D MRI and 3D CT imaging [
30,
31,
32]. However, 2D segmentation using MRI is limited by the small number of samples used in the architectures [
30,
31]. In contrast, the authors employed NCCT and CCT for 3D segmentation [
32]. Nevertheless, the proportion of NCCT and CCT used in their experiments is imbalanced, with approximately
allocated to NCCT and
to CCT. Using NCCT image data, an ADPKD kidney segmentation model was developed with an IP technique, where the kidney area was manually cropped, and FCN architecture was applied with two data augmentation techniques, rotation and scaling [
23]. However, the proposed method was based on a limited number of NCCT images. For ADPKD volumetry, 182 NCCT and 32 CCT image data were used with V-Net [
26]. However, to ensure the robustness of the derived ADPKD kidney segmentation model, balanced data from both CT types is required. To solve this limitation, in this paper, we propose an automatic ADPKD localization and segmentation model by utilizing a balanced number of NCCT (100 patients) and CCT (i.e., 100 samples) images, specifically from ADPKD patients associated with liver cysts.
4. Discussion
Analyzing the progressive loss of renal function in ADPKD patients through medical image data is crucial. Therefore, accurate TKV measurement on NCCT and CCT scans is both essential and challenging. In this paper, we propose an AI-based framework for ADPKD kidney localization, segmentation, and TKV measurement for both NCCT and CCT. The proposed method integrates traditional IP techniques with DL architectures, such as SSD Inception V2 for localization, DeepLab V3+ Xception65 for segmentation, and an ML approach using the DTR algorithm for TKV estimation.
In the first part of this work, we proposed an automatic ADPKD localization model for both NCCT and CCT. The localization model was trained and validated using the training set for over
k rounds, which was followed by testing on an independent testing set. Based on the evaluation results, our model outperformed SSD MobileNet V1 and Faster R-CNN Inception ResNet V2 architectures, achieving
ACC,
PR,
RE,
DS, and
mAP. Our proposed localization model demonstrates a higher mAP than previous works such as [
24,
46], where R-CNN-based detection resulted in a high false positive, leading to low AP =
when MRI was used as input. As shown in
Figure 7 and
Figure 9, our localization model accurately localizes the left kidney (i.e., blue box) and right kidney (i.e., green box) with a high confidence score. Despite the presence of liver cysts, our model successfully localizes the ADPKD kidneys by predicting the bounding boxes, although some parts of adjacent organs, like the liver and spleen, may also be included within the detected box. To address this limitation, we further proposed an automatic ADPKD segmentation model.
The main aim of our second proposed idea is to extract the precise shape of ADPKD kidneys from both NCCT and CCT. Similar to the first localization idea, we trained and validated the segmentation model over
k rounds and then tested the model using a separate testing set. As shown in
Table 7 and
Table 9, the proposed model achieved an average of
PR,
RE,
DS, and
mIoU on both NCCT and CCT. With an mIoU =
, our method outperforms other ADPKD segmentation architectures [
24,
46], which achieved an IoU of
using MRI images. When using CCT as input, our proposed segmentation model achieved a higher DS =
compared to [
19], which reported as DS of
. These results are reflected in the segmented images shown in
Figure 11 and
Figure 13 for NCCT and CCT, respectively. It can be observed that our segmentation model robustly segments ADPKD kidneys of varying sizes and shapes, even in the presence of liver cysts. The outputs of this segmentation model are then used to design the TKV estimation model. To complete our proposed idea, we introduce the third component: the TKV estimation model. As discussed in the performance evaluation results, our proposed method using DTR achieved high
for NCCT and
for CCT. Using k-fold cross validation, the TKV estimation model achieved
=
with an SD of
for NCCT and
=
with an SD of
for CCT.
One of the limitations of our study is the small dataset size, which can affect the generalization of our models. Although our results show high mIoU, some segmentation errors do occur in certain cases, such as when cysts overlap with neighboring organs or have homogeneous intensity. These errors may impact the subsequent task of TKV estimation, and as a result, inaccurate TKV could affect treatment decisions or patient outcomes. To address the small dataset size and segmentation errors, we plan to conduct future experiments with a large dataset and additional external validation. By doing so, we aim to improve the generalizability and reliability of our models. Despite strong performance and lower sensitivity to outliers compared to other regression algorithms, outliers can still influence how the decision tree splits the data. As a result, the tree may overfit to extreme outliers, causing splits that fail to reflect the true distribution of the data. Consequently, this could result in less precise TKV estimations. In the future, we plan to mitigate the influence of outliers by employing outlier removal techniques.
To make the proposed method feasible in real-world settings, we could integrate the automatic ADPKD kidney localization, segmentation, and TKV estimation into a single, user-friendly desktop-based software pipeline in the future. This software could be installed in hospital systems, allowing clinicians and radiologists to compare their conventional methods with our software. This integration can enhance diagnostic accuracy and decision-making for treatment. Additionally, through this comparison, we can validate our model’s results against conventional methods, ensuring the reliability and accuracy of the developed software.