Next Article in Journal
Social Introversion Personality Trait as Predictor of Internalizing Symptoms in Female Adolescents with Gender Dysphoria
Next Article in Special Issue
Outcomes of the Next In Vitro Fertilization Cycle in Women with Polycystic Ovary Syndrome after a Failed In Vitro Maturation Attempt
Previous Article in Journal
Association of aPTT-Guided Anticoagulation Monitoring with Thromboembolic Events in Patients Receiving V-A ECMO Support: A Systematic Review and Meta-Analysis
Previous Article in Special Issue
Could Inhibiting the DNA Damage Repair Checkpoint Rescue Immune-Checkpoint-Inhibitor-Resistant Endometrial Cancer?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

High-Resolution Single-Shot Fast Spin-Echo MR Imaging with Deep Learning Reconstruction Algorithm Can Improve Repeatability and Reproducibility of Follicle Counting

1
Department of Radiology, Renmin Hospital of Wuhan University, Wuhan 430060, China
2
Reproductive Medicine Center, Renmin Hospital of Wuhan University, Wuhan 430060, China
3
MR Research, GE Healthcare, Beijing 100080, China
4
First School of Clinical Medicine of Wuhan University, Wuhan 430060, China
5
Department of Obstetrics, Renmin Hospital of Wuhan University, Wuhan 430060, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
J. Clin. Med. 2023, 12(9), 3234; https://doi.org/10.3390/jcm12093234
Submission received: 11 February 2023 / Revised: 30 March 2023 / Accepted: 28 April 2023 / Published: 30 April 2023
(This article belongs to the Special Issue New Advances in Clinical Reproductive Medicine)

Abstract

:
Objective: To investigate the diagnostic performance of high-resolution single-shot fast spin-echo (SSFSE) imaging with deep learning (DL) reconstruction algorithm on follicle counting and compare it with original SSFSE images and conventional fast spin-echo (FSE) images. Methods: This study included 20 participants (40 ovaries) with clinically confirmed polycystic ovary syndrome (PCOS) who underwent high-resolution ovary MRI, including three-plane T2-weighted FSE sequences and slice-matched T2-weighted SSFSE sequences. A DL reconstruction algorithm was applied to the SSFSE sequences to generate SSFSE-DL images, and the original SSFSE images were also saved. Subjective evaluations such as the blurring artifacts, subjective noise, and clarity of the follicles on the SSFSE-DL, SSFSE, and conventional FSE images were independently conducted by two observers. Intra-class correlation coefficients and Bland–Altman plots were used to present the repeatability and reproducibility of the follicle number per ovary (FNPO) based on the three types of images. Results: SSFSE-DL images showed less blurring artifact, subjective noise, and better clarity of the follicles than SSFSE and FSE (p < 0.05). For the repeatability of the FNPO, SSFSE-DL showed the highest intra-observer (ICC = 0.930; 95% CI: 0.878–0.962) and inter-observer (ICC = 0.914; 95% CI: 0.843–0.953) agreements. The inter-observer 95% limits of agreement (LOA) for SSFSE-DL, SSFSE, and FSE ranged from −3.7 to 4.5, −4.4 to 7.0, and −7.1 to 7.6, respectively. The intra-observer 95% LOA for SSFSE-DL, SSFSE, and FSE ranged from −3.5 to 4.0, −5.1 to 6.1, and −5.7 to 4.2, respectively. The absolute values of intra-observer and inter-observer differences for SSFSE-DL were significantly lower than those for SSFSE and FSE (p < 0.05). Conclusions: Compared with the original SSFSE images and the conventional FSE images, high-resolution SSFSE images with DL reconstruction algorithm can better display follicles, thus improving FNPO assessment.

1. Introduction

Polycystic ovary syndrome (PCOS) is the most common endocrine disorder in women of reproductive age, affecting 6–15% of this population [1]. The diagnosis of PCOS needs to meet the following two of three criteria: clinically or biochemically confirmed hyperandrogenism, irregular menstrual cycles, and polycystic ovary morphology (PCOM) [2]. Therefore, the determination of PCOM is of great value when the diagnosis of PCOS is uncertain according to the clinical manifestations and laboratory findings. The main markers of PCOM are an increase in ovarian volume (OV) and an excess of follicles, with the latter being more diagnostic than the former [3].
According to the revised criteria from the 2003 Rotterdam Consensus Workshop, PCOM was defined as 12 or more follicles at the size of 2–9 mm per ovary and/or OV greater than 10 cm3 in adults by using transvaginal ultrasonography (TVUS) [2]. However, the Androgen Excess and Polycystic Ovary Syndrome Society Task Force recommended increasing the threshold value of follicle number per ovary (FNPO) to 25 for PCOM determination when using advanced ultrasound scanners with improved resolution [3]. The latest international evidence-based guideline has recommended that the threshold value of FNPO should be set at 20 follicles when using newer technology (i.e., transducer frequency ≥ 8 MHz) [4]. The used thresholds of follicle counting using TVUS in adult women varied [3,4,5,6], not to mention age-specific diagnostic criteria for adolescents, in whom TVUS is generally contraindicated. However, TVUS still remains the most widely used imaging measure due to its convenience, safety, and low costs [3,4,7]. The accuracy of follicle count estimation is also low due to its inherent disadvantages, such as operator dependence and limited field-of-view. Furthermore, the poor resolution of the transabdominal pelvic ultrasound in adolescents and women with no sexual history makes it more difficult to perform reliable follicle counting.
Not limited by the observation field-of-view, patient size, or sexual history, magnetic resonance imaging (MRI) is increasingly used as an alternative to TVUS in the characterization of the ovarian morphology in both adult women and adolescents with suspected PCOS [5,7,8]. It has been confirmed that two-dimensional (2D) MRI sequences are more effective than TVUS for detecting small follicles and thus have been recommended as a better method for follicle counting [9,10]. Fast spin-echo (FSE) based T2-weighted imaging (T2WI) is currently the mainstay of ovary MRI for the evaluation of polycystic ovary [5,11,12]. Due to the lack of standard imaging protocols, the repeatability of follicle counting is still controversial. To obtain a reliable assessment of follicle counting, MR images with high spatial resolution and signal-to-noise ratio (SNR) could better delineate the follicles with various sizes.
FSE imaging is relatively easy to achieve high SNR and spatial resolution at the expense of acquisition time, which may lead to increased motion artifacts. Single-shot FSE (SSFSE), as a fast imaging sequence, is not sensitive to motion, but it is not a preferred sequence for the ovary MRI due to its blurring and relatively low SNR, especially when high-resolution imaging is pursued [13,14]. Recently, deep learning (DL) reconstruction algorithm has been widely used in medical imaging and shows great potential in improving SNR for high-resolution imaging without increasing scanning time [15,16]. As far as we know, no study has investigated the diagnostic efficacy of the DL-reconstructed SSFSE (SSFSE-DL) for follicle counting. In this study, we aimed to establish an MR imaging protocol with higher reproducibility of follicle number via comparisons of the repeatability of follicle counts based on original SSFSE, SSFSE-DL, and conventional FSE images.

2. Materials and Methods

2.1. Subjects

This prospective single-center study was approved by the Clinical Ethics Committee of Renmin Hospital of Wuhan University (WDRY2021-K028) and followed the principles of the Declaration of Helsinki. We performed a prior analysis using G. power to determine the required sample size for the comparison of subjective scores of SSFSE with DL or without DL and FSE images. The analysis indicated that a minimum of 34 follicle counts were needed to ensure a test efficiency of ≥0.8. We prospectively recruited a total of 22 clinically confirmed PCOS cases, including 6 adolescent girls and 16 adult women, from the departments of gynecology, endocrinology, reproductive medicine, and pediatrics at Renmin Hospital of Wuhan University between June 2022 and November 2022. The diagnosis of PCOS in adult women met at least two of the following signs: clinically or biochemically confirmed hyperandrogenism, irregular menstrual cycles, and PCOM [2]. For adolescents, both the criteria of hyperandrogenism and irregular cycles are needed. Informed consent was obtained from all participants aged ≥ 18 years. For participants under the age of 18, consent of the parents and participants were both required. Patients aged 14–35 years with clinically confirmed PCOS would be considered to be recruited. Concurrently, patients who (a) were pregnant, (b) had ever received ovarian surgery, or (c) had contraindications for MRI, or (d) could not tolerate the MRI examination would be excluded.

2.2. Magnetic Resonance Imaging

All participants underwent ovary MR imaging on a 3-T MR imaging unit (SIGNA Architect; GE Medical Systems, Milwaukee, WI, USA) with a 30-channel phased-array coil. The feet-first supine position was adopted for all the participants during the MRI examination. Sedation or injection of contrast agents is not used for good patient compliance during the examination. All MRI protocols consisted of three-plane (axial, coronal, sagittal) T2-weighted FSE sequences (TR/TE = 4214–5478 ms/104 ms; receiver bandwidth = ±50 kHz; slice thickness/spacing = 3 mm/0 mm; ETL = 22; number of excitations = 2; FOV = 18 cm; acquisition matrix (frequency × phase) = 352 × 352 and total scan time = 6 min 30 s–7 min 15 s) and matched three-plane T2-weighted SSFSE sequences (TR/TE = 1700 ms/84 ms; receiver bandwidth = ±62.5 kHz; slice thickness/spacing = 3 mm/0 mm; number of excitations = 1; FOV = 18 cm; acquisition matrix (frequency × phase) = 352 × 352 and total scan time = 1 min 30 s–1 min 54 s). A commercially available DL reconstruction algorithm (AIRTM Recon DL, GE Healthcare) was used in the SSFSE sequences (abbreviated as SSFSE-DL), and the original images (abbreviated as SSFSE) were also saved. There were finally three image types, including SSFSE, SSFSE-DL, and FSE, for each participant.

2.3. Qualitative Analysis

Three types of axial T2-weighted images for each participant were evaluated for the display performance of the follicles. A radiologist with 5 years of diagnostic experience in pelvic MRI randomized the 60 image sets before analyses. Two other radiologists who were blinded to the imaging and clinical information assessed bilateral ovaries on each participant independently and subjectively. The evaluation items were as follows: blurring artifacts on a 3-point scale (3 = almost no blurring artifacts, 2 = mild blurring artifacts, and 1 = severe blurring artifacts); subjective noise on a 3-point scale (3 = almost no noise, 2 = mild noise, 1 = severe noise); clarity of the follicles on a 5-point scale (5 = excellent, 4 = good, 3 = fair, 2 = poor, 1 = uninterpretable).

2.4. Assessment of Follicle Number Per Ovary (FNPO)

The FNPO was determined by counting the number of follicles measuring ≥ 1 mm long axis based on each image type. To avoid a repeat count of the follicles in different slices, the follicles were counted on the axial images, and the corresponding sagittal and coronal images would be referred to at the same time. The assessments of FNPO for both ovaries on each participant were performed twice by observer 1 (L.L., a board-certified radiologist with more than 10 years of experience in pelvic MRI) and once by observer 2 (Z.W., a third-year radiology resident with 6 months of training in pelvic MRI). To avoid any potential recall bias, there was an interval of 4 weeks between two assessments by observer 1.

2.5. Statistical Analysis

The subjective scoring data were reported as mean ± standard deviation (SD). Qualitative results were compared between SSFSE-DL and SSFSE and between SSFSE-DL and FSE by using Wilcoxon signed-rank tests. Cohen’s weighted kappa was used to assess inter-observer agreement. Kappa values of 0.00–0.20, 0.20–0.40, 0.40–0.60, 0.60–0.80, and 0.80–1.00 indicated poor, fair, moderate, good, and excellent agreement, respectively [17]. Confidence intervals (CIs) for the kappa values were calculated.
Intra-class correlation coefficients (ICCs) were calculated to show the intra-observer and inter-observer agreements for FNPO assessment, including FNPO assessment by reviewer 1 for the first and second session and reviewer 1 and reviewer 2 for the first session. The agreement levels were as follows: ICC greater than 0.80, almost perfect agreement; 0.61–0.80, substantial agreement; 0.41–0.60, moderate agreement; 0.21–0.40, fair agreement; 0.0–0.2, slight agreement; and less than 0.0, poor agreement. Intra-observer and inter-observer variabilities were assessed by Bland–Altman plots [18]. The mean difference and 95% limits of agreement (LOA), which are defined as the mean difference ± 1.96 SDs, were calculated. To further assess the significance of the intra-observer and inter-observer differences between SSFSE-DL and SSFSE and between SSFSE-DL and FSE, the absolute values of the intra-observer and inter-observer differences were compared by using a paired t test. A paired t test was also used to compare the repeatability and reproducibility within each image group.
Statistical analyses were performed using MedCalc Software (v. 11.6, Nariakerke, Belgium). p < 0.05 was considered statistically significant.

3. Results

3.1. Study Participants

Among the recruited 22 participants, one adult woman was excluded because of the surgery of the right ovary for ovarian teratoma, and one adolescent girl was excluded due to her intolerance during the MRI examination. Finally, this study included 20 Chinese women (5 adolescent girls and 15 adult women) with a mean age of 22.5 ± 4.7 years (range: 15–31 years), mean gynecological age of 10.5 ± 4.9 years (range: 3–20 years), and mean body mass index (BMI) of 24.2 ± 3.6 kg/m2 (range: 16.3–32.2 years). Four adolescent girls admitted no sexual history, and 1 adolescent girl and all 15 adult women had a sexual history. A total of 40 ovaries were included for further evaluation.

3.2. Qualitative Image Analysis

Two independent observers conducted the subjective assessment on a total of 40 ovaries in 60 images. Table 1 summarizes the comparison results of the qualitative image analyses between SSFSE-DL and SSFSE as well as between SSFSE-DL and FSE for each observer. Each subjective evaluation item was presented as mean ± SD. Significantly better scores of the blurring artifacts, subjective noise, and clarity of the follicles rated by observers were found between SSFSE-DL and SSFSE (p < 0.05) as well as FSE (p < 0.05). The use of SSFSE helped to reduce the blurring artifacts compared to the FSE sequences, and the application of DL reconstruction helped to reduce the noise of the high-resolution SSFSE images. Two typical cases are shown in Figure 1 and Figure 2. The inter-observer agreements of the three subjective aspects were good for SSFSE-DL (Kappa = 0.671–0.789), moderate to good for SSFSE (Kappa = 0.580–0.636), and FSE (Kappa = 0.474–0.664).

3.3. Repeatability and Reproducibility of FNPO Assessment

Table 2 presents the mean and range of FNPO based on each image type across all 40 ovaries according to each observer. The FNPO assessments for all 40 ovaries were performed twice by observer 1 and once by observer 2. For two assessment sessions conducted by observer 1 and for one session performed by observer 2, a range of 12 to 38 and 14 to 35 follicles on SSFSE-DL images, 12 to 32 and 14 to 31 follicles on SSFSE images, 11 to 32 and 12 to 34 follicles on FSE images were, respectively, reported. Table 3 summarizes the ICCs for the FNPO assessments according to each image type. SSFSE-DL showed the best intra-observer agreement as a higher ICC of 0.930 compared to SSFSE (0.825) and FSE (0.839). In addition, SSFSE-DL showed the best inter-observer agreement with a higher ICC of 0.914 compared to SSFSE (0.798) and FSE (0.669).
Bland–Altman plots in the representation of the intra-observer and inter-observer variability of FNPO assessments for each image type are presented in Figure 3. The mean differences of intra-observer were 0.3, 0.5, and −0.7 for SSFSE-DL, SSFSE, and FSE, respectively, and the mean differences for inter-observer were 0.4, 1.3, and 0.3 for SSFSE-DL, SSFSE and FSE, respectively. The 95% LOA of inter-observer for SSFSE-DL, SSFSE, and FSE ranged from −3.7 to 4.5, −4.4 to 7.0, and −7.1 to 7.6, respectively, and the 95% LOA of intra-observer for those ranged from −3.5 to 4.0, −5.1 to 6.1, and −5.7 to 4.2, respectively. SSFSE-DL showed the narrowest inter-observer and intra-observer 95% LOA compared to SSFSE and FSE. The absolute values of intra-observer and inter-observer differences were significantly lower for SSFSE-DL than for SSFSE and FSE (p < 0.05). SSFSE-DL showed significantly reduced intra-observer and inter-observer variability for the FNPO assessments. In addition, there were no significant differences between the absolute values of intra-observer and inter-observer differences neither for SSFSE-DL (p = 0.071) nor for SSFSE (p = 0.143). However, for FSE, the absolute values of intra-observer differences were significantly lower than those of inter-observer (p = 0.001) (Table 4).

4. Discussion

To the best of our knowledge, this is the first study evaluating the diagnostic performance of DL-reconstructed SSFSE on the antral follicle counts up to the present. Our data demonstrated that SSFSE-DL significantly improved image quality (i.e., reduced blurring artifacts and image noise). The counts of the follicles assessed by SSFSE-DL had the best intra-observer repeatability and inter-observer reproducibility for the FNPO assessment.
TVUS is currently the most widely used imaging measure to identify PCOM [19], and thus, the criteria for PCOM determination have been largely based on TVUS. Additionally, with the advancement of ultrasound scanners, the threshold value for the identification of PCOM has increased from 12 to 20 [2,3,4], highlighting the substantial impact of the resolution of TVUS on the accuracy of follicle counting. Nevertheless, when compared to 2D MRI, the follicle number is still likely to be underestimated. Wang et al. [10] compared 2D TVUS and 2D MRI for the estimation of follicle count in 84 adult women with infertility. They reported that the follicle counts at 1–9 mm and 1–3 mm assessed by 2D TVUS was about 8 and 13 follicles smaller than those evaluated by 2D MRI, respectively. Nylander et al. [20] compared 2D TVUS, 3D TVUS, and 2D MRI for the estimation of follicle count on 66 overweight women with PCOS and reported that the follicle count assessed by 2D TVUS was 18% and 16% smaller than those evaluated by 3D TVUS and 2D MRI, respectively. In a study conducted by Leonhardt et al. [9], the estimation of follicle count using 3D TVUS and 2D MRI in 99 women aged 21–37 years showed that the total follicle count by 2D MRI was approximately 14 follicles higher than that by 3D TVUS, in particular, the 1–3 mm follicle count by 2D MRI was 22 follicles more than that by 3D TVUS. In view of these above-mentioned findings, 2D MRI is an effective alternative to TVUS for detecting a greater number of small follicles. However, the inter-observer agreement for the small follicles by 2D MRI was found to be poor to moderate [9,10]. The spatial resolution of the 2D MR images is a key factor that affects the repeatability of the follicle count assessment.
To achieve high-resolution imaging of the ovary, an identical acquisition voxel of 0.5 × 0.5 × 3 mm3 was set for both SSFSE and FSE T2WI sequences in our study. The noise was well resolved in the FSE images at the expense of an acquisition time of approximately 2 min for each sequence. Severe image noise appeared in the SSFSE images on account of the excessively small voxel, let alone its inherently low SNR. The reason is that there is a trade-off between resolution and SNR when using the conventional inverse Fourier transform (iFT) reconstruction method [21]. In a previous study of ovarian MRI, there was significant image noise on the conventional SSFSE images, even with a slice thickness of 6 mm and larger FOV [12]. However, SSFSE-DL images showed significantly improved image noise compared to the conventionally reconstructed SSFSE images, and the noise was even less than the FSE images, indicating that the image noise was adequately removed in the images reconstructed by a deep convolutional neural network-based AIRTM Recon DL. This neural network model embedded in the MR image reconstruction pipeline employs a cascade of over 100 thousand unique pattern recognitions for images with noise and low resolution to reconstruct high-resolution images with high SNR [22]. Better SNR on DL reconstructed images enhanced the signal strength of the long-T2-value follicles and increased the contrast between follicles and the surrounding ovarian stroma, thus facilitating the count of the follicles. Moreover, SSFSE-DL achieved a superior contrast between pelvic fluid collection and surrounding fat tissue than FSE T2WI without fat saturation.
In this study, SSFSE-DL images showed fewer blurring artifacts when compared with the original SSFSE and FSE images using conventional reconstruction. The embedded AIRTM Recon DL ahead of conventional reconstruction was one of the main reasons for our result. A conventional image-based linear filter can partially suppress ringing artifacts and noise caused by the iFT reconstruction method, while portions of the acquired k-space data are reduced for consistency with the unsampled high-frequency data, only partially removing these artifacts but broadening the point spread function, thus degrading resolution and blurring images [23]. However, AIRTM Recon DL can use truncation artifacts as indicators of missing information rather than general suppression of truncation artifacts so as to make the target edge clear to the collected data [22]. In addition to the modified reconstruction pipeline, the blurring on the FSE images is mainly attributed to the inevitable motion during the relatively long acquisition time, such as the respiratory and bowel motility. In other words, imaging speed is indeed the decisive factor in reducing motion artifacts. In this study, the acquisition time for a single slice using the SSFSE sequence was less than one second; thus, both respiratory motion and bowel peristalsis were easily frozen, which significantly reduced the blurring effect related to motion. Therefore, the follicles with various sizes were well delineated on the SSFSE-DL images, making the follicle counting more easily and accurately. Similarly, the overall image quality of DL-reconstructed SSFSE for the uterus MRI was equal to or better than that of the periodically rotated overlapping parallel lines with enhanced reconstruction (PROPELLER) sequence [14]. In addition to the female pelvis, the combination of DL reconstruction and SSFSE sequence was also validated in the liver MRI depending on its superior image quality relative to that of the standard FSE sequence, despite its inferior image quality relative to that of the PROPELLER sequence [24,25].
The accurate assessment of follicle count plays an important role in the identification of PCOM for patients with suspected PCOS [26,27] and in the evaluation of ovarian reserve for infertile women [28]. Many studies have reported thresholds to identify PCOM using conventional FSE T2WI. The follicle counts of 20.5 on 0.7 × 0.7 × 6.0 mm3 images were recommended to identify adolescents and young adults with PCOS [29]. The maximum specificity for follicle number per section of 17 could differentiate PCOS adolescents from matched controls on images with a slice thickness of 3–4 mm [5]. The FNPO of 28 (≤9 mm) showed the best sensitivity and specificity on the images of 3.5 mm slice thickness [11]. These variable thresholds in these studies demonstrate that repeatability of the follicle count estimation dominates the superiority of the identification criteria for PCOM.
To evaluate the repeatability of follicle counting using the conventional FSE T2WI sequences, various scanning parameters were examined. In an ovarian MRI study with an acquisition voxel of about 0.7 × 0.7 × 4 mm3 on 1.5T, consistencies for the total follicle count were good, while the inter-observer consistency in evaluating follicles 1 to 3 mm in size was poor to moderate [9]. In another study with an acquisition voxel of about 0.7 × 0.7 × 3 mm3 on 3T, the intra- and inter-observer consistencies for the total follicle count were substantial, while the inter-observer consistency for follicles 1 to 6 mm in size was moderate [10]. We speculated that the mediocre performance of the conventional FSE images on the repeatability of follicle count was mainly caused by the blurring artifacts and limited spatial resolution, which hindered the discrimination of adjacent follicles, especially those of small sizes. For DL-reconstructed SSFSE in our study, the intra- and inter-observer agreements for the FNPO assessment were significantly improved and benefited from the better clarity of individual ovarian follicles compared to the widely-used FSE sequences. Moreover, the inter-observer variability comparable to the intra-observer’s variability implied that reliable follicle counting based on the SSFSE-DL images could be achieved by two different radiologists, while the inter-observer’s variability for the follicle counting using the FSE images was higher than the intra-observer’s. Therefore, SSFSE-DL images were shown to be more effective and independent of the experience of radiologists in the identification of polycystic ovaries than the conventional FSE images.
Our study had several limitations. First, only adolescent girls and adult women with clinically confirmed PCOS were included, and no control subjects were included. Moreover, it should be noted that seven of the participants had already received oral hormonal contraceptive agents for treatment prior to imaging, making it infeasible to determine thresholds for PCOM identification in our study. However, our study placed emphasis on the repeatability of the FNPO assessment that was not affected by the inclusion criterion. The identification of PCOM, especially in adolescent girls, using the SSFSE-DL sequence will be explored. Second, the follicles were not classified by size in this study. It was because the blurring artifacts may lead to the unclear delineation of the follicles, resulting in a false measurement of a single follicle or multiple follicles being considered as one. Thus, the classification of follicle size may reduce the reliability of the repeatability assessment of the follicle count, especially when there are severe artifacts. Third, although the follicle counting was performed on the axial images and with reference lines on the other two imaging planes, the respiratory movement and intestinal peristalsis may still lead to a mismatch of the images on the three different planes, potentially resulting in repeated or missed follicle counts. To mitigate the effects of slice mismatch, the incorporation of breath-hold acquisition mode and gastrointestinal motility inhibitors would be an optimal option if possible. Fourth, although the follicles were more clearly displayed on the SSFSE-DL images, the long T2 relaxation time may affect the display of other pelvic structures and the detection rate of lesions, which warrants further investigation.
In conclusion, compared with the original SSFSE images and conventional FSE images, high-resolution SSFSE images with DL reconstruction can significantly improve the repeatability and reproducibility of FNPO evaluation, benefiting from the better display of the follicles, which indicates that SSFSE-DL is a more stable and reliable method for follicle count assessment.

Author Contributions

Conceptualization, Y.Z. (Yunfei Zha), R.Y. and C.L.; data acquisition, R.Y., C.L. and Z.W.; funding acquisition, M.H. and Y.Z. (Yujie Zou); methodology, Y.Z. (Yunfei Zha), Y.Z. (Yujie Zou), R.Y. and C.S.; data analysis, L.L., C.S. and Z.W.; writing—original draft preparation, R.Y., Y.Z. (Yujie Zou) and W.L.; writing—review and editing, Y.Z. (Yunfei Zha), W.L., M.H. and L.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Key Laboratory Project of Hubei Province (No. 2021KFY005) and the Interdisciplinary Innovative Talents Foundation from Renmin Hospital of Wuhan University (No.JCRCFZ-2022-011).

Institutional Review Board Statement

This work was approved by the Clinical Ethics Committee of Renmin Hospital of Wuhan University (WDRY2021-K028).

Informed Consent Statement

Informed consent was obtained from all participants aged ≥ 18 years. For participants aged < 18 years, informed consent from parents and participants were both obtained.

Data Availability Statement

The datasets analyzed during the current study are available from the corresponding author on reasonable request.

Acknowledgments

The authors would like to thank the technical staff of the GE Healthcare.

Conflicts of Interest

One of the authors (Weiyin Vivian Liu) is an employee of GE Healthcare. The remaining authors declare no competing interests.

References

  1. Fauser, B.C.; Tarlatzis, B.C.; Rebar, R.W.; Legro, R.S.; Balen, A.H.; Lobo, R.; Carmina, E.; Chang, J.; Yildiz, B.O.; Laven, J.S.; et al. Consensus on women’s health aspects of polycystic ovary syndrome (PCOS): The Amsterdam ESHRE/ASRM-Sponsored 3rd PCOS Consensus Workshop Group. Fertil. Steril. 2012, 97, 28–38.e25. [Google Scholar] [CrossRef] [PubMed]
  2. Rotterdam ESHRE/ASRM-Sponsored PCOS Consensus Workshop Group. Revised 2003 consensus on diagnostic criteria and long-term health risks related to polycystic ovary syndrome (PCOS). Hum. Reprod. 2004, 19, 41–47. [Google Scholar] [CrossRef] [PubMed]
  3. Dewailly, D.; Lujan, M.E.; Carmina, E.; Cedars, M.I.; Laven, J.; Norman, R.J.; Escobar-Morreale, H.F. Definition and significance of polycystic ovarian morphology: A task force report from the Androgen Excess and Polycystic Ovary Syndrome Society. Hum. Reprod. Update 2014, 20, 334–352. [Google Scholar] [CrossRef]
  4. Teede, H.J.; Misso, M.L.; Costello, M.F.; Dokras, A.; Laven, J.; Moran, L.; Piltonen, T.; Norman, R.J.; International PCOS Network. Recommendations from the international evidence-based guideline for the assessment and management of polycystic ovary syndrome. Hum. Reprod. 2018, 33, 1602–1618. [Google Scholar] [CrossRef] [PubMed]
  5. Kenigsberg, L.E.; Agarwal, C.; Sin, S.; Shifteh, K.; Isasi, C.R.; Crespi, R.; Ivanova, J.; Coupey, S.M.; Heptulla, R.A.; Arens, R. Clinical utility of magnetic resonance imaging and ultrasonography for diagnosis of polycystic ovary syndrome in adolescent girls. Fertil. Steril. 2015, 104, 1302–1309.e94. [Google Scholar] [CrossRef] [PubMed]
  6. Lujan, M.E.; Jarrett, B.Y.; Brooks, E.D.; Reines, J.K.; Peppin, A.K.; Muhn, N.; Haider, E.; Pierson, R.A.; Chizen, D.R. Updated ultrasound criteria for polycystic ovary syndrome: Reliable thresholds for elevated follicle population and ovarian volume. Hum. Reprod. 2013, 28, 1361–1368. [Google Scholar] [CrossRef]
  7. Kayemba-Kay’s, S.; Pambou, A.; Heron, A.; Benosman, S.M. Polycystic ovary syndrome: Pelvic MRI as alternative to pelvic ultrasound for the diagnosis in overweight and obese adolescent girls. Int. J. Pediatr. Adolesc. Med. 2017, 4, 147–152. [Google Scholar] [CrossRef]
  8. Barber, T.M.; Alvey, C.; Greenslade, T.; Gooding, M.; Barber, D.; Smith, R.; Marland, A.; Wass, J.A.; Child, T.; McCarthy, M.I.; et al. Patterns of ovarian morphology in polycystic ovary syndrome: A study utilising magnetic resonance imaging. Eur. Radiol. 2010, 20, 1207–1213. [Google Scholar] [CrossRef]
  9. Leonhardt, H.; Gull, B.; Stener-Victorin, E.; Hellström, M. Ovarian volume and antral follicle count assessed by MRI and transvaginal ultrasonography: A methodological study. Acta Radiol. 2014, 55, 248–256. [Google Scholar] [CrossRef]
  10. Wang, S.J.; Zhang, M.M.; Duan, N.; Hu, X.Y.; Ren, S.; Cao, Y.Y.; Zhang, Y.P.; Wang, Z.Q. Using transvaginal ultrasonography and MRI to evaluate ovarian volume and follicle count of infertile women: A comparative study. Clin. Radiol. 2022, 77, 621–627. [Google Scholar] [CrossRef]
  11. Fondin, M.; Rachas, A.; Huynh, V.; Franchi-Abella, S.; Teglas, J.P.; Duranteau, L.; Adamsbaum, C. Polycystic Ovary Syndrome in Adolescents: Which MR Imaging-based Diagnostic Criteria? Radiology 2017, 285, 961–970. [Google Scholar] [CrossRef] [PubMed]
  12. Brown, M.; Park, A.S.; Shayya, R.F.; Wolfson, T.; Su, H.I.; Chang, R.J. Ovarian imaging by magnetic resonance in adolescent girls with polycystic ovary syndrome and age-matched controls. J. Magn. Reson. Imaging 2013, 38, 689–693. [Google Scholar] [CrossRef] [PubMed]
  13. Yamashita, Y.; Tang, Y.; Abe, Y.; Mitsuzaki, K.; Takahashi, M. Comparison of ultrafast half-Fourier single-shot turbo spin-echo sequence with turbo spin-echo sequences for T2-weighted imaging of the female pelvis. J. Magn. Reson. Imaging 1998, 8, 1207–1212. [Google Scholar] [CrossRef] [PubMed]
  14. Tsuboyama, T.; Takei, O.; Okada, A.; Honda, T.; Kuriyama, K. Comparison of HASTE with multiple signal averaging versus conventional turbo spin echo sequence: A new option for T2-weighted MRI of the female pelvis. Eur. Radiol. 2020, 30, 3245–3253. [Google Scholar] [CrossRef] [PubMed]
  15. Kidoh, M.; Shinoda, K.; Kitajima, M.; Isogawa, K.; Nambu, M.; Uetani, H.; Morita, K.; Nakaura, T.; Tateishi, M.; Yamashita, Y.; et al. Deep Learning Based Noise Reduction for Brain MR Imaging: Tests on Phantoms and Healthy Volunteers. Magn. Reson. Med. Sci. 2020, 19, 195–206. [Google Scholar] [CrossRef]
  16. Chaudhari, A.S.; Fang, Z.; Kogan, F.; Wood, J.; Stevens, K.J.; Gibbons, E.K.; Lee, J.H.; Gold, G.E.; Hargreaves, B.A. Super-resolution musculoskeletal MRI using deep learning. Magn. Reson. Med. 2018, 80, 2139–2154. [Google Scholar] [CrossRef]
  17. Cohen, J. Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psychol. Bull. 1968, 70, 213–220. [Google Scholar] [CrossRef]
  18. Bland, J.M.; Altman, D.G. Applying the right statistics: Analyses of measurement studies. Ultrasound. Obstet. Gynecol. 2003, 22, 85–93. [Google Scholar] [CrossRef]
  19. Campbell, S. Ultrasound Evaluation in Female Infertility: Part 1, the Ovary and the Follicle. Obstet. Gynecol. Clin. N. Am. 2019, 46, 683–696. [Google Scholar] [CrossRef]
  20. Nylander, M.; Frøssing, S.; Bjerre, A.H.; Chabanova, E.; Clausen, H.V.; Faber, J.; Skouby, S.O. Ovarian morphology in polycystic ovary syndrome: Estimates from 2D and 3D ultrasound and magnetic resonance imaging and their correlation to anti-Müllerian hormone. Acta Radiol. 2017, 58, 997–1004. [Google Scholar] [CrossRef]
  21. Gassenmaier, S.; Herrmann, J.; Nickel, D.; Kannengiesser, S.; Afat, S.; Seith, F.; Hoffmann, R.; Othman, A.E. Image Quality Improvement of Dynamic Contrast-Enhanced Gradient Echo Magnetic Resonance Imaging by Iterative Denoising and Edge Enhancement. Investig. Radiol. 2021, 56, 465–470. [Google Scholar] [CrossRef] [PubMed]
  22. Lebel, R.M. Performance characterization of a novel deep learning-based MR image reconstruction pipeline. arXiv 2020, arXiv:2008.06559. [Google Scholar] [CrossRef]
  23. van der Velde, N.; Hassing, H.C.; Bakker, B.J.; Wielopolski, P.A.; Lebel, R.M.; Janich, M.A.; Kardys, I.; Budde, R.P.J.; Hirsch, A. Improvement of late gadolinium enhancement image quality using a deep learning-based reconstruction algorithm and its influence on myocardial scar quantification. Eur. Radiol. 2021, 31, 3846–3855. [Google Scholar] [CrossRef] [PubMed]
  24. Shanbhogue, K.; Tong, A.; Smereka, P.; Nickel, D.; Arberet, S.; Anthopolos, R.; Chandarana, H. Accelerated single-shot T2-weighted fat-suppressed (FS) MRI of the liver with deep learning-based image reconstruction: Qualitative and quantitative comparison of image quality with conventional T2-weighted FS sequence. Eur. Radiol. 2021, 31, 8447–8457. [Google Scholar] [CrossRef] [PubMed]
  25. Herrmann, J.; Gassenmaier, S.; Nickel, D.; Arberet, S.; Afat, S.; Lingg, A.; Kündel, M.; Othman, A.E. Diagnostic confidence and feasibility of a deep learning accelerated HASTE sequence of the abdomen in a single breath-hold. Investig. Radiol. 2021, 56, 313–319. [Google Scholar] [CrossRef]
  26. Goodman, N.F.; Cobin, R.H.; Futterweit, W.; Glueck, J.S.; Legro, R.S.; Carmina, E. American Association of Clinical Endocrinologists, American College of Endocrinology, and Androgen Excess and PCOS Society disease state clinical review: Guide to the best practices in the evaluation and treatment of polycystic ovary syndrome—Part 1. Endocr. Pract. 2015, 21, 1291–1300. [Google Scholar] [CrossRef]
  27. Azziz, R.; Carmina, E.; Dewailly, D.; Diamanti-Kandarakis, E.; Escobar-Morreale, H.F.; Futterweit, W.; Janssen, O.E.; Legro, R.S.; Norman, R.J.; Taylor, A.E.; et al. Androgen Excess Society. Positions statement: Criteria for defining polycystic ovary syndrome as a predominantly hyperandrogenic syndrome: An Androgen Excess Society guideline. J. Clin. Endocrinol. Metab. 2006, 91, 4237–4245. [Google Scholar] [CrossRef]
  28. Zhou, S.J.; Zhao, M.J.; Li, C.; Su, X. The comparison of evaluative effectiveness between antral follicle count/age ratio and ovarian response prediction index for the ovarian reserve and response functions in infertile women. Medicine 2020, 99, e21979. [Google Scholar] [CrossRef]
  29. Pereira-Eshraghi, C.F.; Tao, R.; Chiuzan, C.C.; Zhang, Y.; Shen, W.; Lerner, J.P.; Oberfield, S.E.; Sopher, A.B. Ovarian follicle count by magnetic resonance imaging is greater in adolescents and young adults with polycystic ovary syndrome than in controls. F. S. Rep. 2022, 3, 102–109. [Google Scholar] [CrossRef]
Figure 1. Ovarian MRI in a 16-year-old adolescent girl with confirmed PCOS. SSFSE-DL images (upper row) show the lowest noise, blurring artifacts, and the bilateral enlarged ovaries with a dominant follicle and many small follicles. The pelvic fluid collection (arrow) is easily detected on the SSFSE-DL images. Noise is prominent on the SSFSE images (middle row). The FSE images (lower row) mainly showed blurring artifacts caused by respiratory and bowel motility.
Figure 1. Ovarian MRI in a 16-year-old adolescent girl with confirmed PCOS. SSFSE-DL images (upper row) show the lowest noise, blurring artifacts, and the bilateral enlarged ovaries with a dominant follicle and many small follicles. The pelvic fluid collection (arrow) is easily detected on the SSFSE-DL images. Noise is prominent on the SSFSE images (middle row). The FSE images (lower row) mainly showed blurring artifacts caused by respiratory and bowel motility.
Jcm 12 03234 g001
Figure 2. Ovarian MRI in a 28-year-old woman with confirmed PCOS. SSFSE-DL images (upper row) show the least noise and blurring artifacts. The bilateral enlarged ovaries with many small peripheral follicles are clearly delineated on the SSFSE-DL images. The display of the follicles is impaired by the noise on the SSFSE images (middle row) and by the blurring artifacts in the FSE images (lower row).
Figure 2. Ovarian MRI in a 28-year-old woman with confirmed PCOS. SSFSE-DL images (upper row) show the least noise and blurring artifacts. The bilateral enlarged ovaries with many small peripheral follicles are clearly delineated on the SSFSE-DL images. The display of the follicles is impaired by the noise on the SSFSE images (middle row) and by the blurring artifacts in the FSE images (lower row).
Jcm 12 03234 g002
Figure 3. Bland–Altman plots show intra-observer (left column) and inter-observer (right column) variability of FNPO assessments based on SSFSE-DL (upper row), SSFSE (middle row) and FSE (lower row) images. The x-axis represents the mean of two measurements, and the y-axis represents the differences between the two measurements. The solid lines (blue) indicate the mean differences between all measurements. The upper dashed lines (red) indicate a mean difference of + 1.96 SD, and the lower dashed lines (red) indicate a mean difference of −1.96 SD. The limits of agreement (mean difference ± 1.96 SDs) are expected that the limits include 95% of the differences between the two measurements.
Figure 3. Bland–Altman plots show intra-observer (left column) and inter-observer (right column) variability of FNPO assessments based on SSFSE-DL (upper row), SSFSE (middle row) and FSE (lower row) images. The x-axis represents the mean of two measurements, and the y-axis represents the differences between the two measurements. The solid lines (blue) indicate the mean differences between all measurements. The upper dashed lines (red) indicate a mean difference of + 1.96 SD, and the lower dashed lines (red) indicate a mean difference of −1.96 SD. The limits of agreement (mean difference ± 1.96 SDs) are expected that the limits include 95% of the differences between the two measurements.
Jcm 12 03234 g003
Table 1. Image quality scores for comparison between SSFSE-DL and SSFSE and between SSFSE-DL and FSE.
Table 1. Image quality scores for comparison between SSFSE-DL and SSFSE and between SSFSE-DL and FSE.
Rated FeatureObserverSSFSE-DLSSFSEFSESSFSE-DL vs.
SSFSE
SSFSE-DL vs.
FSE
Blurring artifacts12.600 ± 0.5982.200 ± 0.8341.950 ± 0.826p = 0.0117p = 0.0015
22.550 ± 0.5102.050 ± 0.7591.900 ± 0.788p = 0.0051p = 0.0022
Inter-observer agreement 0.720 (0.463–0.976)0.591
(0.338–0.843)
0.474
(0.165–0.783)
Subjective noise12.750 ± 0.5501.500 ± 0.5132.450 ± 0.605p = 0.0001p = 0.0277
22.650 ± 0.5871.700 ± 0.5712.250 ± 0.550p = 0.0001p = 0.0051
Inter-observer agreement 0.789
(0.498–1.000)
0.636
(0.352–0.921)
0.664
(0.366–0.961)
Clarity of the follicles14.150 ± 0.7453.700 ± 0.6573.150 ± 0.875p = 0.0164p = 0.0003
24.400 ± 0.6813.800 ± 0.6963.200 ± 1.005p = 0.0051p = 0.0007
Inter-observer agreement 0.671
(0.423–0.919)
0.565
(0.245–0.885)
0.536
(0.260–0.812)
Note. Data are presented as mean ± SD. Comparisons of the subjective scoring were performed using the Wilcoxon signed-rank test. Cohen’s weighted kappa (95% confidence interval) values are shown for inter-observer agreement.
Table 2. The descriptive statistics for follicle number per ovary.
Table 2. The descriptive statistics for follicle number per ovary.
SequenceObserverSessionMinimumMaximumMeanSD
SSFSE-DL11123824.75.4
22143524.45.0
13143624.34.8
SSFSE11123222.15.0
22143121.64.8
13133220.94.2
FSE11133220.84.1
22123421.64.7
13113020.65.0
Table 3. The intra-observer and inter-observer agreements for FNPO assessment based on each image type.
Table 3. The intra-observer and inter-observer agreements for FNPO assessment based on each image type.
SequenceIntra-Observer
ICC (95% CI)
Inter-Observer
ICC (95% CI)
SSFSE-DL0.930 (0.878–0.962)0.914 (0.843–0.953)
SSFSE0.825 (0.693–0.903)0.798 (0.650–0.888)
FSE0.839 (0.716–0.912)0.669 (0.454–0.810)
Note. Comparisons of the intra-observer and inter-observer agreements were performed using the intra-class correlation coefficients. Intra-observer and inter-observer ICC were calculated based on the results of FNPO evaluation, respectively, of the first and second session by observer 1 and of the first session by viewer 1 and reviewer 2.
Table 4. Comparison of the absolute values of intra-observer and inter-observer differences.
Table 4. Comparison of the absolute values of intra-observer and inter-observer differences.
ParameterSSFSE-DLSSFSEFSESSFSE-DL
vs.
SSFSE
SSFSE-DL
vs.
FSE
Intra-observer difference1.475 ± 1.0622.300 ± 1.7722.325 ± 1.163p = 0.0185p = 0.0001
Inter-observer difference1.900 ± 0.955 NS2.825 ± 1.412 NS3.325 ± 1.655 *p = 0.0004p < 0.0001
Note. Comparisons were performed using a paired t test. NS p > 0.05 for comparison between intra-observer and inter-observer differences. * p < 0.05 for comparison between intra-observer and inter-observer differences.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, R.; Zou, Y.; Liu, W.; Liu, C.; Wen, Z.; Li, L.; Sun, C.; Hu, M.; Zha, Y. High-Resolution Single-Shot Fast Spin-Echo MR Imaging with Deep Learning Reconstruction Algorithm Can Improve Repeatability and Reproducibility of Follicle Counting. J. Clin. Med. 2023, 12, 3234. https://doi.org/10.3390/jcm12093234

AMA Style

Yang R, Zou Y, Liu W, Liu C, Wen Z, Li L, Sun C, Hu M, Zha Y. High-Resolution Single-Shot Fast Spin-Echo MR Imaging with Deep Learning Reconstruction Algorithm Can Improve Repeatability and Reproducibility of Follicle Counting. Journal of Clinical Medicine. 2023; 12(9):3234. https://doi.org/10.3390/jcm12093234

Chicago/Turabian Style

Yang, Renjie, Yujie Zou, Weiyin (Vivian) Liu, Changsheng Liu, Zhi Wen, Liang Li, Chenyu Sun, Min Hu, and Yunfei Zha. 2023. "High-Resolution Single-Shot Fast Spin-Echo MR Imaging with Deep Learning Reconstruction Algorithm Can Improve Repeatability and Reproducibility of Follicle Counting" Journal of Clinical Medicine 12, no. 9: 3234. https://doi.org/10.3390/jcm12093234

APA Style

Yang, R., Zou, Y., Liu, W., Liu, C., Wen, Z., Li, L., Sun, C., Hu, M., & Zha, Y. (2023). High-Resolution Single-Shot Fast Spin-Echo MR Imaging with Deep Learning Reconstruction Algorithm Can Improve Repeatability and Reproducibility of Follicle Counting. Journal of Clinical Medicine, 12(9), 3234. https://doi.org/10.3390/jcm12093234

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop