Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence

Ishikawa, Takuya; Hayakawa, Masato; Suzuki, Hirotaka; Ohno, Eizaburo; Mizutani, Yasuyuki; Iida, Tadashi; Fujishiro, Mitsuhiro; Kawashima, Hiroki; Hotta, Kazuhiro

doi:10.3390/diagnostics12020434

Open AccessArticle

Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence

by

Takuya Ishikawa

^1,*

,

Masato Hayakawa

²,

Hirotaka Suzuki

^1,3,

Eizaburo Ohno

¹,

Yasuyuki Mizutani

¹

,

Tadashi Iida

¹,

Mitsuhiro Fujishiro

⁴

,

Hiroki Kawashima

⁵ and

Kazuhiro Hotta

²

¹

Department of Gastroenterology and Hepatology, Graduate School of Medicine, Nagoya University, 65 Tsurumai-cho, Showa-ku, Nagoya 4668550, Japan

²

Department of Electrical and Electronic Engineering, Faculty of Science and Technology, Meijo University, Nagoya 4688502, Japan

³

Department of Gastroenterology, Toyohashi Municipal Hospital, Toyohashi 4418570, Japan

⁴

Department of Gastroenterology and Hepatology, Graduate School of Medicine, The University of Tokyo, Tokyo 1138655, Japan

⁵

Department of Endoscopy, Nagoya University Hospital, Nagoya 4668550, Japan

^*

Author to whom correspondence should be addressed.

Diagnostics 2022, 12(2), 434; https://doi.org/10.3390/diagnostics12020434

Submission received: 17 January 2022 / Revised: 28 January 2022 / Accepted: 7 February 2022 / Published: 8 February 2022

(This article belongs to the Special Issue Endoscopic Ultrasound-Guided Fine-Needle Aspiration (EUS-FNA))

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

We aimed to develop a new artificial intelligence (AI)-based method for evaluating endoscopic ultrasound-guided fine-needle biopsy (EUS-FNB) specimens in pancreatic diseases using deep learning and contrastive learning. We analysed a total of 173 specimens from 96 patients who underwent EUS-FNB with a 22 G Franseen needle for pancreatic diseases. In the initial study, the deep learning method based on stereomicroscopic images of 98 EUS-FNB specimens from 63 patients showed an accuracy of 71.8% for predicting the histological diagnosis, which was lower than that of macroscopic on-site evaluation (MOSE) performed by EUS experts (81.6%). Then, we used image analysis software to mark the core tissues in the photomicrographs of EUS-FNB specimens after haematoxylin and eosin staining and verified whether the diagnostic performance could be improved by applying contrastive learning for the features of the stereomicroscopic images and stained images. The sensitivity, specificity, and accuracy of MOSE were 88.97%, 53.5%, and 83.24%, respectively, while those of the AI-based diagnostic method using contrastive learning were 90.34%, 53.5%, and 84.39%, respectively. The AI-based evaluation method using contrastive learning was comparable to MOSE performed by EUS experts and can be a novel objective evaluation method for EUS-FNB.

Keywords:

artificial intelligence; endoscopic ultrasound-guided fine-needle biopsy; pancreatic diseases; deep learning; contrastive learning; stereomicroscope

1. Introduction

Endoscopic ultrasound-guided fine-needle aspiration (EUS-FNA) has been widely used as a technique to collect pancreatic tissue [1]. With the development of treatment options, including neoadjuvant chemotherapy, immune checkpoint inhibitors, and gene panel tests, the importance of tissue collection during pancreatic cancer treatment has been increasing. Recently, several new needles with novel needle tip shapes have been developed, making it possible to collect a larger amount of tissue [2,3]. These new core needles are used in a technique called endoscopic ultrasound-guided fine-needle biopsy (EUS-FNB). Various attempts have been made to evaluate whether proper specimens are being collected under EUS guidance. In 2011, the usefulness of rapid on-site cytology (ROSE) [4] was reported as a specimen evaluation method during EUS-FNA. Recently, a touch imprint cytology technique was reported for EUS-FNB specimens, which allows to obtain both cytological and histological specimens at the same time with the same needle, as well as to perform ROSE [5]. This technique provided comparable samples to those of EUS-FNA-standard cytology and combined the benefits of cytology and histology for the evaluation. However, the number of facilities that can perform ROSE is limited, and this approach does not necessarily lead to a reduction in procedure time. In addition, the advent of core needles has allowed for an increase in the amount of tissue that can be obtained with a smaller number of needle passes, and some reports suggest that ROSE may not be needed to reduce the number of needle passes in the era of EUS-FNB [6,7,8]. Moreover, the usefulness of macroscopic on-site evaluation (MOSE) has been reported as an alternative to ROSE [9]. Chong et al. [10] reported that MOSE provided the same diagnostic performance as EUS-FNA in the absence of ROSE and reduced the number of punctures. In general, MOSE results are considered positive when whitish tissue (core tissue) can be seen macroscopically in the obtained specimen. However, the judgement is often made by the endosonographer and is largely subjective based on experience level.

Recently, there has been a remarkable development of artificial intelligence (AI) using convolutional neural networks (CNNs) [11]—a deep learning method—in the field of image recognition in medicine [12,13]. This technology has also been applied to the field of gastroenterology and endoscopic procedures, including for the diagnosis of oesophageal cancer [14], gastric cancer [15], colorectal polyps [16], and the EUS-based diagnosis of pancreatic disease [17].

A previous study reported the diagnosis of pancreatic ductal adenocarcinoma (PDAC) using a deep learning model based on pathological specimens obtained by EUS-FNB [18]. However, to the best of our knowledge, there are no studies that have examined the usefulness of AI in predicting the diagnosable material for histology using fresh specimens. In this study, we aimed to develop a new AI-based method that can be an alternative to MOSE for evaluating EUS-FNB specimens in pancreatic diseases using deep learning and contrastive learning.

2. Materials and Methods

2.1. Study Design

This was a retrospective study conducted as a medical-industrial collaborative project between the Department of Gastroenterology and Hepatology at Nagoya University Hospital and the Faculty of Science and Technology at Meijo University. It was performed with the approval of the ethics committee of each institution. The content of the research was described, and contact information was provided in an opt-out format on the website of our hospital for patients who did not wish to participate (approval number: 2019-0310). The study was performed in accordance with the ethical standards stated in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards associated with Grant-in-Aid for Scientific Research (grant no. JP20K12689) support.

2.2. Patients

This study consists of two stages: the first stage uses deep learning, and the second stage uses contrastive learning. For the deep learning stage, we reviewed 98 specimens from 63 patients in whom EUS-FNB was performed for pancreatic diseases using a 22 G Franseen needle (Acquire, Boston Scientific Co., Natick, MA, USA) between September 2019 and October 2020 in Nagoya University Hospital, and all specimens obtained were photographed by a stereomicroscope (SZX12, Olympus Co., Ltd., Tokyo, Japan) immediately after specimen collection. There is no clear rationale for the sample size setting because there are no previous studies that have examined the same issue. Therefore, we first decided to use the specimens obtained by EUS-FNB performed at our hospital during a one-year period. Then we added cases up until January 2021, because the performance is expected to increase with the number in AI analysis, and a total of 173 specimens from 96 patients were reviewed for the later study of deep learning and contrastive learning stage.

2.3. EUS-FNB Procedure

The EUS procedure was performed by two experts with more than 10 years (EO and TIs) of experience or by trainees supervised by the experts using a linear-array endoscope (GF-UCT260, Olympus Co., Ltd., Tokyo, Japan) and an ultrasound system (EU-ME2, Olympus Co., Ltd., Tokyo, Japan). While the patient was under conscious sedation, the EUS scope was inserted orally. The lesion was carefully observed in B-mode first and then in colour Doppler mode before the puncture was performed to confirm that no major vessels were in the needle pathway. After the needle was inserted into the lesion, the stylet was slowly withdrawn (dry slow-pull technique) as the sample was obtained; this was repeated for all needle passes. The number of passes was determined based on whether ROSE confirmed the presence of tumour cells, with a maximum of three passes.

2.4. Specimen Processing for EUS-FNB

The specimen was extruded from the needle onto a Petri dish using saline. The liquid components around the specimen were then aspirated with a syringe and prepared for ROSE and cytology. The aspirated liquid component was diluted to 6 mL with saline. Smears were prepared by processing with Autosmear (Sakura Finetech Japan, Co., Ltd., Tokyo, Japan) at 1500 rpm for 5 s, promptly spray fixed (Melcofix^®, Merck KGaA, Darmstadt, Germany), and stained with the ultrafast Papanicolaou (UFP) method. Microscopic evaluation was performed inside the endoscopy suite by an experienced cytologist, and the presence or absence of cell components was immediately reported to the endosonographer. The remaining solid specimens were immediately observed under a stereomicroscope, and images were photographed. The specimens were then placed in formalin solution for histological examination. All specimens were processed per needle pass, and slides with haematoxylin and eosin (HE) staining were also made per needle pass.

2.5. MOSE and Imaging EUS-FNB Specimens

A high-end zoom stereomicroscope, SZX12, was used for stereomicroscopic observation. The magnification range was from 7× to 90× (zoom ratio 12.86) with an aperture mechanism that allowed for a deeper depth of field. To evaluate the specimens under the same conditions, the observation screen was set up so that the vertical width was 2 cm with a scale of 1-mm increments underneath (Figure 1). The specimen was then observed with nothing in the background, and a single expert endosonographer (TIs) who has performed more than 300 EUS-FNB procedures, performed MOSE. A specimen was defined to have positive MOSE results if it contained a portion recognisable as whitish core tissue, with or without the presence of reddish blood clots. A specimen was defined to have negative MOSE results if it contained little or no core tissue or only reddish blood clots. The images used for MOSE were captured as a JPG file by the digital camera attached to the stereomicroscope and were sent for AI-based analysis. To assess the reliability of MOSE, the stereomicroscopic images were independently reviewed after the EUS-FNB procedures by another expert endosonographer (HS) who had performed more than 200 EUS-FNB procedures and was blinded to the patient history and clinical, radiologic, and histologic information.

2.6. Histology Evaluation

After being fixed in formalin, the specimens were embedded in paraffin, sectioned, and subjected to HE staining and appropriate immunostaining according to the suspected diagnosis. Histological diagnosis was performed by 2 pathologists who specialise in the pancreatobiliary field at Nagoya University Hospital. The final diagnosis was based on the surgical specimen or the clinical course consistent with the results of EUS-FNB with a minimum 6-month follow-up. The histological diagnoses were classified into one of the following 5 categories: malignant, suspicious, atypical, benign, and inadequate. For tumorous lesions such as pancreatic cancer, it was considered to be correctly diagnosed if they were malignant or suspicious. For benign lesions such as mass-forming pancreatitis, in addition to showing atypical or benign characteristics on histology, a diagnosis was made if the lesion was confirmed to not worsen during the 6-month follow-up period. To evaluate the amount of core tissues obtained for each specimen, the total area of the core tissue was marked and measured under a photomicroscope using imaging software (CellSense, Olympus Co., Ltd., Tokyo, Japan) based on our previous reports [2,19,20]. The microscope was connected to a computer with CellSense software installed, and a digitised image of the HE-stained slide optimised for quantitative analysis could be directly captured using the software. The specimen was imaged at low magnification so that the whole specimen was included in a single image, and the area of the intact core tissues was measured manually. The investigator who performed the measurements was blinded to any clinical characteristics or histological results (Figure 2).

2.7. AI-Based Evaluation Using Deep Learning

For deep learning, we used AlexNet [21], an image recognition neural network. AlexNet consists of 3 convolutional layers, 2 pooling layers, and 3 fully connected layers. The size of the input image was 224 × 224 pixels, and the output was a one-dimensional vector with 2 elements. In AlexNet, a square image is divided into upper and lower branches and then combined in the final all-combining layer. Of the 98 specimens included in this study, 81 were classified as histologically diagnosable and 17 as histological nondiagnosable, and we examined whether AI methods using deep learning could predict the diagnosable material for histology based on stereomicroscopic images (Figure 3). To address the small amount of imaging data, augmentation was performed by flipping and rotating the existing imaging data clockwise. The 81 histologically diagnosable images were augmented to 162 images by 90-degree rotation, the 17 nondiagnosable images were augmented to 136 images by flipping plus 3-way rotation, and a total of 298 images were used for analysis (Figure 4) [22]. For the deep learning analysis, we divided the dataset into 3 sets to ensure that the small number of histologically nondiagnosable images were all evaluated. Training and validation were performed for each data set, and the final accuracy was calculated by averaging the accuracy based on the 3 data sets.

2.8. AI-Based Evaluation Using Contrastive Learning

We used an unsupervised representation learning method, such as SimCLR [23] for contrastive learning. Unsupervised representation learning is a concept that requires drastically less training by humans and allows the AI to learn more on its own. Contrastive learning allows the features of the same image with different processing to approach each other and the features of different images to repel each other. We trained several networks simultaneously. In one such network, the AI learned the correspondence between the HE-stained image and the image with the marked core tissue and detected which pixels corresponded to the tissue. As a result, we achieved an overlap rate (intersection over union, IoU) of 89.6% between the HE-stained images and the marked images. Moreover, the tissue area was predicted based on the segmentation of the core tissue. The contrastive learning model was also trained to approach the features of the stereomicroscopic images and HE-stained images, and we investigated whether linking the two images would improve the AI-based methods to predict the diagnosable material for histology (Figure 5). ResNet-34 [24], an image recognition model, was used as the image feature extraction device. One hundred and seventy-three specimens were classified into 145 specimens that were histologically diagnosable and 28 specimens that were histologically nondiagnosable, and we conducted an 8-fold cross-validation (13 h of training and 11 min of inference) on the 173 images. For training, we used the 3 networks surrounded by a square, and for inference, we used only the lower network surrounded by a dotted square, as shown in Figure 5.

2.9. Evaluation Items

The evaluation items were as follows: (1) comparison of AI-based methods using deep learning and MOSE by EUS experts for predicting diagnosable material for histology based on the stereomicroscopic images of EUS-FNB specimens, (2) comparison of diagnoses obtained by AI-based methods after contrastive learning and MOSE, (3) interobserver agreement among the endosonographers performing MOSE, (4) association between lesion features and MOSE positivity, and (5) evaluation of the tissue sample area and diagnostic accuracy.

2.10. Statistical Analysis

The sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV), were calculated to compare the performances of histological diagnoses predicted by the AI-based method and MOSE. Continuous variables are expressed as medians and interquartile ranges (IQRs). Categorical variables are expressed as percentages. The χ² test was used to compare categorical parameters, and the Mann–Whitney U test was used to compare continuous variables. A p-value less than 0.05 was considered statistically significant. The cut-off value of the amount of specimen required for histological diagnosis was assessed by receiver operating characteristic (ROC) curve analysis, and the area under the ROC curve (AUC) was calculated. Interobserver agreement among the endosonographers performing MOSE was assessed using kappa statistics and defined as low (kappa coefficient, 0.01–0.20), fair (0.21–0.40), moderate (0.41–0.60), good (0.61–0.80), or excellent (0.81–1.00). All statistical analyses were performed using SPSS Statistics 27.0 (SPSS, Inc., Chicago, IL, USA).

3. Results

3.1. AI-Based Evaluation Using Deep Learning

The median age of the 63 patients was 65 years (IQR 58–72), 66.7% were male, and the median lesion size was 24 mm (IQR 20–35.5). The final diagnosis was PDAC in 41 cases, mass-forming pancreatitis (MFP) in 11 cases, autoimmune pancreatitis (AIP) in 8 cases, a pancreatic neuroendocrine tumour in 1 case, a metastatic pancreatic tumour in 1 case, and intraductal papillary mucinous carcinoma (IPMC) in 1 case (Table 1). The accuracy of the histological diagnoses per patient and per specimen with EUS-FNB was 82.5% (52/63) and 82.7% (81/98), respectively. The mean sensitivity, specificity, accuracy, PPV, and NPV of AlexNet with 3-fold cross-validation for obtaining a histological diagnosis were 85.8%, 55.2%, 71.8%, 69.5%, and 76.5%, respectively. In contrast, the corresponding values for MOSE were 88.9%, 47.1%, 81.6%, 88.9%, and 47.1%, respectively, showing that the diagnostic accuracy of the AI-based evaluation method using deep learning was not as high as that of MOSE performed by an EUS expert (Table 2). After a re-evaluation by increasing the number of specimens to 173, the accuracy of AI-based evaluation using deep learning was 74.5%, which showed a slight increase, however it was still not as good as that of MOSE by a human.

3.2. AI-Based Evaluation Using Contrastive Learning

The median age of the 96 patients was 68 years (IQR 60–74.75), 61.5% were male, and the median lesion size was 25 mm (IQR 20–35). The final diagnosis was PDAC in 66 cases, MFP in 13 cases, AIP in 11 cases, a pancreatic neuroendocrine tumour in 3 cases, a metastatic pancreatic tumour in 2 cases, and IPMC in 1 case (Table 1).

As shown in Figure 6, we succeeded in approaching the features of the stereomicroscopic images and the HE-stained images using contrastive learning in the feature space. The sensitivity, specificity, accuracy, PPV, and NPV of MOSE for obtaining a histological diagnosis were 88.9%, 53.5%, 83.2%, 90.8%, and 48.4%, respectively, whereas those of the AI-based diagnostic method using contrastive learning were 90.3%, 53.5%, 84.4%, 90.9%, and 51.7%, respectively (Table 3).

3.3. Interobserver Agreement among the Endosonographers Performing MOSE

According to the macroscopic evaluation performed by another expert endosonographer (HS) based on the stereomicroscopic images, the accuracy of this endosonographer for obtaining a histological diagnosis was 82.1% (141/173), which was similar to the 83.2% (144/173) demonstrated by the on-site endosonographer (TIs). Kappa coefficients showed that the interobserver agreement between the two readers was good (kappa coefficient = 0.612), but there were 19 cases (11%) with discrepant findings from the macroscopic evaluation.

3.4. Association between Lesion Features and MOSE Positivity

The median lesion size was significantly larger in cases judged as MOSE-positive (25 mm (IQR 22–34) vs. 21 mm (IQR 15–25), p < 0.001), and when the final diagnosis was divided into PDAC and non-PDAC, the proportion of PDAC was significantly higher in cases judged as MOSE-positive (69.7% (99/142) vs. 48.4% (15/31), p = 0.023).

3.5. Evaluation of Tissue Sample Area and Diagnostic Accuracy

The median tissue area of all 173 FNB specimens was 1.7 mm² (IQR 0.8–2.92), and the tissue area was significantly larger in patients who were correctly diagnosed by histology (2.12 mm² (IQR 1.11–3.18) vs. 0.465 mm² (IQR 0.17–0.81), p < 0.001). In the ROC curve analysis, the AUC was 0.879, and the cut-off value of the tissue area for histological diagnosis calculated based on Youden’s index was 1.05 mm², with a sensitivity of 78.6% and a specificity of 89.3%.

4. Discussion

In this study, we aimed to develop a new evaluation method for EUS-FNB specimens in pancreatic diseases, and AI-based evaluation using contrastive learning showed a diagnostic performance as good as that of MOSE performed by EUS experts.

EUS-FNA for the pancreas was first reported by Vilmann et al., in 1992 [1], and there have been many reports showing its usefulness and safety in obtaining tissue from the pancreas [4,25,26,27]. The major challenge of EUS-FNA/FNB is how accurately we can estimate whether an appropriate specimen is obtained at the time of the procedure since the specimen is basically obtained using a thin needle. MOSE is a recently introduced method to estimate the quality of the obtained specimen. Iwashita et al. [9] first assessed the efficacy of MOSE in estimating the adequacy of histologic core specimens obtained by EUS-FNA with a standard 19-gauge needle for solid lesions and concluded that a macroscopically visible core of ≥4 mm on MOSE could be an indicator of specimen adequacy and lead to an improved diagnostic yield. Since then, there have been several reports showing the usefulness of MOSE in EUS-FNB using core needles [28,29,30,31]. However, since there is no established method for MOSE and the judgement is subjective, the diagnostic ability of this approach may vary depending on the facility and the endosonographer. In the present study, the accuracy of MOSE performed independently by two EUS experts was 83.2% and 82.1%, showing a high performance for both endosonographers, and the interobserver agreement rate was good (kappa coefficient = 0.612). However, 19 of 173 specimens (11%) showed discrepant results. In these 19 specimens, they tended to include a lot of blood components, and even if there were whitish core specimens, they were difficult to distinguish due to small amounts or overlapping with blood clots. Thus, we aimed to develop a more objective and reproducible evaluation method for EUS-FNB specimens using AI.

This study is unique in that we used stereomicroscopic images of EUS-FNB specimens for MOSE and AI-based diagnosis. We believe that the use of a stereomicroscope has several advantages. First, compared with normal macroscopic observation, simply magnifying the image makes it easier to recognise blood clots and core tissue. In addition, all specimens can be evaluated under the same conditions by aligning the magnification degree using a scale under stereomicroscope observation. This increases the objectivity of the evaluation and leads to image-based analyses using AI.

We first examined an AI-based diagnostic method using deep learning, AlexNet, simply by using the images of fresh specimens obtained from EUS-FNB. AlexNet was the first architecture to incorporate the concepts of deep learning and CNNs for object recognition [21]. AlexNet achieved breakthrough performance in the Image Classification Challenge Contest (ILSVRC). In the image classification challenge contests before 2012, feature values were extracted from images and used to classify the images. At that time, when features were extracted from images, the designed features included colour, brightness, and object shape. Therefore, the performance of image classification depended on how effectively the features could be designed. However, AlexNet showed that the machine itself can find features without human input as long as enough data is available. Several concepts first introduced in AlexNet have influenced the development of subsequent image classification architectures and have become current standard techniques: AlexNet incorporated deep learning for image classification by connecting convolutional layers and multiple neural networks, and the ReLU function was adopted as the activation function of the neural network. In addition, to address the issue of a small amount of imaging data, augmentation was performed by flipping and rotating the existing image data. In the present study, the AI-based diagnostic method using AlexNet showed an acceptable accuracy of 71.8% for obtaining a histological diagnosis, however this method was not as good as MOSE performed by endosonographers. In addition, although the accuracy increased slightly by increasing the number of specimens, it was still not as good as that of MOSE by a human, and it was expected that deep learning using only stereomicroscopic images with such a small number of specimens and unbalanced sample size between diagnosable and undiagnosable cases, as used in this study, would not be able to achieve sufficient accuracy.

In reviewing the images used for AI analysis with deep learning, we found that images with whitish areas but a small amount of specimen tended to be incorrectly predicted with undiagnosable material for histology, and images with large amounts of blood components tended to be incorrectly predicted with diagnosable material for histology. Therefore, we attempted to reduce the attention given to the amount of specimen by refraining from the image reduction in the network and making the network learn to move away from images that can be correctly predicted and those that cannot be correctly predicted in the feature space. In addition, since it is relatively easy to estimate the amount of tissue in HE-stained specimens, we aimed to improve the diagnostic performance of the AI system by linking stereomicroscopic images and HE-stained images using contrastive learning methods, such as SimCLR [23]. SimCLR simplifies the self-supervised learning algorithms that have been proposed in recent years without the need for special architectures or memory banks. Most of the current AI creations are based on supervised learning, in which a human prepares a large amount of correct data and then allows the AI to start learning. Unsupervised learning is the concept of a system that requires drastically less training by humans and allows the AI to learn more on its own. As a result, we succeeded in bringing the features of the stereomicroscopic and HE-stained images close to each other and obtained a diagnostic performance equivalent to that of MOSE performed by EUS experts.

Previous studies have reported that there is a positive correlation between the tissue volume obtained by EUS-FNB and the performance of the histological diagnosis [9,28,32]. Similar to previous reports, there was a positive correlation between tissue volume and histological diagnostic performance with 1.05 mm² as the cut-off value. Therefore, in the future, we are planning to use a multistep discrimination method, in which the amount of tissue is estimated first, and then the possible histological diagnoses are evaluated in a group with a small amount of tissue to surpass the accuracy of the macroscopic evaluation performed by humans.

This study has several limitations. First, the number of patients included in this study was small, and further studies with a larger number of patients are necessary to validate our results. Second, there might be discrepancies between the pathological diagnosis of EUS-FNB specimens and the actual pathological diagnosis of tumours that were not surgically resected. However, in these cases, immunohistochemical analysis results and/or periodic follow-up examinations may have minimised the discrepancies. Finally, all images, including those in the test datasets, were obtained retrospectively from a single hospital without randomization, making it difficult to exclude selection bias.

5. Conclusions

In conclusion, the AI-based method using contrastive learning to evaluate stereomicroscopic images of EUS-FNB specimens showed a diagnostic performance comparable to that of MOSE performed by EUS experts. However, it cannot be determined at this time whether our method can really be beneficial as an alternative to MOSE, mainly due to the insufficient number of subjects. Further studies with a better design and a larger sample size are necessary to validate our results. Incorporating information on the amount of tissue into the current AI system may help further improve the diagnostic performance of this approach and establish a new evaluation method for EUS-FNB specimens that can be an alternative to MOSE.

Author Contributions

Conception and design, T.I. (Takuya Ishikawa), M.H., H.S. and K.H.; analysis and interpretation of the data, T.I. (Takuya Ishikawa), M.H., H.S. and K.H.; drafting of the article, T.I. (Takuya Ishikawa) and M.H.; critical revision of the article for important intellectual content, E.O., Y.M., T.I. (Tadashi Iida), M.F., H.K. and K.H. All authors have read and agreed to the published version of the manuscript.

Funding

The study was associated with Grant-in-Aid for Scientific Research support (grant no. JP20K12689).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Ethics Committee of Nagoya University Hospital (approval number: 2019-0310, date of approval: 29 October 2019).

Informed Consent Statement

The content of the research was described, and contact information was provided in an opt-out format on the website of our hospital for patients who did not wish to participate.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

Vilmann, P.; Jacobsen, G.K.; Henriksen, F.W.; Hancke, S. Endoscopic ultrasonography with guided fine needle aspiration biopsy in pancreatic disease. Gastrointest Endosc. 1992, 38, 172–173. [Google Scholar] [CrossRef]
Ishikawa, T.; Kawashima, H.; Ohno, E.; Tanaka, H.; Sakai, D.; Iida, T.; Nishio, R.; Yamamura, T.; Furukawa, K.; Nakamura, M.; et al. Clinical Impact of EUS-Guided Fine Needle Biopsy Using a Novel Franseen Needle for Histological Assessment of Pancreatic Diseases. Can J. Gastroenterol. Hepatol. 2019, 2019, 8581743. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ishikawa, T.; Mohamed, R.; Heitman, S.J.; Turbide, C.; Kumar, P.R.; Goto, H.; Hirooka, Y.; Belletrutti, P.J. Diagnostic yield of small histological cores obtained with a new EUS-guided fine needle biopsy system. Surg. Endosc. 2017, 31, 5143–5149. [Google Scholar] [CrossRef] [PubMed]
Iglesias-Garcia, J.; Dominguez-Munoz, J.E.; Abdulkader, I.; Larino-Noia, J.; Eugenyeva, E.; Lozano-Leon, A.; Forteza-Vila, J. Influence of on-site cytopathology evaluation on the diagnostic accuracy of endoscopic ultrasound-guided fine needle aspiration (EUS-FNA) of solid pancreatic masses. Am. J. Gastroenterol. 2011, 106, 1705–1710. [Google Scholar] [CrossRef] [PubMed]
Crino, S.F.; Larghi, A.; Bernardoni, L.; Parisi, A.; Frulloni, L.; Gabbrielli, A.; Parcesepe, P.; Scarpa, A.; Manfrin, E. Touch imprint cytology on endoscopic ultrasound fine-needle biopsy provides comparable sample quality and diagnostic yield to standard endoscopic ultrasound fine-needle aspiration specimens in the evaluation of solid pancreatic lesions. Cytopathology 2019, 30, 179–186. [Google Scholar] [CrossRef] [PubMed]
Aadam, A.A.; Wani, S.; Amick, A.; Shah, J.N.; Bhat, Y.M.; Hamerski, C.M.; Klapman, J.B.; Muthusamy, V.R.; Watson, R.R.; Rademaker, A.W.; et al. A randomized controlled cross-over trial and cost analysis comparing endoscopic ultrasound fine needle aspiration and fine needle biopsy. Endosc. Int. Open. 2016, 4, E497–E505. [Google Scholar] [CrossRef] [Green Version]
Bang, J.Y.; Hawes, R.; Varadarajulu, S. A meta-analysis comparing ProCore and standard fine-needle aspiration needles for endoscopic ultrasound-guided tissue acquisition. Endoscopy 2016, 48, 339–349. [Google Scholar] [CrossRef]
Mohamadnejad, M.; Mullady, D.; Early, D.S.; Collins, B.; Marshall, C.; Sams, S.; Yen, R.; Rizeq, M.; Romanas, M.; Nawaz, S.; et al. Increasing Number of Passes Beyond 4 Does Not Increase Sensitivity of Detection of Pancreatic Malignancy by Endoscopic Ultrasound-Guided Fine-Needle Aspiration. Clin. Gastroenterol. Hepatol. 2017, 15, 1071–1078. [Google Scholar] [CrossRef]
Iwashita, T.; Yasuda, I.; Mukai, T.; Doi, S.; Nakashima, M.; Uemura, S.; Mabuchi, M.; Shimizu, M.; Hatano, Y.; Hara, A.; et al. Macroscopic on-site quality evaluation of biopsy specimens to improve the diagnostic accuracy during EUS-guided FNA using a 19-gauge needle for solid lesions: A single-center prospective pilot study (MOSE study). Gastrointest. Endosc. 2015, 81, 177–185. [Google Scholar] [CrossRef]
Chong, C.C.N.; Lakhtakia, S.; Nguyen, N.; Hara, K.; Chan, W.K.; Puri, R.; Almadi, M.A.; Ang, T.L.; Kwek, A.; Yasuda, I.; et al. Endoscopic ultrasound-guided tissue acquisition with or without macroscopic on-site evaluation: Randomized controlled trial. Endoscopy 2020, 52, 856–863. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Fujioka, T.; Kubota, K.; Mori, M.; Kikuchi, Y.; Katsuta, L.; Kasahara, M.; Oda, G.; Ishiba, T.; Nakagawa, T.; Tateishi, U. Distinction between benign and malignant breast masses at breast ultrasound using deep learning method with convolutional neural network. Jpn. J. Radiol. 2019, 37, 466–472. [Google Scholar] [CrossRef] [PubMed]
Ting, D.S.W.; Cheung, C.Y.; Lim, G.; Tan, G.S.W.; Quang, N.D.; Gan, A.; Hamzah, H.; Garcia-Franco, R.; San Yeo, I.Y.; Lee, S.Y.; et al. Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes. JAMA 2017, 318, 2211–2223. [Google Scholar] [CrossRef] [PubMed]
Horie, Y.; Yoshio, T.; Aoyama, K.; Yoshimizu, S.; Horiuchi, Y.; Ishiyama, A.; Hirasawa, T.; Tsuchida, T.; Ozawa, T.; Ishihara, S.; et al. Diagnostic outcomes of esophageal cancer by artificial intelligence using convolutional neural networks. Gastroint. Endosc. 2019, 89, 25–32. [Google Scholar] [CrossRef] [PubMed]
Hirasawa, T.; Aoyama, K.; Tanimoto, T.; Ishihara, S.; Shichijo, S.; Ozawa, T.; Ohnishi, T.; Fujishiro, M.; Matsuo, K.; Fujisaki, J.; et al. Application of artificial intelligence using a convolutional neural network for detecting gastric cancer in endoscopic images. Gastric Cancer 2018, 21, 653–660. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Byrne, M.F.; Chapados, N.; Soudan, F.; Oertel, C.; Linares Perez, M.; Kelly, R.; Iqbal, N.; Chandelier, F.; Rex, D.K. Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model. Gut 2019, 68, 94–100. [Google Scholar] [CrossRef] [Green Version]
Kuwahara, T.; Hara, K.; Mizuno, N.; Okuno, N.; Matsumoto, S.; Obata, M.; Kurita, Y.; Koda, H.; Toriyama, K.; Onishi, S.; et al. Usefulness of Deep Learning Analysis for the Diagnosis of Malignancy in Intraductal Papillary Mucinous Neoplasms of the Pancreas. Clin. Transl. Gastroenterol. 2019, 10, 1–8. [Google Scholar] [CrossRef]
Naito, Y.; Tsuneki, M.; Fukushima, N.; Koga, Y.; Higashi, M.; Notohara, K.; Aishima, S.; Ohike, N.; Tajiri, T.; Yamaguchi, H.; et al. A deep learning model to detect pancreatic ductal adenocarcinoma on endoscopic ultrasound-guided fine-needle biopsy. Sci. Rep. 2021, 11, 8454. [Google Scholar] [CrossRef]
Matsuzaki, I.; Miyahara, R.; Hirooka, Y.; Funasaka, K.; Ohno, E.; Nakamura, M.; Kawashima, H.; Nukaga, A.; Shimoyama, Y.; Goto, H. Forward-viewing versus oblique-viewing echoendoscopes in the diagnosis of upper GI subepithelial lesions with EUS-guided FNA: A prospective, randomized, crossover study. Gastroint. Endosc. 2015, 82, 287–295. [Google Scholar] [CrossRef]
Ishikawa, T.; Kawashima, H.; Ohno, E.; Suhara, H.; Hayashi, D.; Hiramatsu, T.; Matsubara, H.; Suzuki, T.; Kuwahara, T.; Ishikawa, E.; et al. Usefulness of endoscopic ultrasound-guided fine-needle biopsy for the diagnosis of autoimmune pancreatitis using a 22-gauge Franseen needle: A prospective multicenter study. Endoscopy 2020, 52, 978–985. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data-Ger. 2019, 6, 1–48. [Google Scholar] [CrossRef]
Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A Simple Framework for Contrastive Learning of Visual Representations. Pr. Mach Learn Res. 2020, 119, 1597–1607. [Google Scholar]
He, K.M.; Zhang, X.Y.; Ren, S.Q.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar] [CrossRef] [Green Version]
Chang, K.J.; Albers, C.G.; Erickson, R.A.; Butler, J.A.; Wuerker, R.B.; Lin, F. Endoscopic ultrasound-guided fine needle aspiration of pancreatic carcinoma. Am. J. Gastroenterol. 1994, 89, 263–266. [Google Scholar] [PubMed]
Cleveland, P.; Gill, K.R.; Coe, S.G.; Woodward, T.A.; Raimondo, M.; Jamil, L.; Gross, S.A.; Heckman, M.G.; Crook, J.E.; Wallace, M.B. An evaluation of risk factors for inadequate cytology in EUS-guided FNA of pancreatic tumors and lymph nodes. Gastroint. Endosc. 2010, 71, 1194–1199. [Google Scholar] [CrossRef]
Klapman, J.B.; Logrono, R.; Dye, C.E.; Waxman, I. Clinical impact of on-site cytopathology interpretation on endoscopic ultrasound-guided fine needle aspiration. Am. J. Gastroenterol. 2003, 98, 1289–1294. [Google Scholar] [CrossRef]
Kaneko, J.; Ishiwatari, H.; Sasaki, K.; Satoh, T.; Sato, J.; Matsubayashi, H.; Yabuuchi, Y.; Kishida, Y.; Yoshida, M.; Ito, S.; et al. Macroscopic on-site evaluation of biopsy specimens for accurate pathological diagnosis during EUS-guided fine needle biopsy using 22-G Franseen needle. Endosc. Ultrasound. 2020, 9, 385–391. [Google Scholar] [CrossRef]
Ki, E.L.L.; Lemaistre, A.I.; Fumex, F.; Gincul, R.; Lefort, C.; Lepilliez, V.; Pujol, B.; Napoleon, B. Macroscopic onsite evaluation using endoscopic ultrasound fine needle biopsy as an alternative to rapid onsite evaluation. Endosc. Int. Open. 2019, 7, E189–E194. [Google Scholar] [CrossRef] [Green Version]
Oh, D.; Seo, D.W.; Hong, S.M.; Song, T.J.; Park, D.H.; Lee, S.S.; Lee, S.K.; Kim, M.H. The impact of macroscopic on-site evaluation using filter paper in EUS-guided fine-needle biopsy. Endosc. Ultrasound 2019, 8, 342–347. [Google Scholar] [CrossRef]
So, H.; Seo, D.W.; Hwang, J.S.; Ko, S.W.; Oh, D.; Song, T.J.; Park, D.H.; Lee, S.K.; Kim, M.H. Macroscopic on-site evaluation after EUS-guided fine needle biopsy may replace rapid on-site evaluation. Endosc. Ultrasound 2021, 10, 111–115. [Google Scholar] [CrossRef]
Okuwaki, K.; Masutani, H.; Kida, M.; Yamauchi, H.; Iwai, T.; Miyata, E.; Hasegawa, R.; Kaneko, T.; Imaizumi, H.; Watanabe, M.; et al. Diagnostic efficacy of white core cutoff lengths obtained by EUS-guided fine-needle biopsy using a novel 22G franseen biopsy needle and sample isolation processing by stereomicroscopy for subepithelial lesions. Endosc. Ultrasound 2020, 9, 187–192. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Macroscopic on-site evaluation (MOSE) using a stereomicroscope. The observation screen was set up with a vertical width of 2 cm with a scale showing 1-mm increments in the background. The quality of the specimen was evaluated without anything in the background, and the positivity of the MOSE results was judged based on the presence of whitish core tissue. (A,B): A specimen judged as MOSE-positive with (A) and without (B) a scale in the background. (C,D): A specimen judged as MOSE-negative with (C) and without (D) a scale in the background.

Figure 2. Evaluation of the tissue area using imaging software. (A): A stereomicroscopic image of an endoscopic ultrasound-guided fine-needle biopsy specimen. (B): Haematoxylin and eosin staining of the specimen, viewed in a low-power field. (C): Measuring the area of the tissue specimen, excluding blood clots, using imaging software (CellSense).

Figure 3. Schema of the deep learning network used for this study. Artificial intelligence (AI) extracted imaging features from stereomicroscopic images, and we examined whether AI could predict the possible histological diagnosis based on stereomicroscopic images.

Figure 4. Data augmentation of the images for deep learning. The histologically diagnosable images were augmented through a 90-degree rotation, and the nondiagnosable images were augmented by flipping plus 3-way rotation.

Figure 5. Schema of the contrastive learning network used for this study. Artificial intelligence (AI) learned the relationships between the haematoxylin and eosin (HE)-stained image and the image with the marked core tissue and detected which pixels corresponded to the tissue. The contrastive learning method was also trained to approach the features of the stereomicroscopic images and HE-stained images, and we investigated whether linking the two images would improve the prediction rate of the AI-based diagnostic method for a positive histological diagnosis. For training, we used the 3 networks surrounded by a square, and for inference, we used only the lower network surrounded by a dotted square. MAE: mean absolute error. The numbers in parentheses represent the weight of each loss.

Figure 6. Visualisation of the feature space by contrastive learning. The stereomicroscopic/haematoxylin and eosin (HE)-stained images with positive histological diagnoses indicated by blue/purple points are close to each other in the feature space. Blue points: diagnosable stereomicroscopic images for training; orange points: diagnosable stereomicroscopic images for validation; green points: undiagnosable stereomicroscopic images for training; red points: undiagnosable stereomicroscopic images for validation; purple points: diagnosable HE-stained images; and brown points: undiagnosable HE-stained images.

Table 1. Patient Characteristics.

	Deep Learning Group	Contrastive Learning Group
	N = 63	N = 96
Age, median (IQR)	65 (58–72)	68 (60–74.75)
Sex, Male, N (%)	42 (66.7)	59 (61.5)
Lesion size, median (IQR), mm	24 (20–35.5)	25 (20–35)
Final diagnosis, N (%)
Pancreatic ductal adenocarcinoma	41 (65.1)	66 (68.8)
Mass-forming pancreatitis	11 (17.5)	13 (13.5)
Autoimmune pancreatitis	8 (12.7)	11 (11.5)
Pancreatic neuroendocrine tumour	1 (1.6)	3 (3.1)
Pancreatic metastasis	1 (1.6)	2 (2.1)
Intraductal papillary mucinous carcinoma	1 (1.6)	1 (1.0)

IQR: interquartile range.

Table 2. Diagnostic performances of the AI-based evaluation method using deep learning and MOSE for obtaining a histological diagnosis.

		Histology					Histology
		Diagnosable	Undiagnosable	Total			Diagnosable	Undiagnosable	Total
AI	Diagnosable	139	23	162	MOSE	Diagnosable	72	9	81
	Undiagnosable	61	75	136		Undiagnosable	9	8	17
	Total	200	98	298		Total	81	17	98
	Sensitivity	85.8%				Sensitivity	88.9%
	Specificity	55.2%				Specificity	47.1%
	Accuracy	71.8%				Accuracy	81.6%
	PPV	69.5%				PPV	88.9%
	NPV	76.5%				NPV	47.1%

AI: artificial intelligence, MOSE: macroscopic on-site evaluation, PPV: positive predictive value, NPV: negative predictive value.

Table 3. Diagnostic performances of the AI-based evaluation method using contrastive learning and MOSE for obtaining a histological diagnosis.

		Histology					Histology
		Diagnosable	Undiagnosable	Total			Diagnosable	Undiagnosable	Total
AI	Diagnosable	131	13	144	MOSE	Diagnosable	129	13	142
	Undiagnosable	14	15	29		Undiagnosable	16	15	31
	Total	145	28	173		Total	145	28	173
	Sensitivity	90.3%				Sensitivity	88.9%
	Specificity	53.5%				Specificity	53.5%
	Accuracy	84.4%				Accuracy	83.2%
	PPV	90.9%				PPV	90.8%
	NPV	51.7%				NPV	48.4%

AI: artificial intelligence, MOSE: macroscopic on-site evaluation, PPV: positive predictive value, NPV: negative predictive value.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ishikawa, T.; Hayakawa, M.; Suzuki, H.; Ohno, E.; Mizutani, Y.; Iida, T.; Fujishiro, M.; Kawashima, H.; Hotta, K. Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence. Diagnostics 2022, 12, 434. https://doi.org/10.3390/diagnostics12020434

AMA Style

Ishikawa T, Hayakawa M, Suzuki H, Ohno E, Mizutani Y, Iida T, Fujishiro M, Kawashima H, Hotta K. Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence. Diagnostics. 2022; 12(2):434. https://doi.org/10.3390/diagnostics12020434

Chicago/Turabian Style

Ishikawa, Takuya, Masato Hayakawa, Hirotaka Suzuki, Eizaburo Ohno, Yasuyuki Mizutani, Tadashi Iida, Mitsuhiro Fujishiro, Hiroki Kawashima, and Kazuhiro Hotta. 2022. "Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence" Diagnostics 12, no. 2: 434. https://doi.org/10.3390/diagnostics12020434

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Development of a Novel Evaluation Method for Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases Using Artificial Intelligence

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Design

2.2. Patients

2.3. EUS-FNB Procedure

2.4. Specimen Processing for EUS-FNB

2.5. MOSE and Imaging EUS-FNB Specimens

2.6. Histology Evaluation

2.7. AI-Based Evaluation Using Deep Learning

2.8. AI-Based Evaluation Using Contrastive Learning

2.9. Evaluation Items

2.10. Statistical Analysis

3. Results

3.1. AI-Based Evaluation Using Deep Learning

3.2. AI-Based Evaluation Using Contrastive Learning

3.3. Interobserver Agreement among the Endosonographers Performing MOSE

3.4. Association between Lesion Features and MOSE Positivity

3.5. Evaluation of Tissue Sample Area and Diagnostic Accuracy

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI