Next Article in Journal
Cyclist Effort Features: A Novel Technique for Image Texture Characterization Applied to Larynx Cancer Classification in Contact Endoscopy—Narrow Band Imaging
Previous Article in Journal
The Rapidly Expanding Group of RB1-Deleted Soft Tissue Tumors: An Updated Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Interobserver Reliability of the Endoscopic Ultrasound Criteria for the Diagnosis of Early Chronic Pancreatitis: Comparison between the 2009 and 2019 Japanese Diagnostic Criteria

by
Akira Yamamiya
1,
Atsushi Irisawa
1,*,
Keiichi Tominaga
1,
Kohei Tsuchida
1,
Takeshi Sugaya
1,
Misako Tsunemi
1,
Koki Hoshi
1,
Hidehito Jinnai
1,
Akane Yamabe
1,
Naoya Izawa
1,
Mari Iwasaki
1,
Yoichi Takimoto
1,
Akira Kanamori
1,
Kazunori Nagashima
1,
Takahito Minaguchi
1,
Ken Kashima
1,
Yasuhito Kunogi
1,
Ai Sato
2,
Kenichi Goda
1,
Makoto Iijima
1 and
Yasuo Haruyama
3
add Show full author list remove Hide full author list
1
Department of Gastroenterology, Dokkyo Medical University School of Medicine, 880 Kitakobayashi Mibu, Tochigi 321-0293, Japan
2
Department of Gastroenterology, Dokkyo Medical University Nikko Medical Center, 632 Takatoku Nikko, Tochigi 321-2593, Japan
3
Department of Public Health, Dokkyo Medical University School of Medicine, 880 Kitakobayashi Mibu, Tochigi 321-0293, Japan
*
Author to whom correspondence should be addressed.
Diagnostics 2021, 11(3), 431; https://doi.org/10.3390/diagnostics11030431
Submission received: 9 February 2021 / Revised: 1 March 2021 / Accepted: 1 March 2021 / Published: 3 March 2021
(This article belongs to the Section Medical Imaging and Theranostics)

Abstract

:
In 2009, diagnostic criteria for early chronic pancreatitis (DCECP2009) were proposed by the Japan Pancreas Society. This study aimed to evaluate the interobserver reliability (IOR) of endoscopic ultrasound (EUS) criteria for diagnosis of early chronic pancreatitis (ECP) between DCECP2009 and 2019 diagnostic criteria for ECP (DCECP2019) to assess the validity of the revision from the perspective of EUS findings. Among patients who underwent a detailed observation of the pancreas by EUS at our institution between January 2018 and March 2019, EUS images of 97 patients were extracted. Images were reviewed by 12 gastrointestinal endoscopy experts (eight pancreatologists, group A and four nonpancreatologists, group B). The overall kappa (K)-values for the IOR of the DCECP2009 EUS criteria were 0.424 and 0.563:0.231 for groups A:B, whereas the overall K-values for the DCECP2019 criteria were 0.618, and 0.733:0.442 for groups A:B. Regarding changes in the final diagnosis of ECP based on clinical features and EUS findings, 20 cases were definite ECP, 53 were probable ECP, and 24 were normal according to DCECP2009. In contrast, seven were definite ECP, 19 were probable ECP, and 71 were normal according to DECEP2019. IOR of DCECP2019 was higher than that of DCECP2009, which indicates an improvement in precision.

1. Introduction

Chronic pancreatitis (CP) is an irreversible and progressive inflammation of the pancreas. CP is characterized by extensive fibrosis in the pancreatic glands due to persistent and recurrent inflammation, which eventually leads to pancreatic exocrine and endocrine secretion disorders [1,2,3]. Its incidence in Japan is 52/100,000 individuals/year, which is continuing to increase [4]. CP is associated with a high incidence of pancreatic cancer, and early diagnosis and intervention can aid in preventing disease progression. In 2009, diagnostic criteria for early chronic pancreatitis (DCECP2009) were proposed by the Japan Pancreas Society [5]. The pathology and prognosis of early CP patient groups based on diagnostic criteria have been partially elucidated, but further clarification is needed [6,7]. According to DCECP2009 and DCECP2019, the diagnosis of early CP (ECP) requires Endoscopic ultrasound (EUS). The usefulness of EUSs was first reported in 1980 [8,9]. Since then, EUSs have been applied to the diagnosis and treatment of not only pancreatic diseases but also diseases of various organs [10,11,12,13,14,15]. EUSs are excellent for detecting small lesions due to their superior image resolution, and tissue sampling is also possible by applying this technique. According to DCECP2009, patients with two of four clinical signs (recurrent abdominal pain, blood and urinary pancreatic enzyme abnormalities, pancreatic exocrine disorder, and persistent heavy alcohol consumption equivalent to at least 80 g/day of pure ethanol (EtOH 80 g/day)), and those with more than two EUS-specific findings should be diagnosed with ECP (Table S1). Ten years after DCECP2009 was established, the ECP criteria were revised in 2019 (DCECP2019) [16,17]. In this revision, based on the accumulated data and expert opinion, EUS findings were grouped into confounding findings (hyperechoic foci without shadowing and stranding and lobularity with and without honeycombing), and the number decreased from seven features to four. Clinical features were also slightly modified (Table S2). DCECP2019 aimed to increase the diagnostic specificity of DCECP2009 (Tables S1 and S2). However, the interobserver reliability (IOR) of the EUS findings in the diagnosis of CP has been previously discussed, and the IOR of the EUS findings based on the Rosemont classification is not necessarily high [18,19]. We observed similar results in the literature review of the IOR of the EUS findings in the diagnosis of CP [20]. The use of diagnostic criteria that have been shown to have high IOR may lead to a highly accurate final diagnosis. Thus, regardless of the diagnostic criteria, if the IOR is high enough, the diagnostic validity cannot be disputed. DCECP2019 summarizes the findings from DCECP2009, and the IOR is expected to increase, but evidence for this has not been examined. In this study, we evaluated the IOR to determine the diagnostic accuracy of DCECP2009 and DCECP2019 in terms of EUS findings and confirmed that DCECP2019 has an increased IOR compared with DCECP2009. Finally, we evaluated the validity of the revision.

2. Materials and Methods

2.1. Study Design

This study was performed at Dokkyo Medical University and was approved by the Medical Ethics Committee of our institution (R-25-6J). The study was registered with the University Hospital Medical Information Network Clinical Trials Registry (000040676). In place of omitting informed consent, a means to opt out was provided, which allowed research subjects to be notified and enabled the publication of research information on our website.
The primary endpoint was the comparison of IOR of the DCECP2009 and DCECP2019 EUS criteria for the diagnosis of ECP. Secondary endpoints were: (1) IOR validation for each EUS finding in DCECP2009 and DCECP2019, (2) differences in kappa (K)-values between pancreatologists and nonpancreatologists, and (3) diagnostic changes due to the revised criteria (DCECP2019).

2.2. Observers for IOR Evaluation

EUS images were reviewed by 12 gastrointestinal endoscopy experts (board-certified fellows of the Japan Gastroenterological Endoscopy Society: A.I., K.G., A.S., K.To., K.Ts., T.S., A.Yamab., K.H., N.I., M.I., A.K., and T.M.), while A.Yamam, selected the EUS images. Of the 12 experts, eight were pancreatologists (group A) and four were nonpancreatologists (group B) based on DCECP2009 and DCECP2019. A pancreatologist was defined as an expert who has been engaged in clinical practice with expertise in pancreatic diseases for at least 5 years. Nonpancreatologists were defined as experts who have not been engaged in clinical practice with expertise in pancreatic disease for more than 5 years, but they have the skills and ability to interpret EUS images.

2.3. Collection of EUS Images and IOR Evaluation

In all, 773 patients underwent detailed observation of the pancreas by EUS at our institution between January 2018 and March 2019. When patients were diagnosed, all cases were finalized by an expert. One hundred consecutive patients with one or more EUS diagnostic findings of ECP based on DCECP2009 were extracted from the medical records by A.Ym. The examinations were performed using the GF-UE290 and GF-UCT260 (Olympus Co., Tokyo, Japan) and EG580UR and EG580UT (Fuji Film Co., Tokyo, Japan) electronic scanners. Therefore, in this study, 97 patients who were examined by GF-UE290 and GF-UCT260 were selected (Figure 1). According to the Rosemont criteria, three EUS images of the pancreatic body and tail were extracted per case [21]. Patients with pancreatic neoplasms, definite CP, and probable CP based on DCECP2009 and DCECP2019 at the time of diagnosis were excluded [5,8].
To first evaluate the IOR for each EUS finding of ECP, EUS images representing the following ECP findings were randomly selected: parenchymal features, such as lobularity with honeycombing, lobularity without honeycombing, hyperechoic foci without shadowing, stranding, and cysts, as well as ductal features, such as hyperechoic main pancreatic duct (MPD) margins and dilated side branches. EUS images of normal pancreas were also reviewed. The presence or absence of findings was evaluated by examiners who were blinded to the conditions. Each EUS finding was analyzed for IOR (multiple K statistics). Next, to evaluate the IOR of DCECP2009 and DCECP2019, three images were selected from each of the 97 selected cases, and a collection of images for evaluation was created for each case. Cases were diagnosed as ECP by observers according to the DCECP2009 and DCECP2019 and were analyzed for IOR. The standard of EUS diagnostic findings was agreed upon by most pancreatologists with the exception of A.Ym, who selected the EUS images.

2.4. DCECP2009

The definition of DCECP2009 is described in Supplementary Table S1. Clinical findings include (1) recurrent upper abdominal pain, (2) abnormal pancreatic enzyme levels in the serum or urine, (3) abnormal pancreatic exocrine function, and (4) continuous heavy alcohol consumption equivalent to at least 80 g/day of pure ethanol (EtOH 80 g/day). ECP is defined when a case has two or more features from (1) to (4) and when imaging findings of ECP are confirmed. Cases with only one finding (either (1) or (2)) and those that exhibit findings indicative of ECP are diagnosed with probable ECP.
EUS diagnostic findings (Figure 2) include (A) lobularity with honeycombing, (B) lobularity without honeycombing, (C) hyperechoic foci without shadowing, (D) stranding, (E) cysts, (F) dilated side branches, and (G) hyperechoic MPD margins. ECP is diagnosed by imaging when two or more findings, including any feature listed from (A) to (D), are observed. In addition, irregular dilatation in three or more branched pancreatic ducts, as seen by endoscopic retrograde cholangiopancreatography (ERCP), also served as imaging findings for ECP.

2.5. DCECP2019

The definition of DCECP2009 is described in Supplementary Table S2. Clinical findings include (1) recurrent upper abdominal pain or back pain, (2) abnormal pancreatic enzyme levels in the serum or urine, (3) abnormal pancreatic exocrine function, (4) continuous heavy alcohol consumption equivalent to at least 60 g/day of pure EtOH or pancreatitis-related genetic factors associated with susceptibility, and (5) a history of acute pancreatitis (AP). ECP is defined when three or more features from (1) to (5) are observed and when imaging findings of ECP are confirmed. Cases with two findings from (1) to (5) and that show the findings of ECP are diagnosed with probable ECP.
EUS diagnostic findings (Figure 3) include (A) hyperechoic foci with nonshadowing or stranding, (B) lobularity (nonhoneycombing or honeycombing type), (C) hyperechoic MPD margins, and (D) dilated side branches. ECP is diagnosed by imaging when two or more findings, including (A) or (B) or both, are observed. In addition, irregular dilatation in three or more branched pancreatic ducts, as seen on ERCP or magnetic resonance cholangiopancreatography, also served as imaging findings for ECP.

2.6. Equipment

The echoendoscope and universal ultrasound processors used were the GF-UE290, GF-UCT260, and EU-ME2 electronic scanners (Olympus Co., Tokyo, Japan).

2.7. Statistical Analysis

Data are shown as the mean ± standard deviation. Statistical analyses were performed using SPSS v. 27.0 statistical analysis software (SPSS Inc., Chicago, IL, USA). In this study, we used Fleiss’ K, which is a generalization of Scott’s pi (π) statistic, a statistical measure of inter-rater reliability [22,23]. This is also related to Cohen’s K statistic and Youden’s J statistic, which may be more appropriate in certain instances [24]. While Scott’s π and Cohen’s K are applicable to only two raters, Fleiss’ K works for any number of raters giving categorical ratings to a fixed number of items. The K statistic can be interpreted as expressing the extent to which the observed amount of agreement among raters exceeds what would be expected if all ratings were completely random. Notably, while Cohen’s K assumes that the same two raters have rated a set of items, when Fleiss’ kappa is used, although there is a fixed number of raters (e.g., three), different items may be rated by different individuals [22].
To evaluate IOR, the K-values were defined as follows: <0, no agreement; 0–0.20, slight; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; 0.81–1.00, almost perfect (Table S3) [25].
The statistics calculated in this study were evaluated by a statistician.

3. Results

3.1. Patient Characteristics

The baseline characteristics of the study population are shown in Table 1. Of the 100 consecutive patients with a median age of 64 years (±12 years), 63 (65%) were male and 27 (28%) had a history of AP. The reason for the EUS observation included alcohol abuse in 32 (36%), biliary stones in eight (8%), idiopathic causes in 47 (48%), and some other etiology in six (6%).
The positivity of each clinical finding in DCECP2009 and DCECP2019 was as follows: recurrent upper abdominal pain in 76 patients (78%), recurrent back pain in 27 patients (28%), abnormal pancreatic enzyme levels in the serum or urine in 36 patients (37%),abdominal pancreatic exocrine function in 0 patients (0%), pancreatitis-related susceptibility genes in 0 patients (0%), continuous heavy alcohol consumption equivalent to at least 80 g/day of pure EtOH in 18 patients (19%), and continuous heavy alcohol consumption equivalent to at least 60 g/day of pure EtOH in 32 patients (36%) (Table 1).
In DCECP2009, the number of cases per clinical finding was 0/1/2/3 (number of items) = 6/67/23/1 (number of cases). In contrast, in DCECP2019, the number of items was 0/1/2/3/4 (number of items) = 1/45/37/13/1 (number of cases).

3.2. IOR of Each EUS Finding in DCECP2009 and DCECP2019

The overall IOR for each EUS finding in DCECP2009 was as follows: (1) lobularity with honeycombing—K = 0.700, (2) lobularity without honeycombing—K = 0.661, (3) hyperechoic foci without shadowing—K = 0.570, (4) stranding—K = 0.524, (5) cysts—K = 0.816, (6) hyperechoic MPD margins—K = 0.771, and (7) dilated side branches—K = 0.552 (Table 2). The IOR for each EUS feature in DCECP2019 was as follows: (1) hyperechoic foci, nonshadowing/stranding—K = 0.741, (2) lobularity (nonhoneycombing/honeycombing type)—K = 0.706, (3) hyperechoic MPD margins—K = 0.771, and (4) dilated side branches—K = 0.552 (Table 3).

3.3. IOR of the DCECP2009 and DCECP2019 EUS Criteria

The overall K-value for the IOR of EUS criteria in DCECP2009 was 0.424 and was 0.563:0.231 for groups A:B. In DCECP2019, the K-value was 0.618 and was 0.733:0.442 for groups A:B (Table 4).

3.4. ECP Diagnosis between Pancreatologists and Nonpancreatologists

The ratio of ECP to normal cases according to the DCECP2009 EUS criteria was 77:72 cases, while that according to the DCECP2009 EUS criteria in group B was 72:25 cases. In contrast, the ratio of ECP to normal cases according to the DCECP2019 EUS criteria in group A was 49:48 cases, while that according to DCECP2019 in group B was 50:47 cases (Table 5).

3.5. Changes in the Final Diagnosis of ECP Based on Clinical and EUS Findings in DCECP2009 and DCECP2019

In group A, K > 0.5 for both DCECP2009 and DCECP2019, and IOR was moderate. The EUS diagnostic findings in group A served as the final diagnosis. DCECP2009 included 20 cases of definite ECP, 53 cases of probable ECP, and 24 normal cases. In contrast, DCECP2019 included seven cases of definite ECP, 19 cases of probable ECP, and 71 normal cases (Figure 4).

4. Discussion

Recently, “ECP”, which describes the early stages of CP as opposed to later, more advanced stages, has been increasingly recognized. Research on the concept of ECP includes its early diagnosis and therapeutic intervention before CP becomes irreversible, which may aid in avoiding late-stage complication and improving clinical outcomes [26].
The first diagnostic criteria for ECP were introduced in the 2009 Japanese clinical diagnostic criteria for CP [27]. According to these criteria, ECP can be diagnosed based on clinical and imaging features. Particularly, EUS, which reveals specific features of ECP, was positioned as a very important imaging modality in DCECP2009, and the EUS findings adopted in these criteria were selected from the EUS diagnostic criteria (Rosemont classification) [21]. A previous report showed that approximately 5% of cases diagnosed as ECP according to the DCECP2009 have been shown to progress to definite CP, [6] and thus, the reliability of this diagnostic classification system has to some extent been accepted. However, the clinical features of ECP differ from those of definite and probable CP, such that ECP is associated with a high proportion of women, an idiopathic nature, and an older age at onset. In recent reports where DCECP2009 was used, the differences in the clinical features between CP and ECP posed a problem [6,7,28]. In addition, an international movement has begun to define CP mechanically, and a new definition of CP (a mechanistic definition) that incorporates the concept of ECP was proposed [3]. Consequently, the criteria for ECP were revised in 2019 (DCECP2019). Although the EUS criteria in DCECP2009 were established based on the Rosemont classification, it was reported that the Rosemont classification did not significantly increase IOR for the EUS-based diagnosis of CP compared with conventional scoring [18,29,30,31,32,33]. The reason for this is that it is difficult to interpret EUS findings of ECP. In DCECP2019, of the seven EUS features established in DCECP2009, the confounding features (hyperechoic foci and strands and lobularity with and without honeycomb) were grouped together into four categories. Thus, this revision is expected to simplify diagnosis and improve diagnostic specificity.
With this revision, if the diagnostic specificity of ECP increases, it is assumed that it would be useful for the early treatment intervention in patients with CP. However, these newly proposed criteria have never been validated or demonstrated to have better IOR than previous criteria. Furthermore, IOR of EUS findings for CP diagnosis is not always highly based on previous literature [18,19], and therefore, we performed this study to assess the significance of the new diagnostic criteria, DCECP2019, from the viewpoint of IOR of the EUS findings, which are crucial for diagnosis. In this study, K-values for each EUS finding were over 0.5 (moderate), and a high IOR was confirmed for each finding. Therefore, both sets of diagnostic criteria (DCECP2009 and DCECP2019), which were established according to EUS findings with high IOR, can be considered to have high reliability as diagnostic imaging features. Consequently, we compared the IOR of EUS criteria for an ECP diagnosis between DCECP2009 and DCECP2019. Our results indicated that the IOR of the DCECP2019 EUS criteria was higher than that of DCECP2009. This suggests that the reliability of the diagnostic criteria for ECP has been further improved in this revision.
In this study, we also investigated the IOR of DCECP by dividing the observers into pancreatologists and nonpancreatologists. The differences in IOR were associated with differences in specialty even among gastrointestinal endoscopy experts. In the pancreatologist group, K is >0.5 for both DCECP2009 and DCECP2019, which indicates moderate IOR of EUS criteria for both. However, the IOR of EUS criteria in the nonpancreatologist group was low for both criteria, which suggests that IOR of DCECP among nonpancreatologists is insufficient. Therefore, while the medical treatment of patients with advanced CP should be entrusted to a pancreatologist for early-stage CP diagnosis, various gastroenterologists should be involved in the diagnosis. Therefore, it is desirable to further educate all gastrointestinal endoscopists on the EUS diagnostic findings of ECP to promote and improve diagnosis.
We also examined how the diagnosis by pancreatologists based on DCECP2009 would change with DCECP2019. With the revision of the diagnostic items, the prevalence of definite and probable ECP and the prevalence of non-CP (normal) have increased.In DCECP2019, it is now possible to identify cases with risk factors for ECP by decreasing the alcohol consumption threshold and incorporating a history of AP in clinical diagnoses. On the contrary, the number of positive clinical findings increased from two to three, which suggests that DCECP2019 may be more suitable for identifying ECP patients with greater certainty than DCECP2009. This suggests that changes in both the ECP findings and the clinical diagnosis may have contributed to the change in final diagnosis. Therefore, the intended revision, DCECP2019 can better indicate “true” ECP.
The limitations of this study are the lack of a gold standard for diagnostic imaging of ECP, the single-center and retrospective design, the small number of patients, and that diagnosis being based on only three EUS images. However, this study only evaluated the IOR of each EUS finding for ECP and the diagnostic criteria, and thus, the lack of a gold standard is irrelevant. Nevertheless, a gold standard for ECP is still required to discuss the validity of these diagnostic criteria, and alternatively, positive follow-up may be necessary. Although the EUS was performed by various endosonographers, all cases were finalized/certified by experts. Therefore, although reliability of the EUS images is guaranteed, selection bias may be involved in the selection process. As a statistical limitation, the number of categories and subjects affects the magnitude of the value, and therefore, the K-value may be higher with fewer categories [34].
In the future, a prospective multicenter study that addresses these limitations will be needed.

5. Conclusions

IOR of the DCECP2019 EUS diagnostic criteria was higher than that of DCECP2009. Although the IOR of the EUS diagnostic criteria is considered a limitation of ECP diagnosis, our results suggest that DCECP2019 has improved precision and higher reliability than DCECP2009.

Supplementary Materials

The following are available online at https://www.mdpi.com/2075-4418/11/3/431/s1: Supplementary Table S1. Clinical diagnostic criteria for early chronic pancreatitis 2009; Supplementary Table S2. Clinical diagnostic criteria for early chronic pancreatitis 2019; Supplementary Table S3. Interobserver reliability (K statistic).

Author Contributions

A.Y. (Akira Yamamiya) and A.I. designed this study, collected and analyzed the data, and drafted the manuscript; A.I. checked the manuscript and approved the final version; A.Y. (Akira Yamamiya), A.I., A.K., T.M., K.N., N.I., M.I., Y.T., H.J., A.Y. (Akane Yamabe), K.H., M.T., T.S., K.T. (Keiichi Tominaga), K.T. (Kohei Tsuchida), A.S., M.I., and K.G. analyzed the data. A.Y. (Akira Yamamiya), K.K. and Y.K. created the figures. Y.H. analyzed the data as an expert in statistics. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The Medical Ethics Committee of Dokkyo Medical University (R-25-6J).

Informed Consent Statement

A means to opt out was provided instead of omitting informed consent, which is a way to guarantee the opportunity for research subjects to be notified and publish research information on our website.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

We wish to thank Ishida M, Iizuka S, and Nishimoto M (Department of Gas tro enterology, Dokkyo Medical University) for their technical assistance.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Witt, H.; Apte, M.V.; Keim, V.; Wilson, J.S. Chronic Pancreatitis: Challenges and Advances in Pathogenesis, Genetics, Diagnosis, and Therapy. Gastroenterology 2007, 132, 1557–1573. [Google Scholar] [CrossRef]
  2. Braganza, J.M.; Lee, S.H.; McCloy, R.F.; McMahon, M.J. Chronic pancreatitis. Lancet 2011, 377, 1184–1197. [Google Scholar] [CrossRef]
  3. Whitcomb, D.C.; Frulloni, L.; Garg, P.; Greer, J.B.; Schneider, A.; Yadav, D.; Shimosegawa, T. Chronic pancreatitis: An international draft consensus proposal for a new mechanistic definition. Pancreatology 2016, 16, 218–224. [Google Scholar] [CrossRef] [PubMed]
  4. Hirota, M.; Shimosegawa, T.; Masamune, A.; Kikuta, K.; Kume, K.; Hamada, S.; Kanno, A.; Kimura, K.; Tsuji, I.; Kuriyama, S. The seventh nationwide epidemiological survey for chronic pancreatitis in Japan: Clinical significance of smoking habit in Japanese patients. Pancreatology 2014, 14, 490–496. [Google Scholar] [CrossRef]
  5. Shimosegawa, T.; Kataoka, K.; Kamisawa, T.; Miyakawa, H.; Ohara, H.; Ito, T.; Naruse, S.; Sata, N.; Suda, K.; Hirota, M.; et al. The revised Japanese clinical diagnostic criteria for chronic pancreatitis. J. Gastroenterol. 2010, 45, 584–591. [Google Scholar] [CrossRef]
  6. Masamune, A.; Nabeshima, T.; Kikuta, K.; Hamada, S.; Nakano, E.; Kume, K.; Kanno, A.; Sato, A.; Tachibana, Y.; Inatomi, O.; et al. Prospective study of early chronic pancreatitis diagnosed based on the Jap-anese diagnostic criteria. J. Gastroenterol. 2019, 54, 928–935. [Google Scholar] [CrossRef] [PubMed]
  7. Masamune, A.; Kikuta, K.; Nabeshima, T.; Nakano, E.; Hirota, M.; Kanno, A.; Kume, K.; Hamada, S.; Ito, T.; Fujita, M.; et al. Nationwide epidemiological survey of early chronic pancreatitis in Japan. J. Gastroenterol. 2017, 52, 992–1000. [Google Scholar] [CrossRef] [PubMed]
  8. Dimagno, E.; Regan, P.; Wilson, D.; Buxton, J.; Hattery, R.; Suarez, J.; Green, P. ULTRASONIC ENDOSCOPE. Lancet 1980, 315, 629–631. [Google Scholar] [CrossRef]
  9. Strohm, W.D.; Phillip, J.; Hagenmüller, F.; Classen, M. Ultrasonic Tomography by Means of an Ultrasonic Fiberendoscope. Endosc. 1980, 12, 241–244. [Google Scholar] [CrossRef] [PubMed]
  10. Nagashima, K.; Irisawa, A.; Tominaga, K.; Kashima, K.; Kunogi, Y.; Minaguchi, T.; Izawa, N.; Yamamiya, A.; Yamabe, A.; Hoshi, K.; et al. The Role of Endoscopic Ultrasound for Esophageal Varices. Diagnostics 2020, 10, 1007. [Google Scholar] [CrossRef] [PubMed]
  11. Takasaki, Y.; Ishii, S.; Fujisawa, T.; Ushio, M.; Takahashi, S.; Yamagata, W.; Ito, K.; Suzuki, A.; Ochiai, K.; Tomishima, K.; et al. Endoscopic Ultrasonography Findings of Early and Suspected Early Chronic Pancreatitis. Diagnostics 2020, 10, 1018. [Google Scholar] [CrossRef]
  12. Kanno, A.; Ikeda, E.; Ando, K.; Nagai, H.; Miwata, T.; Kawasaki, Y.; Tada, Y.; Yokoyama, K.; Numao, N.; Ushio, J.; et al. The Diagnosis of Autoimmune Pancreatitis Using Endoscopic Ultrasonography. Diagnostics 2020, 10, 1005. [Google Scholar] [CrossRef]
  13. Khalid, A.; Nodit, L.; Zahid, M.; Bauer, K.; Brody, D.; Finkelstein, S.D.; McGrath, K.M. Endoscopic Ultrasound Fine Needle Aspirate DNA Analysis to Differentiate Malignant and Benign Pancreatic Masses. Am. J. Gastroenterol. 2006, 101, 2493–2500. [Google Scholar] [CrossRef]
  14. Johnson, K.D.; Laoveeravat, P.; Yee, E.U.; Perisetti, A.; Thandassery, R.B.; Tharian, B. Endoscopic ultrasound guided liver biopsy: Recent evidence. World J. Gastrointest. Endosc. 2020, 12, 83–97. [Google Scholar] [CrossRef]
  15. George, N.; Perisetti, A.; Raghavapuram, S.; Banerjee, D.; Kuo, E.; Tharian, B. Metastatic duodenal germ cell tumor diagnosed with endoscopic ultrasound. Endoscopy 2018, 41, E27–E28. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Masamune, A.; Irisawa, A.; Kikuta, K.; Hamada, S.; Nakano, E.; Kume, K.; Kanno, A.; Sato, A.; Tachibana, Y.; Inatomi, O.; et al. Background and summary of the clinical diagnostic criteria for chronic pan-creatitis 2019. J. Jpn. Pancreas Soc. 2019, 34, 282–292, (In Japanese, English abstract). [Google Scholar] [CrossRef]
  17. Masamune, A.; Society, F.T.J.P.; Kikuta, K.; Kume, K.; Hamada, S.; Tsuji, I.; Takeyama, Y.; Shimosegawa, T.; Okazaki, K. Nationwide epidemiological survey of chronic pancreatitis in Japan: Introduction and validation of the new Japanese diagnostic criteria 2019. J. Gastroenterol. 2020, 55, 1062–1071. [Google Scholar] [CrossRef] [PubMed]
  18. Stevens, T.; Lopez, R.; Adler, D.G.; Al-Haddad, M.A.; Conway, J.; DeWitt, J.M.; Forsmark, C.E.; Kahaleh, M.; Lee, L.S.; Levy, M.J.; et al. Multicenter comparison of the interobserver agreement of standard EUS scoring and Rosemont classification scoring for diagnosis of chronic pancreatitis. Gastrointest. Endosc. 2010, 71, 519–526. [Google Scholar] [CrossRef] [PubMed]
  19. Del Pozo, D.; Poves, E.; Tabernero, S.; Beceiro, I.; Maral, I.; Villafruela, M.; Sanz, C.; Borrego, G. Conventional versus Rosemont endoscopic ultrasound criteria for chronic pan-creatitis: Interobserver agreement in same day back-to-back procedures. Pancreatology 2012, 12, 284–287. [Google Scholar] [CrossRef]
  20. Yamamiya, A.; Irisawa, A.; Kashima, K.; Kunogi, Y.; Nagashima, K.; Minaguchi, T.; Izawa, N.; Yamabe, A.; Hoshi, K.; Tominaga, K.; et al. Interobserver Reliability of Endoscopic Ultrasonography: Literature Review. Diagnostics 2020, 10, 953. [Google Scholar] [CrossRef]
  21. Catalano, M.F.; Sahai, A.; Levy, M.; Romagnuolo, J.; Wiersema, M.; Brugge, W.; Freeman, M.; Yamao, K.; Canto, M.; Hernandez, L.H. EUS-based criteria for the diagnosis of chronic pancreatitis: The Rosemont classifi-cation. Gastrointest. Endosc. 2009, 69, 1251–1261. [Google Scholar] [CrossRef] [PubMed]
  22. Fleiss, J.L. Measuring nominal scale agreement among many raters. Psychol. Bull. 1971, 76, 378–382. [Google Scholar] [CrossRef]
  23. Scott, W.A. Reliability of Content Analysis: The Case of Nominal Scale Coding. Public Opin. Q. 1955, 19, 321. [Google Scholar] [CrossRef]
  24. Powers, D.M.W. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness & correlation. J. Mach. Learn. Res. 2011, 2, 37–63. [Google Scholar]
  25. Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [Green Version]
  26. Raimondi, S.; Lowenfels, A.B.; Morselli-Labate, A.M.; Maisonneuve, P.; Pezzilli, R. Pancreatic cancer in chronic pancreatitis; aetiology, incidence, and early detection. Best Pr. Res. Clin. Gastroenterol. 2010, 24, 349–358. [Google Scholar] [CrossRef] [PubMed]
  27. Ueda, J.; Tanaka, M.; Ohtsuka, T.; Tokunaga, S.; Shimosegawa, T. Surgery for chronic pancreatitis decreases the risk for pancreatic cancer: A multicenter retrospective analysis. Surgery 2013, 153, 357–364. [Google Scholar] [CrossRef] [PubMed]
  28. Lee, L.S.; Tabak, Y.P.; Kadiyala, V.; Sun, X.; Suleiman, S.; Johannes, R.S.; Banks, P.A.; Conwell, D.L. Diagnosis of Chronic Pancreatitis Incorporating Endosonographic Features, Demographics, and Behavioral Risk. Pancreas 2017, 46, 405–409. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Sato, A.; Irisawa, A.; Bhutani, M.S.; Shibukawa, G.; Yamabe, A.; Fujisawa, M.; Igarashi, R.; Arakawa, N.; Oshida, Y.; Abe, Y.; et al. Significance of normal appearance on endoscopic ultrasonography in the diagnosis of early chronic pancreatitis. Endosc. Ultrasound 2018, 7, 110–118. [Google Scholar]
  30. Sahai, A.V.; Zimmerman, M.; Aabakken, L.; Tarnasky, P.R.; Cunningham, J.T.; Van Velse, A.; Hawes, R.H.; Hoffman, B.J. Prospective assessment of the ability of endoscopic ultrasound to diagnose, exclude, or establish the severity of chronic pancreatitis found by endoscopic retrograde cholangiopancreatography. Gastrointest. Endosc. 1998, 48, 18–25. [Google Scholar] [CrossRef]
  31. Wiersema, M.J.; Hawes, R.H.; Lehman, G.A.; Kochman, M.L.; Sherman, S.; Kopecky, K.K. Prospective Evaluation of Endoscopic Ultrasonography and Endoscopic Retrograde Cholangiopancreatography in Patients with Chronic Abdominal Pain of Suspected Pancreatic Origin. Endoscopy 1993, 25, 555–564. [Google Scholar] [CrossRef] [PubMed]
  32. Gardner, T.B.; Gordon, S.R. Interobserver Agreement for Pancreatic Endoscopic Ultrasonography Determined by Same Day Back-to-Back Examinations. J. Clin. Gastroenterol. 2011, 45, 542–545. [Google Scholar] [CrossRef] [PubMed]
  33. Wallace, M.B.; Hawes, R.H.; Durkalski, V.; Chak, A.; Mallery, S.; Catalano, M.F.; Wiersema, M.J.; Bhutani, M.S.; Ciaccia, D.; Kochman, M.L.; et al. The reliability of EUS for the diagnosis of chronic pancreatitis: Interobserver agreement among experienced endosonographers. Gastrointest. Endosc. 2001, 53, 294–299. [Google Scholar] [CrossRef]
  34. Sim, J.; Wright, C.C. The Kappa Statistic in Reliability Studies: Use, Interpretation, and Sample Size Requirements. Phys. Ther. 2005, 85, 257–268. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Flow diagram of this study.EUS, endoscopic ultrasound; ECP, early chronic pancreatitis.
Figure 1. Flow diagram of this study.EUS, endoscopic ultrasound; ECP, early chronic pancreatitis.
Diagnostics 11 00431 g001
Figure 2. Each endoscopic ultrasound (EUS) finding in diagnostic criteria for early chronic pancreatitis 2009 (DCECP2009). (A) Lobularity with honeycombing; (B) lobularity without honeycombing; (C) hyperechoic foci without shadowing; (D) stranding; (E) cysts; (F) dilated side branches; (G) hyperechoic main pancreatic duct (MPD) margins.
Figure 2. Each endoscopic ultrasound (EUS) finding in diagnostic criteria for early chronic pancreatitis 2009 (DCECP2009). (A) Lobularity with honeycombing; (B) lobularity without honeycombing; (C) hyperechoic foci without shadowing; (D) stranding; (E) cysts; (F) dilated side branches; (G) hyperechoic main pancreatic duct (MPD) margins.
Diagnostics 11 00431 g002
Figure 3. Each EUS finding in DCECP2019; (A1,A2) hyperechoic foci; nonshadowing/stranding; (B1,B2) lobularity (nonhoneycombing/honeycombing type); (C1,C2) hyperechoic MPD margins; (C3,C4) dilated side branches.
Figure 3. Each EUS finding in DCECP2019; (A1,A2) hyperechoic foci; nonshadowing/stranding; (B1,B2) lobularity (nonhoneycombing/honeycombing type); (C1,C2) hyperechoic MPD margins; (C3,C4) dilated side branches.
Diagnostics 11 00431 g003
Figure 4. Changes in final diagnosis between DCECP2009 and DCECP2019, DCECP, diagnostic criteria for early chronic pancreatitis; ECP, early chronic pancreatitis.
Figure 4. Changes in final diagnosis between DCECP2009 and DCECP2019, DCECP, diagnostic criteria for early chronic pancreatitis; ECP, early chronic pancreatitis.
Diagnostics 11 00431 g004
Table 1. Patient characteristics and clinical features in DCECP2009 and DCECP2019.
Table 1. Patient characteristics and clinical features in DCECP2009 and DCECP2019.
Age, Mean ± SD (Range)64 ± 12 (21–84)
Sex, male, n (%)63 (65)
Reason for EUS observation, n (%)
Alcohol abuse32 (36)
Stone in biliary system8 (8)
Recurrent upper abdominal pain, n (%)76 (78)
Recurrent back pain, n (%)27 (28)
Abnormal pancreatic enzyme levels in the serum or urine, n (%)36 (37)
Abnormal pancreatic exocrine function, n (%)0(0)
Pancreatitis-related susceptibility genes, n (%)0(0)
Continuous heavy alcohol consumption equivalent to at least 80 g/day of pure ethanol, n (%)18 (19)
Continuous heavy alcohol consumption equivalent to at least 60 g/day of pure ethanol, n (%)32 (36)
History of AP, n (%)27 (28)
Number of cases per clinical finding (number of items)
DCECP2009 (0/1/2/3)6/67/23/1
DCECP2019 (0/1/2/3/4)1/45/37/13/1/
DCECP, diagnostic criteria for early chronic pancreatitis; SD, standard deviation; AP, acute pancreatitis.
Table 2. Interobserver reliability of each EUS finding in DCECP2009.
Table 2. Interobserver reliability of each EUS finding in DCECP2009.
EUS FindingsOverall
Kappa (95% CI)
A Group
(Pancreatologists)
Kappa (95% CI)
B Group
(Nonpancreatologists)
Kappa (95% CI)
Lobularity with honeycombing0.700
(0.624–0.776)
0.736
(0.619–0.854)
0.583
(0.330–0.836)
Lobularity without honeycombing0.661
(0.584–0.737)
0.730
(0.621–0.847)
0.467
(0.214–0.720)
Hyperechoic foci without shadowing0.570
(0.494–0.647)
0.576
(0.459–0.693)
0.524
(0.271–0.777)
Stranding0.524
(0.448–0.600)
0.545
(0.428–0.662)
0.457
(0.204–0.710)
Cysts0.816
(0.740–0.893)
0.912
(0.795–1.029)
0.600
(0.347–0.853)
Dilated side branches0.552
(0.476–0.628)
0.576
(0.459–0.693)
0.430
(0.177–0.683)
Hyperechoic MPD margins0.771
(0.695–0.995)
0.812
(0.695–0.930)
0.734
(0.481–0.987)
MPD, main pancreatic duct.
Table 3. Interobserver reliability of each EUS finding in DCECP2019.
Table 3. Interobserver reliability of each EUS finding in DCECP2019.
EUS FindingsOverall
Kappa (95% CI)
A Group
(Pancreatologists)
Kappa (95% CI)
B Group
(Nonpancreatologists)
Kappa (95% CI)
Hyperechoic foci; nonshadowing/stranding0.741
(0.687–0.795)
0.827
(0.744–0.910)
0.541
(0.362–0.720)
Lobularity0.706
(0.652–0.760)
0.862
(0.779–0.945)
0.695
(0.516–0.874)
Hyperechoic MPD margins0.771
(0.695–0.995)
0.812
(0.695–0.930)
0.734
(0.481–0.987)
Dilated side branches 0.552
(0.476–0.628)
0.576
(0.459–0.693)
0.430
(0.177–0.683)
MPD, main pancreatic duct.
Table 4. Interobserver reliability of EUS criteria in DCECP2009 and DCECP2019.
Table 4. Interobserver reliability of EUS criteria in DCECP2009 and DCECP2019.
ObserversDCECP2009
Kappa (95% CI)
DCECP2019
Kappa (95% CI)
Overall0.424
(0.399–0.448)
0.618
(0.594–0.643)
A group
(pancreatologists)
0.563
(0.525–0.601)
0.733
(0.696–0.771)
B group
(nonpancreatologists)
0.231
(0.150–0.313)
0.442
(0.361–0.523)
DCECP, diagnostic criteria for early chronic pancreatitis.
Table 5. Early chronic pancreatitis (ECP) diagnosis between pancreatologists and nonpancreatologists.
Table 5. Early chronic pancreatitis (ECP) diagnosis between pancreatologists and nonpancreatologists.
ObserversDCECP2009
ECP
NormalDCECP2019
ECP
Normal
Overall75225344
A group
(pancreatologists)
77204948
B group
(nonpancreatologists)
72255047
DCECP, diagnostic criteria for early chronic pancreatitis.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yamamiya, A.; Irisawa, A.; Tominaga, K.; Tsuchida, K.; Sugaya, T.; Tsunemi, M.; Hoshi, K.; Jinnai, H.; Yamabe, A.; Izawa, N.; et al. Interobserver Reliability of the Endoscopic Ultrasound Criteria for the Diagnosis of Early Chronic Pancreatitis: Comparison between the 2009 and 2019 Japanese Diagnostic Criteria. Diagnostics 2021, 11, 431. https://doi.org/10.3390/diagnostics11030431

AMA Style

Yamamiya A, Irisawa A, Tominaga K, Tsuchida K, Sugaya T, Tsunemi M, Hoshi K, Jinnai H, Yamabe A, Izawa N, et al. Interobserver Reliability of the Endoscopic Ultrasound Criteria for the Diagnosis of Early Chronic Pancreatitis: Comparison between the 2009 and 2019 Japanese Diagnostic Criteria. Diagnostics. 2021; 11(3):431. https://doi.org/10.3390/diagnostics11030431

Chicago/Turabian Style

Yamamiya, Akira, Atsushi Irisawa, Keiichi Tominaga, Kohei Tsuchida, Takeshi Sugaya, Misako Tsunemi, Koki Hoshi, Hidehito Jinnai, Akane Yamabe, Naoya Izawa, and et al. 2021. "Interobserver Reliability of the Endoscopic Ultrasound Criteria for the Diagnosis of Early Chronic Pancreatitis: Comparison between the 2009 and 2019 Japanese Diagnostic Criteria" Diagnostics 11, no. 3: 431. https://doi.org/10.3390/diagnostics11030431

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop