Next Article in Journal
Prediction of Fetal Death in Preterm Preeclampsia Using Fetal Sex, Placental Growth Factor and Gestational Age
Previous Article in Journal
Myocarditis: Differences in Clinical Expression between Patients with ST-Segment Elevation in Electrocardiogram vs. Patients without ST-Segment Elevation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A 5-Year Mortality Prediction Model for Prostate Cancer Patients Based on the Korean Nationwide Health Insurance Claims Database

1
Department of Artificial Intelligence, University of Seoul, Seoul 02540, Republic of Korea
2
Department of Biostatistics and Computing, Yonsei University Graduate School, Seoul 03722, Republic of Korea
3
Department of Urology, Chungbuk National University Hospital, Cheongju 28644, Republic of Korea
4
Department of Urology, College of Medicine, Chungbuk National University, Cheongju 28644, Republic of Korea
5
Department of Family Medicine, Severance Hospital, Yonsei University College of Medicine, Seoul 03722, Republic of Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
These authors contributed equally to this work.
J. Pers. Med. 2024, 14(10), 1058; https://doi.org/10.3390/jpm14101058
Submission received: 9 September 2024 / Revised: 1 October 2024 / Accepted: 9 October 2024 / Published: 13 October 2024
(This article belongs to the Section Epidemiology)

Abstract

:
Background: Prostate cancer is the fourth most common cancer and eighth leading cause of cancer-related mortality worldwide. Its incidence is increasing in South Korea. This study aimed to investigate a predictive model for the 5-year survival probability of prostate cancer patients in a Korean primary care setting. Method: This retrospective study used data from the nationwide insurance claims database. The main outcome was survival probability 5 years after the initial diagnosis of prostate cancer. Potential confounding factors such as age, body mass index (BMI), blood pressure, laboratory results, lifestyle behaviors, household income, and comorbidity index were considered. These variables were available in the national health check-up information. A Cox proportional hazards regression model was used to develop the predictive model. The predictive performance was calculated based on the mean area under the receiver operating characteristic curve (AUC) after 10-fold cross-validation. Results: The mean 5-year survival probability was 82.0%. Age, fasting glucose and gamma-glutamyl transferase levels, current smoking, and multiple comorbidities were positively associated with mortality, whereas BMI, alkaline phosphatase levels, total cholesterol levels, alcohol intake, physical activity, and household income were inversely associated with mortality. The mean AUC after 10-fold cross-validation was 0.71. Conclusions: The 5-year survival probability model showed a moderately good predictive performance. This may be useful in predicting the survival probability of prostate cancer patients in primary care settings. When interpreting these results, potential limitations, such as selection or healthy user biases, should be considered.

1. Introduction

Cancer statistics in South Korea (hereinafter, Korea) reported that malignant neoplasms are the leading cause of death in Korea, even though mortality rates due to cancers have declined since 2002 [1]. Globally, the disease burden of malignant neoplasms has increased [2]. Prostate cancer is the fourth most common cancer and eighth leading cause of cancer-related mortality worldwide [3]. In 2021, the age-standardized prostate cancer incidence and mortality per 100,000 individuals in Korea were 34.5 and 4.0, respectively [4]. Age-standardized prostate cancer incidence and mortality ranked fourth and seventh in 2021 [4]. The incidence and mortality of prostate cancer has steadily increased since 1999. The National Cancer Screening Programs provided by the National Health Insurance Service (NHIS) include screening for five types of cancer (stomach, colorectum, liver, breast, and uterine cervix) but do not include prostate cancer screening [5]. The NHIS does not provide routine screening for prostate cancer because it is unclear whether prostate-specific antigen (PSA)-based screening can reduce prostate cancer-related or overall mortality and be cost-effective [6,7].
The 5-year relative survival rate of prostate cancer patients in Korea increased from 59.1% in 1993–1995 to 96.0% in 2017–2021 [4]. The mortality of prostate cancer is significantly affected by pathological stage and Gleason score [8]. In addition, age at diagnosis, comorbidities, body mass index (BMI), economic status, and lifestyle behaviors such as cigarette smoking, alcohol intake, and physical activity are associated with the deaths of patients with various types of cancer including prostate cancer [9,10,11,12,13]. If the survival probability of prostate cancer patients can be estimated based on health check-up information and personal health records, primary care doctors and prostate cancer specialists could provide them with appropriate health care and resources according to their life expectancy.
Therefore, this study aimed to develop a survival prediction model for patients with prostate cancer using national health check-up data that are easily available in outpatient clinics. Thus, this model may be useful for predicting the 5-year survival probability in primary care settings, where information regarding the pathological stage, Gleason score, or anti-cancer treatment details is difficult to obtain.

2. Materials and Methods

2.1. Data Source

This retrospective cohort study was designed using the NHIS Health Screening cohort from the Korean National Health Information Database (NHID). Almost all Koreans are subscribed to the NHIS (approximately 97%), with the exception of Medicaid beneficiaries (approximately 3%) who have very low household incomes. The NHIS collects insurance information (type and premium based on annual household income); information regarding medical utilization, including hospital visits, medical records (such as prescriptions, treatments, and diagnostic codes), and medical expenses; death information (date and cause of death) from death certificates; and data from health check-ups. The NHIS provides adults over 40 years of age with a biennial national health screening program during which the following details are collected: anthropometric data (height and body weight); blood pressure; lifestyle behaviors, such as cigarette smoking, alcohol intake, and physical activity; personal medical history; and laboratory test results, including fasting glucose, alanine aminotransferase (ALT), gamma-glutamyl transferase (GGT), and total cholesterol concentrations. The NHIS collects and provides these data to researchers for research purposes only. The last date that data were accessed for this study was 1 April 2024.
The institutional review board of Severance Hospital, Yonsei University Health System approved this study on 28 November 2023 (IRB No. 4-2023-1318). This study complied with the tenets of the 1964 Helsinki Declaration.

2.2. Study Population and Definition of Malignant Neoplasms

Figure 1 shows the flowchart of participants according to the inclusion and exclusion criteria. This cohort included patients with a prostate cancer diagnosis between 2002 and 2021. To enroll patients newly diagnosed with prostate cancer and rule out false positives for prostate cancer, those diagnosed before September 2005 when the special code system began were excluded from the study. Initially, 120,984 individuals diagnosed with prostate cancer between 2005 and 2016 were included in the study. The following exclusion criteria were applied: (1) patients who had been already diagnosed with prostate cancer between January 2002 and August 2005 before enrollment (n = 16,071); (2) individuals aged ≥ 80 years at the time of enrollment (n = 17,192); (3) patients with a history of malignant neoplasms other than prostate cancer (n = 12,059); (4) individuals who did not undergo a national health check-up within one year of being newly diagnosed with prostate cancer (n = 18,972); (5) individuals with missing data (n = 42,422); and (6) individuals with follow-up data of less than 30 days after diagnosis (n = 40). In total, 14,228 men were included in the final analysis.
Patients with malignant neoplasms were defined using the special codes V193 or V194 in their medical records. Special codes are normally assigned for severe refractory diseases, such as cardiovascular diseases or end-stage kidney diseases, and rare diseases, such as congenital anomalies or severe metabolic diseases, in addition to malignant neoplasms [14]. Patients with special diseases are more strictly monitored than patients with more common or less refractory diseases because the NHIS covers more medical expenditure for these special diseases. Among the special codes, V193 and V194 indicate malignant neoplasms. The types of malignant neoplasms are defined according to the combination of the V193 or V194 codes and the main diagnostic codes, which are based on the International Classification of Diseases (ICD)-10th edition codes for cancer (C00–C99). Prostate cancer is defined as both the C61 and the V193 or V194. This strict definition minimizes selection bias against false positives or negatives.

2.3. Prediction Variables

Variables obtained from health check-ups and personal medical records were selected as potential predictors of mortality in prostate cancer patients. Age, BMI, socioeconomic status, and lifestyle factors such as smoking, alcohol intake, and physical activity are associated with death in patients with various types of cancer, including prostate cancer [9,10,11,12,13]. Patients with prostate cancer often die from non-prostate cancer-related causes, such as cardiovascular disease [15]. Thus, the risk factors for cardiovascular diseases should be considered when developing a prognostic model. Age, BMI, blood pressure, fasting glucose, ALT, GGT, total cholesterol, and lifestyle factors were controlled in the present study [16,17]. These variables are available in the health check-up database from the NHID and commonly measured in primary care settings.
Continuous variables included age, BMI, systolic blood pressure (SBP), and laboratory test results (fasting glucose, ALT, GGT, and total cholesterol levels). Categorical variables included health behaviors (cigarette smoking, alcohol intake, and physical activity), economic status (based on annual household income), and the Charlson Comorbidity Index (CCI) score. Patients were categorized as never (those who had never smoked cigarettes), former (those who had smoked cigarettes in the past, but not at the time of enrollment), or current smokers (those who currently smoked cigarettes at the time of enrollment). Patients were classified as rare, moderate, or heavy drinkers. Rare drinkers were individuals who consumed alcohol ≤ three times per month, moderate drinkers were those who consumed alcohol one to four times per week, and heavy drinkers were those who consumed alcohol five or more times per week. Physical activity was categorized into low, moderate, and high levels. A low level was defined as engaging in physical activity < two times per week; moderate, three or four times per week; and high, five or more times per week. In terms of annual household income, patients were categorized into five groups according to insurance premiums.
The CCI is used to categorize patients according to their comorbidities based on the ICD codes recorded in administrative data [18]. Each comorbidity has weighted values from 1 to 6 based on the adjusted risk of death or associated resource use. The sum of all the weights provides a single comorbidity score for each patient. A score of zero means that no serious comorbidities are detected. A higher CCI score indicates a greater likelihood that a condition will result in death or higher medical resource use [19]. In this study, patients were categorized into four groups based on the CCI score (0, 1, 2, and 3 or more).

2.4. Statistical Analysis

Continuous variables (age, BMI, SBP, fasting glucose, ALT, GGT, and total cholesterol levels) are presented as the mean ± standard deviation. Categorical variables (cigarette smoking, alcohol intake, physical activity, annual household income, and CCI score) are expressed as the number of patients (percentage).
A Cox proportional hazards regression model was used to develop a prediction model for the 5-year mortality of prostate cancer patients [20,21]. Patients were censored on the date of their last clinic visit if they did not die during the study period. Potential risk factors, such as age, BMI, SBP, laboratory results (fasting glucose, total cholesterol, ALT, and GGT levels), lifestyle behaviors (smoking, alcohol intake, and physical activity), economic status, and CCI score, were adjusted in the Cox proportional hazards regression model. Except for the CCI score, these variables were collected from the National Health Check-Up Program. The proportional hazards assumption was verified by investigating the Schoenfeld residuals. Additionally, the logarithm of the cumulative hazard function was estimated based on Kaplan–Meier curves.
The 5-year mortality probability for a patient with prostate cancer can be estimated using the following equation:
P s u r v i v a l   t i m e t = 1 S 0 ( t ) e x p β 1 x 1 M 1 + β 2 x 2 M 2 + + β p ( x p M p )
where S0(t) is the survival rate at the mean value of the risk factor at time t. In this study, the 5-year survival rate, S0(5), was estimated to be 0.820. βj and Mj (j = 1, 2, … p = 12) are the regression coefficients in the Cox proportional hazards model and the sample mean of the j-th risk factor, respectively.
We evaluated the performance of the model using 10-fold cross-validation. K-fold cross-validation was used to divide the data into K-folds of the same size, fit the prediction model with K − 1 folds as the training set, and measure the predictive performance of the fitted model with the remaining folds as the test set. Every fold is used as a test set once, and in this study, the predictive performance was measured using the average area under the receiver operating characteristic curve (AUC) [22].

3. Results

Table 1 presents the baseline patient characteristics. The number of 1000 person-years of follow-up was 49.16, with a median follow-up duration of 7.42 years. The mean age and BMI were 69.13 years and 23.92 kg/m2, respectively. The percentage of current smokers and heavy drinkers was 21.0% and 21.5%, respectively. The proportions of individuals who were engaged in low, moderate, and high physical activity were 71.9%, 5.2%, and 22.9%, respectively. Patients with multiple comorbid diseases (CCI 2 and ≥3) comprised 6.8% and 4.9% of the study population, respectively.
Table 2 summarizes the survival rates of prostate cancer patients after the initial diagnosis. The mean survival probabilities 1, 3, and 5 years after prostate cancer diagnosis were 0.955, 0.882, and 0.820, respectively.
The β-coefficients in the survival model function (1) and the hazard ratios in the prostate cancer survival models are presented in Table 3. Age, fasting glucose and GGT levels, current smoking, and CCI score were positively associated with mortality, whereas BMI, ALT level, total cholesterol level, alcohol intake, physical activity, and household income were inversely associated with mortality. Prediction models for the 5-year survival probability were constructed from the mean values in Table 1, the 5-year survival rate in Table 2, and the β-coefficients in Table 3. The risk prediction formulae for the 5-year survival of prostate cancer patients are as follows:
(1) S u r v i v a l   m o d e l   f u n c t i o n   f o r   m e n   ( S M F M ) = 0.0777 × ( a g e 69.13 ) 0.0477 × ( B M I 23.92 ) + 0.0009 × ( S B P 127.89 ) + 0.0023 × ( f a s t i n g   g l u c o s e   c o n c e n t r a t i o n 105.09 ) 0.0020 × ( A L T   c o n c e n t r a t i o n 25.09 ) + 0.0022 × ( G G T   c o n c e n t r a t i o n 44.68 ) 0.0023 × ( t o t a l   c h o l e s t e r o l   c o n c e n t r a t i o n 186.67 ) 0.0082 × ( f o r m e r   s m o k e r 0.352 ) + 0.4020 × ( c u r r e n t   s m o k e r 0.210 ) 0.1563 × ( m o d e r a t e   d r i n k e r 0.268 ) 0.1189 × ( h e a v y   d r i n k e r 0.215 ) 0.2592 × ( m o d e r a t e   e x e r c i s e r 0.052 ) 0.2165 × ( h e a v y   e x e r c i s e r 0.229 ) 0.0505 × ( l o w   i n c o m e 0.161 ) 0.1126 × ( m i d l o w 0.202 ) 0.1753 × ( m i d h i g h   i n c o m e 0.240 ) 0.2905 × ( h i g h   i n c o m e 0.219 ) + 0.1556 × ( C C I 1 0.212 ) + 0.5532 × ( C C I 2 0.068 ) + 1.0092 × ( C C I 3 0.049 )
(2) SMFM1 = exp (SMFM)
(3) 5-year survival probability = 1 − 0.820SMFM1
The performance of the prediction model was measured via 10-fold cross-validation. Figure 2 displays the 10 receiver operating characteristic curves and the mean curve. The mean AUC was 0.71, suggesting that the model had moderately good predictive performance.

4. Discussion

In this study, a 5-year survival probability model for patients newly diagnosed with prostate cancer was developed using a nationwide insurance claims database. This model had good predictive performance based on the national health check-up data available in the primary care setting.
The age-standardized incidence rate of prostate cancer in Western and African countries is higher than that in Asian countries (62.1 in Europe, 73.7 in North America, 26.6 in Africa, and 11.5 in Asia per 100,000 individuals) [23]. Prostate cancer is the most common cancer in Japan, the United States (except skin cancer), and the United Kingdom, but ranks fourth in Korea [24]. Although the incidence rate in Korea is lower than that in other countries, it has steadily increased [4]. As many Koreans adopt westernized lifestyles and foods and grow older, the incidence will increase. The American Urology Association recommends PSA-based prostate cancer screening in combination with shared decision making for early detection [25]. However, the NHIS does not provide prostate cancer screening, as the National Cancer Screening Programs contain five types of cancer (stomach, liver, colorectum, breast, and uterine cervix) [5]. PSA-based prostate cancer screening can significantly increase the number of undiagnosed prostate cancer patients. If a PSA-based prostate cancer screening program is included in the National Cancer Screening Programs, the incidence of prostate cancer will increase significantly.
The recent 5-year relative survival rate for prostate cancer is 96.0% in Korea [4]. For these reasons, prostate cancer survivors will considerably increase in the near future.
Prostate cancer patients are more likely to die from cardiac, cerebrovascular, and chronic obstructive pulmonary diseases than from prostate cancer [26,27]. This likelihood is higher in patients with early-stage prostate cancer or in those who have survived a long time after diagnosis. Predicting the survival probability of prostate cancer patients is important for planning long-term care. If prostate cancer patients are expected to survive for a long time, their primary doctors should focus on preventing cardio-cerebrovascular diseases and second primary cancers as well as relieving their symptoms.
Although the 5-year relative survival rate among Korean prostate cancer patients diagnosed in 2017–2021 reached 96.0%, the survival rates 1, 3, and 5 years after the initial diagnosis were 95.5%, 88.2%, and 82.0%, respectively, in this cohort. This survival rate is comparable to that reported in another Korean study by Park et al. (2021) [28]. One reason for the possible underestimation in the present study is that patients diagnosed with prostate cancer between 2005 and 2016 were included. Other Korean studies reported a prognostic model [28,29]. Woo et al. developed a 5-year survival probability model for stomach cancer patients who underwent a gastrectomy [30]. This model is strong due to the inclusion of histopathological data, information about lymph node involvement, and surgical details; however, the reliance on detailed medical information makes it difficult for patients and primary care doctors to apply this model. Park et al. (2021) calculated the 5-year conditional relative survival and demonstrated competing mortality among patients with prostate cancer using a large cohort of >81,000 patients [28]. However, it is inconvenient to use this model in primary clinics because a numerical prognostic model was not developed [28]. Koo et al. (2020) developed an individualized survival prognostic model for patients newly diagnosed with prostate cancer [29]. The model was developed based on an artificial neural network model and included pathological stage, Gleason score, PSA levels, and anti-cancer treatment details such as surgery, radiation, androgen deprivation therapy, and performance scales [29]. While this model may more accurately predict survival outcomes compared to our model, it does not account for laboratory results, economic status, blood pressure, and lifestyle behaviors. In contrast, our model included other variables such as age, BMI, laboratory results, lifestyle behaviors, and economic status. These variables are easily available to nonmedical personnel or primary care doctors. In addition, these are commonly associated with cancer incidence, cardiovascular events, and mortality. Thus, patients with prostate cancer who are not medical personnel, their families, or primary care doctors are able to predict the 5-year survival probability, even without access to pathological and anti-cancer treatment details.
This study had several strengths. This retrospective cohort study included all patients newly diagnosed with prostate cancer between 2005 and 2016 based on nationwide insurance claims data. The Korea National Health Insurance Corporation (NHIC) is a non-profit organization and the single insurer that operates the NHIS to which all citizens must subscribe. The NHIC collects information from the NHIS and saves it in the NHID. The NHID contains nationwide claims and health check-up results, including anthropometric data and personal and familial medical histories. All data used in this study were obtained from the NHID. Thus, the study population is representative of Korean men with prostate cancer. Various confounders that may influence survival but are useful in primary care settings were included. Economic status, a major determinant of medical service use and accessibility, was considered when a patient’s survival probability was determined. Economic status was considered in our formula. Economic deprivation is associated with a higher cancer incidence and mortality [31,32,33]. Comorbidities, which were calculated as the CCI, were also reflected in the risk prediction formula, as the CCI is a measure of comorbid conditions related to patient death or medical resource utilization [19]. In addition, anthropometric data, laboratory results, and lifestyle behaviors were included in the formulae. Thus, this prognostic model for predicting the 5-year survival probability offsets the effects of unexpected confounders on the survival probability of prostate cancer patients.
However, limitations of this study should be considered when interpreting the results. First, histopathology, stage of prostate cancer, and treatment details, such as surgery, chemotherapy, hormonal therapy, and radiation therapy, were not available in the NHID. Although the inclusion of this information would allow for a more accurate prognosis, it would be difficult to use in primary care settings. Second, the survival rate relative to the time of cancer diagnosis was not exactly reflected because of the long inclusion period (2005–2016) and time-dependent improvement in the 5-year relative survival rate of prostate cancer (81.0% in 2001–2005 and 96.0% in 2016–2020) [4]. Third, no validation cohort was analyzed because the population size of prostate cancer patients was small. However, to mitigate the disadvantage of the lack of a validation cohort, we used 10-fold cross-validation to fit the model. Fourth, there may be imperfections in predictive modeling-based studies due to inaccurate variable measurements, insufficient clinical information, such as psychiatric problems, and a lack of supporting systems from residential areas and families. Continuous research using various population cohorts is needed to identify the most important variables related to death and to improve the accuracy of predictive models. Fifth, selection or healthy user bias should be considered. A combination of special codes for malignant neoplasms (V193 and V194) and prostate cancer based on the ICD-10 (C61) was used to minimize false positives in cancer diagnosis. However, patients with prostate cancer who underwent a national health check-up within 1 year before enrollment were included. The inclusion and exclusion criteria may have introduced bias.

5. Conclusions

A 5-year survival probability model for Korean men with prostate cancer was developed based on real-world data. This model may be useful for predicting the 5-year survival probability of prostate cancer patients in a primary care setting without the knowledge of the cancer stage, histological results, or treatment details.

Author Contributions

Conceptualization, H.-T.K. and Y.-J.K.; methodology, J.K.; software, J.K. and Y.-H.K.; validation, J.K., Y.-J.K., and H.-T.K.; formal analysis, Y.-H.K.; investigation, Y.-H.K. and H.-T.K.; data curation, Y.-H.K.; writing—original draft preparation, Y.-H.K. and H.-T.K.; supervision, H.-T.K.; funding acquisition, Y.-J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: RS-2022-KH129294). This work was supported by the 2024 Advanced Facility Fund of the University of Seoul.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Severance Hospital (IRB No. 4-2023-1318) Approval date: 28.11.2023.

Informed Consent Statement

Patient consent was waived due to not including information identifying individuals using previously collected data released to the general public.

Data Availability Statement

The data supporting this article are accessible from the National Health Insurance Service’s Open Data Portal.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Kang, M.J.; Jung, K.W.; Bang, S.H.; Choi, S.H.; Park, E.H.; Yun, E.H.; Kim, H.J.; Kong, H.J.; Im, J.S.; Seo, H.G.; et al. Cancer Statistics in Korea: Incidence, Mortality, Survival, and Prevalence in 2020. Cancer Res. Treat. 2023, 55, 385–399. [Google Scholar] [CrossRef] [PubMed]
  2. Bray, F.; Laversanne, M.; Sung, H.; Ferlay, J.; Siegel, R.L.; Soerjomataram, I.; Jemal, A. Global cancer statistics 2022; GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2024, 74, 229–263. [Google Scholar] [CrossRef] [PubMed]
  3. Global Cancer Observatory. Available online: https://gco.iarc.who.int/media/globocan/factsheets/populations/900-world-fact-sheet.pdf (accessed on 31 August 2024).
  4. Park, E.H.; Jung, K.W.; Park, N.J.; Kang, M.J.; Yun, E.H.; Kim, H.J.; Kim, J.E.; Kong, H.J.; Im, J.S.; Seo, H.G.; et al. Cancer Statistics in Korea: Incidence, Mortality, Survival, and Prevalence in 2021. Cancer Res. Treat. 2024, 56, 357–371. [Google Scholar] [CrossRef] [PubMed]
  5. Kang, H.T. Current Status of the National Health Screening Programs in South Korea. Korean J. Fam. Med. 2022, 43, 168–173. [Google Scholar] [CrossRef] [PubMed]
  6. García-Albéniz, X.; Hsu, J.; Etzioni, R.; Chank, J.M.; Shi, J.; Dickerman, B.; Hernan, M.A. Prostate-Specific Antigen Screening and Prostate Cancer Mortality: An Emulation of Target Trials in US Medicare. JCO Clin. Cancer Inform. 2024, 8, e2400094. [Google Scholar] [CrossRef]
  7. Ilic, D.; Djulbegovic, M.; Jung, J.H.; Hwang, E.C.; Zhou, Q.; Cleves, A.; Agoritsas, T.; Dahm, P. Prostate cancer screening with prostate-specific antigen (PSA) test: A systematic review and meta-analysis. BMJ 2018, 362, k3519. [Google Scholar] [CrossRef]
  8. Pak, S.; You, D.; Jeong, I.G.; Lee, D.E.; Kim, S.H.; Joung, J.Y.; Lee, K.H.; Hong, J.H.; Kim, C.S.; Ahn, H. Cause of Mortality after Radical Prostatectomy and the Impact of Comorbidity in Men with Prostate Cancer: A Multi-institutional Study in Korea. Cancer Res. Treat. 2020, 52, 1242–1250. [Google Scholar] [CrossRef]
  9. Onerup, A.; Mehlig, K.; Ekblom-Bak, E.; Lissner, L.; Börjesson, M.; Åberg, M. Cardiorespiratory fitness and BMI measured in youth and 5-year mortality after site-specific cancer diagnoses in men-A population-based cohort study with register linkage. Cancer Med. 2023, 12, 20000–20014. [Google Scholar] [CrossRef]
  10. Han, K.T.; Kim, D.W.; Kim, W. Impact of Cardiovascular Diseases on Mortality in Gastric Cancer Patients with Preexisting Chronic Disease. Yonsei Med. J. 2022, 63, 1043–1049. [Google Scholar] [CrossRef]
  11. Hong, J.S.; Yi, S.W.; Yi, J.J.; Hong, S.; Ohrr, H. Body Mass Index and Cancer Mortality Among Korean Older Middle-Aged Men: A Prospective Cohort Study. Medicine 2016, 95, e3684. [Google Scholar] [CrossRef]
  12. Bressi, B.; Iotti, C.; Cagliari, M.; Fugazzaro, S.; Cavuto, S.; Bergamaschi, F.A.M.; Moscato, A.; Costi, S. Physical exercise habits, lifestyle behaviors, and motivation to change among men with prostate cancer: A cross-sectional study. Support Care Cancer 2022, 30, 5017–5026. [Google Scholar] [CrossRef]
  13. Coughlin, S.S. A review of social determinants of prostate cancer risk, stage, and survival. Prostate Int. 2020, 8, 49–54. [Google Scholar] [CrossRef] [PubMed]
  14. Health Insurance Review & Assessment Service. Special Diseases. Available online: https://www.hira.or.kr/dummy.do?pgmid=HIRAA030037000000&isPopupYn=Y&isNewWindow=Y&cmsurl=/cms/popup/03/03/1344228_26965.html (accessed on 31 August 2024).
  15. Matthes, K.L.; Pestoni, G.; Korol, D.; Van Hemelrijck, M.; Rohrmann, S. The risk of prostate cancer mortality and cardiovascular mortality of nonmetastatic prostate cancer patients: A population-based retrospective cohort study. Urol. Oncol. 2018, 36, 309.e15–309.e23. [Google Scholar] [CrossRef] [PubMed]
  16. Kunutsor, S.K.; Apekey, T.A.; Khan, H. Liver enzymes and risk of cardiovascular disease in the general population: A meta-analysis of prospective cohort studies. Atherosclerosis 2014, 236, 7–17. [Google Scholar] [CrossRef] [PubMed]
  17. Yun, K.E.; Shin, C.Y.; Yoon, Y.S.; Park, H.S. Elevated alanine aminotransferase levels predict mortality from cardiovascular disease and diabetes in Koreans. Atherosclerosis 2009, 205, 533–537. [Google Scholar] [CrossRef] [PubMed]
  18. Charlson, M.E.; Pompei, P.; Ales, K.L.; MacKenzie, C.R. A new method of classifying prognostic comorbidity in longitudinal studies: Development and validation. J. Chronic Dis. 1987, 40, 373–383. [Google Scholar] [CrossRef]
  19. Chan, T.C.; Luk, J.K.; Chu, L.W.; Chan, F.H. Validation study of Charlson Comorbidity Index in predicting mortality in Chinese older adults. Geriatr. Gerontol. Int. 2014, 14, 452–457. [Google Scholar] [CrossRef]
  20. Wolf, P.A.; D’Agostino, R.B.; Belanger, A.J.; Kannel, W.B. Probability of stroke: A risk profile from the Framingham Study. Stroke 1991, 22, 312–318. [Google Scholar] [CrossRef]
  21. Jee, S.H.; Park, J.W.; Lee, S.Y.; Nam, B.H.; Ryu, H.G.; Kim, S.Y.; Kim, Y.N.; Lee, J.K.; Choi, S.M.; Yun, J.E. Stroke risk prediction model: A risk profile from the Korean study. Atherosclerosis 2008, 197, 318–325. [Google Scholar] [CrossRef]
  22. Hanley, J.A.; McNeil, B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982, 143, 29–36. [Google Scholar] [CrossRef]
  23. Culp, M.B.; Soerjomataram, I.; Efstathiou, J.A.; Bray, F.; Jemal, A. Recent Global Patterns in Prostate Cancer Incidence and Mortality Rates. Eur. Urol. 2020, 77, 38–52. [Google Scholar] [CrossRef] [PubMed]
  24. National Cancer Center. Annual Cancer Statistics Reports in 2021. Available online: https://ncc.re.kr/cancerStatsView.ncc?bbsnum=678&searchKey=total&searchValue=&pageNum=1 (accessed on 31 August 2024).
  25. Wei, J.T.; Barocas, D.; Carlsson, S.; Coakley, F.; Eggener, S.; Etzioni, R.; Fine, S.W.; Han, M.; Kim, S.K.; Kirkby, E.; et al. Early Detection of Prostate Cancer: AUA/SUO Guideline Part I: Prostate Cancer Screening. J. Urol. 2023, 210, 46–53. [Google Scholar] [CrossRef] [PubMed]
  26. Elmehrath, A.O.; Afifi, A.M.; Al-Husseini, M.J.; Saad, A.M.; Wilson, N.; Shohdy, K.S.; Pilie, P.; Sonbol, M.B.; Alhalabi, O. Causes of Death Among Patients With Metastatic Prostate Cancer in the US From 2000 to 2016. JAMA Netw. Open 2021, 4, e2119568. [Google Scholar] [CrossRef] [PubMed]
  27. Zaorsky, N.G.; Churilla, T.M.; Egleston, B.L.; Fisher, S.G.; Ridge, J.A.; Horwitz, E.M.; Meyer, J.E. Causes of death among cancer patients. Ann. Oncol. 2017, 28, 400–407. [Google Scholar] [CrossRef]
  28. Park, J.; Han, K.; Shin, D.W.; Park, S.H.; Shin, H.B. Conditional Relative Survival and Competing Mortality of Patients with Prostate Cancer in Korea: A Nationwide Cohort Study. Cancer Epidemiol. Biomark. Prev. 2021, 30, 326–334. [Google Scholar] [CrossRef]
  29. Koo, K.C.; Lee, K.S.; Kim, S.; Min, C.; Min, G.R.; Lee, Y.H.; Han, W.K.; Rha, K.H.; Hong, S.J.; Yang, S.C.; et al. Long short-term memory artificial neural network model for prediction of prostate cancer survival outcomes according to initial treatment strategy: Development of an online decision-making support system. World J. Urol. 2020, 38, 2469–2476. [Google Scholar] [CrossRef]
  30. Woo, Y.; Son, T.; Song, K.; Okumura, N.; Hu, Y.; Cho, G.S.; Kim, J.W.; Choi, S.H.; Noh, S.H.; Hyung, W.J. A Novel Prediction Model of Prognosis After Gastrectomy for Gastric Carcinoma: Development and Validation Using Asian Databases. Ann. Surg. 2016, 264, 114–120. [Google Scholar] [CrossRef]
  31. Chang, Y.; Hwang, S.H.; Cho, S.A.; Lee, H.; Cho, E.; Lee, J.Y. Health Inequities in Cancer Incidence According to Economic Status and Regions Are Still Existed even under Universal Health Coverage System in Korea: A Nationwide Population Based Study using National Health Insurance Database. Cancer Res. Treat. 2023, 56, 380–403. [Google Scholar] [CrossRef]
  32. Kweon, S.S.; Kim, M.G.; Kang, M.R.; Shin, M.H.; Choi, J.S. Difference of stage at cancer diagnosis by socioeconomic status for four target cancers of the National Cancer Screening Program in Korea: Results from the Gwangju and Jeonnam cancer registries. J. Epidemiol. 2017, 27, 299–304. [Google Scholar] [CrossRef]
  33. Khang, Y.H.; Kim, H.R. Socioeconomic Inequality in mortality using 12-year follow-up data from nationally representative surveys in South Korea. Int. J. Equity Health 2016, 15, 51. [Google Scholar] [CrossRef]
Figure 1. Flowchart of study population.
Figure 1. Flowchart of study population.
Jpm 14 01058 g001
Figure 2. Area under receiver operating characteristic curves via 10-fold cross-validation.
Figure 2. Area under receiver operating characteristic curves via 10-fold cross-validation.
Jpm 14 01058 g002
Table 1. Participants’ characteristics at baseline.
Table 1. Participants’ characteristics at baseline.
CategoryMale
Number14,228
Age, years69.13 ± 6.97
Body mass index, kg/m223.92 ± 2.85
Systolic blood pressure, mmHg127.89 ± 15.43
Fasting glucose, mg/dL127.89 ± 27.39
Alanine aminotransferase, IU/L25.09 ± 22.49
Gamma-glutamyl transferase, IU/L44.68 ± 63.19
Total cholesterol, mg/dL186.67 ± 39.11
Smoking status, n (%)
 Never smokers6236 (43.8%)
 Former smokers5000 (35.2%)
 Current smokers2985 (21.0%)
Alcohol consumption, n (%)
 Rare7363 (51.8%)
 Moderate3808 (26.8%)
 Heavy3057 (21.5%)
Physical activity, n (%)
 Low10,229 (71.9%)
 Moderate741 (5.2%)
 High3258 (22.9%)
Household income, n (%)
 0th–20th 2542 (17.9%)
 21st–40th 2294 (16.1%)
 41st–60th 2870 (20.2%)
 61st–80th3412 (24.0%)
 81st–100th3110 (21.9%)
Charlson Comorbidity Index, n (%)
 09538 (67.0%)
 13020 (21.2%)
 2966 (6.8%)
 ≥3704 (4.9%)
Values are means ± standard deviations for continuous variables or number (%) for categorical variables.
Table 2. Survival rates among patients with prostate cancer.
Table 2. Survival rates among patients with prostate cancer.
Time since diagnosis1 year3 years5 years
Number at risk13,58512,52111,436
Number of deaths6341050880
Survival rate (95% CIs *)0.955 (0.952−0.959)0.882 (0.876−0.887)0.820 (0.813−0.826)
* Cis: confidence intervals.
Table 3. Hazard ratios and 95% confidence intervals for death among patients with prostate cancer.
Table 3. Hazard ratios and 95% confidence intervals for death among patients with prostate cancer.
CategoryCoefficientHR (95% CIs)
Age, years0.07771.081 (1.075–1.086)
Body mass index, kg/m2−0.04770.953 (0.944–0.963)
Systolic blood pressure, mmHg0.00091.001 (0.999–1.003)
Fasting glucose, mg/dL0.00231.002 (1.001–1.003)
Alanine aminotransferase, IU/L−0.00200.998 (0.996–1.000)
Gamma-glutamyl transferase, IU/L0.00221.002 (1.002–1.003)
Total cholesterol, mg/dL−0.00230.998 (0.997–0.998)
Smoking status, n (%)
 Never smokers0.000Reference
 Former smokers−0.00820.992 (0.930–1.058)
 Current smokers0.40201.495 (1.394–1.603)
Alcohol consumption, n (%)
 Rare0.000Reference
 Moderate−0.15630.855 (0.798–0.916)
 Heavy−0.11890.888 (0.826–0.955)
Physical activity, n (%)
 Low0.000Reference
 Moderate−0.25920.772 (0.681–0.875)
 High−0.21650.805 (0.751–0.864)
Household income, n (%)
 0th–20th 0.000Reference
 21st–40th−0.05050.951 (0.868–1.041)
 41st–60th −0.11260.894 (0.819–0.974)
 61st–80th−0.17530.839 (0.772–0.912)
 81st–100th−0.29050.748 (0.685–0.817)
Charlson Comorbidity Index, n (%)
 00.000Reference
 10.15561.168 (1.091–1.251)
 20.55321.202 (1.582–1.911)
 ≥31.00922.743 (2.488–3.024)
Abbreviations: CI, confidence interval; HR, hazard ratio.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, J.; Kim, Y.-H.; Kim, Y.-J.; Kang, H.-T. A 5-Year Mortality Prediction Model for Prostate Cancer Patients Based on the Korean Nationwide Health Insurance Claims Database. J. Pers. Med. 2024, 14, 1058. https://doi.org/10.3390/jpm14101058

AMA Style

Kim J, Kim Y-H, Kim Y-J, Kang H-T. A 5-Year Mortality Prediction Model for Prostate Cancer Patients Based on the Korean Nationwide Health Insurance Claims Database. Journal of Personalized Medicine. 2024; 14(10):1058. https://doi.org/10.3390/jpm14101058

Chicago/Turabian Style

Kim, Joungyoun, Yong-Hoon Kim, Yong-June Kim, and Hee-Taik Kang. 2024. "A 5-Year Mortality Prediction Model for Prostate Cancer Patients Based on the Korean Nationwide Health Insurance Claims Database" Journal of Personalized Medicine 14, no. 10: 1058. https://doi.org/10.3390/jpm14101058

APA Style

Kim, J., Kim, Y. -H., Kim, Y. -J., & Kang, H. -T. (2024). A 5-Year Mortality Prediction Model for Prostate Cancer Patients Based on the Korean Nationwide Health Insurance Claims Database. Journal of Personalized Medicine, 14(10), 1058. https://doi.org/10.3390/jpm14101058

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop