Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years

Shim, Jae-Geum; Ryu, Kyoung-Ho; Cho, Eun-Ah; Ahn, Jin Hee; Kim, Hong Kyoon; Lee, Yoon-Ju; Lee, Sung Hyun

doi:10.3390/medicina57111230

Open AccessArticle

Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years

by

Jae-Geum Shim

,

Kyoung-Ho Ryu

,

Eun-Ah Cho

,

Jin Hee Ahn

,

Hong Kyoon Kim

,

Yoon-Ju Lee

and

Sung Hyun Lee

^*

Department of Anesthesiology and Pain Medicine, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul 03181, Korea

^*

Author to whom correspondence should be addressed.

Medicina 2021, 57(11), 1230; https://doi.org/10.3390/medicina57111230

Submission received: 22 October 2021 / Accepted: 9 November 2021 / Published: 11 November 2021

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Background and Objectives: Chronic lower back pain (LBP) is a common clinical disorder. The early identification of patients who will develop chronic LBP would help develop preventive measures and treatment. We aimed to develop machine learning models that can accurately predict the risk of chronic LBP. Materials and Methods: Data from the Sixth Korea National Health and Nutrition Examination Survey conducted in 2014 and 2015 (KNHANES VI-2, 3) were screened for selecting patients with chronic LBP. LBP lasting >30 days in the past 3 months was defined as chronic LBP in the survey. The following classification models with machine learning algorithms were developed and validated to predict chronic LBP: logistic regression (LR), k-nearest neighbors (KNN), naïve Bayes (NB), decision tree (DT), random forest (RF), gradient boosting machine (GBM), support vector machine (SVM), and artificial neural network (ANN). The performance of these models was compared with respect to the area under the receiver operating characteristic curve (AUROC). Results: A total of 6119 patients were analyzed in this study, of which 1394 had LBP. The feature selected data consisted of 13 variables. The LR, KNN, NB, DT, RF, GBM, SVM, and ANN models showed performances (in terms of AUROCs) of 0.656, 0.656, 0.712, 0.671, 0.699, 0.660, 0.707, and 0.716, respectively, with ten-fold cross-validation. Conclusions: In this study, the ANN model was identified as the best machine learning classification model for predicting the occurrence of chronic LBP. Therefore, machine learning could be effectively applied in the identification of populations at high risk of chronic LBP.

Keywords:

chronic lower back pain; machine learning; artificial neural network; logistic regression k-nearest neighbors; naïve Bayes; decision tree; random forest; gradient boosting machine; support vector machine; prediction

1. Introduction

Lower back pain (LBP) is one of the most common musculoskeletal disorders experienced by people of all ages [1]. Around 60–80% of the general population experiences LBP at least once in their lifetime in the United States [2,3]. Globally, LBP is the leading cause of years lived with disability, which had increased substantially in the Global Burden of Disease, Injuries, and Risk Factors Study 2017 [4,5]. This causes significant personal and social losses in terms of reduced productivity and increased costs of health care [6]. Although most acute LBP patients recover well within a few weeks or months, approximately one-quarter of the patients who present to primary care settings develop chronic LBP (pain lasting for >3 months) [7]. Therefore, an understanding of the risk factors of chronic LBP and the population with a potential for chronic LBP development can help in identifying people who are at a high risk of LBP and implementing suitable preventive or treatment measures.

Machine learning is a scientific discipline that uses computer algorithms to identify patterns in large amounts of data, which can also be used to make predictions based on novel datasets [8]. Machine learning has shown excellent performance in improving the predictive value of statistics in medical imaging and postoperative clinical outcomes [9,10,11,12,13]. Although there have been studies in the past attempting to predict LBP risk, previous research had limitations, such as only applying the Cox proportional-hazards model or not incorporating psychosocial factors and ergonomics-related variables [14].

Currently, there is no existing research on models that predict the occurrence of LBP using machine learning. Therefore, we undertook this study to develop and validate a selection of machine learning models to construct an LBP predictor.

2. Materials and Methods

2.1. Data Collection

Data from respondents who participated in the Korea National Health and Nutrition Examination Surveys (KNHANES) VI-2 and VI-3 (2014–2015) were retrospectively analyzed. The KNHANES is a nationwide, cross-sectional study conducted annually by the Korea Centers for Disease Control and Prevention using a nationwide, multistage, stratified, clustered, random sampling method [15]. It evaluates demographic and clinical data, including those of sex, geographic information, and age, in the Korean population [16]. In this study, data were collected to assess the health and nutritional status of Koreans. Individuals under 50 years of age were excluded because the KNHANES IV-2 and IV-3 did not evaluate or provide data related to LBP in this age group. Therefore, 6119 respondents who participated in the chronic LBP examination survey, aged 50–89 years, were included in the study.

2.2. Clinical Data and Outcomes

We collected data on all patients’ demographic and clinical characteristics from the KNHANES IV-2 and IV-3. Twenty-five predictor variables were collected and used in our proposed models. Patient demographic variables included age, sex, body mass index (BMI), occupation, education level, household income, and marital status. Comorbidity variables included hypertension, diabetes mellitus, hyperlipidemia, ischemic heart disease, cerebrovascular disease, osteoarthritis, and rheumatoid arthritis. Psychosocial variables included depression symptoms, stress, sleep duration, smoking status, and alcohol intake status. We also collected data on sitting time, physical activity, fasting blood glucose levels, and chronic LBP were defined by a simple survey response to a question regarding experiencing LBP lasting >30 days in the past 3 months. Sitting time was divided into two categories: >7 h and <7 h based on the median (7 h). Physical activity was divided into two categories based on the response “yes” to the question: “Does your job involve medium-intensity physical activity that lasts for at least 10 min or makes your heart beats slightly faster?”. Stress was divided into two categories based on responses to the question “How much stress do you feel in your daily life?”. Smoking status and alcohol intake status were divided into two categories, depending on whether the participants usually smoke or drink.

2.3. Statistical Analyses

R software version 3.6.1 (R Development Core Team, Vienna, Austria) was used for the analysis. The following packages for machine learning were used: Caret (https://CRAN.R-project.org/package=caret, accessed on 10 September 2021), Xgboost (https://CRAN.R-project.org/package=xgboost, accessed on 11 August 2021), and Keras (https://CRAN.R-project.org/package=keras, accessed on 11 August 2021). The Caret package was used for logistic regression, k-nearest neighbor, naïve Bayes, decision tree, random forest, and support vector machine. The Xgboost package was used for gradient boosting machine. The Keras package was used for the artificial neural network (ANN). The entire code of our machine learning algorithm (https://github.com/jgshim/chronicLBP, accessed on 11 August 2021) is freely available.

Before constructing the machine learning models, our collected data were randomly segregated into training and test sets. Specifically, 70% of the data was used for training the prediction models, and 30% was used as the test set for verification. A 10-fold cross-validation approach was used to choose a set of optimal hyperparameters. The missing data were estimated using a nearest neighbor imputation algorithm, which is a similarity-based method to fill in missing data that relies on distance metrics [17]. The synthetic minority oversampling technique, addressing imbalanced datasets, was used to oversample the minority classes to overcome the low incidence of chronic LBP in the training set [18].

We identified 25 potential features, including demographic and clinical variables from previous studies conducted to identify features that may potentially affect LBP risk. Feature selection is the process of selecting features that contributes the most to the output prediction for an efficient functioning of the machine learning algorithms [19,20]. In this process, recursive feature elimination (RFE) was used as a wrapper-type feature selection algorithm to help select features. RFE works by fitting the random forest function from the Caret package in the core of the model, ranking features by importance, and removing the least important features; a specified number of features remains, as seen in Figure 1. To construct the machine learning model, we included only a subset of the available features resulting from RFE.

Model performance evaluation was conducted using the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, and specificity. The AUROC from each machine learning model was plotted using the test dataset as a strong indicator of performance for classifiers in imbalanced datasets [21,22]. Although our data come from a nationwide study, nested cross-validation was used to estimate an unbiased generalization performance in addition to simple cross-validation. Nested cross-validation consists of a double loop. An inner loop serves for parameter selection over the validation set by fitting a model to each training set. The outer layer will be used for estimating the generalization error by averaging the test set scores over several dataset splits.

2.4. Ethics Statement

The VI-2 and VI-3 versions of the KNHANES were approved by the Institutional Review Board of the Korea Centers for Disease Control and Prevention (approval no. 2013-12EXP-03-5C and 2015-01-02-6C) and complied with the Declaration of Helsinki. Each participant voluntarily provided written informed consent before participating in this study. Additionally, the Institutional Review Board of Kangbuk Samsung Hospital waived the need for approval because the KNHANES survey data are openly published (approval no. KBSMC 2020-07-001).

3. Results

3.1. Patients’ Characteristics

We analyzed the data of 6119 patients who participated in the KNHANES IV-2 and IV-3 from 1 January 2014 to 31 December 2015. A total of 1394 patients (22.8%) experienced chronic LBP. The demographic and patient characteristics of the complete dataset are summarized in Table 1.

3.2. Feature Selection

The input variables after RFE included age, sex, BMI, household income, diabetes mellitus, hyperlipidemia, ischemic heart disease, osteoarthritis, depression symptoms, smoking status, physical activity, sitting time, and fasting blood glucose levels. The 13 features following final feature selection were used as input variables in creating the machine learning models for predicting the occurrence of chronic LBP. Correlation analyses showed a weak positive correlation between age, osteoarthritis, and chronic LBP (Figure 2).

3.3. Model Performance

After applying the test dataset for all machine learning techniques for predicting chronic LBP, the AUROCs calculated were 0.656 (95% CI, 0.634–0.678) for logistic regression, 0.656 (95% CI, 0.628–0.685) for k-nearest neighbor, 0.712 (95% CI, 0.685–0.740) for naïve Bayes, 0.671 (95% CI, 0.643–0.698) for decision tree, 0.699 (95% CI, 0.671–0.728) for random forest, 0.660 (95% CI, 0.631–0.690) for gradient boosting machines, 0.707 (95% CI, 0.678–0.735) for support vector machine, and 0.716 (95% CI, 0.689–0.744) for ANN, as seen in Table 2. The ANN method achieved the best performance in terms of AUROCs, as well as accuracy, sensitivity, and specificity (Figure 3). Results of nested cross validation are shown in Table 3.

4. Discussion

Previous reports have highlighted a lack of well-validated models for predicting LBP [23]. Well-verified risk prediction models help in the identification of patients at a high risk of disease and help in the implementation of preventive measures in advance. The objective of this study was to demonstrate that machine learning algorithms could accurately predict the occurrence of chronic LBP.

Feature selection is an important concept in machine learning, especially when dealing with a dataset that contains numerous features. This type of dataset is referred to as a high-dimensional dataset, with a multitude of problems, including a long training time for a machine learning model. The objective of feature selection is to improve the prediction performance of predictors and aid better understanding of the underlying principle in the dataset. In our study, analysis and modeling with RFE facilitated the identification of patients at a high risk of LBP and the determination of clinical factors associated with chronic LBP. A previous study employed the Cox proportional-hazards model to identify patients at a high risk of LBP [14,24]. However, only one model’s performance was obtained, and it was impossible to compare the various models. Another previous study applied stepwise logistic regression analysis to predict whether a patient with a recent new episode of LBP would develop persistent pain [25]. Stepwise methods have well-known limitations, such as unstable variable selection and biased coefficient estimation. In this study, we developed and validated our models by performing feature selection, cross-validation, and testing using different machine learning algorithms. Thus, we anticipate that the effective implementation of machine learning methods in clinical settings may facilitate the provision of personalized medicine to patients with chronic LBP in the future.

The handling of missing data is a major concern in machine learning and different application domains, including medical areas. In this study, we applied the nearest neighbor imputation algorithm for extrapolating the missing data rather than deletion. However, different methods exists for imputing missing data. Recently, oversampling methods have been proposed to impute missing data or generate valid synthetic instances to train classifiers in the case of extreme scarcity of training data. Izonin et al. showed the high accurate prediction using data augmentation procedure and support vector regression [26]. Additionally, Salazar et al. proposed a new method using generative adversarial networks and vector Markov random field to effectively improve the classifier performance [27].

Each machine learning algorithm has its own hyperparameters, such as the number of hidden layers in ANN or number of features available for splitting at each tree node in a random forest [11]. It is a parameter that is set before the learning process begins. In our study, we found that the optimal ANN, specifically multilayer perceptron, was composed of two hidden layers to predict the occurrence of chronic LBP. In the ANN model, the first and second hidden layers included 20 and 10 nodes, respectively, which were interconnected. Since the most optimal hyperparameters should often be specified by the researcher or set using heuristics to construct the ANN model, we obtained the suitable hyperparameters empirically, as seen in Appendix A. The hyperparameters found in this study could be useful for further research using the ANN method.

This study had certain limitations. The study used data from a cross-sectional survey that involved looking at data from a population at one specific point in time. Thus, it is not guaranteed to be representative, and the temporal relationship between predictor variables and chronic LBP cannot be determined. In addition, the prediction model in our study was based on a Korean population that is over 50 years of age. Thus, it may be difficult to generalize our study to different age groups considering the unique characteristics of Korean culture, such as sitting posture and high-intensity working hours. Clinically, it is meaningful to show an accuracy of 71.7% in prediction, but it still requires further research. It is doubtful that demographic data and clinical information are enough to accurately predict chronic LBP. Model alterations are most likely necessary for better predictive model. One possible region of our interest includes the lumbar spine X-ray, computed tomography, or magnetic resonance imaging.

5. Conclusions

This study is important because it promotes the identification of patients at high risk of chronic LBP in a population of Koreans over 50 years of age using machine learning. Among the machine learning models that were developed and validated, the ANN model was found to be the best machine learning classification model for predicting the occurrence of chronic LBP.

Author Contributions

Conceptualization, J.-G.S. and S.H.L.; methodology, J.-G.S., J.H.A. and S.H.L.; software, J.-G.S.; validation, K.-H.R. and E.-A.C.; formal analysis, J.-G.S. and S.H.L.; data curation, H.K.K. and Y.-J.L.; writing—original draft preparation, J.-G.S. and S.H.L.; writing—review and editing, J.-G.S., H.K.K., K.-H.R., E.-A.C. and S.H.L.; visualization, Y.-J.L.; supervision, J.-G.S. and S.H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The VI-2 and VI-3 versions of the KNHANES were approved by the Institutional Review Board of the Korea Centers for Disease Control and Prevention (approval no. 2013-12EXP-03-5C and 2015-01-02-6C) and complied with the Declaration of Helsinki. the Institutional Review Board of Kangbuk Samsung Hospital waived the need for approval because the KNHANES survey data are openly published (approval no. KBSMC 2020-07-001 on 7 March 2020).

Informed Consent Statement

Retrospective data collection and analysis were approved by the Institutional Review Board of the Korea Centers for Disease Control and Prevention (approval no. 2013-12EXP-03-5C and 2015-01-02-6C)—informed consent statement not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Acknowledgments

This study was not supported by any kind of funding.

Conflicts of Interest

Jae-Geum Shim, Kyoung-Ho Ryu, Eun-Ah Cho, Jin Hee Ahn, Hong Kyoon Kim, and Sung Hyun Lee declare that they have no conflict of interest.

Appendix A

Table A1. Optimal Hyperparameters of All Machine Learning Models.

Model	Optimal Hyperparameters
LR	nIter = 21
KNN	k = 7
NB	usekernal, Laplace = 0, Adjust = 1
DT	Maximum depth = 5 Criterion = Gini index
RF	Mtry * = 3
GBM	Maximum depth = 3 Number of estimators = 50, Gamma = 0
SVM	degree = 3, scale = 0.1 and C = 1.0
ANN	Number of hidden layers = 2 Number of nodes in a layer = 20, 10

LR, logistic regression; KNN, k-nearest neighbors; NB, naïve Bayes; DT, decision tree; RF, random forest; GBM, gradient boosting machine; SVM, support vector machine; ANN, artificial neural networks. * mtry indicates the number of variables available for splitting at each tree node.

References

Hartvigsen, J.; Hancock, M.; Kongsted, A.; Louw, Q.; Ferreira, M.L.; Genevay, S.; Hoy, D.; Karppinen, J.; Pransky, G.; Sieper, J.; et al. What low back pain is and why we need to pay attention. Lancet 2018, 391, 2356–2367. [Google Scholar] [CrossRef] [Green Version]
Ganesan, S.; Acharya, A.S.; Chauhan, R.; Acharya, S. Prevalence and Risk Factors for Low Back Pain in 1355 Young Adults: A Cross-Sectional Study. Asian Spine J. 2017, 11, 610–617. [Google Scholar] [CrossRef] [PubMed]
Patrick, N.; Emanski, E.; Knaub, M.A. Acute and chronic low back pain. Med. Clin. N. Am. 2014, 98, 777–789. [Google Scholar] [CrossRef]
Wu, A.; March, L.; Zheng, X.; Huang, J.; Wang, X.; Zhao, J.; Blyth, F.M.; Smith, E.; Buchbinder, R.; Hoy, D. Global low back pain prevalence and years lived with disability from 1990 to 2017: Estimates from the Global Burden of Disease Study 2017. Ann. Transl. Med. 2020, 8, 299. [Google Scholar] [CrossRef] [PubMed]
Safiri, S.; Kolahi, A.A.; Cross, M.; Carson-Chahhoud, K.; Almasi-Hashiani, A.; Kaufman, J.; Mansournia, M.A.; Sepidarkish, M.; Ashrafi-Asgarabad, A.; Hoy, D.; et al. Global, regional, and national burden of other musculoskeletal disorders 1990–2017: Results from the Global Burden of Disease Study 2017. Rheumatology 2021, 60, 855–865. [Google Scholar] [CrossRef]
Lambeek, L.C.; Bosmans, J.; Van Royen, B.J.; van Tulder, M.; Van Mechelen, W.; Anema, J.R. Effect of integrated care for sick listed patients with chronic low back pain: Economic evaluation alongside a randomised controlled trial. BMJ 2010, 341, c6414. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Chou, R.; Shekelle, P. Will this patient develop persistent disabling low back pain? JAMA 2010, 303, 1295–1302. [Google Scholar] [CrossRef] [PubMed]
Motwani, M.; Dey, D.; Berman, D.S.; Germano, G.; Achenbach, S.; Al-Mallah, M.; Andreini, D.; Budoff, M.J.; Cademartiri, F.; Callister, T.Q.; et al. Machine learning for prediction of all-cause mortality in patients with suspected coronary artery disease: A 5-year multicentre prospective registry analysis. Eur. Hear. J. 2016, 38, 500–507. [Google Scholar] [CrossRef]
Kim, J.; Merrill, R.K.; Arvind, V.; Kaji, D.; Pasik, S.D.; Nwachukwu, C.C.; Vargas, L.; Osman, N.S.; Oermann, E.K.; Caridi, J.M.; et al. Examining the Ability of Artificial Neural Networks Machine Learning Models to Accurately Predict Complications Following Posterior Lumbar Spine Fusion. Spine 2018, 43, 853–860. [Google Scholar] [CrossRef]
Lee, H.-C.; Yoon, H.-K.; Nam, K.; Cho, Y.J.; Kim, T.K.; Kim, W.H.; Bahk, J.-H. Derivation and Validation of Machine Learning Approaches to Predict Acute Kidney Injury after Cardiac Surgery. J. Clin. Med. 2018, 7, 322. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lee, H.-C.; Bin Yoon, S.; Yang, S.-M.; Kim, W.H.; Ryu, H.-G.; Jung, C.-W.; Suh, K.-S.; Lee, K.H. Prediction of Acute Kidney Injury after Liver Transplantation: Machine Learning Approaches vs. Logistic Regression Model. J. Clin. Med. 2018, 7, 428. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Lei, L.; Ji, M.; Tong, J.; Zhou, C.-M.; Yang, J.-J. Predicting postoperative delirium after microvascular decompression surgery with machine learning. J. Clin. Anesth. 2020, 66, 109896. [Google Scholar] [CrossRef] [PubMed]
Han, S.S.; Azad, T.D.; Suarez, P.A.; Ratliff, J.K. A machine learning approach for predictive models of adverse events following spine surgery. Spine J. 2019, 19, 1772–1781. [Google Scholar] [CrossRef] [Green Version]
Mukasa, D.; Sung, J. A prediction model of low back pain risk: A population based cohort study in Korea. Korean J. Pain 2020, 33, 153–165. [Google Scholar] [CrossRef] [PubMed]
Kweon, S.; Kim, Y.; Jang, M.-J.; Kim, Y.; Kim, K.; Choi, S.; Chun, C.; Khang, Y.-H.; Oh, K. Data Resource Profile: The Korea National Health and Nutrition Examination Survey (KNHANES). Int. J. Epidemiol. 2014, 43, 69–77. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Park, S.M.; Kim, H.J.; Jang, S.; Kim, H.; Chang, B.S.; Lee, C.K.; Yeom, J.S. Depression is Closely Associated with Chronic Low Back Pain in Patients Over 50 Years of Age: A Cross-sectional Study Using the Sixth Korea National Health and Nutrition Examination Survey (KNHANES VI-2). Spine 2018, 43, 1281–1288. [Google Scholar] [CrossRef]
Beretta, L.; Santaniello, A. Nearest neighbor imputation algorithms: A critical evaluation. BMC Med. Inform. Decis. Mak. 2016, 16, 197–208. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, Q.; Luo, Z.; Huang, J.; Feng, Y.; Liu, Z. A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapola-tion-SMOTE SVM. Comput. Intell. Neurosci. 2017, 2017, 1827016. [Google Scholar] [CrossRef] [PubMed]
Wu, C.-C.; Hsu, W.-D.; Islam, M.; Poly, T.N.; Yang, H.-C.; Nguyen, P.-A.; Wang, Y.-C.; Li, Y.-C. An artificial intelligence approach to early predict non-ST-elevation myocardial infarction patients with chest pain. Comput. Methods Programs Biomed. 2019, 173, 109–117. [Google Scholar] [CrossRef]
Wu, C.-C.; Yeh, W.-C.; Hsu, W.-D.; Islam, M.; Nguyen, P.A.; Poly, T.N.; Wang, Y.-C.; Yang, H.-C.; Li, Y.-C. Prediction of fatty liver disease using machine learning algorithms. Comput. Methods Programs Biomed. 2019, 170, 23–29. [Google Scholar] [CrossRef] [PubMed]
Buda, M.; Maki, A.; Mazurowski, M.A. A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw. 2018, 106, 249–259. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Mehmood, A.; Maqsood, M.; Bashir, M.; Shuyuan, Y. A Deep Siamese Convolution Neural Network for Multi-Class Classification of Alzheimer Disease. Brain Sci. 2020, 10, 84. [Google Scholar] [CrossRef] [Green Version]
McIntosh, G.; Steenstra, I.; Hogg-Johnson, S.; Carter, T.; Hall, H. Lack of Prognostic Model Validation in Low Back Pain Prediction Studies: A Systematic Review. Clin. J. Pain. 2018, 34, 748–754. [Google Scholar] [CrossRef] [PubMed]
Hancock, M.J.; Maher, C.M.; Petocz, P.; Lin, C.-W.C.; Steffens, D.; Luque-Suarez, A.; Magnussen, J.S. Risk factors for a recurrence of low back pain. Spine J. 2015, 15, 2360–2368. [Google Scholar] [CrossRef] [PubMed]
Traeger, A.C.; Henschke, N.; Hübscher, M.; Williams, C.M.; Kamper, S.J.; Maher, C.G.; Moseley, G.L.; McAuley, J.H. Estimating the Risk of Chronic Pain: Development and Validation of a Prognostic Model (PICKUP) for Patients with Acute Low Back Pain. PLoS Med. 2016, 13, e1002019. [Google Scholar] [CrossRef] [PubMed]
Izonin, I.; Tkachenko, R.; Shakhovska, N.; Lotoshynska, N. The Additive Input-Doubling Method Based on the SVR with Nonlinear Kernels: Small Data Approach. Symmetry 2021, 13, 612. [Google Scholar] [CrossRef]
Salazar, A.; Vergara, L.; Safont, G. Generative Adversarial Networks and Markov Random Fields for oversampling very small training sets. Expert Syst. Appl. 2021, 163, 113819. [Google Scholar] [CrossRef]

Figure 1. Schematic representation of recursive feature elimination (RFE) in random forest algorithm.

Figure 2. Correlation between variables.

Figure 3. Areas under the receiver operating curve for test data.

Table 1. Demographic data and variable features of the included population.

Variables	All Cases (n = 6119)	No Lower Back Pain (n = 4725)	Lower Back Pain (n = 1394)	p-Value
Age (years)	64 (56–72)	62 (56–70)	69 (60–76)	<0.001
Sex (female)	3511 (57.4%)	2464 (52.1%)	1047 (75.1%)	<0.001
BMI (kg/cm²)	23.9 (22.0–26.0)	23.9 (22.0–25.9)	24.1 (21.9–26.4)	0.001
Comorbidities (n)
Hypertension	3006 (49.1%)	2249 (47.6%)	757 (54.3%)	<0.001
Diabetes mellitus	1020 (16.7%)	747 (15.8%)	273 (19.6%)	<0.001
Hyperlipidemia	1449 (23.7%)	1027 (21.7%)	422 (30.3%)	<0.001
Ischemic heart disease	280 (4.6%)	182 (3.8%)	98 (7.0%)	<0.001
Cerebrovascular accident	253 (4.1%)	161 (3.4%)	92 (6.6%)	<0.001
Osteoarthritis	1294 (21.1%)	736 (15.6%)	558 (40.0%)	<0.001
Rheumatoid arthritis	163 (2.7%)	106 (2.2%)	57 (4.1%)	<0.001
Education (n)	878 (14.3%)	773 (16.4%)	105 (7.5%)	<0.001
Marital status (n)	6045 (98.8%)	4666 (98.8%)	1379 (98.9%)	0.70
Household income (n)	2636 (43.1%)	2238 (47.4%)	398 (28.6%)	<0.001
Occupation (n)				<0.001
Managers, experts	330 (5.4%)	295 (6.2%)	35 (2.5%)
Office work	213 (3.5%)	185 (3.9%)	28 (2.0%)
Sales and services	599 (9.8%)	490 (10.4%)	109 (7.8%)
Agriculture, forestry, and fishery	493 (8.1%)	370 (7.8%)	123 (8.8%)
Machine fitting	509 (8.3%)	448 (9.5%)	61 (4.4%)
Simple labor	672 (11.0%)	531 (11.2%)	141 (10.1%)
Unemployed (student, housewife, etc.)	3303 (54.0%)	2406 (50.9%)	897 (64.3%)
Sitting time (n)	2845 (46.5%)	2110 (44.7%)	735 (52.7%)	<0.001
Duration of sleep (n)	3210 (52.5%)	2548 (53.9%)	662 (47.5%)	<0.001
Smoking (n)	2402 (39.3%)	2022 (42.8%)	380 (27.3%)	<0.001
Alcohol intake (n)	4940 (80.7%)	3928 (83.1%)	1012 (72.6%)	<0.001
Depressive symptom (n)	364 (6.0%)	206 (4.4%)	158 (11.3%)	<0.001
Stress (n)	4633 (75.7%)	3515 (74.4%)	1118 (80.2%)	<0.001
Physical activity (n)	437 (7.1%)	297 (6.3%)	140 (10.0%)	<0.001
Fasting blood glucose (mg/dL)	99 (92–110)	99 (92–110)	99 (92–109)	0.69

KNHANES, The Korea National Health and Nutrition Examination Survey; BMI, body mass index. The data are presented as medians (interquartile ranges) or numbers (%).

Table 2. Performance of all machine learning models.

Model	AUROC (95% CI)	Accuracy (95% CI)	Sensitivity (95% CI)	Specificity (95% CI)
LR	0.656 (0.634–0.678)	0.608 (0.582–0.634)	0.82 (0.79–0.84)	0.36 (0.32–0.40)
KNN	0.656 (0.628–0.685)	0.631 (0.608–0.653)	0.83 (0.81–0.85)	0.35 (0.32–0.39)
NB	0.712 (0.685–0.740)	0.713 (0.692–0.733)	0.84 (0.82–0.86)	0.43 (0.39–0.47)
DT	0.671 (0.643–0.698)	0.665 (0.643–0.687)	0.85 (0.83–0.87)	0.39 (0.35–0.42)
RF	0.699 (0.671–0.728)	0.701 (0.680–0.722)	0.84 (0.81–0.86)	0.42 (0.38–0.46)
GBM	0.660 (0.631–0.690)	0.689 (0.667- 0.710)	0.82 (0.80–0.84)	0.39 (0.35–0.43)
SVM	0.707 (0.678–0.735)	0.677 (0.656–0.699)	0.85 (0.83–0.87)	0.40 (0.36–0.44)
ANN	0.716 (0.689–0.744)	0.717 (0.696–0.734)	0.84 (0.82–0.86)	0.44 (0.40–0.48)

AUROC, area under the receiver operating characteristic curve; CI, confidence interval; LR, logistic regression; KNN, k-nearest neighbors; NB, naïve Bayes; DT, decision tree; RF, random forest; GBM, gradient boosting machine; SVM support vector machine; ANN, artificial neural network.

Table 3. Nested cross validation results of all machine learning models.

Model	AUROC (k = 1)	AUROC (k = 2)	AUROC (k = 3)	AUROC (k = 4)	AUROC (k = 5)	AUROC (mean + SD)
LR	0.690	0.607	0.679	0.637	0.651	0.653 ± 0.033
KNN	0.612	0.676	0.604	0.626	0.579	0.619 ± 0.036
NB	0.610	0.649	0.602	0.671	0.671	0.641 ± 0.033
DT	0.636	0.710	0.579	0.669	0.597	0.638 ± 0.053
RF	0.654	0.714	0.677	0.633	0.636	0.663 ± 0.034
GBM	0.538	0.612	0.661	0.637	0.628	0.615 ± 0.047
SVM	0.700	0.665	0.674	0.726	0.691	0.691 ± 0.024
ANN	0.728	0.718	0.739	0.662	0.724	0.714 ± 0.030

AUROC, area under the receiver operating characteristic curve; LR, logistic regression; KNN, k-nearest neighbors; NB, naïve Bayes; DT, decision tree; RF, random forest; GBM, gradient boosting machine; SVM support vector machine; ANN, artificial neural network; k, number of folds in the outer loop of nested cross-validation; SD, standard deviation.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shim, J.-G.; Ryu, K.-H.; Cho, E.-A.; Ahn, J.H.; Kim, H.K.; Lee, Y.-J.; Lee, S.H. Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years. Medicina 2021, 57, 1230. https://doi.org/10.3390/medicina57111230

AMA Style

Shim J-G, Ryu K-H, Cho E-A, Ahn JH, Kim HK, Lee Y-J, Lee SH. Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years. Medicina. 2021; 57(11):1230. https://doi.org/10.3390/medicina57111230

Chicago/Turabian Style

Shim, Jae-Geum, Kyoung-Ho Ryu, Eun-Ah Cho, Jin Hee Ahn, Hong Kyoon Kim, Yoon-Ju Lee, and Sung Hyun Lee. 2021. "Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years" Medicina 57, no. 11: 1230. https://doi.org/10.3390/medicina57111230

APA Style

Shim, J.-G., Ryu, K.-H., Cho, E.-A., Ahn, J. H., Kim, H. K., Lee, Y.-J., & Lee, S. H. (2021). Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years. Medicina, 57(11), 1230. https://doi.org/10.3390/medicina57111230

Article Menu

Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Collection

2.2. Clinical Data and Outcomes

2.3. Statistical Analyses

2.4. Ethics Statement

3. Results

3.1. Patients’ Characteristics

3.2. Feature Selection

3.3. Model Performance

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI