Next Article in Journal
Psychometric Properties of the Spanish Version of the Caregiver Burden Inventory
Next Article in Special Issue
Racial/Ethnic Disparities in Mortality: Contributions and Variations by Rurality in the United States, 2012–2015
Previous Article in Journal
Health Risk Assessment of Inorganic Mercury and Methylmercury via Rice Consumption in the Urban City of Guiyang, Southwest China
Previous Article in Special Issue
The Relationship between Infectious Diseases and Housing Maintenance in Indigenous Australian Households
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Socioeconomic Status and Morbidity Rate Inequality in China: Based on NHSS and CHARLS Data

1
School of Economics, Peking University, Beijing 100871, China
2
School of Economics and Management, Beihang University, Beijing 100083, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(2), 215; https://doi.org/10.3390/ijerph16020215
Submission received: 20 November 2018 / Revised: 3 January 2019 / Accepted: 10 January 2019 / Published: 14 January 2019

Abstract

:
Previous studies have shown there are no consistent and robust associations between socioeconomic status and morbidity rates. This study focuses on the relationship between the socioeconomic status and the morbidity rates in China, which helps to add new evidence for the fragmentary relationship between socioeconomic status and morbidity rates. The National Health Services Survey (NHSS) and China Health and Retirement Longitudinal Study (CHARLS) data are used to examine whether the association holds in both all-age cohorts and in older only cohorts. Three morbidity outcomes (two-week incidence rate, the prevalence of chronic diseases, and the number of sick days per thousand people) and two socioeconomic status indicators (income and education) are mainly examined. The results indicate that there are quadratic relationships between income per capita and morbidities. This non-linear correlation is similar to the patterns in European countries. Meanwhile, there is no association between education years and the morbidity in China, i.e., either two-week incidence rate or prevalence rate of chronic diseases has no statistically significant relationship with the education level in China.

1. Introduction

So far, an abundance of research has been undertaken on the relationship between socioeconomic status and mortality rates. These studies were carried out in the United States [1,2], Canada [3,4], Europe [5,6], and China [7,8], where they found that the people with socioeconomical disadvantages usually have higher mortality rates than those who have higher education or income level. However, mortality is not the only one which matters. Morbidity, which is another basic element of health, is, at least, equally important as mortality [9]. Morbidity has a vital impact on life expectancy and the length of dependent life [10] when it has a different geographic distribution from the distribution of mortality. Specific in China, morbidity in different regions of China varies widely: 2008 statistics show that the Chengguan District in Tibet has a minimal two-week incidence rate of 5.2%, while the Dongcheng District in Beijing has a maximal two-week incidence rate of 53.2%. As for the prevalence of chronic diseases, the Chengguan District in Tibet has a minimal rate of 5.4%, while the Luwan District in Shanghai, however, has a maximum of 33.6%.
Studies tried to explain how income and education level relate to mortality rates. Those who are wealthy and highly educated are less likely to die younger, as they tend to enjoy advantaged access to health-enhancing resources [11,12,13,14,15,16,17,18], and are more likely to live in well-built houses situated in safe neighborhoods in a non-toxic environment [19,20,21,22]. In addition to affording the cares of better quality [23,24,25], people who have a greater socioeconomic status tend to better understand and follow the instructions given by their health care providers [21,26]. Similarly, people who have a better socioeconomic status may also more easily adopt healthy lifestyles [25,27,28], which decreases their exposures to material deprivation and stressful psychosocial environments [25,29,30,31].
However, while the inverse effects of socioeconomic factors on mortality have been reported by a number of studies, no consistent and robust associations have been found between socioeconomic status and morbidity rates [4,9]: A higher income levels experienced lower levels of morbidity in England [32,33], European countries [9], and Nordic countries [34], while no association between income and morbidity was observed in Canada [4]. Meanwhile, a higher education level experienced lower levels of morbidity in the United States [33] and European countries [9,35], while no association between education and morbidity was observed in Canada [4], England [32], and Nordic countries [34]. Nevertheless, in China, the relationship between socioeconomic status and the overall morbidity rates was less focused on: Though a research has found no wealth and education gradients in the prevalence of hypertension [36], and another found there is a lack of socioeconomic gradients in the overall incidence of non-hospitalized injuries for children in China [37]. Therefore, it becomes important to understand and discuss the morbidity rate itself. It is not only because the lack of literature but also because that simple associations between mortality and morbidity cannot be made, nor can the trend of morbidity be inferred from the trend of mortality because of the development of vaccinations and medical technology [38]. We cannot directly apply the conclusions about mortality in China to morbidity in China, where morbidity also has an important impact on people’s life expectancy in China.
This study aims to focus on the relationship between socioeconomic status and the morbidity rates in China, which helps to add new evidence for the fragmentary relationship between socioeconomic status and morbidity [9]. There are three main contributions of this paper. First, we examine the relationship between socioeconomic status and morbidity rates in China. Previous research mainly focused on other countries except for China. Second, both all-age cohorts and older only aged cohorts’ socioeconomic status and morbidity rate are examined in China. Attention was given to the relationship between socioeconomic status and health at all ages [1,2,4,33,39] and older only ages [3,7,8,9,25]. It is a meaningful observation for China to examine the relationship between socioeconomic status and morbidity still hold only in old aging cohorts after checking all the age cohorts. We use two data sources: NHSS (National Health Services Survey) questionnaire collects data from Chinese residents at all ages; CHARLS (China Health and Retirement Longitudinal Study) questionnaire collects data from Chinese residents aged 45 and older. Third, we discover the non-linearity in the association between income and the morbidity rate in China by incorporating the quadratic term of income into the regression model. Previous studies [9,32] have found the non-linear relationship between income and morbidity, we further examine the existence of the quadratic relationship between income and morbidity.
In summary, this study provides a detailed analysis of the relationship between socioeconomic status and morbidity in China at all ages and old only ages by using three morbidity indicators and two socioeconomic statuses. The structure of the paper is as follows: in Section 2, we describe the data used in our empirical analysis then outline the model; we present empirical results in Section 3; finally, Section 4 contains our conclusion.

2. Data and Methods

2.1. Data Sources

Our data come from the National Health Services Survey (NHSS) in China, and the China Health and Retirement Longitudinal Study (CHARLS) [40]. The NHSS survey began in 1993 and is conducted every five years. In this paper, we use the data in 1998, 2003, and 2008. The data of NHSS in 1993, 2013 and later are not used because some important socioeconomic variables were not collected in 1993, and the detailed data of 2013 and later have not been published by the time we conduct this research. As for the CHARLS survey, we use the Harmonized CHARLS data (Version C) published in April 2018. The harmonized CHARLS data contains the data from wave 1 (2011) to wave 4 (2015). We use the data of 2011, 2013, and 2015 surveys which collected consistent variables with which we are concerned.
The NHSS is a national sample survey, and its respondents are the actual population of selected households. A household is defined as the person or people in the same dwelling, regardless of whether family members and others live together or individuals living alone. A multistage stratified random cluster sampling method was adopted in the NHSS. One hundred and fifty-six counties (cities or districts) from 31 provinces, autonomous regions, and municipalities were randomly selected, from which five sample townships (or neighborhoods) were collected, totaling 780 nationwide. In each township (neighborhood), two administrative villages (or neighborhood committees)—1560 across the whole country—were collected. Furthermore, 60 households from each sample village (or neighborhood committee) were randomly selected for further analysis, coming to a total of 93,600 households (nearly 300,000 people) from all over China.
The questionnaires of the NHSS covered: (1) The health service needs of urban and rural residents, including a survey of population and socioeconomic characteristics and health status; (2) health service demands and the utilization of urban and rural areas residents, including treatment of illness, satisfaction rate and reasons for dissatisfaction, public health, maternal and child care, emergency and inpatient service, the utilization of hospital services, and personal payment of medical expenses; (3) urban and rural health security, including health insurance and the composition of the medical security system; and (4) residents’ satisfaction, including satisfaction with the service systems, service delivery, and coverage and level of health insurance.
CHARLS is a nationally representative, multi-disciplinary and public dataset. It covers many aspects from across the interviewees’ lifetimes, including the household, income, health, finance, social security, and so forth. This study used the harmonized CHARLS (Version C), which contains 25,504 observations or rows. The national baseline survey comprises information on about 17,000 individuals and 10,000 households. Our reasons for choosing the CHARLS data are: First, the CHARLS includes detailed information on individuals’ socioeconomic circumstances and prevalence of chronic diseases; second, it collects data from the persons over 45 years old, which will provide data to check whether the association between socioeconomic status and morbidity still hold in older age cohorts.

2.2. Variables

Table 1 displays the definitions of the original variables selected from the NHSS data. In the NHSS data, health indicators of morbidity rates includes: cut down in daily activities due to a physical or mental problem [9,41] and long-term disability [9], incidence of severe illness [42], number of bedridden days [42], multimorbidity [39,43], chronic medical morbidity [44], etc. Considering the availability of data, three health outcomes are taken as our dependent variables: two-week incidence rate (illnessratio), number of sick days per thousand people (illnessday), and prevalence of chronic diseases (chronicratio). The two-week incidence rate measures the respondents’ feelings regarding disease, mainly from the perspective of health services. The rate has three outcomes according to reaction to sickness in two weeks: receiving medical treatment in a health institution, taking medicine or some other adjuvant therapy by themselves, and resting for at least one day without receiving medical treatment or taking medicine. The number of sick days is defined as the average number of sick days in two weeks per 1000 surveyed people, which measures the severity of illness. It is highly correlated with the two-week incidence rate, with a correlation coefficient of 0.946. The prevalence of chronic diseases (chronicratio) refers to the prevalence of chronic diseases among the surveyed population. The variable is positively related to the other two, with both correlation coefficients being over 0.7.
The variables on socioeconomic status used frequently includes income [3,41,45], education [3,41], occupational prestige [3], and housing tenure [46]. We focus on income and education in this paper. Real income per capita (income) is defined as the average annual income per capita, deflated by the GDP deflators of each city to its 1998 purchasing value. The deflator data are from the China Statistical Yearbook for Regional Economy (1998–2008). The weighted education years (edu) is calculated by years of completed education composition. In addition, several kinds of controlling variables are usually considered in literature: demographic factors [41], consumption and health expenses level [42,47], accessibility and affordability of health services [48,49], and environment factor [19]. The demographic factors in this paper are measured by four variables: the average age of all age group weighted by group size (average), the population over 65 years old (age65), male population proportion (male), and a dummy variable of urbanization (urban). Consumption and health service characteristics are measured by the following indicators: total annual consumptions per capita (expend), proportion of family health expenditure to total living expenses (mediratio), average annual medical treatment cost (permedicost) and average hospitalization expense of each time (perhospitalcost). The accessibility of health services has two perspectives: geographical accessibility and economic accessibility. The first one considers the distance from and the time cost to medical institutions which measure the physical convenience of accessing health services. Economic accessibility measures people’s capacity to afford medical bills, i.e., a patient’s income level and whether he/she has medical insurance [48]. The accessibility and affordability of health services are measured by the following indicators: proportion of the population within 1 km of the nearest medical and health unit (distance), proportion of the population whose time cost to the nearest hospital is less than 10 min (time10), and proportion of the population with medical insurance (insurance). The environment factor is shown by the proportion of hygienic toilets (washroom).
In this paper, the variables from the Harmonized CHARLS data are selected then aggregated to correspond with the variables selected from NHSS data. Table A1 in Appendix A displays the definitions of aggregated variables. Corresponding to the dependent variables in the NHSS dataset, we use the prevalence of chronic diseases (CHRONIC_RATIO) as our dependent variable in the analysis on the CHARLS dataset. Meanwhile, the earned income per capita (AVGINDIINCOME_EARN) and average education years (AVGEDU) are the two main socioeconomic variables that we study in the CHARLS dataset. All variables in the CHARLS dataset can be roughly classified to eight sub-categories: morbidity, income, education, demographic backgrounds, health expenditure, health insurance, job status, and family relationships. The first six categories separately correspond to the variables selected from NHSS data. However, because the respondents of CHARLS are older people who possibly have been retired or been receiving extra financial support from other family members, we add the variables of the last two categories to control the effects of non-earned incomes, such as the transfer payments from children.

2.3. Descriptive Statistics

In this paper, we use two-panel datasets to perform econometric analysis: a 3-year (1998, 2003, 2008) panel dataset of NHSS, and a 3-year (2011, 2013, 2015) panel dataset of CHARLS. The NHSS dataset was aggregated by the National Health Commission of China and published at the county level. It covers around 95 counties every year. Meanwhile, the CHARLS dataset was aggregated from the Harmonized CHARLS dataset to city level, where 126 cities are included.
In addition to simplify aggregating variables, we also adjusted the variables of income: the real income per capita (income) in the NHSS dataset and the average earned income per capita (AVGINDIINCOME_EARN) in the CHARLS dataset. In the NHSS dataset, because of the collinearity (correlation coefficient is 0.639) between the real income per capita and weighted education years, the original real income per capita was firstly regressed by the weighted education years with OLS (ordinary least squares). Then, it was replaced by the regression residuals. However, considering the lower correlation coefficient (0.532) between the earned income per capita and education years in the CHARLS dataset, we did not adjust the earned income per capita. Then, to discover whether there is a quadratic relationship between income per capita and health outcomes, we separately added the squared real income per capita (income2) to our NHSS dataset, and the squared earned income per capita (AVGINDIINCOME_EARN2) to the CHARLS dataset. Table 2 displays the general descriptive statistics of health outcome indicators, income indicators, and education indicators of the NHSS and CHARLS datasets. Because we used principle component analysis (PCA) to aggregate and rotate our candidate variables to orthogonal components, their descriptive statistics are not reported. In Section 2.4, we introduce how we performed the PCA.
To discover the existence of the inequality of health outcomes, we compute the Gini coefficients of health outcome indicators at county/city level: two-week incidence rate, number of sick days per thousand people, the prevalence of chronic diseases in the NHSS dataset, and the prevalence of chronic diseases in the CHARLS dataset. Table 3 reports the Gini coefficients in every year. The Gini coefficients of the three indicators in the NHSS dataset reveal a moderate inequality of health outcomes. However, the prevalence of chronic diseases in the CHARLS dataset shows a lack of variance. It is partly because that the mean value of the prevalence (75%) is much higher than its counterpart (16.35%) in the NHSS dataset: If most people have at least one kind of chronic diseases, their differences of whether to have a chronic disease lead to a smaller variance in numbers. In addition to Gini coefficients, we also compute other types of inequality indices as a reference. Note: "prop" indicates proportions, a digit in the range from 0 to 1
Table A2 displays the complete tabular, where we can also find the existence of similar inequality of health outcomes.
The inequality of health outcomes is related to the difference among counties/cities. For example, Figure 1 displays this relationship using the prevalence of chronic diseases. In Figure 1, every circle marks a county (NHSS) or city (CHARLS) whose X-Y coordinates are defined with real longitudes and latitudes. The circles’ color indicates the relative level of a county’s or a city’s prevalence of chronic diseases when compared with other counties or cities. The red color becomes deeper with the chronic disease prevalence increasing. To make the two sub-figures comparable, we dyed these circles in quantile measure rather than the absolute values of the prevalence of chronic diseases, i.e., a circle’s color indicates the county’s or city’s relative level of chronic disease prevalence in the whole sample. In Figure 1, both sub-figures show a consistent geographic inequality of chronic disease prevalence among counties/cities, e.g., coastal metropolises such as Shanghai (N31.23°, E121.47°) and Guangzhou (N23.13°, E113.27°) have a relatively higher prevalence of chronic diseases; and even counties/cities in the same province may have different levels of chronic disease prevalence. However, this kind of descriptive inequality among counties/cities cannot finally answer whether the inequality of the prevalence of chronic diseases is significantly related to the counties/cities themselves.
In addition to counties/cities themselves, the inequality of chronic disease prevalence is also descriptively related to the income per capita in counties/cities. In Figure 1, circle size denotes the level of income per capita. A larger circle means the county/city has a higher average income level. When combing the circles’ color and size, Figure 1 displays an intuitive pattern that counties/cities with a higher level of income per capita tend to have a relatively higher prevalence of chronic diseases. cities with higher income level, such as Shanghai, Beijing (N39.90°, E116.40°), and Chengdu (N30.67°, E104.07°), usually have higher chronic disease prevalence. However, this pattern is not universal, e.g., Luoyang (N36.03°, E103.73°) and Lanzhou (N36.07°, E103.82°), in the sub-figure of CHARLS dataset have higher incomes per capita but a relatively lower prevalence of chronic diseases. Meanwhile, similar patterns can be observed in Figure 2, where circles’ color and size separately denote chronic disease prevalence and average education years. In the first sub-figure of Figure 2, counties with higher average education years tend to have a higher prevalence of chronic diseases, while this pattern is less intuitive in the second sub-figure. Nevertheless, all these possible correlations are descriptive but without statistical proof: no individual difference considered, no other socioeconomic factors controlled, etc. Therefore, further econometric analysis is required to accurately answer whether there are significant correlations between health outcomes and income per capita or education years.

2.4 Model Specification

In this paper, we use panel data models with individual effects as our benchmark models. Because individual difference needs to be controlled to evaluate the real correlations between health outcomes and main socioeconomic status indicator variables such as income per capita and education years. On the one hand, we use panel datasets of both NHSS and CHARLS data. On the other hand, the descriptive statistics, e.g., Figure 1, display possible individual differences at county/city level. Nevertheless, there are 95 counties from 31 provinces in the NHSS dataset and 126 cities from 28 provinces in the CHARLS dataset, while only three years’ data are acquirable. The incidental parameter problem [50] suggested we use province as individual fixed effects and county/city for individual random effects to obtain consistent estimates. In this context, Hausman tests are no longer required if the results of different model specifications are consistent and robust.
Through principle component analysis (PCA), the socioeconomic indicator variables other than income per capita and education years are converted to orthogonal components which are used as control variables in our regression analysis. There are two reasons not to use original socioeconomic indicator variables: One is that many similar variables can together describe a specific aspect of socioeconomic status, e.g., consumptions can be described by both the average amount of annual food consumptions and Engel’s coefficient; the other one is that similar or related socioeconomic indicator variables usually have severe collinearity. Selecting original socioeconomic indicator variables are arbitrary and may result in the failure of estimation. Meanwhile, the trading-off among similar but different variables may also result in omitted variable bias. Therefore, we use dimensionality reduction techniques to solve the problems.
PCA, as one of the most popular dimensionality reduction techniques, has been widely used to construct socioeconomic status indices [51,52,53,54], because there are many similar variables to collect in surveys, where similar information may be covered by different variables. PCA can eliminate variable duplication, distinguish dimensions of information, and save as much variance as possible while reducing dimensionality [55]. Thus, considering we have many candidate socioeconomic indicator variables with similar but different economic meanings, we use PCA to summarize these variables to several interpretable components. These components, rather than original socioeconomic indicator variables, are used as control variables in our final model specifications. Specifically, we perform the regression-based PCA with maximum variance on the centralized and scaled socioeconomic indicator variables. Components are selected according to their eigen values (greater than or equal to 1). Finally, these components are named according to loading matrices. In Appendix B, Figure A1 reports the scree plots, when Table A3, Note: 1. Loadings are colored by column. Larger correlations have deeper red colors.
Table A4, and Note: 1. loadings are colored by column. Larger correlations have deeper red colors.
Table A5 display the loading matrices with component names.
The final model specifications for different health outcomes are:
i l l n e s s r a t i o i t = i n c o m e i t + i n c o m e 2 i t + e d u i t + u r b a n i z a t i o n i t + m e d i c a l   b u r d e n i t + g e o g r a p h i c   a c c e s s i b i l i t y i t + ε i t
i l l n e s s d a y i t = i n c o m e i t + i n c o m e 2 i t + e d u i t + u r b a n i z a t i o n i t + m e d i c a l   b u r d e n i t + g e o g r a p h i c   a c c e s s i b i l i t y i t + h e a l t h   i n s u r a n c e i t + ε i t
c h r o n i c r a t i o i t = i n c o m e i t + i n c o m e 2 i t + e d u i t + u r b a n i z a t i o n i t + m e d i c a l   b u r d e n i t + g e o g r a p h i c   a c c e s s i b i l i t y i t + h e a l t h   i n s u r a n c e i t + ε i t
C H R O N I C _ R A T I O i t = A V G I N D I I N C O M E _ E A R N i t + A V G I N D I I N C O M E _ E A R N 2 i t + A V G E D U i t + c h i l d r e n   s u p p o r t i t + f a m i l y   r e l a t i o n s i t + c o n s u m p t i o n i t + p h y s i c a l   b u r d e n i t + m e d i c a l   b u r d e n i t + d r i n k i n g i t + u n e m p l o y m e n t i t + ε i t

3. Results

In the regression analysis on both the NHSS and CHARLS datasets, we use feasible generalized least square (FGLS) estimators to avoid possible heteroscedasticity problems. Meanwhile, in addition to individual fixed effect model and individual random effect model, three kinds of regressions are also reported as robustness check: Two-ways fixed effect models are used to exclude the effect of time in short panel datasets; pooling models with FGLS estimator are used to see whether individual effects significantly affect estimation results; and pooling models with OLS estimator are reported as the most conservative estimates. Table 4 displays the main results of the regression analysis of the county-level NHSS dataset, and Table 5 reports the main results of the analysis of the city-level CHARLS dataset. Meanwhile, in Appendix C, we provide complete regression results without effect terms.
Table 4 presents the estimates of the effects of income per capita, squared income per capita and education years on three kinds of morbidities. Every column summarizes the result of a specific model, where the standard errors of coefficients are reported in parentheses. In regressions on the three morbidities, the squared income per capita has significant positive coefficients whose estimates are robust among different models. Education years, however, do not display a statistically significant impact on all the three dependent variables. Meanwhile, the income per capita shows a significant negative effect on the prevalence of chronic diseases, while it does not show such an impact on two-week incidence rate and the number of sick days per thousand people.
The coefficients of squared income per capita indicate non-linear correlations between income per capita and our three kinds of morbidities: two-week prevalence, the number of sick days and chronic disease prevalence. Table A6 in Appendix C displays the variance inflation factor (VIF) whose values are less than 5. The VIF excludes the possibility that the significance of the coefficient of squared income per capita is a fake one raised by the collinearity with the linear term (income). When correlations can be profiled with quadratic curves, it means that there are turning-point levels of income per capita: when people’s income is lower than these levels, the three health outcome indicators decline with the growth of income level; however, when people’s income is higher than the turning-point levels, the increase of incidence and prevalence is positively correlated with income per capita. In this context, the linear term of income per capita (income) does not solely reflect income’s correlation with morbidity, but its coefficient decides the turning-point level of income per capita together with the coefficient of squared income per capita. However, in our analysis on two-week incidence rate and the number of sick days per thousand people, the specific turning-point level of income per capita cannot be determined, because the coefficients of the linear term of income per capita are not statistically significant but also not robust among different models.
When age and other socioeconomic indicators controlled, the analysis on the CHARLS dataset presents similar estimation results: the coefficient of squared earned income per capita is significantly positive among different models; the average education years has no robust significant impact on the prevalence of chronic diseases. Nevertheless, the coefficient of the linear term of earned income per capita becomes significantly negative in the analysis on the CHARLS dataset. Therefore, we estimate that the turning-point level of earned income per capita locates in the range from 11.1 thousand yuan to 12.1 thousand yuan.
The regression analysis on the NHSS and CHARLS datasets do not fully support the conclusions of descriptive statistics: a non-linear correlation between morbidities and income per capita was discovered; and education, however, was found to have no significant impact on morbidities. The quadratic relationship between income per capita and morbidities is a new answer to the argument whether there is a universal income—morbidity correlation in China. So far, different correlations have been discovered in different countries: negative correlation was found in the U.S. and Europe [9,33,46], while no specific correlation was found in Canada [4]. Meanwhile, a study in Europe [9] pointed out that the negative income—morbidity correlation in Europe is non-linear among different income strata. Thus, what is the case of China? In our analysis, both positive and negative income—morbidity correlations are found in China, where the relationship is also found to be non-linear: morbidity decreases with growing per capita income; however, when income per capita exceeds a specific turning-point level, morbidity begins to increase with continuing income growth. This non-linear relationship in China can be profiled with a quadratic curve. Therefore, this paper suggests distinguishing different income groups when discussing the relationship between income and morbidity in China, e.g., designing gradient contribution policies of the health insurance plans in China. Other than the effect of income per capita on morbidity, the effect of education years in our analysis is consistent with previous research conducted in China [36], i.e., education years have no significant effect on morbidity in different cohorts. However, a negative correlation between education and morbidity is found in the older population in other countries, e.g., the U.S. [33]. This paper does not discuss the reasons for this difference. It should be discussed with causality analysis.

4. Conclusions

This study focuses on the relationship between socioeconomic status and the morbidity rate in China. It concerns the cohorts at not only all age stages but also old age stages to add new evidence for the fragmentary relationship between socioeconomic status and morbidity. In our regression analysis on the NHSS and CHARLS datasets, three morbidities are used as dependent variables: two-week incidence rate, the number of sick days per thousand people, and the prevalence of chronic diseases. Meanwhile, we use PCA to convert different socioeconomic indicator variables to several interpretable components as the controlling variables in our model specifications. Then, these specifications are estimated with five models, where our robustness check shows consistent estimates among different models.
The quadratic relationship between income per capita and morbidities were found in both the NHSS and CHARLS data. This relationship is statistically robust on different models and on all the three dependent variables. Such a non-linear correlation means there is an all-age quadratic pattern between the income and morbidities in China. This correlation is similar to the patterns in European countries [9] and England [32]. Meanwhile, our study found no correlation between the education level and the two-week incidence rate in both all-age and old-age cohorts in China. The same conclusion was also found on the number of sick days and chronic disease prevalence in the NHSS data, also the chronic disease prevalence in the CHARLS data. Our conclusions indicate that the relationship between education and morbidity rates in China is consistent to the cases in Canada [4], England [32], and Nordic countries [34]. Such a relationship is different from the cases in the United States [33] and European countries [9,35], where previous research found that education is associated with the onset of health problems.
However, the average education level may visibly affect the correlation between education and morbidity rates. Previous studies suggest that education affects morbidity rates through people’s medical knowledge. However, the average education years is about 7 years in NHSS data and 2 years in CHARLS data, where CHARLS interviews people over 45 years old in China. These respondents were all born no later than 1970 and received less education than the younger generations in China. It suggests that the low education level and generally insufficient medical knowledge of this cohort may be one of the causes of absent association between education level and morbidity in China.
In summary, the negative correlation between socioeconomic status and morbidity might not be proved to be a general pattern, but instead depends on the studied countries. In this paper, the relationship between socioeconomic status and morbidity in China was proved to be not universally negative, which is similar patterns to the cases in some other countries

Author Contributions

Y.J. conceived and designed the empirical study; H.Z. performed the empirical study; T.Z. analyzed the data; Y.J., H.Z. and T.Z. wrote the paper.

Funding

This work was financially supported by the National Natural Science Foundation of China (Nos 71371021, 71401192, 71420107025, 71333014, 71873012). The first author would like to thank Major Program Sponsored by Philosophy and Social Sciences Foundation of China (No. 13&ZD042). The second author would also like to thank the support of the Humanities and Social Sciences Planning Fund of Ministry of Education (No. 17YJA790097). This analysis uses data or information from the Harmonized CHARLS dataset and Codebook, Version C as of April 2018 developed by the Gateway to Global Aging Data. The development of the Harmonized CHARLS was funded by the National Institute on Ageing (R01 AG030153, RC2 AG036619, R03 AG043052). For more information, please refer to www.g2aging.org. Finally, we want to express our gratitude to all the reviewers who provided many valuable suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A: Variables and other descriptive statistics

Table A1. Variable definitions of CHARLS (complete).
Table A1. Variable definitions of CHARLS (complete).
VariablesCodesUnitVariable explanation
Prevalence of chronic diseaseCHRONIC_RATIOpropProportion of the respondents who have at least one of the 13 kinds of chronic diseases
1-year earned incomes per capitaAVGINDIINCOME_EARN10-thousand yuanAnnually incomes from employment per capita
Square of the 1-year earned incomes per capitaAVGINDIINCOME_EARN210-thousand yuan squareSquare of AVGINDIINCOME_EARN
Average education yearsAVGEDUyearApproximate education years
Average ageAVGAGEyearAverage age
Marriage rateMARITAL_RATIOpropRatio of married respondents
Marriage yearsMARITAL_AVELENyearYears of current or the latest marriage
Drinking rateDRINK1Y_RATIOpropRatio of the respondents who ever drank in the past 1 year
Smoking rateSMOKENOW_RATIOpropRatio of smoking respondents
Smoking frequencyAVGSMOKENUMintegerNumber of cigarettes smoked per day
1-year inpatient expenditureAVGHOSP1Y_REALEXPyuanOut-of-pocket inpatient expenditure in the past 1 year
1-month outpatient expenditureAVGOUTP1M_REALEXPyuanOut-of-pocket outpatient expenditure in the past 1 month
1-week food consumptionAVGEXP1W_FOODyuanFood consumptions in the past 1 week
Coverage rate of health insuranceINSURANCE_RATIOpropRatio of the respondents who are covered by any kind of health insurance plans
Coverage rate of public health insuranceINSGOV_RATIOpropRatio of the respondents who are covered by government or public health insurance plans
1-year total consumptionAVGEXP1Y_TOTALyuanTotal amount of consumptions in the past 1 year
Children support rateCHILDCARE_RATIOpropRatio of the respondents who receive any kind of supports from children or grandchildren
Children co-residence rateCHILDCORESD_RATIOpropRatio of the respondents living with children or grandchildren
Children living-nearby rateCHILDLVNEAR_RATIOpropRatio of the respondents whose children live nearby
Children financial support rateTRANSCHILD_RATIOpropRatio of the respondents who receive financial support from children
Working rateWORK_RATIOpropRatio of the respondents who reported they are still working de facto
Agricultural-work rateJOBSTATUS_AGRI_RATIOpropRatio of the respondents who reported they are doing agricultural jobs
Non-agricultural employment rateJOBSTATUS_NAGE_RATIOpropRatio of the respondents who reported they are employed by non-agricultural jobs
Non-agricultural self-employment rateJOBSTATUS_NAGS_RATIOpropRatio of the respondents who reported they are self-employed by non-agricultural jobs
Unemployment rateJOBSTATUS_UNEM_RATIOpropRatio of the respondents who reported they are unemployed but not retired yet
Never-worked rateJOBSTATUS_NEWK_RATIOpropRatio of the respondents who reported they never worked before
Note: “prop” indicates proportions, a digit in the range from 0 to 1.
Table A2. Inequality indices (complete).
Table A2. Inequality indices (complete).
IndexNHSSCHARLS
YearCountiesIllnessratioIllnessdayChronicratioYearCitiesCHRONIC_RATIO
Gini1998940.23160.25120.279020111260.0744
2003950.19980.23950.231220131260.0640
2008930.25140.28370.219620151260.0511
Theil-I1998940.08740.10260.121520111260.0089
2003950.06350.09090.086020131260.0065
2008930.10260.12870.076320151260.0041
Theil-II1998940.08520.10030.128620111260.0091
2003950.06700.09560.093220131260.0067
2008930.09940.12750.079920151260.0041
Coef of Variation1998940.44180.48080.504820111260.1326
2003950.36030.43600.418920131260.1141
2008930.48250.53750.397720151260.0901

Appendix B: PCA Results

Figure A1. Scree plots.
Figure A1. Scree plots.
Ijerph 16 00215 g0a1
Table A3. Component loadings (illnessratio).
Table A3. Component loadings (illnessratio).
ComponentsRC2RC1RC3
NamesMedical BurdenUrbanizationGeographic Accessibility
urban0.16990.85650.2240
expend0.77310.50700.1652
medicost0.86230.29980.1742
male−0.3136−0.69830.0610
age650.54390.6531−0.0027
average0.66340.59760.1000
time100.08810.11950.9264
permedicost0.62680.28270.2826
perhospitalcost0.68120.50510.2325
insurance0.78410.0106−0.1891
washroom0.15890.87590.1772
Note: 1. Loadings are colored by column. Larger correlations have deeper red colors.
Table A4. Component loadings (illnessday/chronicratio).
Table A4. Component loadings (illnessday/chronicratio).
ComponentsRC3RC1RC2RC4
NamesUrbanizationMedical BurdenGeographic AccessibilityHealth Insurance
urban0.81400.33030.2173−0.0703
expend0.47630.66730.13570.4397
medicost0.26330.74710.10830.4842
male−0.7305−0.0573−0.1031−0.3919
age650.65010.40760.03250.3698
average0.59010.49280.12190.4591
distance0.21850.06550.9288−0.0077
time100.07660.12920.94500.0397
permedicost0.18980.87720.06790.0250
perhospitalcost0.45830.68380.16240.2964
insurance0.06770.2284−0.03120.8704
washroom0.84740.28640.1460−0.0374
Note: 1. loadings are colored by column. Larger correlations have deeper red colors.
Table A5. Component loadings (CHRONIC_RATIO).
Table A5. Component loadings (CHRONIC_RATIO).
ComponentsRC5RC1RC2RC3RC4RC6RC7
NamesChildren SupportFamily RelationsConsumptionPhysical BurdenDrinkingMedical BurdenUn−Employment
AVGAGE−0.21460.88110.06710.0786−0.00630.1024−0.0350
MARITAL_RATIO−0.1523−0.4380−0.32840.19570.39460.18680.2155
MARITAL_AVELEN−0.15600.8706−0.0073−0.14430.06260.07630.1515
DRINK1Y_RATIO−0.2836−0.06580.00670.09260.7237−0.0122−0.1155
AVGHOSP1Y_REALEXP0.10420.25340.31650.16630.14290.5617−0.0350
AVGOUTP1M_REALEXP−0.1089−0.02780.1028−0.0954−0.05700.78730.1441
AVGEXP1W_FOOD0.00230.06750.9361−0.01170.03530.10280.0147
AVGEXP1Y_TOTAL0.03200.03710.91270.15260.05830.1972−0.0049
CHILDCARE_RATIO0.9551−0.1080−0.05140.0988−0.06290.0202−0.0128
CHILDCORESD_RATIO0.7243−0.22370.2799−0.1026−0.2160−0.11790.0216
CHILDLVNEAR_RATIO0.9489−0.1397−0.05710.0848−0.07840.0073−0.0054
TRANSCHILD_RATIO−0.20220.63910.0789−0.24190.26430.05400.3319
WORK_RATIO0.0219−0.1650−0.1095−0.59460.3037−0.40710.3917
JOBSTATUS_NAGE_RATIO0.1384−0.13950.07880.76400.18200.0212−0.1084
JOBSTATUS_NAGS_RATIO−0.0906−0.22290.02550.6041−0.0895−0.35630.4260
JOBSTATUS_UNEM_RATIO−0.0371−0.21120.00180.0694−0.0347−0.1339−0.7833
JOBSTATUS_NEWK_RATIO−0.0098−0.3213−0.14210.0629−0.69850.0012−0.1844
Note: 1. Loadings are colored by column. Larger correlations have deeper red colors.

Appendix C: Other Results

Table A6. VIF.
Table A6. VIF.
NHSSCHARLS
IllnessratioIllnessdayChronicratioCHRONIC_RATIO
income4.857income4.875income4.875AVGINDIINCOME_EARN15.597
income22.298income22.304income22.304AVGINDIINCOME_EARN211.190
edu7.238edu7.388edu7.388AVGEDU2.210
RC1 3.934RC14.309RC14.309RC11.107
RC24.764RC21.881RC21.881RC21.053
RC31.992RC33.564RC33.564RC32.815
RC42.091RC42.091RC41.049
RC51.022
RC61.354
RC71.281
Note: 1. Rotated principle components (RCn) of different specifications have the same variables names but distinct definitions, loadings and scores. 2. Blank lines are spared to keep the table equally high among columns.
Table A7. Complete regression results (NHSS).
Table A7. Complete regression results (NHSS).
VariableIndividual Fixed Effect (FGLS) ⱡIndividual Random Effect (FGLS) ⱴTwo-way Fixed Effect (FGLS) ⱡPooling (FGLS)Pooling (OLS)
illnessratio
(Intercept)21.8469 ***10.0209 **24.5788 ***13.0202 ***13.0202 ***
(4.4718)(3.9618)(4.6590)(4.0368)(4.0368)
income−0.31180.1051−0.5214−0.2278−0.2278
(0.3252)(0.3149)(0.3371)(0.3391)(0.3391)
income20.2167 ***0.1758 ***0.2243 ***0.1756 ***0.1756 ***
(0.0379)(0.0387)(0.0378)(0.0395)(0.0395)
edu−0.15210.6774−0.45920.29530.2953
(0.5851)(0.5469)(0.5938)(0.5557)(0.5557)
RC2 (medical burden)1.9514 **0.69782.3577 **1.7011 **1.7011 **
(0.8429)(0.8103)(1.0439)(0.8601)(0.8601)
RC1 (urbanization)0.97460.04381.40620.50490.5049
(0.8793)(0.7792)(0.8967)(0.7816)(0.7816)
RC3 (geographic accessibility)0.0508−0.47410.4391−0.1480−0.1480
(0.5704)(0.5538)(0.5890)(0.5562)(0.5562)
illnessday
(Intercept)1631.8347 ***744.3945 **1706.3932 ***1068.9368 ***1068.9368 ***
(401.3076)(351.3749)(415.9300)(360.3742)(360.3742)
income−13.71663.3721−26.7318−34.9995−34.9995
(28.749)(27.7268)(30.0394)(30.0065)(30.0065)
income218.7594 ***16.8128 ***19.0379 ***17.7110 ***17.7110 ***
(3.3440)(3.3296)(3.3100)(3.4914)(3.4914)
edu−20.040162.9851−37.859019.517919.5179
(53.0576)(48.5439)(52.7612)(49.5975)(49.5975)
RC3 (urbanization)174.3866 **32.8512183.2006**84.500784.5007
(76.2881)(65.5659)(75.7176)(65.7198)(65.7198)
RC1 (health burden)54.1368−59.3752155.9446 **55.778855.7788
(69.9565)(68.4793)(76.2863)(72.2603)(72.2603)
RC2 (geographic accessibility)34.860720.791441.740738.983638.9836
(52.4659)(48.0439)(51.8061)(47.7470)(47.7470)
RC4 (health insurance)171.5954 ***135.6597 ***247.2420 ***200.8171 ***200.8171 ***
(49.6029)(46.1464)(69.9900)(50.3339)(50.3339)
chronicratio
(Intercept)16.9854 ***13.0889 ***16.9771 ***14.5350 ***14.5350 ***
(3.0980)(2.7301)(3.1958)(2.7004)(2.7004)
income−0.5160 **−0.4471**−0.5789 **−0.7147 ***−0.7147 ***
(0.2219)(0.2074)(0.2308)(0.2249)(0.2249)
income20.0865 ***0.0789 ***0.0868 ***0.0974 ***0.0974 ***
(0.0258)(0.0246)(0.0254)(0.0262)(0.0262)
edu−0.37650.0614−0.4943−0.1397−0.1397
(0.4096)(0.3752)(0.4054)(0.3717)(0.3717)
RC3 (urbanization)4.0814 ***3.1392 ***4.1033 ***3.3727 ***3.3727 ***
(0.5889)(0.5082)(0.5818)(0.4925)(0.4925)
RC1 (health burden)2.0875 ***1.6458 ***2.9994 ***2.2899 ***2.2899 ***
(0.5400)(0.5160)(0.5862)(0.5415)(0.5415)
RC2 (geographic accessibility)0.59260.52480.62550.5997 *0.5997 *
(0.4050)(0.3805)(0.3981)(0.3578)(0.3578)
RC4 (health insurance)2.5506 ***2.4239 ***3.3971 ***2.6387 ***2.6387 ***
(0.3829)(0.3520)(0.5378)(0.3772)(0.3772)
Note: 1. Standard errors are reported in parentheses. 2. Models marked with ⱡ use province-level individual fixed effects. Models marked with ⱴ use city-level individual random effects. 3. *** p < 0.01, ** p < 0.05, * p < 0.1.
Table A8. Complete regression results (CHARLS).
Table A8. Complete regression results (CHARLS).
VariableIndividual Fixed Effect (FGLS) ⱡIndividual Random Effect (FGLS) ⱴTwo-way Fixed Effect (FGLS) ⱡPooling (FGLS)Pooling (OLS)
CHRONIC_RATIO
(Intercept)0.9415 ***0.7798 ***0.9772 ***0.7906 ***0.7906 ***
(0.0549)(0.0480)(0.0515)(0.0460)(0.0460)
AVGINDIINCOME_EARN−0.1982 ***−0.1724 ***−0.1482 ***−0.2609 ***−0.2609 ***
(0.0452)(0.0342)(0.0433)(0.0493)(0.0493)
AVGINDIINCOME_EARN20.0882 ***0.0792 ***0.0670 ***0.1081 ***0.1081 ***
(0.0253)(0.0187)(0.0239)(0.0285)(0.0285)
AVGEDU−0.0768 **0.0221−0.05090.03380.0338
(0.0362)(0.0405)(0.0340)(0.0379)(0.0379)
RC5 (children support)−0.0137 ***−0.0203 ***−0.0059−0.0217 ***−0.0217 ***
(0.0039)(0.0033)(0.0039)(0.0041)(0.0041)
RC1 (family relations)0.0409 ***0.0462 ***0.00900.0333 ***0.0333 ***
(0.0037)(0.0032)(0.0060)(0.0042)(0.0042)
RC2 (consumption)0.0236 ***0.0163 ***0.0119 ***0.0295 ***0.0295 ***
(0.0037)(0.0030)(0.0039)(0.0041)(0.0041)
RC3 (physical burden)0.0150 **0.00260.00700.00740.0074
(0.0065)(0.0060)(0.0062)(0.0068)(0.0068)
RC4 (drinking)0.0192 ***0.0109 ***0.0107 **0.0122 ***0.0122 ***
(0.0043)(0.0039)(0.0042)(0.0041)(0.0041)
RC6 (medical burden)0.0244 ***0.0148 ***0.0158 ***0.0241 ***0.0241 ***
(0.0042)(0.0031)(0.0041)(0.0047)(0.0047)
RC7 (unemployment)0.00130.0043−0.00510.00230.0023
(0.0041)(0.0031)(0.0042)(0.0046)(0.0046)
Note: 1. Standard errors are reported in parentheses. 2. Models marked with ⱡ use province-level individual fixed effects. Models marked with ⱴ use city−level individual random effects. 3. *** p < 0.01, ** p < 0.05.
Table A9. Normality tests on regression residuals.
Table A9. Normality tests on regression residuals.
NHSS
IllnessratioIllnessdayChronicratio
Shapiro-Wilk0.0000.0030.091
Lilliefor0.0070.0020.321
Pearson Chi-square0.3210.0010.371
Anderson-Darling0.0010.0010.256
Cramer-von Mises0.0020.0020.295
CHARLS
CHRONIC_RATIO
Shapiro-Wilk0.151
Lilliefor0.401
Pearson Chi-square0.569
Anderson-Darling0.134
Cramer-von Mises0.239
Note: 1. residuals are calculated based on the fixed individual effect models; 2. H0: if p > α, the sample is from a normal distribution.

References

  1. Myers, G.C. Differential Mortality in the United States: A Study in Socioeconomic Epidemiology; Kitagawa, E.M., Hauser, P.M., Eds.; Harvard University Press: Cambridge, MA, USA, 1973; Volume 69, pp. 654–662. [Google Scholar]
  2. Elo, I.T.; Preston, S.H. Educational differentials in mortality: United States, 1979–1985. Soc. Sci. Med. 1996, 42, 47. [Google Scholar] [CrossRef]
  3. Bassuk, S.S.; Berkman, L.F.; Amick, B.C. Socioeconomic Status and Mortality among the Elderly: Findings from Four US Communities. Am. J. Epidemiol. 2002, 155, 520–533. [Google Scholar] [CrossRef] [Green Version]
  4. Mustard, C.A.; Derksen, S.; Berthelot, J.-M.; Wolfson, M.; Roos, L.L. Age-specific education and income gradients in morbidity and mortality in a Canadian province. Soc. Sci. Med. 1997, 45, 383–397. [Google Scholar] [CrossRef]
  5. Marmot, M.G.; Shipley, M.J.; Rose, G. Inequalities in death-specific explanations of a general pattern? Lancet 1984, 323, 1003–1006. [Google Scholar] [CrossRef]
  6. Davey, S.G.; Hart, C.; Hole, D.; Mackinnon, P.; Gillis, C.; Watt, G.; Blane, D.; Hawthorne, V. Education and occupational social class: Which is the more important indicator of mortality risk? J. Epidemiol. Community Health 1998, 52, 153–160. [Google Scholar] [CrossRef]
  7. Zhu, H.; Xie, Y. Socioeconomic differentials in mortality among the oldest old in China. Res. Aging 2007, 29, 125–143. [Google Scholar] [CrossRef]
  8. Luo, Y.; Zhang, Z.; Gu, D. Education and mortality among older adults in China. Soc. Sci. Med. 2015, 127, 134–142. [Google Scholar] [CrossRef] [PubMed]
  9. Huisman, M.; Kunst, A.E.; Mackenbach, J.P. Socioeconomic inequalities in morbidity among the elderly: A European overview. Soc. Sci. Med. 2003, 57, 861–873. [Google Scholar] [CrossRef]
  10. Crimmins, E.M.; Hayward, M.D.; Saito, Y. Changing mortality and morbidity rates and the health status and life expectancy of the older population. Demography 1994, 31, 159–175. [Google Scholar] [CrossRef]
  11. Elo, I.T. Social class differentials in health and mortality: Patterns and explanations in comparative perspective. Annu. Rev. Sociol. 2009, 35, 553–572. [Google Scholar] [CrossRef]
  12. Han, C. Health implications of socioeconomic characteristics, subjective social status, and perceptions of inequality: An empirical study of china. Soc. Indic. Res. 2014, 119, 495–514. [Google Scholar] [CrossRef]
  13. Kwon, S. Economic segmentation and health inequalities in urban post-reform China. Aims Public Health 2016, 3, 487. [Google Scholar] [CrossRef] [PubMed]
  14. Mirowsky, J.; Ross, C.E. Education, Social Status, and Health; Transaction Publishers: Piscataway, NJ, USA, 2003. [Google Scholar]
  15. Ross, C.E.; Mirowsky, J. Does medical insurance contribute to socioeconomic differentials in health? Milbank Q. 2000, 78, 291–321. [Google Scholar] [CrossRef] [PubMed]
  16. Ross, C.E.; Wu, C.-L. The links between education and health. Am. Sociol. Rev. 1995, 719–745. [Google Scholar] [CrossRef]
  17. Wang, H.; Yu, Y. Increasing health inequality in China: An empirical study with ordinal data. J. Econ. Inequal. 2016, 14, 41–61. [Google Scholar] [CrossRef]
  18. Winkleby, M.A.; Jatulis, D.E.; Frank, E.; Fortmann, S.P. Socioeconomic status and health: How education, income, and occupation contribute to risk factors for cardiovascular disease. Am. J. Public Health 1992, 82, 816–820. [Google Scholar] [CrossRef]
  19. Anderson, N.B.; Armstead, C.A. Toward understanding the association of socioeconomic status and health: A new challenge for the biopsychosocial approach. Psychosom. Med. 1995, 57, 213–225. [Google Scholar] [CrossRef]
  20. Evans, G.W.; Kantrowitz, E. Socioeconomic status and health: The potential role of environmental risk exposure. Annu. Rev. Public Health 2002, 23, 303–331. [Google Scholar] [CrossRef]
  21. Luo, W.; Xie, Y. Socio-economic disparities in mortality among the elderly in China. Popul. Stud. 2014, 68, 305–320. [Google Scholar] [CrossRef] [Green Version]
  22. Rosenbaum, E. Racial/ethnic differences in asthma prevalence: The role of housing and neighborhood environments. J. Health Soc. Behav. 2008, 49, 131–145. [Google Scholar] [CrossRef]
  23. Feinstein, J.S. The relationship between socioeconomic status and health: A review of the literature. Milbank Q. 1993, 279–322. [Google Scholar] [CrossRef]
  24. Mackenbach, J.P.; Stronks, K.; Kunst, A.E. The contribution of medical care to inequalities in health: Differences between socio-economic groups in decline of mortality from conditions amenable to medical intervention. Soc. Sci. Med. 1989, 29, 369–376. [Google Scholar] [CrossRef]
  25. Von dem Knesebeck, O.; Lschen, G.; Cockerham, W.C.; Siegrist, J. Socioeconomic status and health among the aged in the United States and Germany: A comparative cross-sectional study. Soc. Sci. Med. 2003, 57, 1643–1652. [Google Scholar] [CrossRef]
  26. Lutfey, K.; Freese, J. Toward some fundamentals of fundamental causality: Socioeconomic status and health in the routine clinic visit for diabetes. Am. J. Sociol. 2005, 110, 1326–1372. [Google Scholar] [CrossRef]
  27. Lantz, P.M.; Lynch, J.W.; House, J.S.; Lepkowski, J.M.; Mero, R.P.; Musick, M.A.; Williams, D.R. Socioeconomic disparities in health change in a longitudinal study of US adults: The role of health-risk behaviors. Soc. Sci. Med. 2001, 53, 29–40. [Google Scholar] [CrossRef]
  28. Lynch, J.W.; Kaplan, G.A.; Cohen, R.D.; Tuomilehto, J.; Salonen, J.T. Do cardiovascular risk factors explain the relation between socioeconomic status, risk of all-cause mortality, cardiovascular mortality, and acute myocardial infarction? Am. J. Epidemiol. 1996, 144, 934–942. [Google Scholar] [CrossRef] [PubMed]
  29. Blane, D.; Brunner, E.; Wilkinson, R.; Wilkinson, R.G. Health and Social Organization: Towards a Health Policy for the Twenty-First Century; Psychology Press: Hove, UK, 1996. [Google Scholar]
  30. Marmot, M.G.; Theorell, T.; Siegrist, J. Work and Coronary Heart Disease; BMJ Books: London, UK, 2002. [Google Scholar]
  31. Smith, G.D.; Dorling, D.; Gordon, D.; Shaw, M. The widening health gap: What are the solutions? Crit. Public Health 1999, 9, 151–170. [Google Scholar] [CrossRef]
  32. Potente, C.; Monden, C. Disability pathways preceding death in England by socio-economic status. Popul. Stud. A J. Demogr. 2018, 72, 175–190. [Google Scholar] [CrossRef]
  33. Liao, Y.; Mcgee, D.L.; Kaufman, J.S.; Cao, G.; Cooper, R.S. Socioeconomic status and morbidity in the last years of life. Am. J. Public Health 1999, 89, 569–572. [Google Scholar] [CrossRef]
  34. Huijts, T.; Eikemo, T.A.; Skalicka, V. Income-related health inequalities in the Nordic countries: Examining the role of education, occupational class, and age. Soc. Sci. Med. 2010, 71, 1964–1972. [Google Scholar] [CrossRef]
  35. Knesebeck, O.V.D.; Verde, P.E.; Dragano, N. Education and health in 22 European countries. Soc. Sci. Med. 2006, 63, 1344–1351. [Google Scholar] [CrossRef] [PubMed]
  36. Lei, X.; Yin, N.; Zhao, Y. Socioeconomic status and chronic diseases: The case of hypertension in China. China Econ. Rev. 2012, 23, 105–121. [Google Scholar] [CrossRef]
  37. Fang, X.; Jing, R.; Zeng, G.; Linnan, H.W.; Zhu, X.; Linnan, M. Socioeconomic status and the incidence of child injuries in China. Soc. Sci. Med. 2014, 102, 33–40. [Google Scholar] [CrossRef] [PubMed]
  38. Riley, J.C. The risk of being sick: Morbidity trends in four countries. Popul. Dev. Rev. 1990, 403–432. [Google Scholar] [CrossRef]
  39. Ki, M.; Lee, Y.H.; Kim, Y.-S.; Shin, J.-Y.; Lim, J.; Nazroo, J. Socioeconomic inequalities in health in the context of multimorbidity: A Korean panel study. PLoS ONE 2017, 12, e0173770. [Google Scholar] [CrossRef] [PubMed]
  40. Beaumaster, S.; Chien, S.; Lau, S.; Lin, A.; Phillps, D.; Wilkens, J.; Lee, J. Harmonized CHARLS Documentation, Version C.; Center for Economic and Social Research, USCDornsife: Santa Monica, CA, USA, 2018. [Google Scholar]
  41. Beydoun, M.A.; Popkin, B.M.J.S.S. The impact of socio-economic factors on functional status decline among community-dwelling older adults in China. Medicine 2005, 60, 2045–2057. [Google Scholar] [CrossRef] [PubMed]
  42. Cheng, L.; Liu, H.; Zhang, Y.; Shen, K.; Zeng, Y.J.H.E. The impact of health insurance on health outcomes and spending of the elderly: Evidence from china’s new cooperative medical scheme. Health Econ. 2015, 24, 672–691. [Google Scholar] [CrossRef]
  43. Elwellsutton, T.; Jiang, C.Q.; Zhang, W.S.; Cheng, K.K.; Lam, T.H.; Leung, G.; Schooling, C.M.J.S.S. P2-395 Socioeconomic influences at different life stages on self-rated health in Guangzhou, China. Medicine 2011, 73, 1884–1892. [Google Scholar]
  44. Kempen, G.I.; Brilman, E.I.; Ranchor, A.V.; Ormel, J.J.S.S. Morbidity and quality of life and the moderating effects of level of education in the elderly. Medicine 1999, 49, 143–149. [Google Scholar] [CrossRef]
  45. Bell, A.C.; Adair, L.S.; Popkin, B.M.J.S.S. Understanding the role of mediating risk factors and proxy effects in the association between socio-economic status and untreated hypertension. Medicine 2004, 59, 275–283. [Google Scholar]
  46. Huisman, M.; Kunst, A.E.; Andersen, O.; Bopp, M.; Borgan, J.K.; Borrell, C.; Costa, G.; Deboosere, P.; Desplanques, G.; Donkin, A.; et al. Socioeconomic inequalities in mortality among elderly people in 11 European populations. J. Epidemiol. Community Health 2004, 58, 468–475. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Filmer, D.; Pritchett, L.J.S.S. The impact of public spending on health: Does money matter? Medicine 1999, 49, 1309. [Google Scholar] [CrossRef]
  48. Rosenberg, M.W. Accessibility to health care: A North-American perspective. Prog. Hum. Geogr. 1983, 7, 78–87. [Google Scholar] [CrossRef]
  49. Eggleston, K.; Ling, L.; Qingyue, M.; Lindelow, M.; Wagstaff, A. Health service delivery in China: A literature review. Health Econ. 2008, 17, 149–165. [Google Scholar] [CrossRef] [PubMed]
  50. Lancaster, T. The incidental parameter problem since 1948. J. Econom. 2000, 95, 391–413. [Google Scholar] [CrossRef] [Green Version]
  51. Gordon, D.; Howe, L.D.; Galobardes, B.; Matijasevich, A.; Johnston, D.; Onwujekwe, O.; Patel, R.; Webb, E.A.; Lawlor, D.A.; Hargreaves, J.R. Authors’ response to alternatives to principal components analysis to derive asset-based indices to measure socio-economic position in low- and middle-income countries: The case for multiple correspondence analysis. Int. J. Epidemiol. 2012, 41, 1209–1210. [Google Scholar] [CrossRef]
  52. Kolenikov, S.; Angeles, G. Socioeconomic status measurement with discrete proxy variables: Is principle component analysis a reliable answer? Rev. Income Wealth 2010, 55, 128–165. [Google Scholar] [CrossRef]
  53. Vyas, S.; Kumaranayake, L. Constructing socio-economic status indices: How to use principal components analysis. Health Policy Plan. 2006, 21, 459–468. [Google Scholar] [CrossRef]
  54. Zell, J.A.; Cinar, P.; Mobasher, M.; Ziogas, A.; Meyskens, F.L., Jr.; Anton-Culver, H. Survival for patients with invasive cutaneous melanoma among ethnic groups: The effects of socioeconomic status and treatment. J. Clin. Oncol. 2008, 26, 66–75. [Google Scholar] [CrossRef]
  55. Giri, N.C. Multivariate Statistical Analysis: Revised and Expanded; CRC Press: Boca Raton, FL, USA, 2003. [Google Scholar]
Figure 1. Geographic distributions: income and the prevalence of chronic diseases. (a) NHSS; (b) CHARLS.
Figure 1. Geographic distributions: income and the prevalence of chronic diseases. (a) NHSS; (b) CHARLS.
Ijerph 16 00215 g001
Figure 2. Geographic distributions: average education years and the prevalence of chronic diseases. (a) NHSS; (b) CHARLS.
Figure 2. Geographic distributions: average education years and the prevalence of chronic diseases. (a) NHSS; (b) CHARLS.
Ijerph 16 00215 g002
Table 1. Variables from National Health Services Survey (NHSS) Data.
Table 1. Variables from National Health Services Survey (NHSS) Data.
VariablesCodesUnitVariable Explanation
Dependent variableTwo-week incidence rateillnessratio%Number of injuries per two weeks in every 100 respondents
Prevalence of chronic diseaseschronicratio%Number of chronic illness cases in every 100 years of age 15 and over
Number of sick days per thousand peopleillnessdaydayAverage number of sick days in two weeks per 1000 people
Socioeconomic characteristicsIncome per capitaincome10 thousandAverage annual income per capita
Average years of educationeduyearThe years required by a respondent to obtain his/her highest degree
Demographic characteristicsAverage age in countyaverageageThe average age of all age group weighted by group size
Older population proportionage65%Proportion of the population over 65 years old
Male population proportionmale%Proportion of male population
Urban flagurban Urban = 1, rural = 0
Consumption and health service characteristicsAnnual consumptionexpendyuanTotal annual consumption per capita
Average of medical treatment costspermedicostyuanAverage annual medical treatment costs
Average hospitalization expenseperhospitalcostyuanAverage hospitalization expense of each time
Health expenditure per capitamedicostyuanProportion of family health expenditure to total living expenses
Accessibility and affordability of health servicesAccessibility of distance to the nearest hospitaldistance%Proportion of the population whose distance from the nearest hospital to their home is less than 1 km
Accessibility of time to the nearest hospitaltime10%Proportion of the population whose time cost to the nearest hospital is less than 10 min
Coverage of social health insurance plansinsurance%Proportion of the population covered by social health insurance plans
Environment factorHygienic toilets shareswashroom%Proportion of hygienic toilets
Table 2. Descriptive statistics.
Table 2. Descriptive statistics.
VariablesMeanStdevMinPct 25%MedianPct 75%Max
NHSS (282 observations in 3 years)
illnessratio16.357.533.7111.4314.7019.2953.20
illnessday1328.88691.82231.00854.751167.501589.254128.00
chronicratio14.146.382.899.8412.9717.9733.55
income *0.002.56−4.98−1.57−0.641.1411.07
income26.5415.140.000.611.975.80122.54
edu7.391.911.706.157.008.7411.65
CHARLS (378 observations in 3 years)
CHRONIC_RATIO0.750.100.450.680.750.820.98
AVGINDIINCOME_EARN0.440.320.030.210.350.591.81
AVGINDIINCOME_EARN20.300.470.000.050.120.353.29
AVGEDU1.170.161.001.071.121.201.94
* The original income variable has been regressed with education years. Its statistics in this table are based on the regression residuals.
Table 3. Gini coefficients of health outcomes.
Table 3. Gini coefficients of health outcomes.
NHSS
YearCountiesIllnessratioIllnessdayChronicratio
1998940.23160.25120.2790
2003950.19980.23950.2312
2008930.25140.28370.2196
CHARLS
YearCitiesCHRONIC_RATIO
20111260.0744
20131260.0640
20151260.0511
Table 4. Results of the regression analysis on NHSS dataset.
Table 4. Results of the regression analysis on NHSS dataset.
VariableIndividual Fixed Effect (FGLS) ⱡIndividual Random Effect (FGLS) ⱴTwo-way Fixed Effect (FGLS) ⱡPooling (FGLS)Pooling (OLS)
illnessratio
income−0.31180.1051−0.5214−0.2278−0.2278
(0.3252)(0.3149)(0.3371)(0.3391)(0.3391)
income20.2167 ***0.1758 ***0.2243 ***0.1756 ***0.1756 ***
(0.0379)(0.0387)(0.0378)(0.0395)(0.0395)
edu−0.15210.6774−0.45920.29530.2953
(0.5851)(0.5469)(0.5938)(0.5557)(0.5557)
illnessday
income−13.71663.3721−26.7318−34.9995−34.9995
(28.7490)(27.7268)(30.0394)(30.0065)(30.0065)
income218.7594 ***16.8128 ***19.0379 ***17.7110 ***17.7110 ***
(3.3440)(3.3296)(3.3100)(3.4914)(3.4914)
edu−20.040162.9851−37.859019.517919.5179
(53.0576)(48.5439)(52.7612)(49.5975)(49.5975)
chronicratio
income−0.5160**−0.4471**−0.5789**−0.7147 ***−0.7147 ***
(0.2219)(0.2074)(0.2308)(0.2249)(0.2249)
income20.0865 ***0.0789 ***0.0868 ***0.0974 ***0.0974 ***
(0.0258)(0.0246)(0.0254)(0.0262)(0.0262)
edu−0.37650.0614−0.4943−0.1397−0.1397
(0.4096)(0.3752)(0.4054)(0.3717)(0.3717)
Note: 1. Standard errors are reported in parentheses. 2. Models marked with ⱡ use province-level individual fixed effects. Models marked with ⱴ use city-level individual random effects. 3. *** p < 0.01, ** p < 0.05.
Table 5. Results of the regression analysis on China Health and Retirement Longitudinal Study (CHARLS) dataset.
Table 5. Results of the regression analysis on China Health and Retirement Longitudinal Study (CHARLS) dataset.
VariableIndividual Fixed Effect (FGLS) ⱡIndividual Random Effect (FGLS) ⱴTwo-way Fixed Effect (FGLS) ⱡPooling (FGLS)Pooling (OLS)
CHRONIC_RATIO
AVGINDIINCOME_EARN−0.1982 ***−0.1724 ***−0.1482 ***−0.2609 ***−0.2609 ***
(0.0452)(0.0342)(0.0433)(0.0493)(0.0493)
AVGINDIINCOME_EARN20.0882 ***0.0792 ***0.0670 ***0.1081 ***0.1081 ***
(0.0253)(0.0187)(0.0239)(0.0285)(0.0285)
AVGEDU−0.0768 **0.0221−0.05090.03380.0338
(0.0362)(0.0405)(0.0340)(0.0379)(0.0379)
Note: 1. Standard errors are reported in parentheses. 2. Models marked with ⱡ use province-level individual fixed effects. Models marked with ⱴ use city-level individual random effects. 3. *** p < 0.01, ** p < 0.05.

Share and Cite

MDPI and ACS Style

Jiang, Y.; Zheng, H.; Zhao, T. Socioeconomic Status and Morbidity Rate Inequality in China: Based on NHSS and CHARLS Data. Int. J. Environ. Res. Public Health 2019, 16, 215. https://doi.org/10.3390/ijerph16020215

AMA Style

Jiang Y, Zheng H, Zhao T. Socioeconomic Status and Morbidity Rate Inequality in China: Based on NHSS and CHARLS Data. International Journal of Environmental Research and Public Health. 2019; 16(2):215. https://doi.org/10.3390/ijerph16020215

Chicago/Turabian Style

Jiang, Yunyun, Haitao Zheng, and Tianhao Zhao. 2019. "Socioeconomic Status and Morbidity Rate Inequality in China: Based on NHSS and CHARLS Data" International Journal of Environmental Research and Public Health 16, no. 2: 215. https://doi.org/10.3390/ijerph16020215

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop