3.2. Model Specification
To investigate multilateral linkages among school performance, housing prices, and population in-migration, we apply a spatial simultaneous equations model composed of three equations described as:
where
W is a spatial weights matrix of known constants, and the vectors
Y1,
Y2, and
Y3 represent school performance, housing prices, and population in-migration, respectively.
Xi denotes an
N ×
K matrix of exogenous explanatory variables, and
is an associated
K × 1 vector with unknown parameters to be estimated. The disturbance vector
exhibits spatial dependence, and the vector
is an independent and identically distributed (i.i.d.) error term with zero mean and finite variance (
). The scalar
is the spatial autoregressive coefficient, and the scalar
is the spatial autocorrelation coefficient. To address both feedback simultaneity between
Y2 and
Y3 and spatial dependence specified in the model, the generalized spatial three-stage least squares procedure (GS3SLS) suggested by Kelejian and Prucha [
45] is applied. The two-stage least squares procedure method (GS2SLS) is also available to estimate our model, but GS3SLS estimator is more efficient than GS2SLS estimator by utilizing full system information concerning potential correlation across equations. For details in estimation method, refer to Kelejian and Prucha [
45] and Jeanty et al. [
46], in which the simultaneous interaction and spatial interdependence between housing prices and population migration was analyzed using two-stage least squares procedure method.
The elements of the spatial weights matrix are defined as below (LeSage and Pace [
47] showed the estimates and inferences were not sensitive to the specification of spatial weights matrix, and flexible spatial spillover specification could save the efforts to build an accurate matrix. In this study, a contiguity based spatial weights matrix is chosen over those in other types (e.g., a distance decay type) to illustrate local attributes shared across the administrative boundary).
The matrix is row-standardized to provide information on the scale of spatial interactions across observation. Among 369 observations, there are 5.36 neighbors per row (column) on average. The number of neighbors per row (column) ranges from one to eleven, and the medium is five.
In a spatial econometric model, spillover effects should be considered to evaluate the impact of the change in the explanatory variables on dependent variables. Regarding this issue, Lesage and Pace [
48] proposed the partial derivative effects estimates composed of direct and indirect effects. In detail, the indirect effect is defined as the sum of the marginal change in dependent variable of a particular observation due to the change in explanatory variable of all other observations, whereas the direct effect is caused by the change in its own observation. The accumulation of spatial spillover effects due to the feedback between observations is described as a form of a multiplier matrix as in Equation (3).
The direct effect of
kth independent variable of
ith observation is represented by its coefficient
βk multiplied by the value of
ith diagonal element of the multiplier matrix. The corresponding indirect effect due to the change in all other observations (except for the
ith) is the coefficient
βk multiplied by the sum of off-diagonal elements of
ith row of the matrix. In the same time, the indirect effect, representing the change in the rest owing to the change in
ith observation, is calculated by multiplying
βk by the sum of off-diagonal elements of
ith column, instead of
ith row. Due to the symmetry of the multiplier matrix, the indirect effect can be measured by either row sums or the column sums of off-diagonal elements of the matrix [
48]. Total effect, the sum of direct and indirect effects, is used to examine the impact of the change in an explanatory variable on the dependent variable. In order to test the spatial spillover, standard errors of the effects are calculated using simulated parameter sets based on 1000 random draws with a normal distribution assumption and variance-covariance matrixes provided by Elhorst [
49].
3.3. Estimation
This study uses a cross-sectional data set of 369 town-level units of Seoul in 2010.We use same-year-data for all variable because the relationship between school performance, housing prices, and residential mobility we explore in this paper is association, rather than causation. Further, using different-year data seems odd due to the feedback linkage between the three key variables because the three key variables serve as both dependent and independent variables.The matching of schools to these spatial units is done using the information on school catchment areas announced by the Seoul Metropolitan Office of Education. Since 2008, Korean Ministry of Education has made public announcement of individual school information including the outcome of national assessment of academic achievement conducted for 6th grade elementary school students. The information on the academic achievement of a school is provided in the form of the percentages of students who were rated above-average level, average level, and below-average level. The odds-ratio of students in a school who achieved above-average levels in all three subjects of Korean, Mathematics, and English to those who achieved average and below-average level (SCORE) is used as a proxy for school performance. As a proxy of local level housing prices, the annual average of median sale prices per square meter (PRICE), provided by a realty bank is used (In order to relieve aggregation bias, price information is collected from plats with common physical attributes (e.g., size, age, type, and the scale of apartment complex)). The residential in-migration (IN_MIG) is measured by the population inflow excluding those from neighboring towns and residential relocation within the same town.
Explanatory variables for school performance are composed of the frequency of short-distance residential moves (SHORT), the access of private tutoring services (PRIVATE), and the level of family wealth (WEALTH), which is a predictor of social economic status of the family. Note that in Seoul, elementary school quality is not much dependent on teacher quality due to highly qualified teacher requirement and teachers’ rotation across schools. In addition, school funding for academic curriculum is relatively equivalent across schools as well because a large portion of the finance is covered by the budget of the Seoul, rather than that of local authority of education. The SHORT variable is measured by the annual record of the residential changes within the own and neighboring towns in Seoul. Though the literature discussing the difference in relationship between move and academic achievement by the distance of move is not very rare, there is a lack of consensus in identifying “short-distance” in terms of quantitative measure. In accordance with Schafft [
44], we consider residential mobility driven by housing instability or any other type of financial hardships is represented by short-distance migration in our model. In South Korea, the majority of housing lease in South Korea is characterized by key-money system: instead of paying monthly rent, tenants make a lump sum payment as a deposit for the occupancy. After a termination of two-year lease contract, many renters change home due to the burden of renewed increase in key deposit money. They tend to relocate into no further than neighboring towns, in order to avoid major change in daily life and reduce search cost of new housing. Therefore, we consider the residential changes within the own and neighboring towns as short-distance moves. The PRIVATE variable and the WEALTH variable are measured using the number of employees in private education industries (confined to those related to tutoring services for primary and secondary school age kids) and local property tax revenue, respectively.
Housing prices are estimated using the distance to central areas (DIST), access to public transport (TRANS), endowment of public sports facilities (SPORTS) (Some of local governmental agencies operate sport facilities in which fitness programs or lessons of specific sports are provided to residents for a reasonable fee) and parks (PARK), both representing local amenities. In addition, the effect from the scale of out-migration (OUT_MIG) is controlled along with the IN_MIG, becausethey collectively affect housing prices through the change in housing demand. Access to employment locations, along with the quality of local public goods is the most significant determinants of population in-migration [
50,
51]. The proximity to workplace is measured by average commuting time of residents (CTIME), provided by Seoul Metropolitan Household Travel Survey. Not only the proximity to workplaces, the opportunity of being employed as well explains residential choice [
52,
53,
54,
55]. The effect of job opportunity (Confined to consumer-oriented service sectors, it can be viewed as access to the specific functions) on residential choice might be different across industrial sectors, due to both supply- (e.g., spatial allocation of urban functions) and demand-side reasons (e.g., preference to or avoidance of externalities of the industry). Accordingly, the scale of local industries, measured by the size of employment of the own and neighboring towns, is controlled for both manufacturing (MANU) and service (SERV) sectors. In addition, the size of local housing market (HOUSING), represented by housing stock level is controlled (The size of local housing market on population in-migration might be better represented by the quantity of potentially available housings (e.g., the size of local housing lease market) rather than the housing stock per se, but the latter is advantageous in terms of data availability). To address potential endogeneity issue due to feedback linkage between dependent variables, three excluded instrument variables (IVs) are selected: the ratio of female teachers in a school, log of average of officially assessed land price in 2005, and the residential in-migration in 2005. The use of the ratio of female teachers in a school as an IV for the school performance variable is based on the empirical finding that female teachers were associated with higher test scores in the U.S. [
56,
57]. While some authors suggested that female teachers would be more favorable to female students (e.g., Ehrenberg [
58]), the hypothesis that a same-sex teacher improves students’ performance was not empirically supported in Holmlund [
59]. To check the validity of the IVs, F-tests concerning the joint significance of IVs are performed (The statistics are reported on
Table A1 in
Appendix A). Null hypotheses that the IVs are weak are rejected at the 1% of significance level in both models.
Table 1 gives the summary statistics and a description of the variables used in this study. The left skewed distribution of school performance and its explanatory variables (e.g., the access to private tutoring services) imply that the benefit from accessibility to high quality of educational environment is limited to a few neighborhoods. To illustrate, a local indicator of spatial association (LISA) developed by Anselin [
60] is analyzed with respect to the access to private tutoring services and school performance (see
Figure 2a,b). In both figures, hot spots, defined as local spatial clusters, are observed in the south-eastern part and mid-western part. These neighborhoods were initially developed in the 1970s and the 1980s as urban sub-centers of Seoul. Massive investment in transportation infrastructure (e.g., subway networks) and transfers of urban facilities including prestigious high schools from central zone attracted a population. In addition, concentration of top-ranked schools and upper-middle income households led to the growth of private education industries. Combined with speculative housing demand, the fame as a Mecca of education created housing price premium in these area (see
Figure 2c).
Figure 2d shows that spatial pattern of the ratio of short-distance in-migration is different from the former ones. The hot spots of access to private tutoring services, school performance and housing prices tend to be cold spots in terms of the ratio of short-distance in-migration. Relatively longer-distance in-migration indicates that relocations to these neighborhoods are more likely to be inter-school district moves motivated by the quality of educational settings or involved with speculative housing investment, rather than related with unstable housing conditions.
Table 2 shows the estimation results using the GS3SLS method (in
Appendix B, we present estimation results using the OLS and the 3SLS methods, and test results for endogeneity and spatial autocorrelation, by which the use of the GS3SLS method is justified). The results are by and large consistent with our expectation and findings from the literature. Focusing on the key variables involved with our hypotheses, the academic performance of schools in catchment areas (SCORE) is positively associated with both housing prices (PRICE) and population in-migration (IN-MIG); housing prices (PRICE) is negatively linked to population in-migration (IN_MIG), but population in-migration (IN_MIG) is positively linked to housing prices (PRICE). Overall, the results satisfy the hypotheses in terms of sign and statistical significance, except that the estimate of IN_MIG of the housing price equation is not significant at 10% level. While short-distance residential move (SHORT) has a negative association with school performance, the access to private tutoring services (TUTOR) and wealth level (WEALTH) shows a positive linkage to it. This supports the importance of educational environment including financial stability of the own and peer students, and the provision of private tutoring services on school outcome. The positive and significant estimates of spatial lags (
ρ) evidence the existence of the spatial spillovers and dependencies.
The estimation results provide a schematic sketch of linkages among school performance, housing prices, and residential moves. However, as mentioned above, the impact of a change in independent variables on the dependent variable can be more precisely measured based on partial derivative effects estimates composed of direct and indirect effects.
Table 3 presents the average direct, indirect, and total effects of the GS3SLS results. The direct and indirect effects are statistically significant in key variables of our model, supporting the hypothesis of linkage among school performance, housing prices, and residential in-migration. Beginning with the school performance equation, a 1% increase in the frequency of residential change within its own and neighboring towns relates to 0.081unit decrease in the school performance of the town on average. As for indirect effect, a 1% increase in the frequency of short-distance residential moves of the town is related to 0.232 unit decrease in the school performance of all other towns on average, being greater than the direct effect. From a different point of view, a 1% increase in the frequencies of short-distance residential move of all other towns relates to 0.232 unit decrease in the academic performance of a particular town on average. The total effect arising from a percent increase in the frequency of short-distance residential move of a town is 0.313 unit decrease, which is significant at 1% level. The change of school performance in a town also relates to the change in the access to private education service and financial status of family in its own and the other towns. Comparing the size of direct and indirect effects, the school performance is more elastic to changes in the educational environments of other towns than the particular town. (Constant ratio between the direct and indirect effects for every explanatory variable is a limitation of SAC type model (without spatial lags of explanatory variables) [
49]). This finding suggests that local educational setting generates a significant spatial externality in either positive or negative way, so inter-school level cooperation to improve educational environment is needed.
Housing prices are highly elastic to school performance of the catchment area: they would go up by 1.281% in response to a unit increase in academic performance on average. Taking into consideration of both direct and indirect effects, a unit increase in academic performance relates to 1.311% increase in housing prices on average. The size of the population inflow into a town is negatively associated with the increase in housing prices of its own and neighboring towns: the sizes of direct effect and indirect effect are 1.086% and 0.192%, respectively. A unit increase in school performance of a town relates to the increase in population inflow by 1.086% in the town and 0.192% in neighboring towns. The indirect effect as well as the indirect effect is statistically significant, suggesting that the impact of the increase in housing prices spread to neighboring towns, due to inter-connectivity across local housing markets. A 1% increase in school performance of a town leads to the increase in population inflow by 1.033% in the town and 0.182% in neighboring towns. Though significant at 1% level, the effect of school performance on population in-migration is smaller than that of housing prices.
Table 4 shows the change in housing prices and population in-migration with respect to a unit change in school performance, calculated based on the size of total effect in GS3SLS specification. A unit increase in school performance of a town leads to 1.355% of increase in housing prices, and the majority of the effect is generated through a direct channel (1.311%). In contrast, a unit increase in school performance of a town leads to the decrease in population in-migration by 0.46%, because the negative effect through the indirect channel (−1.675%) outweighs the positive effect through the direct channel (1.216%).
3.4. Discussion
According to the figures above, housing prices are more than twice elastic than population in-migration with respect to school performance, and a potential explanation is speculative investments in housing in good school districts, which impose further financial restriction on the residential choices of lower-income households. This suggest that inter-relationship between school performance, housing prices, and residential mobility under rigid geographic school assignment contributes to persistent school segregation. By identifying the multilateral linkage between these variables, we elucidate the underlying cause of substantial segregation of schools and residences with limited school choice, observed in the literature (e.g., [
61]). Interestingly, while Clapp and Ross [
62] found that racial (and income)-oriented segregation have only slight effects on middle- and high-income residences, the school quality (and housing price)-oriented segregation would affect both high- and low-income residences due to their difference in the drivers of residential moves(i.e., pull factors versus push factors).
Given the high dependence of school performance on peer composition in elementary school level [
16], limited access to good school districts may further worsen the inequality of school performance. Furthermore, considerable sizes of indirect effects in three equations imply a spatially clustered pattern of the cumulative process of inequality. Therefore, spatially comprehensive approaches to enhance the general achievement level of weak school districts and dampen the speculative demand for housing in affluent school districts would be needed. In addition, governmental initiated land use plan to improve social mix (e.g., provision of public rental housings in affluent school districts) could increase positive peer effects that students from economically disadvantaged households can enjoy.
In order to reduce the inequality of educational achievement, more financial support is required for school districts with low fiscal self-reliance. The housing prices of these areas tend to be relatively low, which affects not only the amount of property taxes (a primary source of local government revenue) but also also school development funds and donations by parents and graduates. The budget support from the national government could be an effective solution to the problem of educational inequality. In addition, the national and local governments need to take an active role in the operation of after-school programs in an integrated way. An after-school program could provide children from lower income families with experiences similar to those enjoyed by children from middle-class families, who have access to a rich array of lessons, coached sports, and private tutoring. Such a program is one way to reduce the gap in access to qualified educational services [
63]. The after-school program in South Korea has been operated to reduce the disparity of access to better educational services across school districts and income level groups since 2006, but the gap in the program’s quality between rich and poor districts remains due to income disparities. Finally, public education of children under school age could ease spatial inequality of educational environment. The public education system for children under school age has been critical in students’ academic performance. A long term research work showed strong correlation between kindergarten test score and adult outcomes such as college enrollment, mean income, and home ownership, implying the government support to enhance the quality of early childhood education in underprivileged areas would be valuable in light of the equality of education [
64].