You are currently viewing a new version of our website. To view the old version click .
Sustainability
  • Article
  • Open Access

13 July 2020

Strengthening the Statistical Summaries of Economic Output Areas for Urban Planning Support Systems

,
,
and
1
Department of Geography, Kyung Hee University, Dongdaemun-Gu, Seoul 02447, Korea
2
Department of Geography Education, Dongguk University, Jung-gu, Seoul 04620, Korea
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Geospatial Technologies and the 4th Industrial Revolution for Sustainable Urban Environment

Abstract

Despite efforts to research the transformation of urban structures, difficulties remain in estimating credible statistical information in the existing census output areas. This research proposes two alternatives to construct new economic output areas by considering the socioeconomic homogeneities where economic activities occur. In particular, we developed an algorithm to aggregate new economic zones into the existing census output areas. For this purpose, we utilized matrix systems that consider population sizes, the number of workers and workplaces, and a combination of these factors in the two alternatives. Urban planners need to provide credible statistical summaries at the census output areas. Our findings contribute to this research by suggesting that it is essential to consider the population and the number of workplaces with socioeconomic homogeneity. These findings will also help other researchers who study the transformation of urban structures because they can use more reliable statistical information for their simulation model that predicts an urban structure. Furthermore, it will help improve the national statistics office’s roles for public and urban planners and provide an important source for the national statistical geographic information services.

1. Introduction

As stressed by many speakers at the United Nations 2030 Agenda for Sustainable Development in 2015 and 2019, realistic and robust statistical data are seen as critical [1,2]. Without reliable and robust measurements of where economic activities particularly occur, it would not be easy to successfully achieve sustainable development goals and monitor sustainable transformations of urban environments. The economy is one of the three pillars (economy, society, and the environment) of sustainability. An economic factor is often used for assessing the performances of cities. Consequently, statistical information in an urban area is seen as more important than ever to accurately monitor sustainable transformations of urban environments and as one of the important indicators of urban sustainability. However, each country has used different types of theories or issues that determine economic zones or census output areas in a hierarchy of census geographic units because transformations of urban structures are complex and depend on a variety of factors.
Normally, the transformations have been affected by the different types of sectors [3,4,5], the size of sub-centers [6,7], the population [8,9], employment densities [10,11,12,13], transportation costs [14,15], agglomeration patterns of the commercial establishment [16,17], and land-use planning and regulations [18]. As a result, many researchers have paid attention to the structure of urban transformation, but few papers have discussed statistical information in the small census output areas or economic zones. Credible statistical summaries in economic zones can be one of the important factors supporting the right policy-making systems or planning support systems [17,19]. As noted above, there are a variety of theories or computational approaches that construct smaller sub-zones that are spatially aggregated. The approaches help resolve spatial location concerns and validate statistical information in a small economic zone [20,21,22]. Because each country has used a different hierarchy of census geographic units, different statistical summaries in the same economic zones may result [22]. As a result, statistical summaries can often be overestimated or underestimated. Thus, urban planners need to construct an optimal economic district with the proper homogeneity of the geographic area.
In response to the reliable and robust statistical data in the existing census output areas, this research raised the following questions: (1) Do the existing census output areas that are aggregated guarantee the homogeneity of the geographic area? (2) What are the scientifically defensible approaches to accurately estimate statistical information about where economic activity occurs in the census output areas? and (3) What types of social and economic factors should be considered to maintain the consistency of the statistical information in the census output areas? With respect to these considerations, this study proposes two main alternatives to constructing new census output areas within the current basic unit districts. We are particularly interested in a central business district where economic activities frequently occur.
In this study, we developed a new algorithm to delimit new census output areas that contain reasonable statistical summaries of the number of workers resulting from the basic unit districts. Note that a basic unit district is the smallest unit (i.e., zone) in the current hierarchy of census geographic units of the study area [23]. The smallest districts are delimited through physical characteristics such as roads, creeks, railroads, and mountain ranges. Districts are also determined through a block unit or the smallest administrative boundaries in an urban area [23]. The existing census output areas are determined by the population size (at least 300, proper 500, and a max of 1000), type of residency and property values, shapes of census output areas, and more. One census output area includes multiple basic unit districts and is an upper-level unit of the basic unit districts and a lower-level unit to administrative boundaries [22]. This study used matrix systems that consider population sizes, the number of workers and workplaces, and a combination of these factors in two main alternatives. The factors are quantitatively assessed through descriptive statistics and used as weighted values in a matrix that computationally creates new economic census zones. Accordingly, this research constructs new economic census zones that satisfy the reliable statistical summaries of where economic activities highly occur. The economic zones include the existing basic unit districts. The following show the key contributions of this work:
  • Through this research, we expect that the resultant outputs will be used for a new method to determine business districts or economic census zones in a country. It is important for urban planners to use scenario-based approaches that directly apply a new policy to reality.
  • On the research side, urban scientists can statistically monitor how the economic zones have been transformed over time and consider the social and economic homogeneities that contain a combination of populations, workers, and workplaces.
  • Furthermore, this approach will contribute to individuals who would like to use reliable statistical summaries for general data analysis.
  • Additionally, due to the issues related to maps’ aerial unit problem, statistical summaries for the same location can be overestimated or underestimated. However, the outputs of this work will be used for urban planners to provide more accurate national services associated with statistical geographic information services.
Accordingly, this research assists in estimating reliable and accurate statistical information that is used for the development of a sustainable economy and high-quality spatial data. This paper is structured as follows. In Section 2, we describe the previous studies related to the automated zoning program (AZP) and the existing small census output areas used in multiple countries. Furthermore, modifiable areal unit problems are addressed in the existing census output areas. In Section 3, we address the primary research approaches. Section 4 provides the results obtained from the primary approaches. Finally, in Section 5, we discuss the significant contribution of this research, and then provide our conclusions in Section 6.

3. Methods

3.1. Schematic Diagram

This study proposes two main alternatives that help construct new census output areas where economic activities heavily happen. The following shows a schematic diagram developed in this research.
As shown in Figure 2, the first alternative considered population, the number of workers, and the number of workplaces in a matrix (Figure 2(A1)). The second alternative considered only the number of workers and the number of workplaces in a matrix (Figure 2(A2)). The first alternative had three scenarios: population (Scenario 1.1), number of workplaces (Scenario 1.2), and a combination of the population and the number of workplaces (Scenario 1.3). This approach integrated information of the workspace into the information of the existing COAs, which is beneficial for individuals using the workplace-centered statistical information. The second alternative only considered new economic zones without considering the existing COAs. It also had three scenarios: the number of workers (Scenario 2.1), the number of workplaces (Scenario 2.2), and a combination of the number of workers and workplaces (Scenario 2.3). The second alternative was designed for applications of business information.
Figure 2. Schematic diagram of this research: Two main alternatives to identify an optimized small economic zone.
Next, we classified all types of land into two main classes: urbanized and non-urbanized areas. The sub-classes (Table 1) in the two main classes were used to quantitatively assess all scenarios. We used unique identifiers (UIDs) to represent all types of land classes, which we inserted into a matrix to determine a weighted value for the newly determined economic zones (Figure 2(A3)). Finally, to examine how homogeneity improved, this study computed means and standard deviation (SD) of both the existing small census output areas and new economic zones (Figure 2(A4)). Furthermore, the study explored spatial patterns of the economic zones using hot spot analysis that used Getis-Ord Gi*, which can determine statistically significant hot or cold spots and reveal spatial trends in the clustering of polygon features, i.e., it assesses where high values in the economic zones spatially cluster [55]. The following sub-sections in Section 3.2 address detailed analysis procedures to delimit new economic zones.
Table 1. Classes of unique identifiers (UIDs) in the basic statistics areas.

3.2. Analysis Procedures for Delimiting New Economic Places

In this study, we considered social and economic homogeneities in the analysis procedures. To quantitatively assess these homogeneities, we classified unique identifiers (UIDs) by areas of building types, land use, and urbanized and non-urbanized areas (Figure 2(A3)). For example, if a building area was greater than 10% in the area of interest, we classified it as an urbanized area; however, if it was less than 10%, we classified it as a non-urbanized area. If the areas of residence were greater than 70% in the area of interest, we classified them as residences (e.g., single-family housing districts, apartment areas, multiunit dwellings, and mixed housing districts); however, if they were less than 30%, we classified them as non-residences (See Table 1). In the case of non-urbanized areas, we divided them into farmlands, forest lands, and rivers. The classes and sub-classes in Table 1 are pre-defined land-use classes determined by Statistics Korea [49].
When considering population, workers, and workplaces, the unit of the value is different from the others; thus, the values in UIDs must be standardized. This study used a z-value that standardized the three different measurement units. Using this standardization process, this research explained the similarities between factors (Figure 2(A4)). The formula of the z-value is calculated as follows:
Z = X m σ .
Here, Z is a standardized variable, x denotes the raw data value, m denotes the mean, and σ denotes the standard deviation (SD).
Social and economic homogeneity between UIDs can be represented as a distance in a matrix (UID-Matrix). Based on the standardized values (Table 2), we computed the means of each of the standardized UIDs and then, we calculated the geometric distance in a matrix (e.g., in the case of Alternative 1, we considered three factors: population, workers, and workplaces). We computed the three factors using the following formula:
D i j = ( p i p j ) 2 + ( e i e j ) 2 + ( c i c j ) 2 Alternative   1 .
D i j = ( e i e j ) 2 + ( c i c j ) 2 Alternative   2 .
Table 2. Mean of standardized values per UID.
p = population;
e = number of workers;
c = number of workplaces.
Here, D i j represents degrees of social and economic homogeneity between UIDs i and j in a matrix. Thus, as D i j decreases, the levels of social and economic homogeneity increase.
We used the AZTool, developed by Statistics Korea, in this study. In the AZTool, the above matrix was used to determine a weighted value after considering each of the alternatives (Table 3). It automatically merges, splits, and retains economic zones in a COA. In addition to the matrix, this study provided the following criteria (Table 4) to delimit new economic zones (i.e., census output areas).
Table 3. UID Matrix for Alternative 1.
Table 4. Indices and weighted values for six scenarios (“S” stands for scenario).
In Scenario 1.1, new economic zones were delimited by computing standardized population values in the above matrix. The given criteria in Scenario 1 were that the optimized population was 500, the lowest population was 300, and the tolerance of the population was 100. The tolerance value was used to protect overestimated values. Note that Scenarios 1.2 and 1.3 used log values, whereas Scenario 1.2 used log (number of workers + 1) × log (number of workplaces + 1). Note that the optimal value was 10 (approximate values to the mean of the new economic statistics zone), the least was 4 (close to the mean of the basic statistic zones), and the tolerance value was 2 (because there were two factors, the least value was divided by 2 [1/2]).
Scenario 1.3 computed log (population + 1) × log (number of worker + 1) × log (number of workplace + 1); thus, the optimal value was 65 (approximate values to the mean of the new economic statistics zone), the least was 15 (close to mean of the basic statistic zones), and the tolerance value was 5 (because there were three factors, the least value was divided by 2 [1/3]). Scenario 2.1 considered 200 as the optimal number of workers, 100 as the minimum number of workers, and 40 as the tolerance value. Scenario 2.2 considered 30 as the optimal number of workplaces, 10 as the minimum number of workplaces, and 7 as the tolerance value. Scenario 2.3 computed log (number of worker + 1) × log (number of workplace + 1); thus, the optimal value was 10 (approximate values to mean of the new economic statistics zone), the least was 4 (close to mean of the basic statistic zones), and the tolerance value was 2 (because there were two factors, the least value was divided by 2 [1/2]). It was important to assess social and economic homogeneity between UIDs in the matrix. To examine how homogeneity improved, this study computed the means and SDs of both the existing small census output areas and new economic zones. Furthermore, the study explored spatial patterns of the economic zones using hot spot analysis that used Getis-Ord GI*, which can determine statistically significant hot or cold spots and reveal spatial trends in the clustering of polygon features, i.e., it assesses where high values in the economic zones spatially cluster [55].
G i * = j = 1 n w i j x j X _ j = 1 n w i j j = 1 n x j 2 n ( X _ ) 2 × [ n j = 1 n w i j 2 ( j = 1 n w i j ) 2 ] n 1
X _ = j = 1 n x j n
-
Getis-Ord G i * .
When using Getis-Ord GI*, we visualized highly overcrowded areas as hot spots in a red color. The spatial trends were used to determine an optimal scenario by observing the increase or decrease of hot spots in areas of interest. This research used one method that is the default weight for Self-Potential in the Hot Spot Analysis tool in ArcMap.

4. Results

4.1. Means and SDs for Population, Workers and Workplaces.

The total number of COAs was 1085, the average population size in each of the COAs was 477.09, the SD of each COA was 145.74, and the average number of workers in each COA was 616.19. Note that the SD of the number of workers was 1824, and the mean and SD of workplaces were 62.75 and 149.16, respectively. The following table shows the mean and SD of population, workers, and workplaces.
As shown in Table 5 and Figure 3a, the mean (477.09) of the population in the existing COA was similar to the mean (465.20) in Scenario 1.1; however, this confirmed that the mean of other scenarios was lower than the mean of the existing COA and all COAs. As for the SDs, except for Scenario 1.1, all of the SDs of the other scenarios showed big differences with the mean of all of the COAs. Regarding workers (Figure 3b), the means of Scenarios 2.2 and 2.3 were lower than others. When comparing the SD of Scenario 1.1, the SDs of other scenarios were much lower than the others. As for workplaces (Figure 3c), the trend of the differences was similar to those shown in Figure 3b, i.e., the SDs of other scenarios were much lower than the means of the existing COA and Scenario 1.1. The above differences meant that the extent of social and economic homogeneity was different, and it depended on the situation.
Table 5. Means and SDs of population, workers, and workplaces in the Gangnam district.
Figure 3. Means and standard deviations (SDs) of the (a) population, (b) workers, and (c) workplaces in the Gangnam district.

4.2. Spatial Distributions of Hot Spots and Second SD

This study used hot spot analysis to explore highly overcrowded populations, workers, and workplaces of the economic zones used in the two alternatives. It was important to investigate the general spatial trend of the variables in a place. The spatial trend of the pattern was used to select an optimal scenario among the proposed alternatives. Figure 4 shows hot and cold spot trends of the population in the existing and new economic zones.
Figure 4. Choropleth maps of the population size in the existing COAs (a-1) and Scenarios 1.1 (b-1), 1.2 (c-1), and 1.3 (d-1). Hot and cold spot trends of the population size in the existing COAs (a-2), Scenarios 1.1 (b-2), 1.2 (c-2), and 1.3 (d-2).
Figure 5 shows hot and cold spot trends of the population size in Alternative 2. In the case of Scenario 2.1, the spatial trend was similar to that of Scenario 1.3, whereas Scenario 2.2 was similar to Scenario 2.1. As for Scenario 2.3, the hot spot trends occurring in Gaepo-dong, Segok-dong, and Irwon-dong (c-2) were moderated (i.e., disseminated).
Figure 5. Choropleth maps of the population size in Scenarios 2.1 (a-1), 2.2 (b-1), and 2.3 (c-1). Hot and cold spot trends of the population size in Scenarios 2.1 (a-2), 2.2 (b-2), and 2.3 (c-2).
Regarding the hot and cold spot trends of the number of workers, Alternative 1 showed that hot spot trends existed around Nonhyeon-dong and Teheran street (Figure 6(a-2)). Furthermore, the hot spot patterns expanded to Suseo station (Figure 6(d-2)), and cold spot trends were observed around Nonhyeon-dong, Dachi 4-dong, and Gaepo 4-dong. Alternative 2, associated with the number of workers, showed that hot spots occurred around Teheran street (Figure 6(d-2)) and expanded to Suseo station, whereas cold spots were observed around Cheongdam-dong, Yeoksam2-dong, Gaepo 1 and 4-dong, Teheran street, and Nohyeon 1-dong.
Figure 6. Choropleth maps of the number of workers in the existing COAs (a-1) and Scenarios 1.1 (b-1), 1.2 (c-1), and 1.3 (d-1). Hot and cold spot trends of the number of workers in the existing COAs (a-2), Scenario 1.1 (b-2), 1.2 (c-2), and 1.3 (d-2).
Alternative 2, associated with the number of workers, showed that hot spots occurred around Teheran street and expanded to Suseo station, whereas cold spots were observed around Cheongdam-dong, Yeoksam2-dong, Gaepo 1 and 4-dong, Teheran street, and Nohyeon 1-dong (Figure 7).
Figure 7. Choropleth maps of the number of workers in Scenarios 2.1 (a-1), 2.2 (b-1), and 2.3 (c-1). Hot and cold spot trends of the number of workers in Scenarios 2.1 (a-2), 2.2 (b-2), and 2.3 (c-2).
Regarding the hot and cold spot trends for the number of workplaces in the existing COAs (a-2), a high number of workplaces were clustered near Teheran street, Nohyeon-dong, and Suseo Stations (Figure 8). As for Scenario 1.1, it was obvious that a high number of workplaces were clustered in Teheran street, Nohyeon-dong, and Suseo Stations; however, cold spots were observed around Yeocksam 2-dong and Irwon-dong. Interestingly, hot spot patterns were fewer than in Scenarios 1.1 to 1.3, particularly on Teheran street; however, in Scenarios 1.2 and 1.3, hot spot zones were observed around Suseo station and the patterns were sporadic.
Figure 8. Choropleth maps of the number of workplaces in the existing COAs (a-1) and Scenarios 1.1 (b-1), 1.2 (c-1), and 1.3 (d-1). Hot and cold spot trends of the number of workplaces in the existing COAs (a-2), Scenarios 1.1 (b-2), 1.2 (c-2), and 1.3 (d-2).
Figure 9 shows hot spot trends still existed in areas near Teheran Street, Daechi 1-dong, and Suseo station, whereas cold spot trends were observed in Dogok-dong, Chungdam-dong, and Segok-dong. Overall, the hot spot trend decreased
Figure 9. Choropleth maps of the number of workplaces in Scenarios 2.1 (a-1), 2.2 (b-1), and 2.3 (c-1). Hot and cold spot trends of the number of workplaces in Scenarios 2.1 (a-2), 2.2 (b-2), and 2.3 (c-2).

4.3. Second SDs of Population, Worker and Workplace

The following three tables show the means of the SDs for population, workers, and workplaces. The number from the existing census output areas (i.e., COAs) with the population was 59; however, the numbers were higher in the three scenarios. Scenario 1.3 had the highest value (1070.83) and Alternative 2 showed a similar pattern (Table 6); however, Scenario 2.1 showed the highest value (1164.98).
Table 6. Second SDs of the Gangnam district—Population.
Regarding second SDs for workers, the number of the existing census output areas was 114, but the numbers increased in Scenarios 1 and 2 (Table 7). Scenario 1.2 showed the highest number of economic zones (369). As for Alternative 2, Scenario 2.1 had the highest number of economic zones (829.62).
Table 7. Second SDs of the Gangnam district—Workers.
Regarding second SDs for workplaces, the number from the existing census output areas was 149, but the numbers increased in Scenarios 1 and 2 (Table 8). Scenario 2.2 showed the highest number of economic zones (320). As for Alternative 2, Scenario 2.2 had the highest number of economic zones (339).
Table 8. Second SDs of the Gangnam district—Workplaces.

4.4. Examples of the Number of Workplaces in the Existing COAs and Two Alternatives

Figure 10 shows an example of the number of workplaces in Nonhyeon-dong in the Gangnam district. As shown in Figure 10, there were 1105 workplaces in the existing basic statistic zone; however, there were 938 workplaces in Scenario 1.1. Both Scenarios 1.2 and 1.3 provided more segmented values. This protected the statistical information that might have been overestimated or underestimated in the same district or the economic zones.
Figure 10. The number of workplaces in the existing COAs and Scenarios 1.1, 1.2, and 1.3.
In the case of Alternative 2 (Figure 11), the total number of the workplaces was similar; nevertheless, each of the outputs focused on just the number of workers, the number of workplaces, and a combination of workers and workplaces. Eventually, new economic zones were delimited through the algorithm proposed in this research.
Figure 11. The number of workplaces in Scenarios 2.1, 2.2, and 2.3.

5. Discussion

Despite efforts to research the transformation of the urban structure, difficulties remain in estimating credible statistical information in the census output areas. Furthermore, there are a variety of theories and computational methods that delimitate a hierarchy of census geographic units or economic zones. Even though the methods consider the homogeneity of the geographic area to the aggregate zones, the size of the zones will vary [6,7]. Because each country used a different hierarchy of census geographic units and economic places or zones, it may result in different statistical summaries in the same geographic scale. As a result, the number of summaries can be often overestimated or underestimated. However, the existing census output areas in a hierarchy of census geographic units do not reflect the reality of the current spatial urban structure, particularly in providing business-relevant information. Thus, urban planners should consider all possible scenarios that estimate reliable statistical information in the existing census output areas and even create new economic census output areas.
As noted in this paper, we first evaluated the statistical summaries in the existing census output areas, and secondly, we considered social and economic homogeneities that contained a combination of populations, workers, and workplaces. Specifically, three scenarios in Alternative 1 considered population, number of workers, and number of workplaces in a matrix. Another three scenarios in Alternative 2 referred to only the number of workers and workplaces in a matrix. Each of the scenarios had three scenarios. Furthermore, this study assessed the degrees of social and economic homogeneity using the SDs (standard deviations) in the six scenarios. Because the means of different sizes of economic zones showed big differences, it was difficult to compare the differences of newly aggregated economic zones. Thus, we used SDs to investigate the degrees of social and economic homogeneity and monitor the trends of social and economic homogeneity at a given location.
Comparing the SDs of the existing census output areas with the SDs of new economic census zones, the results indicated that the extent of social and economic homogeneity was different, and it depended on the situation given. In other words, when the SD was lower than those of the other scenarios, it meant that the scenario with the lower value improved social and economic homogeneity. As a result, given an urban planning or a specific project, it meant that we could choose one of the scenarios with the lower SD values. We also developed an algorithm to newly aggregate new economic zones, including the existing basic unit districts. In addition to the UID matrix and AZP-based algorithms, new economic zones (census output areas) were delimited for central business districts. Consequently, we selected Scenarios 1.3 and 2.3 as the best working scenarios. Specifically, in the case of the Gangnam district, the SD of the population increased, whereas the SDs of workers and workplaces decreased, indicating that socioeconomic homogeneity improved.
When exploring the hot and cold spot trends, Scenarios 1.3 (a combination of the population and the number of workplaces) and 2.3 (a combination of the number of workers and workplaces) showed that hot and cold spot trends decreased compared with the spatial patterns of the existing census output areas, indicating that the overcrowded patterns (overestimated) in the existing census output areas were lesser in Scenarios 1.3 and 2.3. Therefore, Scenario 1.3 showed that the SDs for the number of workplaces were lesser, suggesting that the degree of homogeneity between economic zones of Scenarios 1.3 was high. Thus, we selected Scenario 1.3 as the optimal scenario to delimit new economic zones. In the case of Scenario 2.3, because the SDs for the number of workplaces were less, this scenario was suitable for areas with a high number of workplaces, such as Gangnam. Compared with the existing census output areas, the district should consider two factors (workers and workplaces) regarding the population (Scenarios 1.3 and 2.3). The two alternatives can make the results more meaningful. Accordingly, this work includes the following key contributions: (1) a new method can be used to determine new economic census zones through scenario-based approaches, and it is crucial for urban planners to directly apply this method to new urban planning; (2) researchers are able to scrutinize urban transformations over time where economic activities frequently occur; (3) even the general public can use reliable statistical summaries from the newly aggregated census output areas; and (4) eventually, policymakers or public officers can provide more accurate national statistical summaries, particularly those related to statistical geographic information services. After completing this research, the outputs of this research were validated by the research group at the Department of Geospatial Information Service Division of Statistics Korea [56]. Based on the outputs of this research, Statistics Koreas is currently working on developing new census output areas to serve the public with credible statistical summaries in economic zones.
Even though this research produces reliable statistical outputs in new economic zones, there are limitations to this research. The data used in this research are census output data including population, the number of workers and businesses, transportation, households, property values, and more that are officially surveyed by the National Statistics office in South Korea. For research purposes, we were able to use the smallest geographic unit, which is the BUD. However, the BUDs also need to be re-aggregated when an urban structure transforms over time. Additionally, statistical summaries in the BUD often provide very detailed information. For example, only one or two employees existed in the economic zone. However, the BUDs are not publicly available because many of the zones have lower numbers, even one or two employees. This will unintentionally share personal information with the public. Thus, we also need to consider ways to protect personal information at the smallest census output area (i.e., economic zone).
In sum, this research is significant for two reasons. First, our findings suggest that it is essential to consider the population and the number of workplaces and a combination of the number of workers and workplaces, particularly for central business districts. The numbers should be periodically recalculated when the urban structure has transformed. It will help other researchers who study the transformations of the urban structure because they can use more reliable statistical information for their simulation models that predict an urban structure. Second, as Pan and Deal suggested [17], it is important for the urban planner to use scenario-driven exercises. Through the analysis procedures and algorithms proposed in this research, we could easily apply a variety of combinations with socioeconomic homogeneity on purpose. Accordingly, Statistics Korea can use the results to serve reliable statistical information in an economic zone and for the development of a sustainable economy and high-quality spatial data.

6. Conclusions

In the era of big data, realistic and robust statistical data are seen as critical. This will help us gather more individualized and sophisticated data and applications for sustainable urban development. We note that this research was focused on proposing two main alternatives to find the best scenarios that can provide reliable statistical information where economic activity occurs. Moreover, we developed a model to aggregate new economic zones based on the proposed scenarios. Consequently, we identified Scenarios 1.3 and 2.3 as the best working scenarios for the study, and the National Statistical Offices can provide more accurate statistical information or summaries of workers and workplaces at central business districts. We contend that the resultant outputs can help urban planners to evaluate the existing economic zones that improve the usability of the national statistics data. Eventually, the results can be connected to the national geographic information service. As stated in the Discussion, we will next develop a masking tool that can reasonably hide or protect personal information in the smallest economic zones and maintain the consistency and accuracy of statistical information in the economic zone. Furthermore, we will also forecast changes in the urban structure using the credible statistical information in the new economic zones.

Author Contributions

C.S.H. had developed the theory and S.-Y.H. and T.H. performed the computation, in addition to verifying the analytical methods and writing this paper. B.Y. had participated in this research project to design, review, assess, and write this paper. All authors have read and agreed to the published version of the manuscript.

Funding

The research received no external funding and the APC was funded by the Dongguk University Research fund of 2019.

Acknowledgments

This research was supported from the International Scholar Program at Kyung Hee University during the Fall Semester of 2018.

Conflicts of Interest

No potential conflict of interest is reported by the author.

References

  1. United Nations Headquarters. United Nations Summit on Sustainable Development 2015, Informal Summary; United Nations: New York, NY, USA, 2015. [Google Scholar]
  2. Independent Group of Scientists. The Future Is Now—Science for Achieving Sustainable Development; United Nations: New York, NY, USA, 2019. [Google Scholar]
  3. Giuliano, G.; Small, K.A. Subcenters in the Los Angeles region. Reg. Sci. Urban Econ. 1991, 21, 163–182. [Google Scholar] [CrossRef]
  4. Combes, P.-P. Economic Structure and Local Growth: France, 1984–1993. J. Urban Econ. 2000, 47, 329–355. [Google Scholar] [CrossRef]
  5. Combes, P.-P.; Magnac, T.; Robin, J.-M. The dynamics of local employment in France. J. Urban Econ. 2004, 56, 217–243. [Google Scholar] [CrossRef]
  6. McMillen, D.P. The return of centralization to Chicago: Using repeat sales to identify changes in house price distance gradients. Reg. Sci. Urban Econ. 2003, 33, 287–304. [Google Scholar] [CrossRef]
  7. Duranton, G.; Puga, D. Micro-foundations of urban agglomeration economies. In Handbook of Regional and Urban Economics; Elsevier: Amsterdam, The Netherlands, 2004; Volume 4, pp. 2063–2117. [Google Scholar]
  8. Stern, D.I. Historical path-dependence of the urban population density gradient. Ann. Reg. Sci. 1993, 27, 259–283. [Google Scholar] [CrossRef]
  9. McMillen, D.P.; Lester, T.W. Evolving subcenters: Employment and population densities in Chicago, 1970–2020. J. Hous. Econ. 2003, 12, 60–81. [Google Scholar] [CrossRef]
  10. McMillen, D.P. Employment Densities, Spatial Autocorrelation, and Subcenters in Large Metropolitan Areas. J. Reg. Sci. 2004, 44, 225–244. [Google Scholar] [CrossRef]
  11. Glaeser, E.L.; Kahn, M.E. Sprawl and urban growth. In Handbook of Regional and Urban Economics; Elsevier: Amsterdam, The Netherlands, 2004; Volume 4, pp. 2481–2527. [Google Scholar]
  12. Muñiz, I.; Garcia-López, M.-Á.; Galindo, A. The Effect of Employment Sub-centres on Population Density in Barcelona. Urban Stud. 2008, 45, 627–649. [Google Scholar] [CrossRef]
  13. Gilli, F. Sprawl or Reagglomeration? The Dynamics of Employment Deconcentration and Industrial Transformation in Greater Paris. Urban Stud. 2009, 46, 1385–1420. [Google Scholar] [CrossRef]
  14. Cervero, R. America’s Suburban Centers: The Land Use–Transportation Link; Unwin Hyman: Boston, MA, USA, 1989. [Google Scholar]
  15. Cavailhès, J.; Gaigne, C.; Thisse, J.-F. Trade Costs Versus Urban Costs: Do Jobs Move to the Suburbs or to The Sticks? Centre for Economic Policy Research (CEPR) Discussion Paper; Université catholique de Louvain: Ottignies-Louvain-la-Neuve, Belgium, 2004; Volume 4440, pp. 1–34. [Google Scholar]
  16. Yang, T.; Pan, H.; Hewings, G.J.D.; Jin, Y. Understanding urban sub-centers with heterogeneity in agglomeration economies—Where do emerging commercial establishments locate? Cities 2019, 86, 25–36. [Google Scholar] [CrossRef]
  17. Pan, H.; Deal, B. Reporting on the Performance and Usability of Planning Support Systems—Towards a Common Understanding. Appl. Spat. Anal. Policy 2019, 13, 137–159. [Google Scholar] [CrossRef]
  18. Kim, J.H. Linking Land Use Planning and Regulation to Economic Development: A Literature Review. J. Plan. Lit. 2010, 26, 35–47. [Google Scholar] [CrossRef]
  19. Deal, B.; Pan, H.; Pallathucheril, V.; Fulton, G. Urban Resilience and Planning Support Systems: The Need for Sentience. J. Urban Technol. 2017, 24, 29–45. [Google Scholar] [CrossRef]
  20. Costanza, R. Model goodness of fit: A multiple resolution procedure. Ecol. Model. 1989, 47, 199–215. [Google Scholar] [CrossRef]
  21. Wu, F. Calibration of stochastic cellular automata: The application to rural-urban land conversions. Int. J. Geogr. Inf. Sci. 2002, 16, 795–818. [Google Scholar] [CrossRef]
  22. Pontius, J.R.G.; Peethambaram, S.; Castella, J.-C. Comparison of Three Maps at Multiple Resolutions: A Case Study of Land Change Simulation in Cho Don District, Vietnam. Ann. Assoc. Am. Geogr. 2011, 101, 45–62. [Google Scholar] [CrossRef]
  23. Statistics Korea. Planning a Standard System for Spatial Statistics Information; Statistics Korea: Daejeon, Korea, 2008.
  24. Fujita, M.; Ogawa, H. Multiple equilibria and structural transition of non-monocentric urban configurations. Reg. Sci. Urban Econ. 1982, 12, 161–196. [Google Scholar] [CrossRef]
  25. Boarnet, M.G. The Monocentric Model and Employment Location. J. Urban Econ. 1994, 36, 79–97. [Google Scholar] [CrossRef]
  26. Sivitanidou, R. Do Office–Commercial Firms Value Access to Service Employment Centers? A Hedonic Value Analysis within Polycentric Los Angeles. J. Urban Econ. 1996, 40, 125–149. [Google Scholar] [CrossRef]
  27. Bogart, W.T.; Ferry, W.C. Employment Centres in Greater Cleveland: Evidence of Evolution in a Formerly Monocentric City. Urban Stud. 1999, 36, 2099–2110. [Google Scholar] [CrossRef]
  28. Anderson, N.B.; Bogart, W.T. The Structure of Sprawl: Identifying and Characterizing Employment Centers in Polycentric Metropolitan Areas. Am. J. Econ. Sociol. 2001, 60, 147–169. [Google Scholar] [CrossRef]
  29. Hewings, G.J.; Sonis, M.; Guo, J.; Israilevich, P.R.; Schindler, G.R. The hollowing-out process in the Chicago economy, 1975–2011. Geogr. Anal. 1998, 30, 217–233. [Google Scholar] [CrossRef]
  30. USA General Services Administration. Economic Places. 2017. Available online: https://catalog.data.gov/dataset/economic-places (accessed on 13 June 2018).
  31. United State Census Bureau. Introduction to Geographies Tutorial. 2019. Available online: https://factfinder.census.gov/help/en/tutorials/introduction_to_geographies_tutorial.htm (accessed on 20 August 2018).
  32. Martin, D.; Cockings, S.; Harfoot, A. Development of a geographical framework for census workplace data. J. R. Stat. Soc. Ser. A 2012, 176, 585–602. [Google Scholar] [CrossRef]
  33. Cockings, S.; Harfoot, A.; Martin, D.; Hornby, D. Maintaining Existing Zoning Systems Using Automated Zone-Design Techniques: Methods for Creating the 2011 Census Output Geographies for England and Wales. Environ. Plan. A 2011, 43, 2399–2418. [Google Scholar] [CrossRef]
  34. United State Census Bureau. Center for Statistical Research and Methodology. 2018. Available online: https://www.census.gov/srd/csrm/SmallArea.html (accessed on 1 March 2019).
  35. Cockings, S.; Martin, D.; Harfoot, A. A Classification of Workplace Zones for the England and Wales (COWZ-EW); University of Southampton: Southampton, UK, 2015. [Google Scholar]
  36. Statistics Norway (SN). Classification of Economic Regions. 2000. Available online: https://www.ssb.no/a/english/publikasjoner/pdf/nos_c634_en/nos_c634_en.pdf (accessed on 10 December 2018).
  37. Statistics Canada. Economic Region. 2018. Available online: https://www150.statcan.gc.ca/n1/pub/92-195-x/2011001/geo/er-re/er-re-eng.htm (accessed on 20 December 2018).
  38. Hu, L.; Sun, T.; Wang, L. Evolving urban spatial structure and commuting patterns: A case study of Beijing, China. Transp. Res. Part D Transp. Environ. 2018, 59, 11–22. [Google Scholar] [CrossRef]
  39. Czepkiewicz, M.; Ottelin, J.; Ala-Mantila, S.; Heinonen, J.; Hasanzadeh, K.; Kytta, M. Urban structural and socioeconomic effects on local, national and international travel patterns and greenhouse gas emissions of young adults. J. Transp. Geogr. 2018, 68, 130–141. [Google Scholar] [CrossRef]
  40. Kang, Y.; Jang, S. Revision of delimitation of census output areas in Korea considering dwelling characteristics. J. Korean Cartogr. Assoc. 2008, 8, 47–57. [Google Scholar]
  41. Yang, B. GIS based 3-D landscape visualization for promoting citizen’s awareness of coastal hazard scenarios in flood prone tourism towns. Appl. Geogr. 2016, 76, 85–97. [Google Scholar] [CrossRef]
  42. Li, X.; Yeh, A.G.-O. Data mining of cellular automata’s transition rules. Int. J. Geogr. Inf. Sci. 2004, 18, 723–744. [Google Scholar] [CrossRef]
  43. Li, X.; Yang, Q.; Liu, X. Discovering and evaluating urban signatures for simulating compact development using cellular automata. Landsc. Urban Plan. 2008, 86, 177–186. [Google Scholar] [CrossRef]
  44. Li, X.; Gong, P.; Yu, L.; Hu, T. A segment derived patch-based logistic cellular automata for urban growth modeling with heuristic rules. Comput. Environ. Urban Syst. 2017, 65, 140–149. [Google Scholar] [CrossRef]
  45. De Almeida, C.M.; Monteiro, A.M.V.; Câmara, G.; Soares-Filho, B.S.; Cerqueira, G.C.; Pennachin, C.L.; Batty, M. GIS and remote sensing as tools for the simulation of urban land-use change. J. Geogr. Inf. Sci. 2005, 26, 759–774. [Google Scholar] [CrossRef]
  46. Al-Kheder, S.; Wang, J.; Shan, J. Fuzzy inference guided cellular automata urban-growth modelling using multi-temporal satellite images. Int. J. Geogr. Inf. Sci. 2008, 22, 1271–1293. [Google Scholar] [CrossRef]
  47. Wikipedia. Gangnam District. 2020. Available online: https://en.wikipedia.org/wiki/Gangnam_District (accessed on 3 October 2019).
  48. Statistics Korea. Interactive Statistical Map. 2017. Available online: https://sgis.kostat.go.kr/jsp/english/interactive.jsp (accessed on 1 December 2018).
  49. Statistics Korea. Development of Spatial Statistics Information Data Standard; Statistics Korea: Daejeon, Korea, 2014.
  50. Openshaw, D.K.; Thomas, D.L.; Rollins, B.C. Adolescent self-esteem: A multidimensional perspective. J. Early Adolesc. 1981, 1, 273–282. [Google Scholar] [CrossRef]
  51. Openshaw, S. The Modifiable Areal Unit Problem; Concepts and Techniques in Modern Geography; Geo Book: Norwich, UK, 1983; Volume 38, pp. 1–40. [Google Scholar]
  52. Parenteau, M.-P.; Sawada, M. The modifiable areal unit problem (MAUP) in the relationship between exposure to NO2 and respiratory health. Int. J. Health Geogr. 2011, 10, 58. [Google Scholar] [CrossRef] [PubMed]
  53. Stafford, M.; Duke-Williams, O.; Shelton, N. Small area inequalities in health: Are we underestimating them? Soc. Sci. Med. 2008, 67, 891–899. [Google Scholar] [CrossRef] [PubMed]
  54. Cockings, S.; Martin, D. Zone design for environment and health studies using pre-aggregated data. Soc. Sci. Med. 2005, 60, 2729–2742. [Google Scholar] [CrossRef] [PubMed]
  55. Anselin, L. Local Indicators of Spatial Association-LISA. Geogr. Anal. 2010, 27, 93–115. [Google Scholar] [CrossRef]
  56. Shin, W.R. Development trend and current status of the national statistics service. In Proceedings of the 9th National Statistical Methodology Symposium, Sejong, Korea, 23–24 October 2019. [Google Scholar]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.