4.1. User Groups and Travel Statistics
Transit card users are classified into 10 groups; however, only four groups constitute the majority of transit users. From largest to smallest, the four groups are regular users, middle- and high-school students, seniors, and card holders with disabilities. Here, the regular group refers to the card users that do not belong to any other nine groups. Since around 99% of the total transit card users are from these four groups, we focused on analyzing their mobility patterns.
Table 1 presents the number of unique card IDs, the number of trip transactions made, and the daily travel time of user groups. The number of card IDs is not necessarily equal to the number of card users, because there may be one-time cards used during a week—however, we treated each unique ID as one traveler. We can also find out that the number of transactions are basically proportionate to the number of card IDs. AFC data also showed that seniors’ and people with disabilities’ daily travel times are usually longer than the other two groups.
Table 2 shows the travel characteristics of the top four groups of users. Within one week, most users across all population groups tend to travel more frequently on weekdays compared to weekends. If they swipe their cards during weekends as often as weekdays, the weekday/weekend ratio should be close to 2.5. However, the ratios for all four groups were much higher than 2.5. Compared to the regular users, students travel more often on weekends.
Modal preferences (
Table 3) vary across population groups. Regular riders have similar shares for traveling on buses and metro trains. However, students prefer buses more since buses are ideal for short distance traveling, and students usually live near the schools. On the other hand, seniors and passengers with disabilities usually prefer metros when traveling within the city since the metro is free for seniors and people with disabilities.
For weekdays, we defined 6:00 a.m. to 10:00 a.m. as morning peak hours (MP), 4:00 p.m. to 8:00 p.m. as afternoon peak hours (AP), and all remaining time as non-peak hours (NP). We also defined a trip of a user as the combination of transactions between his or her origin and the final destination. The reason is that sometimes a passenger transfers between modes in order to reach his or her final destination, but the transactions of this trip will be recorded separately. A transfer needs to occur within 30 min, and there is little to no charge for transfers.
As shown in
Table 4, only around 30% of trips of regular and student groups are made during non-peak hours. However, the percentage is higher for seniors and people with disabilities. The transfer time is the time spent to transfer between routes, modes, or stations. It is usually higher during non-peak hours since the headway of the bus and metro is shorter. Students have overall the shortest travel time among all groups. They spend less time on buses and the metro when traveling. Compared to seniors and the people with disabilities groups, regular users have a slightly lower travel time over all times of day.
In Seoul, students pay a discounted fare on both modes of transportation. For seniors and people with disabilities, taking the metro is free but taking the bus is not. This is one of the reasons why the metro is highly popular among these two population groups. We also found that the travel distance for all groups were similar during peak and non-peak hours, since the travel time and travel cost did not change much with the time changes.
The data shows that seniors and individuals with disabilities usually choose routes with fewer transfers (see
Table 4). Regular users and students, on average, transfer twice as often as the other two groups. In terms of the time per transfer, we can see that the senior group has longer transfers every time, sometimes even twice as long as the regular and student groups.
We also measured destination variety and regularity (the last two columns of
Table 4) to understand mobility patterns. Destination variety is defined as the number of unique destinations a traveler visited during a time period. We found that the variety was decreasing in the order of the regular, student, and people-with-disabilities groups. Destination variety is higher during weekdays than weekends. For weekends, this variety is much lower on Saturdays. On an hourly scale, we found the variety is much higher during 5:00–9:00 a.m. on weekdays. The groups with the highest variety during peak hours are usually the regular and student groups. However, the senior group has a high destination variety during non-peak hours.
Destination regularity measures how regularly a passenger visits his/her frequent destinations during each hour. Here, “frequent destination” is defined as the most-frequently-visited destination for this traveler during the same hour for all weekdays.
The set of all the destinations traveler
t visited during hour
h is denoted as
, for all weekdays. The most-visited destination during hour
h for traveler
t is denoted as
.
is denoted as a function to calculate the number of records in the AFC database, which indicates that traveler
t travels to destination
during hour
h. Then, destination regularity (
) for traveler
t during hour
h will be calculated as:
For each traveler t, we used the average value to represent his/her daily, morning-peak-hour (MP), afternoon-peak-hour (AP), and non-peak-hour (NP) destination regularity. Thus, hour set for these four time periods is: or or or . A higher destination regularity value indicates a more regular and predictable pattern of their daily/MP/AP/NP travel activity. For the regular and student user groups, the destination regularity value is low during the middle of the day. However, destination regularity for the other two groups is relatively higher, especially during peak hours.
4.3. Spatial Patterns of Human Mobility
Figure 2 presents the spatial distributions of potential home, work, and other activity locations. Most of the potential home locations are concentrated in the northeast and southwest urban fringe areas (a,d). These two clusters are more preferred by seniors and users with disabilities as their home locations (g,j). However, the work locations for regular users are agglomerated near central city areas of Seoul (b). School locations are basically scattered all over the city (e). Daily routine places for seniors and people with disabilities are clustered near home as well as the central city areas (h,k). The distribution of other activity locations is similar among the four groups, although the purpose of these trips are various. These places are normally located between home and work locations (c,f,i,l).
The radius of gyration measures the standard deviation of distances between users’ traveled stations and stations’ centers of mass. It measures the frequency and distance of a user’s mobility. A high radius of gyration value indicates that the user moves longer distances, while a low value of radius of gyration shows a short range of movements. It is defined as
in which
n is the number of stations a user travels, and
is the distance between station
and center of mass
.
Figure 3 shows that, on average, the radius of gyration for students is significantly shorter than that of the other three groups. For seniors and riders with disabilities, the radius of gyration is not as affected by their identity, compared to the regular group of users.
The number of unique destinations is also a spatial measurement of human mobility (
Figure 4). Overall, the frequency distribution of the “number of distinct destinations visited” by each age group shows a power-law pattern. Most of the public transit users only visit two to four destinations per week. The average destination variety is various among different age groups. It is decreasing in the order of regular, student, senior, and people-with-disabilities-population groups, but the difference is not obvious.
Destination-regularity measures the hourly percentages of the most-visited destinations (
Figure 5). The frequency graph shows that most of the travelers have a high repeated pattern of destinations, and most of the regularity is between 0.8 and 1. Their most-visited places are more than 80% of all the destinations they have visited according to their travel history. People with disabilities, seniors, and students have higher destination regularity compared with the regular smart-card users.
4.4. Temporal Patterns of Human Mobility
Figure 6 shows the temporal mobility patterns of transit riders. As shown in
Figure 6a, the distributions of stay time at home for different user groups are usually different. For regular users, students, and people with disabilities, there is a peak around 12 h, indicating a typical stay time at home for these users. Seniors have a longer typical stay time at home. All user groups have peaks at a very short time in their stay time distributions. These 1 to 2 h could represent lunch breaks or the breaks between arriving home in the afternoon and leaving for recreational, social, shopping, or extracurricular activities (for student groups) purposes in the evening.
The differences of stay times at work locations are even greater across groups (see
Figure 6b). Regular users and students usually stay at work places or schools for 9–10 h. However, most of the seniors will stay at their daily routine places for 2–3 h per day. People with disabilities are unique because the distribution has two peaks in terms of the stay time. Nearly half of them stay at work locations for 10 h, but the other half only leave home for less than one hour per day. If a card user with disabilities has a job, this individual probably will stay at his/her workplace around 10 h, just like a regular user. However, if a card user with disabilities does not have a job, he/she will tend to make short trips during the day.
Figure 6c shows that all groups usually spend less than four hours for other activities per day.
In
Figure 7a, we observe that regular users, seniors, and people with disabilities usually take about 10 more minutes than students to travel between their homes and workplaces. This information implies that schools are usually close to home compared to work places.
Students also spend less time traveling between places other than home and schools, as shown in
Figure 7b. The reason is that students often do not go too far away from home and schools. Unlike other adult groups, the daily activity region of students is much smaller. It can also be confirmed from the results of comparing the radius of gyration across groups.
Figure 8 shows that the distribution of stay time varies based on the time of day. Usually, from morning to afternoon, the earlier a user reaches home, the longer the stay time is. This pattern is reversed from afternoon to midnight. From morning to afternoon, the later a user arrives at a workplace, the shorter the stay time is. Users will stay less than 4 h in other activity locations during non-peak hours. This changing pattern by time of day is applicable for all user groups.
4.5. Heterogeneity among Mobility Patterns
Although we could observe the differences of travel behaviors through the above distribution figures, it is still necessary to measure and compare them statistically. In order to measure the heterogeneity of stay time, travel time and radius of gyration distributions across various smart card user groups, the best fit of distributions needs to be decided and the parameters need to be estimated and compared. Previous studies indicate that most of the displacements and pauses follow power-law, exponential, or log-normal distributions [
32]. By comparing, we found that the log-normal distribution had the highest log-likelihood and the lowest Akaike Information Criterion (AIC) value. HA hgher log-likelihood is usually preferred when using the maximum-likelihood technique to fit the data with statistical models. AIC is a measurement of the relative quality of a model. The one with the minimum AIC value is normally the best model among candidate models. It has been shown that a log-normal distribution is a statistically better fit for describing the distributions of stay time, travel time, and radius of gyration for all population groups.
As shown in
Table 5, for the stay time at home, all four groups had similar estimated parameters. The only exception is group 6 (seniors), which had the largest mean and standard deviation. It means that the senior group tend to stay longer at home, which could be confirmed by observing
Figure 6a. For the stay time at workplaces, regular and student groups shared similar estimated parameters. Seniors and people with disabilities usually stay less time at their “workplaces,” and this is also true if we check
Figure 6b. For the stay time and other activity locations, there are still two type of patterns: regular/student groups and senior/people with disability groups (also see
Figure 6c). Regular/student groups have a longer stay time but a lower standard deviation, while senior/people-with-disabilities groups have a shorter stay time but higher standard deviation. Nevertheless, the heterogeneity exists across groups in terms of the stay time at various important daily locations.
Except for the student group, all three other groups shared a similar distribution for travel time between commuting locations. Students usually traveled closer to home compared to the other groups. The mean of the distribution was smaller, but the standard deviation was higher. This behavior can also be observed from
Figure 7. The distribution of the radius of gyration for the student group also stands out in the four groups. It had a smaller mean and a higher standard deviation.
We present the results of the Kolmogorov–Smirnov test for pair-wise comparison of distribution. The Kolmogorov–Smirnov test is a non-parametric test of the equality of two distributions. All of the results showed
p-values that are less than 0.001. This data mean that, statistically, none of all the pair-wise comparison of two distributions can be stated as equality (
Table 6).