Analyzing Mobility Patterns at Scale in Pandemic Scenarios Leveraging the Mobile Network Ecosystem
Abstract
:1. Introduction
- Cellular Network Information: Mobile operators register the specific cell to which the smartphone is connected, providing a geographical footprint at the network cell level. Note that companies different from mobile operators can also generate maps of base station locations and register the cell id information on the device.
- GPS Information: OS providers (for example, Google or Apple) as well as mobile app developers (with the permission of the user) can access the GPS location information. This allows to generate a geographical footprint at GPS-level.
- WiFi Signal Information: There are both public and proprietary maps of the location of existing WiFi SSIDs. Using these maps, OS providers and/or mobile app developers, who have access to the WiFI SSIDs within range of the smartphone can use these data to generate a geographical footprint of the device.
- We discuss the alternative solutions for measuring human mobility using network data.
- We leverage data from Predicio to showcase how this type of accessible data can be used to analyze citizens’ mobility patterns at different scales during the COVID-19 pandemic. We focus our analysis on two of the most affected countries: Spain and Italy. For this analysis, we leverage two state-of-the-art methodologies to (i) showcase how data coming from mobile terminals and mobile networks can be used in the context of a pandemic and (ii) discuss possible outcomes of this analysis.
2. Measurement Alternatives and Recent Efforts
2.1. Infrastructure-Based and Passive Probes
2.2. Terminal- and User-Based Solutions
- Added-value service through OS frameworks: Mobile OSes usually consider positioning information as one of the pieces of information available in the mobile terminal, and major mobile OSes offer application developers unified ways to gather such information.For instance, Android has offered the Location Framework API since its inception in version 1 (released in 2008), seeking to integrate location-based services into their applications. However, as technology evolves, as well as the demands for precision, efficiency, and feature diversity, Android now advocates for the adoption of its advanced Awareness API [16]. This API enables better performance in terms of accuracy, battery utilization and feature richness, and it requires the user’s exact location to be sent and processed in the cloud. A parallel evolution can be observed within Apple’s ecosystem, where their location framework, introduced alongside iOS version 2.0, has similarly undergone transformative advancements. While OS providers claim that all the processing is performed with the maximum privacy standards, and offer increased transparency to the users, they can still use it to provide information such as the Google [7] and Apple [6] COVID-19 Mobility reports.
- Digital ads’ bid requests: An increasingly important aspect of the mobile ecosystem is the one related to advertising. Ad providers track and characterize user profiles to offer the most well-suited advertisements and increase the click ratio. Advertisers create profiles out of each user’s individual behavior, and users’ location is an important feature of users’ profiles. Usually, advertisers rely on products such as GeoIP databases to gather location knowledge out of end-user activities.However, the need for an increasingly more precise advertising characterization is forcing ad providers to gather location information (usually available within apps or browsers) and enrich the profiling with this information.Thus, advertising stakeholders have a relatively good picture of user mobility, obtained by analyzing the location information.
- Geolocation SDKs: Another technology currently used for tracking positioning information from users is Geolocation SDKs. Such third-party libraries provide app developers with an alternative way of monetizing their user engagement by including into their codebase the functionality needed to fetch the users’ current position, typically with GPS granularity, and upload them to the cloud.Once there, these third-party providers, a.k.a, location data providers, analyze this information to provide location intelligence analytics or directly sell anonymized data to businesses or researchers for other kinds of analysis. This kind of dataset is the one we use in Section 3 and Section 4 for our analysis.
2.3. Trade-Offs
- Granularity: This is possibly the most critical factor, particularly from a spatial perspective. While infrastructure-based solutions are currently limited to tracking at the cell level, user-based methods can precisely pinpoint the location of each user at any point in time. Without relying on costly triangulation techniques, user-based approaches offer spatial resolutions ranging from tens to hundreds of meters, enabling highly accurate location tracking.Also, time granularity may play a role. Due to the high pervasiveness of cellular technology, logging user mobility at scale is a challenging task that can generate huge amounts of data. Thus, such reporting (especially if performed at the network core) can be aggregated in batches to increase scalability with a price in terms of time resolution. This aspect is particularly important for passive measurements, which may have many events to be logged. Still, it also has relative importance for the user-based solutions. However, the intrinsically human nature of the interaction (i.e., usually events are recorded upon a human-to-device input) makes this factor less important.
- Accuracy: Infrastructure-based solutions are very accurate. Location errors are negligible due to the difficulty of spoofing the point of attachment to the network. User-based solutions, instead, offer a very heterogeneous accuracy level. Firstly, they depend on the quality of the GPS signal, which may be bad indoors. Then, such location reporting can be biased by using mock locations or even with GPS spoofing techniques. Still, such errors could be mitigated during data processing, or improved by using other terminal information such as the one coming from the WiFi/Bluetooth network scans.
- Pervasiveness: Infrastructure-based solutions monitor the entire population in a developed country, as the penetration rate is beyond 100%. By joining the major telco operators in a country, a trustworthy picture of the mobility patterns can be achieved, although with the limitations in granularity discussed above.Similar considerations may apply to OS frameworks, as the major developers also have the totality of the market share, although joining the databases may be challenging. The other solutions (ads or SDKs) have a lower pervasiveness, as they depend on the number of installations that mobile apps that embed the SDK obtain, or the number of visits that pages that show advertisements obtain. Finally, WiFi/Bluetooth based solutions are limited to the areas they can cover.
- Completeness: Infrastructure-based measurements obtain the full view of user mobility practically at any point in time if the mobile terminal is switched on, as control messages are sent frequently enough to provide a continuous view of the mobility trends (at the price of a large amount of generated data). User-based solutions, instead, heavily depend on the interactions between the human and the device. The trajectories generated by these techniques are more event-based (e.g., check-ins) rather than a continuous flow, although frameworks that are deeply embedded into the OS (e.g., maps services) may make this interaction more fluid.
- Accessibility: While all the previously discussed factors are relevant, even the best dataset is useless if it is not available to the right people. The previous subsections show that infrastructure-based data offers the largest pervasiveness with a poorer granularity, whereas OS frameworks offer the best trade-off between pervasiveness and granularity. Unfortunately, none of these data have been made available to the research community, even to fight the COVID-19 pandemic. Big tech companies and telcos have shared only aggregate data [8] which are not enough to perform the required mobility analysis. Instead, location data providers, such as Predicio (the one we use in this paper), have released fine granular data for research purposes in the context of COVID-19. In this paper, we showcase that despite the limitations in the pervasiveness of these types of data compared to infrastructure-based data or OS frameworks, they are valid for conducting both macro- and micro-mobility analyses. Both infrastructure- and terminal-based solutions shall enforce the highest privacy standards. However, while the data generated by infrastructure-based solutions are a side product for the correct operation of the, e.g., mobile networks, the data collected from terminals (hence end-users) shall enforce the highest transparency, as is the case for the data used in this work.
2.4. Current Efforts
3. Large-Scale Mobility Metrics
3.1. Data and Methodology
3.2. Mobility Time Dynamics
3.3. Mobility Extent Dynamics
4. Micro-Mobility
4.1. Motifs
- We compute the nodes of the motifs graphs by retaining only the locations where people spend a non-negligible amount of time, called stay-points. For this step, we apply the stay-point location algorithm available in the scikit-mobility library [31], which compresses a trajectory into a sequence of stay-points, computed using the DBSCAN algorithm, merging all observations belonging to a single trajectory that are closer than 200 m within a time window of 20 min. In this way, we isolate locations where contact may have actually taken place. Stay-points are calculated for each user on a daily basis.
- We detect possible recurrent nodes (i.e., stay-points that are visited more than once in each trajectory), we discretize stay-points by using the Uber-h3 [28] spatial tessellation, which assigns a unique identifier to hexagons on the Earth’s surface with different spatial resolutions. We use the spatial resolution index 9, which corresponds to hexagons with an edge of approximately 120 m, an area comparable to the one used by the stay-point location algorithm. At this point, each motif is a graph, the nodes of which are the h3 identifiers of the stay-point location, and the edges are the movements among them.
- We finally compute the popularity of motifs by understanding if two users’ trajectories yielded to the same graph structure, hence computing the isomorphism between them. For instance, two user trajectories that only have two nodes (e.g., two stay-points) with a back and forth behavior account for the popularity of the same motif.
4.2. Motifs Popularity
5. Ethical Considerations
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
COVID-19 | Coronavirus Disease 2019 |
MME | Mobility Management Entity |
OS | Operating System |
SSID | Service Set Identifier |
ROG | Radius of gyration |
SDK | Software Development Kit |
GPS | Global Positioning System |
GDPR | General Data Protection Regulation |
References
- Deloitte. Global Mobile Consumer Trends, 2nd Edition. Mobile Continues Its Global Reach into All Aspects of Consumers’ Lives. Available online: https://www2.deloitte.com/content/dam/Deloitte/us/Documents/technology-media-telecommunications/us-global-mobile-consumer-survey-second-edition.pdf (accessed on 15 July 2024).
- Toch, E.; Lerner, B.; Ben-Zion, E.; Ben-Gal, I. Analyzing large-scale human mobility data: A survey of machine learning methods and applications. Knowl. Inf. Syst. 2019, 58, 501–523. [Google Scholar] [CrossRef]
- Yang, Y.; Zhang, B.; Guo, D.; Wang, W.; Nie, J.; Xiong, Z.; Xu, R.; Zhou, X. Stochastic Geometry-Based Age of Information Performance Analysis for Privacy Preservation-Oriented Mobile Crowdsensing. IEEE Trans. Veh. Technol. 2023, 72, 9527–9541. [Google Scholar] [CrossRef]
- Han, Z.; Yang, Y.; Bilal, M.; Wang, W.; Krichen, M.; Alsadhan, A.A.; Ge, C. Smart Optimization Solution for Channel Access Attack Defense Under UAV-Aided Heterogeneous Network. IEEE Internet Things J. 2023, 10, 18890–18897. [Google Scholar] [CrossRef]
- Lutu, A.; Perino, D.; Bagnulo, M.; Frias-Martinez, E.; Khangosstar, J. A Characterization of the COVID-19 Pandemic Impact on a Mobile Network Operator Traffic. In Proceedings of the ACM Internet Measurement Conference, Virtual Event, 27–29 October 2020; pp. 19–33. [Google Scholar]
- Apple. Mobility Trends Reports. No Longer Reported. Available online: https://covid19.apple.com/mobility (accessed on 15 July 2024).
- Google. COVID-19 Community Mobility Reports. Available online: https://www.google.com/covid19/mobility/ (accessed on 15 July 2024).
- Meta. Data for Good. COVID-19. Available online: https://dataforgood.facebook.com/dfg/ (accessed on 15 July 2024).
- Safegraph. The Source of Truth for Places Data. Available online: https://www.safegraph.com/ (accessed on 15 July 2024).
- Predicio. Location-Based Behavior Intelligence. Available online: https://proptechzone.com/startups/predicio/ (accessed on 15 July 2024).
- González, M.C.; Hidalgo, C.A.; Barabási, A.L. Understanding individual human mobility patterns. Nature 2008, 453, 779–782. [Google Scholar] [CrossRef] [PubMed]
- Schneider, C.M.; Belik, V.; Couronné, T.; Smoreda, Z.; González, M.C. Unravelling daily human mobility motifs. J. R. Soc. Interface 2013, 10, 20130246. [Google Scholar] [CrossRef] [PubMed]
- Pappalardo, L.; Simini, F.; Rinzivillo, S.; Pedreschi, D.; Giannotti, F.; Barabási, A.L. Returners and explorers dichotomy in human mobility. Nat. Commun. 2015, 6, 8166. [Google Scholar] [CrossRef]
- Alsaeedy, A.A.; Chong, E.K. A review of mobility management entity in LTE networks: Power consumption and signaling overhead. Int. J. Netw. Manag. 2020, 30, e2088. [Google Scholar] [CrossRef]
- Calabrese, F.; Colonna, M.; Lovisolo, P.; Parata, D.; Ratti, C. Real-Time Urban Monitoring Using Cell Phones: A Case Study in Rome. IEEE Trans. Intell. Transp. Syst. 2011, 12, 141–151. [Google Scholar] [CrossRef]
- Google. Awareness API. Available online: https://developers.google.com/awareness (accessed on 15 July 2024).
- Kraemer, M.U.; Yang, C.H.; Gutierrez, B.; Wu, C.H.; Klein, B.; Pigott, D.M.; Du Plessis, L.; Faria, N.R.; Li, R.; Hanage, W.P.; et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science 2020, 368, 493–497. [Google Scholar] [CrossRef]
- Rahman, M.M.; Thill, J.C. Associations between COVID-19 pandemic, lockdown measures and human mobility: Longitudinal evidence from 86 countries. Int. J. Environ. Res. Public Health 2022, 19, 7317. [Google Scholar] [CrossRef]
- Zhou, Y.; Xu, R.; Hu, D.; Yue, Y.; Li, Q.; Xia, J. Effects of human mobility restrictions on the spread of COVID-19 in Shenzhen, China: A modelling study using mobile phone data. Lancet Digit. Health 2020, 2, e417–e424. [Google Scholar] [CrossRef] [PubMed]
- Buckee, C.O.; Balsari, S.; Chan, J.; Crosas, M.; Dominici, F.; Gasser, U.; Grad, Y.H.; Grenfell, B.; Halloran, M.E.; Kraemer, M.U.; et al. Aggregated mobility data could help fight COVID-19. Science 2020, 368, 145–146. [Google Scholar] [CrossRef] [PubMed]
- Li, T.; Zhang, M.; Li, Y.; Lagerspetz, E.; Tarkoma, S.; Hui, P. The Impact of COVID-19 on Smartphone Usage. IEEE Internet Things J. 2021, 8, 16723–16733. [Google Scholar] [CrossRef] [PubMed]
- Seufert, A.; Poignée, F.; Hoßfeld, T.; Seufert, M. Pandemic in the digital age: Analyzing WhatsApp communication behavior before, during, and after the COVID-19 lockdown. Humanit. Soc. Sci. Commun. 2022, 9, 140. [Google Scholar] [CrossRef]
- Mejova, Y.; Kourtellis, N. Youtubing at home: Media sharing behavior change as proxy for mobility around covid-19 lockdowns. In Proceedings of the 13th ACM Web Science Conference 2021, Virtual Event, 21–25 June 2021; pp. 272–281. [Google Scholar]
- Pase, F.; Chiariotti, F.; Zanella, A.; Zorzi, M. Bike sharing and urban mobility in a post-pandemic world. IEEE Access 2020, 8, 187291–187306. [Google Scholar] [CrossRef]
- Yabe, T.; Tsubouchi, K.; Fujiwara, N.; Wada, T.; Sekimoto, Y.; Ukkusuri, S.V. Non-compulsory measures sufficiently reduced human mobility in Tokyo during the COVID-19 epidemic. Sci. Rep. 2020, 10, 18053. [Google Scholar] [CrossRef]
- TAPTAP Digital. Intelligence for Marketing. Available online: https://www.taptapdigital.com/ (accessed on 15 July 2024).
- Universidad Carlos III de Madrid. Research Portal. Chair TAPTAP DIGITAL-UC3M in Advanced AI and Data Science Applied to Advertising and Marketing. Available online: https://researchportal.uc3m.es/display/act538274 (accessed on 15 July 2024).
- Uber Engineering. H3: Uber’s Hexagonal Hierarchical Spatial Index. Available online: https://eng.uber.com/h3/ (accessed on 15 July 2024).
- Hale, T.; Webster, S.; Petherick, A.; Phillips, T.; Kira, B. Oxford COVID-19 Government Response Tracker (OxCGRT). Last Update. Available online: https://www.bsg.ox.ac.uk/research/covid-19-government-response-tracker (accessed on 15 July 2024).
- Cao, J.; Li, Q.; Tu, W.; Wang, F. Characterizing preferred motif choices and distance impacts. PLoS ONE 2019, 14, e0215242. [Google Scholar] [CrossRef]
- Pappalardo, L.; Simini, F.; Barlacchi, G.; Pellungrini, R. Scikit-Mobility: A Python Library for the Analysis, Generation and Risk Assessment of Mobility Data. arXiv 2019, arXiv:1907.07062. [Google Scholar] [CrossRef]
- Frik, A.; Kim, J.; Sanchez, J.R.; Ma, J. Users’ Expectations About and Use of Smartphone Privacy and Security Settings. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, CHI ’22, New Orleans, LA, USA, 29 April–5 May 2022. [Google Scholar] [CrossRef]
- Cox, J. Leaked Location Data Shows Another Muslim Prayer App Tracking Users. Vice. 2021. Available online: https://www.vice.com/en/article/muslim-app-location-data-salaat-first/ (accessed on 15 July 2024).
- Jensen, C.; Potts, C. Privacy Policies as Decision-Making Tools: An Evaluation of Online Privacy Notices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’04, Vienna, Austria, 24–29 April 2004; pp. 471–478. [Google Scholar] [CrossRef]
- McDonald, A.M.; Cranor, L.F. The Cost of Reading Privacy Policies. I/S A J. Law Policy Inf. Soc. 2008, 4, 543. [Google Scholar]
- Swain, G.R.; Burns, K.A.; Etkind, P. Preparedness: Medical Ethics versus Public Health Ethics. J. Public Health Manag. Pract. 2008, 14, 354–357. [Google Scholar] [CrossRef] [PubMed]
- Wang, T.; Guo, L.; Bashir, M. COVID-19 Apps and Privacy Protections from Users’ Perspective. Proc. Assoc. Inf. Sci. Technol. 2021, 58, 357–365. [Google Scholar] [CrossRef] [PubMed]
- World Health Organization. WHO Guidelines on Ethical Issues in Public Health Surveillance; World Health Organization: Geneva, Switzerland, 2017.
- Anom, B.Y. The Ethical Dilemma of Mobile Phone Data Monitoring during COVID-19: The Case for South Korea and the United States. J. Public Health Res. 2022, 11, 22799036221102491. [Google Scholar] [CrossRef] [PubMed]
- GDPR. Complete Guide to GDPR Compliance. Available online: https://gdpr.eu/ (accessed on 15 July 2024).
Technology | Granularity | Accuracy | Pervasiveness | Completeness | Accessibility | |
---|---|---|---|---|---|---|
Infrastructure based | Medium, difficult to achieve | High | High | High | Very Low | |
Terminal Based | OS frameworks | High | Medium | High | High | Very Low |
Digital Ads | High | Medium | Low | Low | High | |
SDK | High | Medium | Low | Low | High |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Callejo, P.; Gramaglia, M.; Cuevas, R.; Cuevas, Á.; Tschantz, M.C. Analyzing Mobility Patterns at Scale in Pandemic Scenarios Leveraging the Mobile Network Ecosystem. Electronics 2024, 13, 3654. https://doi.org/10.3390/electronics13183654
Callejo P, Gramaglia M, Cuevas R, Cuevas Á, Tschantz MC. Analyzing Mobility Patterns at Scale in Pandemic Scenarios Leveraging the Mobile Network Ecosystem. Electronics. 2024; 13(18):3654. https://doi.org/10.3390/electronics13183654
Chicago/Turabian StyleCallejo, Patricia, Marco Gramaglia, Rubén Cuevas, Ángel Cuevas, and Michael Carl Tschantz. 2024. "Analyzing Mobility Patterns at Scale in Pandemic Scenarios Leveraging the Mobile Network Ecosystem" Electronics 13, no. 18: 3654. https://doi.org/10.3390/electronics13183654
APA StyleCallejo, P., Gramaglia, M., Cuevas, R., Cuevas, Á., & Tschantz, M. C. (2024). Analyzing Mobility Patterns at Scale in Pandemic Scenarios Leveraging the Mobile Network Ecosystem. Electronics, 13(18), 3654. https://doi.org/10.3390/electronics13183654