Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web
Abstract
:1. Introduction
2. Materials and Methods
2.1. Domain and Phenomenon Characterisation
2.2. Methodology
- Recipes: A recipe describes the components of an action in terms of parameters, sub-actions, and constraints.
- Plans: A plan corresponds to a schema describing not only how to act, but more importantly, the mental attitude towards it, such as beliefs, commitments, and Execution status.
- Actions: An action refers to a specific goal as well as the efforts needed to achieve it. An action can be basic or complex.
- Type I: Acquisition of spatial data (e.g., map layer retrieval)
- Type II: Analytical tasks (e.g., finding spatial clusters)
- Type III: Cartographic and visualisation tasks (e.g., zooming, panning)
- Type IV: Domain-specific tasks (e.g., evacuation planning during hurricanes)
2.3. Survey
2.4. Comparison with ChatGPT
3. Results
3.1. Implementation
3.1.1. Frameworks
3.1.2. Web Speech API
3.1.3. Geocoding API
3.1.4. Functions
- MainSpeechAPI.js: This JavaScript script, like others in the application, leverages the Web Speech API for speech recognition and action execution. It initializes essential variables for speech recognition and establishes a crucial connection between the frontend and backend, enabling command recognition and control.
- logToBoxArea(): Logs chat messages, distinguishes user and system messages, transcribes snippets and responds to interpreted commands.
- interpret(): Interprets received commands, matching them using regular expressions. Actions include changing the base map, zooming, panning, geocoding, and more. Accurate geocoding depends on correct transcription.
The interpret() function defines voice commands using regular expressions. Each recognized command triggers the appropriate function in VoiceMapChart.js. - VoiceMapChart.js: This script extends the code for the voice map chart’s structure. It encompasses marker and thematic map functionalities tailored for voice interactions. To enable visualization through voice commands, functions must be adapted to align with defined commands and callbacks in MainSpeechAPI.js.
3.2. Usability Testing
- Task Completion Rate: This metric measured the percentage of participants who successfully completed each task.
- Task Time: The average time taken by participants to complete each task was recorded.
- Error Count: The number of errors made by participants during task completion was measured. The error count, practically, was due to the complexity of the tasks for users, or the accent of the users, which was not recognizable for the application.
- User Satisfaction Score: Participants were asked to rate their satisfaction with the application on a scale of 1 to 10. This metric gauged the overall user experience and their level of contentment with the application’s usability.
4. Discussion
- Using machine learning to enhance natural language processing and find real-time alternatives for the terms not in the initial geospatial interaction corpus.
- Expanding to multilingual voice assistants for global accessibility: assessing the heterogeneity of voice commands across different languages, exploring the mismatches and similarities in terms across various linguistic contexts.
- A dedicated study on evaluating and enhancing virtual assistants’ adherence to existing geospatial standards and the potential development of new standards tailored for voice interfaces.
- Investigating applications in remote sensing scenarios.
- Developing specialised voice interfaces for platforms like Alexa, focusing on geospatial queries.
- Conducting comprehensive user studies to measure the effectiveness and user satisfaction of voice assistants in geospatial visualisation.
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Haklay, M. How good is volunteered geographical information? A comparative study of OpenStreetMap and Ordnance Survey datasets. Environ. Plan. Plan. Des. 2010, 37, 682–703. [Google Scholar] [CrossRef]
- Austerjost, J.; Porr, M.; Riedel, N.; Geier, D.; Becker, T.; Scheper, T.; Marquard, D.; Lindner, P.; Beutel, S. Introducing a Virtual Assistant to the Lab: A Voice User Interface for the Intuitive Control of Laboratory Instruments. SLAS Technol. 2018, 23, 476–482. [Google Scholar] [CrossRef] [PubMed]
- Blanco, T.; Martín-Segura, S.; de Larrinzar, J.L.; Béjar, R.; Zarazaga-Soria, F.J. First Steps toward Voice User Interfaces for Web-Based Navigation of Geographic Information: A Spanish Terms Study. Appl. Sci. 2023, 13, 2083. [Google Scholar] [CrossRef]
- Granell, C.; Pesántez-Cabrera, P.G.; Vilches-Blázquez, L.M.; Achig, R.; Luaces, M.R.; Cortiñas-Álvarez, A.; Chayle, C.; Morocho, V. A scoping review on the use, processing and fusion of geographic data in virtual assistants. Trans. GIS 2021, 25, 1784–1808. [Google Scholar] [CrossRef]
- Lai, P.-C.; Degbelo, A. A Comparative Study of Typing and Speech For Map Metadata Creation. AGILE GISci. Ser. 2021, 2, 7. [Google Scholar] [CrossRef]
- Gilbert, T. VocalGeo: Using Speech to Provide Geospatial Context in the Classroom; figshare: London, UK, 2020. [Google Scholar] [CrossRef]
- Cali, D.; Condorelli, A.; Papa, S.; Rata, M.; Zagarella, L. Improving intelligence through use of Natural Language Processing. A comparison between NLP interfaces and traditional visual GIS interfaces. Procedia Comput. Sci. 2011, 5, 920–925. [Google Scholar] [CrossRef]
- Wang, H.; Cai, G.; MacEachren, A.M. GeoDialogue: A software agent enabling collaborative dialogues between a user and a conversational GIS. In Proceedings of the 2008 20th IEEE International Conference on Tools with Artificial Intelligence, Dayton, OH, USA, 3–5 November 2008; Volume 2, pp. 357–360. [Google Scholar] [CrossRef]
- Kadriu, A.; Rista, A. Automatic Speech Recognition: A Comprehensive Survey. SEEU Rev. 2020, 15, 86–112. [Google Scholar]
- Tulshan, A.S.; Dhage, S.N. Survey on Virtual Assistant: Google Assistant, Siri, Cortana, Alexa. In Advances in Signal Processing and Intelligent Recognition Systems, Proceedings of the 4th International Symposium SIRS 2018, Bangalore, India, 19–22 September 2018; Revised Selected Papers 4; Springer: Singapore, 2018. [Google Scholar] [CrossRef]
- Zhu, A.-X.; Zhao, F.-H.; Liang, P.; Qin, C.-Z. Next generation of GIS: Must be easy. Ann. GIS 2021, 27, 71–86. [Google Scholar] [CrossRef]
- Cai, G.; Wang, H.; MacEachren, A.M.; Fuhrmann, S. Natural conversational interfaces to geospatial databases. Trans. GIS 2005, 9, 199–221. [Google Scholar] [CrossRef]
- Dodge, M. Mapping and geovisualization. In Approaches to Human Geography, 2nd ed.; Aitken, S.C., Valentine, G., Eds.; Sage: London, UK, 2014; pp. 289–305. [Google Scholar]
User | Age | Gender | Mother Tongue | Filed of Study | Task Completion Rate (%) | Task Time (minutes) | Error Count | User Satisfaction Score (Out of 10) |
---|---|---|---|---|---|---|---|---|
1 | 25 | Female | Persian | Geomatics Eng. | 100% | 10 | 4 | 8 |
2 | 30 | Male | Persian | Computer Science | 90% | 15 | 2 | 7 |
3 | 22 | Male | Italian | Computer Science | 100% | 12 | 5 | 9 |
4 | 28 | Female | Italian | Environ. Science 1 | 95% | 18 | 3 | 8 |
5 | 37 | Male | Italian | Urban Planning | 85% | 20 | 4 | 6 |
6 | 20 | Male | Italian | Mechanical Eng. | 100% | 11 | 5 | 9 |
7 | 26 | Male | Italian | Data science | 90% | 14 | 4 | 7 |
8 | 24 | Male | Italian | Data science | 100% | 9 | 3 | 8 |
9 | 31 | Female | Persian | Architecture and Landscape | 95% | 16 | 4 | 9 |
10 | 23 | Male | Italian | Law | 75% | 22 | 2 | 6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mahmoudi, H.; Camboim, S.; Brovelli, M.A. Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web. ISPRS Int. J. Geo-Inf. 2023, 12, 441. https://doi.org/10.3390/ijgi12110441
Mahmoudi H, Camboim S, Brovelli MA. Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web. ISPRS International Journal of Geo-Information. 2023; 12(11):441. https://doi.org/10.3390/ijgi12110441
Chicago/Turabian StyleMahmoudi, Homeyra, Silvana Camboim, and Maria Antonia Brovelli. 2023. "Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web" ISPRS International Journal of Geo-Information 12, no. 11: 441. https://doi.org/10.3390/ijgi12110441
APA StyleMahmoudi, H., Camboim, S., & Brovelli, M. A. (2023). Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web. ISPRS International Journal of Geo-Information, 12(11), 441. https://doi.org/10.3390/ijgi12110441