1. Introduction
In recent decades, the availability of a variety of advanced digital tools for design has advanced the creative process gradually supplanting the traditional analogue pen-and-paper methods used by individual designers to communicate their ideas. The recent shift towards the development of collaborative design systems has enabled multiple designers and citizens to act together interactively (e.g., collaborative Computer Aided Design—CAD, Building Information Modeling—BIM, Volunteered Geographic Information platforms—VGI) [
1,
2] to address the growing complexity of contemporary design challenges. The general trend is now to move farther away from desktop single-user solutions towards web-based interactive multi-user systems [
3]. Novel technological advances are giving rise to potential new ways of designing and mapping in collaborative environments.
Computer-based support tools in urban and regional planning have moved along this trend thanks to the advances both in Information Communication and Technology (ICT) and in the planning approaches. The increasing complexity of current planning challenges requires “smart” support tools which allow creation and evaluation of design alternatives quickly and efficiently, and at the same time in a more engaging way [
4]. A subset of these geo-information technologies, known as planning support systems (PSS), has met these challenges by providing support to the whole, or to some part of, complex planning processes and workflows. PPS combine a range of digital technologies to support different aspects of the planning process in an integrated way [
5,
6], including “computer-based tools for public participation” and collaboration among stakeholders [
7].
The use of digital information technologies and the active engagement of local communities, or
the people of the place in the design process are two key elements of the geodesign approach [
8,
9]. Although traditional public participation has been a challenge in many situations, [
10], geodesign methods have been proven successful in engaging members of the local community in the design phase through genuine collaboration.
In the last decade, the geodesign approach to spatial planning has attracted the interest of the academic community [
9,
11,
12], business companies [
13] and institutional environments [
14,
15]. Ervin [
16,
17] identified “15 essential components of an ideal geodesign toolbox” associating to each of them a specific set of digital tools. Among these components, real-time digital dashboards are proposed as a tool for interactively displaying the impacts of design alternatives against desired performance criteria and, thus, offering real-time support to decision-making. These dashboards can be used to rapidly evaluate the achievement of goals or targets, or to identify at a glance potential conflicts or critical situations.
In addition, real-time digital dashboards are useful for monitoring and analysis of the geodesign collaborative process itself. Within the context of the broader geodesign analytical framework, this study focuses on the development of a set of indicators that may be used by the coordinators of geodesign studies to evaluate both the behavior and the performance of the participants involved in the process, and the evolution of the design outcomes through time. Among the objectives of this study are assessment of the achievement of performance and participation levels (e.g., number of times a participant used the sketching tool in a design support system, or to identify leading vs lagging behind participants), along with analysis of spatial relations (e.g., intersection, proximity) among design alternatives proposed by different groups of stakeholders. This is key in identifying areas of disagreement, which may help to reach consensus among stakeholders. It is argued that investigating these aspects can increase the coordinator’s understanding of the process, which can, in turn, lead to improved outcomes, as well as, inform future process workflows. Moreover, through this approach, it is possible to evaluate existing support tools, as well as, to aid the development of new process-oriented PSS [
18].
In general, traditional data collection methods used to measure design dynamics in collaborative and computer-supported processes are based on traditional data collection tools (e.g., audio/video recording, survey, etc.) [
5,
19], and are very demanding in terms of both time and human resources. ([
20,
21]). However, in the contemporary information age, automated or computer-supported processes leave historical traces behind that exist in several forms, such as event logs [
22]. Many commonly used design software applications capture and make available information about actions taken automatically by the system or actively by the users [
23,
24]. The recently developed collaborative PSS Geodesignhub [
25] records log-data regarding the actions undertaken by the participants involved in a geodesign study and their products.
Increasingly in the last decade, organizations rely on the availability of log-data to improve and support their business processes in competitive and rapidly changing environments [
26]. Robust process mining techniques have been developed in recent years in several domains ranging from web page contents and usage [
27,
28,
29], education and e-learning platforms [
30,
31] and software engineering to detect anomalies and errors in systems processes [
32]. Despite the growing interest in log-based process analysis, very few, although very promising, studies—have been undertaken to exploit the potentialities of this new data source in the design field [
20,
22,
33].
Taking advantage of this approach, we propose a methodology to extract knowledge from Geodesignhub log-data in order to monitor and to understand the design dynamics of a study, ultimately aiming at improving collaborative geodesign processes. Both historical and real-time log-data can be analyzed using the set of indicators developed as a part of a more comprehensive framework for a Geodesign Process Analytics (GDPA). In a previous study, we applied inferential statistics techniques for testing a subset of those indicators using log-data of an existing geodesign study [
34]. Insights gained from ex-post analyses of a large number of collaborative design studies can be used to guide future cases with a view to process, design and coordination improvement.
The objective of the present study is to evaluate whether the analysis of information on the ongoing design dynamics, as recorded in the Geodesignhub log-data and made available in a timely and user-friendly manner through dashboards, can help improve the real-time management of geodesign processes. Several steps are necessary to collect, prepare, and analyze log-data to extract useful information about the current process unfolding. Hence, this paper’s structure is organized as follows: in
Section 2 the main phases and the characteristics of the log-data of the PSS Geodesignhub are presented;
Section 3 is entirely dedicated to the proposed analytical process, from data extraction methods to the detailed description of the indicators, a subset of which was tested with the log-data of the Cagliari case study presented in
Section 4.
3. Analytical Process
3.1. Geodesign Process Analytics
The peculiar data structure of the Geodesignhub log-data provides unparalleled opportunities to analyze geodesign studies. In order to provide the basis for the development of a comprehensive geodesign analytical framework (i.e., , Geodesign Process Analytics or GDPA), we focused on the two meanings of the term design, as stated by Steinitz [
8]: design as a verb, highlighting the importance of the
process itself, and design as a noun, identifying its
product (
Figure 2). Unlike other type of design software, such as CAD or BIM, information on participants’ interactions with the Geodesignhub platform is obtained indirectly by looking at the results of their actions.
On the one hand, diagrams are seen as the fundamental products of a geodesign study and analyzed with regard to geometry, the system they belong to (e.g., Agriculture, Housing, etc.), or other characteristics to assess their individual design quality and the spatial relations between them. A set of spatial analysis models, previously proposed by Freitas and Moura [
44], is extended here in order to create spatial indicators useful to identify in real-time possible conflicts of interest among stakeholder teams (i.e., , incompatible land-uses).
On the other hand, diagrams are considered as design events each representing a task carried out by a participant (i.e., , create a diagram; select a diagram). The quantity of diagrams created or selected can be used to analyze the productivity of the workshop participants and the evolution of the design alternatives. Temporal information of the diagrams allows evaluation of a participant’s behavior and performance over time. The analytical process presented in the following sections aims to extract useful information on the design
product and
process from the Geodesignhub log-data. This process includes data collection, preparation, analysis and the use of spatial analysis and statistics techniques to derive useful insights from this information (
Figure 3).
The proposed methodology is only the first step towards developing a complete Geodesign Process Analytics, yet it offers a set of useful indicators and measures to gain insights on the design dynamics and patterns that drive participants’ actions and their results in a typical workshop workflow. We argue, in fact, that the new knowledge gained from log-data analysis can help the workshop coordinator i) to evaluate the participants’ performance during the ongoing process, and ii) to discover meaningful patterns and trends in post-workshop analyses. Since (geo)design processes are now observable and measurable, ongoing or future processes can be potentially improved on the basis of an empirical understanding of their dynamics. Statistical analytical techniques were applied to the performance, temporal and design evolution indicators proposed in this study. Some of them are more suitable to support the coordinator in real-time in their role of facilitating the process, while others aid post-workshop analysis of the design process itself and in comparisons with other studies. This study focusses on the use of descriptive statistics and interactive dashboards aiming to provide quick and simple real-time updates to the coordinator of the workshop, whose fast pace workflow requires fast-moving attention. Indicators have not been designed to automate decisions but rather as warning systems able to inform the coordinator about potential issues in the process that may require further attention.
3.2. Data Extraction
Geodesignhub exposes its log-data via API. Information related to project objects is made available, including: diagrams (e.g., details of all diagrams created in a project); change teams (e.g., list of all diagrams selected in a group synthesis); the project in general (e.g., list of participants that took part in a project); systems (e.g., details of all systems considered in a project). The API is JSON-based and all the requests return GeoJSON files. The Python
urllib2 module was used to query the API and retrieve information useful for analyzing the design process. An excerpt of the GeoJSON file containing details of all diagrams that were created in a project is shown in
Figure 4. Spatial and non-spatial components of a GeoJSON object (i.e., , FeatureCollection) are stored in the one-row GeoJSON string. In this example, the “diagramid” is the unique identifier assigned by Geodesignhub platform and the “author” information were omitted for privacy. Information on other project objects were similarly retrieved and saved in JSON or GeoJSON formats.
In addition, the plug-in Geodesignhub Dashboard is accessible to the non-technical user and allows downloading individual diagrams in the common Shapefile format. API functionality was used here to get a text-based version of the Geodesignhub interface with textual descriptions of diagrams, change teams and users. Groups’ syntheses can also be manually downloaded in the “Design History” section. Although Geodesignhub enables export of key data from the platform, manual download can be very time consuming because of the many project and policy diagrams (typically200 or more).
More recently, the research team of the
Laboratório de Geoprocessamento of the Universidade Federal de Rio de Janeiro in Brazil developed an open source tool for querying Geodesignhub API and downloading diagrams in Shapefile format. The tool (Geodesign Hub—Vicon SAGA) is integrated in the GIS-based web platform Vicon/SAGA, which provides functionalities for data collection, storage, querying, visualization and exchange in many formats [
45,
46]. Although programming skills are not required, making downloading much easier and faster, the collected data are lacking several available attributes. Data gaps may affect the analysis and should be addressed. Data access through the API, although requiring more advanced IT skills, remains the preferred option to enable collection of a richer set of log-data.
The methods and tools of data pre-processing described in the next section allowed us to extract complete design process information from raw log-data and organize them into a geodatabase [
47], facilitating the analyses. Software and procedures used in this study are not new in the field of data mining, however, the application of these methodologies in the design domain is still substantially unexplored.
3.3. Data Preparation
Data preprocessing aims to offer a structural, reliable, and integrated data source for pattern discovery. This process encompasses a first data cleaning phase that can be implemented using Extract Transform Load (ETL) software. ETL transformation tasks support data optimization for efficient storage and analysis. Performed operations include cleaning, summarization, integration, and aggregation. The spatially-enabled version of Pentaho Data Integration, GeoKettle (version 2.5) provides full control over the entire process through an intuitive graphical user interface. The ETL tool extracts data from a source, transforms it to fit the users’ needs, and then loads it into a destination or database. ETL tools were initially developed to be used in the Information Communication Technologies (ICT) field, specifically in the processes of data migration between relational databases. More recently, the rapid diffusion of Business Intelligence tools (BI) has broadened the scope of ETL software by including the fast and automated building of data warehouses for data visualization rather than only storage. Currently, ETL tools expand their functionalities though the inclusion of spatial data and operations capable of competing with existing desktop Geographic Information System (GIS) software. The Geokettle software employed in this study falls within the category of spatial ETL tools.
Figure 5 shows the ETL data transformation diagram for validating (e.g., eliminate empty diagrams), filtering (e.g., selecting only certain attributes/columns) and splitting (e.g., multi-polygon diagram into single-polygon diagrams) a GeoJSON collection of features containing details of all the diagrams created in a Geodesignhub project. The sequence of data transformation activities began with reading the input data, splitting the GeoJSON unformatted single row into features or feature collections, and identifying diagrams with no geometry to be excluded from the analysis (
Figure 5a). Features and feature collections were then separated for further pre-processing into their spatial component (
Figure 5b,
Figure 5c) and into their non-spatial attributes (
Figure 5d). A series of transformation steps was applied to pull out useful information from the input data, in particular, multipart diagrams were split into single-part features (
Figure 5c) to avoid losing information. Finally, cleaned geometry and properties components were merged and temporarily saved in shapefile format (
Figure 5e) to allow a preliminary data exploration and visualization in a GIS environment.
The ETL transformation task described in
Figure 5 allows automatic preprocessing of GeoJSON files downloaded from Geodesignhub API. Resulting data were used to create and populate the database following a specific data model (
Figure 6). Although small, the relational database was structured in accordance with a series of normal forms to reduce redundancy of data and prepare a clearer and readable data model [
47]. Normalization includes organizing data attributes in tables and establishing relationships between those tables. Relations can be of various types as well as different in their representation. For example, a “one-to-many” relationship (1 - 1*) exists between the tables “group” and “sys_priority”. A system priority value represents design preferences of a single group, but a group assigns different priority values, as many as the systems considered in a project. Whereas, a “zero-to-many” relationship (0 - *) connects the “group” and the “synthesis” tables. A synthesis could be created by a group or by no group, it could, in fact, be the result of a negotiation process within a coalition. Additionally, a group may have created more than one synthesis.
The open source relational database PostegreSQL was used in this study, mainly because of its spatial functions for processing and analysis of geographic objects. The extension PostGIS allows storage and querying of spatial data and supports various geometry types (e.g., POLYGON, MULTI-POLYGON) expressed in Open Geospatial Consortium (OGC) formats (e.g., Well-Known Text—WKT, Well-Known Binary—WKB). Currently, it provides a large set of spatial functions enabling the fast retrieval and processing of geographic information, without the need to use any additional GIS-based analysis tools.
At this point, the “Load” functions of ETL software were exploited for writing the processed data into the database. The second phase of the data transformation (
Figure 7) includes a series of steps for filtering first, and then loading, data attributes into the target tables. Geometries and properties components of all diagrams created in a Geodesign project populated the “system” (
Figure 7a), “author” (
Figure 7b) and “diagram” (
Figure 7c) tables. Additionally, priority weights assigned to the systems by the stakeholder groups can be obtained by Geodesignhub API calls and loaded into the “sys_priority” table (
Figure 7d).
The other tables, identified in the data model in
Figure 6, were populated with data containing details of all diagrams selected in the group/coalition syntheses created in the project (
Figure 8). The input step of the data transformation diagram was a series of shapefiles collected in a File List (a text file listing of the shapefiles contained in the syntheses folder) (
Figure 8a). Data was cleaned and loaded in the target tables including “selection” (
Figure 8b), “synthesis” (
Figure 8c), “group_” (
Figure 8d), “coalition” (
Figure 8e) and “component” (
Figure 8f) tables.
We provided a first set of indicators to analyze participants’ performance and design evolution which combine the analytical dimensions made available by the data model. The following section describes in detail the analysis, which was articulated as follows: (i) spatial queries were used to investigate the relations among geometric objects (diagrams) present in the spatial dataset, and (ii) descriptive statistical analysis was applied to identify the relations between temporal and thematic attributes (e.g., time of creation of a diagram, system name, frequency of selection, etc.). The results of the analyses applied to the Cagliari case study are reported in
Section 4.
3.4. Data Analysis
3.4.1. Spatial Indicators
In 2018, Freitas and Moura [
44] exploited the ETL capabilities to manage spatial data and developed effective methods to analyze topological relationships and positional similarity among diagrams (
Table 1). Two data transformation diagrams were set up and tested using diagram collections from several geodesign workshops. Understanding spatial relationships (e.g., intersection, proximity) among selected diagrams, and combining them with other attributes (e.g., system name, diagram title), allows identification of possible conflicts of interest or areas of agreement within the teams or among the teams. This, in turn, is likely to stimulate dialogue between workshop participants with a view to fostering consensus on a common design. The following indicators are suitable both to be displayed in a real-time dashboard facilitating dialogue and negotiation and to be used as a basis of comparison with other studies.
In this paper, we propose the use of PostGIS spatial functionality for querying the log-data database about spatial relationships among diagrams. Spatial databases are able to manipulate spatial data, rather than simply store and organize them. PostGIS, in particular, supports all the standard OGC geospatial operators (e.g., distance, within, intersects, closest, etc.) and it is considered the most efficient open source solution for managing geospatial data [
48]. Topological and geometrical queries were formulated in SQL to analyze the spatial properties of diagrams. The advantage of using the databases for data analysis (as opposed to ETL tools) is that it eliminates the need to extract the data by creating transformation steps that can be directly performed in the database repository. In addition, data is automatically processed as information and is loaded into the database without the need for further technical intervention.
A first set of spatial functions defines the topological relationships between two overlapping diagrams (A ∩ B ≠ ∅) selected by two different change teams (
Figure 9). The existing relationships between the spatial objects can be determined by evaluating their possible combinations and calculating the proportion between the intersection area and the diagrams areas. Three topological relations—partially based on the 9-intersection model [
49]—were identified as being of interest for the workshop coordinator:
If the intersection area is greater than the 80% of the total area of the two diagrams, they are considered “similar” (
Figure 9a);
If the intersection area is greater than the 80% of the area of the first diagram, A is “within” B (
Figure 9b);
If the intersection area is greater than the 80% of the area of the second diagram, A “contains” B (
Figure 9c).
A second set of spatial functions includes a more elaborate analysis setting to be used in case of a “disjointed” relation between two diagrams (A ∩ B = ∅). The proposed approach relates the concept of proximity (or nearness) between two diagrams to the minimum bounding rectangle of all diagrams created. A bounding box is a rectangular polygon aligned with the coordinate axes that encompass a spatial feature, or group of features, from its minimum and maximum coordinates in the x and y directions. If the distance between two diagrams, based on the length of the segment linking their centroids, is less or equal to 12.5% of the shorter side of the bounding box, the two diagrams are considered to be close. It is worth mentioning that in assessing spatial proximity, the shape of the diagram counts. This approach, which considers the coordinates of the centroids, provides an efficient trade-off between computational time requirements and accuracy of results.
3.4.2. Participants’ Performance Indicators
Drawing design proposals in the form of projects and policies is one of the main tasks performed by the participants involved in a Geodesignhub workshop. Diagrams were used as the basis for assessment of participants’ performance. The number of diagrams created was combined with thematic attributes (i.e., , author, system, type) to construct the first three indicators of the set presented in
Table 2. We argue that those indicators can support the workshop coordinator’s understanding of participants’ performance in real-time, and help detect any related issues in the early stages of the design development process.
The indicator
Top Contributors (
Table 2) identifies leading participants with greatest potential to influence the design and, perhaps more importantly, those lagging behind who may need further attention in performing their work. Also, within the multi-system approach of geodesign, all systems identified as relevant for the development of the study area (e.g., Agriculture, Housing, etc.) will be taken into consideration. The coordinator plays an essential role in identifying systems of major interest and those not sufficiently considered. The real-time assessment of the
Diagram creation by system efficiently supports this task. Geodesignhub uses two types of diagrams to convey ideas for a change in a system, projects and policies. A
project envisages a physical change on the study area and its impacts and costs can be measured. In the Geodesignhub computational logic, a
policy will not have a quantified physical impact on the site. The objectives underlying policies are achieved through private/public incentives or the creation of ad hoc laws, therefore, its impact and cost are not taken into account. The number of diagrams created per type (
Diagram creation by type) helps to get a preliminary idea of most pressing needs (e.g., new infrastructure or buildings or changes in activity patterns).
A second group of performance indicators is used to evaluate the influence of background information on the number of diagrams created. Cocco et al. [
34] explored the relationship between number of diagrams created and the personal and professional profiles. Inferential statistical techniques were used to perform correlation analysis between the available dimensions: age group, education level (e.g., undergraduate, graduate, PhD), level of experience (e.g., previous experience with Geodesignhub/PSS), professional expertise (e.g., architecture, planning, ecology), role within the area of interest (e.g., practitioner, researcher, student). The number of diagrams created is also indicative of participants’ perceived mastery of the built-in sketching tool. However, the tendency of homogeneous categories of participants (in terms of age, expertise, etc.) to reach similar level of performance should be further investigated by using the descriptive indicators proposed here in comparative studies of workshops results, whether or not they are complemented with inferential statistics.
Another aspect to be analyzed in a post-workshop phase is the influence of the role played by system experts and stakeholders) on performance as authors of diagrams. It is expected that stakeholders would create more diagrams if they were not satisfied by the experts’ first design proposals. System experts’ performance can act as an indicator of stakeholders’ satisfaction in respect to the experts’ first proposals/diagrams.
3.4.3. Temporal Indicators
Time stamps contained in the Geodesignhub log-files provide precise temporal information on two events: “diagram creation” and “synthesis creation”. In addition, it is possible to infer temporal information for several different steps in a geodesign workshop workflow (e.g., “Experts Create Diagrams”, “Add Diagrams and Create Synthesis 1”, “Add Diagrams and Create Synthesis 2”, etc.) and establish time intervals between the starting time and the end time of each step. This information was systematically exploited in the set of temporal indicators listed in
Table 3.
More specifically, real-time measurement of Workshop steps duration was useful to assess whether the workflow is properly following the initial schedule. If delays occur, they should be carefully monitored by the coordinator during the workshop and then analyzed in detail by the project coordination team after the workshop to identify the reasons for delays.
The temporal dimension of diagrams was also used to measure various indicators for evaluating participants’ and groups’ performance over time. Diagrams can be associated to workflow steps on the basis of their creation time, making it possible to observe variations in the number of diagrams created as the workshop progresses. Participants’ performance (individually or in groups) can differ significantly throughout the workshop until a final agreement is reached. This analysis was particularly interesting in regard to
Diagram creation over time by group of stakeholders/coalitions. Despite the fact that no statistically significant difference has been found so far [
34], these dynamics need to be examined in more depth using time series graphs to investigate differences in the
Diagram creation over time, both between, and within the workshop steps.
The Average time spent on diagram creation can help to better understand differences across the steps. We argue average time spent on diagram creation may vary greatly from one step to another. Leaving aside differences in the workshop timetable, there could be other reasons for variations (e.g., influence of the coordinator, difficulties experienced by the participants in using the software, disagreements within the team, difficulty of the subject matter, etc.), which should be considered in comparative studies.
3.4.4. Indicators of Design Evolution
Log-data have previously been used to measure the productivity of participants based on the number of diagrams created and assessed over time. Similarly, in this section we relate the number of diagrams selected in the groups’/coalitions’ syntheses to different thematic attributes (i.e., , author, stakeholder group, system, system priority weight) to develop a better understanding of design evolution dynamics. The first set of indicators proposed in
Table 4 should both help the coordinator to have a clearer insight into the intermediate results of the design process, and enable a comprehensive analysis of the results.
The Frequency of diagram selection in all the groups’/negotiation coalitions’ syntheses is a useful indicator of each diagram’s success. The diagrams with the highest frequency of use in the syntheses were identified. The shape of the curve in the frequency distribution graph shows the extent of agreement among groups and provides a first indication of the degree of disagreement that must be addressed in order to achieve consensus among the stakeholders. This metric was also applied at the level of individual participants to identify leading individuals having greater influence in the design. The Top Contributors can be compared to the Top Influencers to identify for each participant potential relationships between the number of diagrams created and their success.
The evolution of the syntheses, measured by the number of diagrams selected, provides an indication of the groups’ performance over time, especially if compared with the analysis of how many diagrams were selected per system. Differences across the syntheses can be associated with possible changes in Groups’ views as the process develops. Significant differences across the group’s syntheses in the number of selected diagrams aggregated by system in relation to its priority weight are likely to indicate that the group has modified its views.
Lastly, the role played by the diagram authors can also be tested. The number of diagrams selected in the syntheses is classified by author type, either system experts or stakeholder teams. The hypothesis according to which the
Diagram selection frequency changes based on the
role played by author has been tested in a previous study [
34]. Preliminary results showed that the chance of diagrams created by expert groups to be selected in the subsequent syntheses was greater than those created by the non-expert stakeholders in the other steps.
The participants’ performance, and the temporal and design evolution indicators described above, and related set of measures, were implemented in R, an open source software environment for statistical computing and graphics. R can be used to analyze data from many different sources including PostgreSQL. The log-data geodatabase was connected, and data access operations were performed using the R driver PostgreSQL. The indicators suitable for display in the coordinator’s real-time dashboards were tested on the Cagliari geodesign workshop log-data.The results are presented in the following section. The R extension Shiny was used for data visualization in interactive dashboards. Dashboards are visual indicators of information based on performance metrics that have been previously defined as relevant. Shiny enables users to rapidly build complex web applications using the R language without web development. It cannot be considered a direct substitute to more complex Business Intelligence (BI) platforms, yet it ensures an interactive data experience of sufficient quality for use in this study.
5. Discussion
In this case, even more than in others, the tasks that are necessary for the correct geodesign workshop development were included in a well-structured workflow. In particular for its academic and educational nature, the coordinator should have constantly had a clear vision of the participants’ performance and of the design evolution. This required ex-post evaluation work after each session by the coordination team in order to better target support during coming session. The implementation of the indicators described in the previous section (
Figure 11) may provide important complementary support aimed at improving the monitoring in real-time.
For example, the information obtained through
Top Contributors and
Top Influencers analysis may not only be used for monitoring the participants’ or students’ performance, but also to analyze whether those who have great potential to influence the design are also those who really influence the design, or in other words to make an assessment of performance in terms of quantity vs quality. As can be seen by comparing
Figure 11a with
Figure 11b there is not a direct relationship between the number of diagrams created by a participant and their selection frequency.
In addition, it is particularly useful to break down the number of diagrams created by system and by type thus offering detailed insights into the participants’ performance. Arguing the fact that all the systems need to be taken into consideration when designing future development alternatives, the
Diagram creation by system and
Diagram creation by type (
Figure 11c,e) facilitate the identification and timely execution of “corrective” measures to ensure a balanced distribution (in terms of number and type) of initial proposals, whether they were created by experts or by stakeholders.
Similarly, the
Diagram selection over time by system helps in identifying that all systems were taken into account in the syntheses creation. Again, the relation between diagrams created and diagram selected broken down by system is not strictly linear (e.g.,
Figure 11f shows they the cultural heritage diagrams were the most numerous in all syntheses created by the group EA35; however cultural heritage is among the less considered systems—in terms of created diagrams - as shown in
Figure 11c), and thus both indicators should be analyzed.
Another interesting aspect that is not clearly observable in the Geodesignhub user interface is the
Diagram selection over time by group that provides a detailed picture of the groups’ performance and highlights possible differences in behavior (e.g., the opposite trend recorded for group EA35 in
Figure 11d).
Summarizing, the chart reports are an effective way to visualize ongoing dynamics in a live dashboard and may ensure the effective coordination of the assistance efforts during the workshop. The overall results show the appropriateness of the first set of indicators proposed to analyze the design log-data made available for the first time by recent PSS, as Geodesignhub. Such an investigation may be useful both for monitoring ongoing processes, and for learning from past case studies with the aim of improving future one. First, while the experience and the observation skills of those involved in the coordination of geodesign studies will always be relevant and needed, the availability of digital dashboard monitoring the process (design as a verb) and its product (design as a noun) real-time may potential add great value, especially in fast-pace intensive geodesign workshops with high number of participants. Second, by identifying recurrent behaviours and pattern which appear to be more or less effective for the process to succeed, better processes could be designed and managed in the future avoiding bottlenecks and facilitating the emergence of positive dynamics.
Lastly, the opportunity of analysing this new type of data with digital dashboards may potentially enable the application of a new business intelligence perspective in real-time geodesign study management, and in retrospective or comparative studies by mining, what may be considered geodesign (processes) big-data.
6. Conclusions
The use of process mining techniques to discover, monitor and improve processes dynamics is gaining acceptance in many fields including collaborative design. Planning processes are becoming increasingly complex as a result of the multi-dimensional context (i.e., , multi-actor, multi-objective, multi-criteria, multi-scale processes) characterizing current practices. The new generation of advanced planning support technology is able to handle complex design workflows and to record detailed information about the history of processes and make it readily available for analysis.
This research develops an analytical framework to exploit information about collaborative geodesign processes recorded in the Geodesignhub log-data aiming at supporting the workshop coordinator in their role of guidance by getting real-time feedback on ongoing dynamics. As an early step towards a comprehensive Geodesign Process Analytics, this paper describes in detail the proposed analytical process: data extraction, preprocessing and analysis.
As highlighted in earlier sections, collaborative design log-data have a peculiar structure which integrates information relate to both the tasks carried out by participants along the process (i.e., , create a diagram; select a diagram) and the outputs of those tasks (i.e., , diagrams created, diagrams selected). The analytics tools, therefore, should cover two types of measures: those linked to the actions of the participants which characterize the process, and those related to design aspects of the products. The acquired knowledge can be applied to facilitate targeted and effective process improvement initiatives regarding on-going and future situations.
To this end, the indicators proposed for the analysis of a typical workshop dynamics provide knowledge about (i) spatial relations among design proposals by different groups of stakeholders, (ii) participants’ performance, (iii) actual compliance of the process with the workshop schedule, (iv) design evolution over time. The usefulness of the analytical framework has been demonstrated ex-post by applying a sub-set of representative indicators to gain insights on the geodesign study in real-world use-cases developed within the International Geodesign Collaboration project and involving postgraduate students in designing the future of the Metropolitan City of Cagliari, Italy.
While the set of the indicators has not been tested yet live during a geodesign workshop, the simulation of the application of a real-time dashboard implementing the indicators demonstrates their potential value in offering contextual advice to the geodesign workshop coordinator. The application of so called "descriptive" analytics does not, in fact, lead to any automated decisions based on the results of the analysis, but rather to better-informed real-time/proactive coordination actions and decisions. The objective is, therefore, to develop an analytical tool to support the coordination and management of running geodesign workshops and, subsequently, to facilitate the identification of recurrent behaviors and rules in the post-workshop analysis. In both cases, the improvement of current/future processes is the focus of the application of the proposed analytics.
In the light of the results of this research, the proposed analytical process could be integrated in the PSS architecture. The recently developed plugin “Geodesign Analytics” in Geodesignhub can be regarded as a first step towards the integration of process analysis tool within a PSS. The tool provides basic analysis including a timeline of when different groups saved their syntheses, a visual representation of how many diagrams were added and subtracted as the design develops. The capabilities offered by current web-based analytic apps should be fully exploited to carry out exploratory log-data analysis directly downloading design information from the cloud-based design platform. Design logs are rich data sources that offer many advantages in comparison with more traditional data gathering techniques and coding systems. The opportunity of obtaining value from this data is unprecedented, and it is worth to be investigated further as it may contribute to offer a better understanding of the process unfolding, and of its results.