Next Article in Journal
Evaluation of the Create@School Game-Based Learning–Teaching Approach
Previous Article in Journal
In Vivo Measurement of Cervical Elasticity on Pregnant Women by Torsional Wave Technique: A Preliminary Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improving IoT Predictions through the Identification of Graphical Features

School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA 99163, USA
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(15), 3250; https://doi.org/10.3390/s19153250
Submission received: 8 June 2019 / Revised: 20 July 2019 / Accepted: 22 July 2019 / Published: 24 July 2019
(This article belongs to the Section Internet of Things)

Abstract

:
IoT sensor networks have an inherent graph structure that can be used to extract graphical features for improving performance in a variety of prediction tasks. We propose a framework that represents IoT sensor network data as a graph, extracts graphical features, and applies feature selection methods to identify the most useful features that are to be used by a classifier for prediction tasks. We show that a set of generic graph-based features can improve performance of sensor network predictions without the need for application-specific and task-specific feature engineering. We apply this approach to three different prediction tasks: activity recognition from motion sensors in a smart home, demographic prediction from GPS sensor data in a smart phone, and activity recognition from GPS sensor data in a smart phone. Our approach produced comparable results with most of the state-of-the-art methods, while maintaining the additional advantage of general applicability to IoT sensor networks without using sophisticated and application-specific feature generation techniques or background knowledge. We further investigate the impact of using edge-transition times, categorical features, different sensor window sizes, and normalization in the smart home domain. We also consider deep learning approaches, including the Graph Convolutional Network (GCN), for the elimination of feature engineering in the smart home domain, but our approach provided better performance in most cases. We conclude that the graphical feature-based framework that is based on IoT sensor categorization, nodes and edges as features, and feature selection techniques provides superior results when compared to the non-graph-based features.

1. Introduction

We seek to leverage the inherent graph structure in IoT sensor networks to find a generic graph representation of sensor network data. We can provide a tool to IoT application builders to boost prediction task performance if we can build a generic graph-based framework and graphical features to support different prediction tasks from different IoT sensor network data. Graph-based approaches have been successful in several IoT applications. For example, George and Shekar [1] used a graph representation for road navigation to efficiently find the shortest route that may change with time. Graph-based methods that were applied to activity recognition by Long and Holder [2] showed uncorrelated errors when compared to non-graph methods, which indicated the potential for improved performance through the use of a graph representation. Salomon and Tirnauca [3] learn users’ behavioral patterns while using weighted finite automata by building a time-aggregated graph from the activity flow of a session (morning, afternoon, evening). They represent each activity as a node and time spent for a particular activity or the time elapsed between different activities as edge labels. George et al. [4] proposed Spatio-Temporal Sensor Graphs (STSG) to capture time dependence of sensor network parameters. They generalized time-aggregated graphs and included the probability parameters to represent the stochastic nature of sensor graphs. They illustrated three case studies of anomaly detection, basic hotspot detection and growing hotspot detection while utilizing the STSG model. Based on these works, we hypothesize that representing the sensor data as a graph and providing this graphical input to graph mining algorithms to identify new features in the data can improve the performance of learning methods in recognition and prediction tasks that are related to sensor networks. We develop a Graphical Feature-based Framework (GFF) to represent and analyze IoT sensor network data. This framework collects data from sensor networks, uses graph structure to represent movement-related data, and employs selected graphical features to improve the corresponding prediction tasks. As part of this framework, the sensors are represented as nodes in a graph, and consecutive triggering of sensors are represented as edges in the graph. Subsequently, graphical features, such as existence of nodes, existence of edges, count of nodes, and count of edges, are extracted from the graph to construct a feature set. This graphical feature set is added to the typical feature set of the IoT application. This augmented feature set is supplied to a classifier to learn a model to perform the prediction task for the IoT application.
We applied the GFF to three different IoT domains with different sensors and different prediction tasks. The first domain is activity recognition from smart home motion sensors, the second domain is demographic information prediction from smart phone GPS sensors, and the third domain is activity recognition from smart phone GPS sensors. Our graphical feature-based approach improved the classification accuracy for the first two domains as compared to most of the benchmarks in each domain [5,6]. For the third domain, we use graphical features along with typical non-graphical features that are based on sensor data, such as accelerometer and gyroscope, to predict activities. Experimental results show that the inclusion of GPS data as node features significantly improves the performance over other typical non-graphical features. Analyzing the confusion matrix shows that the addition of edges may improve the performance for some activities. Using only selected features has the potential to improve the performance with the addition of edges [7]. In this article, along with graphical features used in our above-mentioned work, we propose and evaluate additional graphical features as part of our framework. Here, we assess the use of different window sizes, adding edge transition time, categorization of nodes and edges, normalization of nodes and edges, application of different classifiers, and impact on different evaluation measures for the smart home domain. We also evaluate the use of deep learning and Graph Convolutional Networks to classify activities in the smart home domain in order to compare our framework with these deep learning approaches.
In Section 2, we describe related work that was conducted in the three domains of IoT applications to which we applied our GFF. In Section 3, we describe details of each step in the graphical feature-based method that was designed for IoT applications. In Section 4, we summarize the performance of the GFF in three domains. Afterwards, we describe the implementation details of varying window sizes, normalization of non-graphical and graphical features, using additional features, such as edge transition time and categorical features, use of Linear Support Vector Machine (SVM) and Random Forest as the classifier, and the application of deep learning approaches in the smart home domain. We present the results and the impact of these experiments on the performance of the prediction task. In Section 5, we conclude with a summary of our findings and suggest the best practices for applying graphical features. In Section 6, we discuss the future directions for using graph-based representations for IoT applications.

2. Related Work

In this section, we discuss previous work conducted in the three domains of activity recognition from smart home sensors, demographic prediction from mobile phone sensors, and activity recognition from smart phone sensors. Researchers have approached the problem of activity recognition from motion sensors using various methods, such as Bayes Belief Networks, Artificial Neural Networks, Sequential Minimal Optimization, LogitBoost Ensemble [8], Naive Bayes, Hidden Markov Models [9,10], and Conditional Random Fields [11]. Extensive research has been conducted regarding improving activity recognition performance while using non-graphical features such as motion sensor triggering; temporal information during the activity, such as time of day, day of week, whether the day is a weekend or not; activity length in time; previous/next activity; number of kinds of motion sensors involved; total number of times motion sensor events triggered; and, energy consumption for an activity [8,11].
Aicha et al. [12] detected the presence of multiple people or visits while using the number of sensor transitions that are not topologically connected. Two sensors are considered to be topologically connected if one person can activate those two sensors consecutively without activating another sensor. They also considered the number of sensor transitions where at least one of the sensors is a door sensor. To handle these two heterogeneous information streams with Markov Modulated Poison Process (MMPP), the authors introduced a variant of the process, called Markov Modulated Multidimensional Poison Process (M3P2), which performed better than MMPP. Their features are similar to our graphical features in regard to using sensor transitions. The difference is that we are considering any sensor transitions that occur in the data, regardless of topological connection and the types of sensors involved. Our model is applicable to multiple applications while using different kinds of sensors and prediction tasks. The model that was proposed by Aicha et al. [12] needs to be evaluated for its ability to generalize to other sensor applications and other prediction tasks. In their model, two variables representing the counts of two kinds of sensor transitions are represented as states of time. In our approach, we used the sensor transition times as additional edge features and directly feed these features to classifiers.
Mo et al. [13] used both raw data from all available sensors and applications in the mobile phone, computed tens of thousands of features through a complicated feature engineering process, designed specific features for specific tasks, and used background information, such as task-interdependency, to obtain a high performance. Nadeem et al. [14] used SVM and Random Forest with a bagging scheme over mean, variance, frequency, and many other measures to produce thousands of features that employ call log, visited GSM cells, visited Bluetooth devices, visited wireless LAN devices, accelerometer data, and other information. Brdar et al. [15] represented the data as a graph, where each node is a user, and used k-nearest-neighbor with feature selection to achieve better performance for predicting job-types. Dong et al. [16] predict the gender and age-group from call log and SMS data based on the relationship among populations of different genders and age groups. Ying et al. [17] address the imbalance class problem in the data through the use of multi-level classification and compute features that are related to users’ behavior and environment. Demographics predictions also have been done while using blogger’s writing style [18] and web-page click through data [19]. In “Limits of Predictability in Human Mobility” [20], the findings show that users’ daily mobility has a regularity and it has 93% predictability on average. Hence, it might be possible to build accurate predictive models that are dependent on human mobility. Montjoye et al. [21] emphasize the privacy concerns of an individual’s mobility due to the evidence that the uniqueness of human mobility traces is high and that mobility datasets can be re-identified while only using a few outside locations. Our GFF uses more generalized information of location categories, avoids complicated feature engineering, and uses graphical features that are only derived from GPS sensor data. We discuss more detailed comparisons to related works in the Results section.
Bouchard et al. [22] used raw latitude and longitude as the features to improve activity recognition performance and also discussed the potential of distance, position, shape, and gesture as features. Aminikhanghahi et al. [23] developed an approach, called Thyme, for adapting prompt timing that is based on the context of the user’s activity. Liao et al. [24] attempted to recognize activities from GPS traces through training a conditional random field to iteratively re-estimate significant places and activity labels. In our work, we add graphical features to the typical features that are used in this area and evaluate the potential of obtaining improved performance, along with having a general framework for IoT applications.
Recently, combining symbolic representations of domain knowledge and data-driven learning approaches has approached the activity recognition task. Yordanova et al. [25] proposed a Computational Causal Behaviour Model (CCBM) that is based on the principle of Computational State Space Models (CSSMs). They developed these models to address noisy sensor observations and the complications caused by behavior complexity and variability that are observed in real settings. They applied this approach to a sensor dataset that was collected by monitoring unscripted kitchen activities in order to detect eating behavior and provide indicators of health status. The performance was comparable to Hidden Markov Models and decision tree models. Olaru and Florea [26] used context graphs, context patterns, and continuous, persistent context matching to recognize context in ambient assisted living situations viewed as a set of associations between different concepts. Hao et al. [27] proposed a real-time inference engine while using formal concept analysis to graphically represent the concepts that can be used to improve the recognition of sequential, interleaved, and concurrent human activity patterns. Chen et al. [28] proposed an approach to address the cold-start, reusability, and incompleteness of data-driven activity modeling. They applied an ontology-based hybrid approach in three phases. In the first phase, the cold-start problem is resolved by creating seed activity models that are based on the domain knowledge that most activities of daily living (ADLs) are daily routines and normally take place at a specific time, location, and space. In the second phase, activity models are created in two levels of abstraction starting with generic coarse-grained activity models that are applicable for all users and then iteratively creating individual activity models, thus solving the reusability problem. In the third phase, the incompleteness problem is resolved by using learned activity patterns to update the activity models. Ye et al. [29] conducted similar work of applying a hybrid approach, who first use a general ontology model to represent domain knowledge across different environments and users. They then combine learning techniques with ontological semantics for unsupervised discovery of patterns for each individual’s daily activities.
In previous work, we proposed a graph-based representation of sensor data from which we extract graphical features that can improve the performance of prediction tasks for sensor networks, regardless of prediction tasks and domains [5,6,7]. We applied this concept to three different domains. Our graphical feature-based approach outperformed four previously explored graph-based approaches for activity recognition from motion sensor data in smart homes in [2]: Graph SVM, Subgraph SVM, Nearest Neighbor, and an ensemble of these three. The graphical features also outperformed three widely used baseline approaches in this domain: Naïve Bayes, Hidden Markov Model, and Conditional Random Field for the Cairo, Aruba, and Tulum smart home datasets [5]. In the second domain on demographic prediction, we used GPS location data from the Nokia Mobile Data Challenge 2012 dataset [30] to perform the classification task of demographic attributes, in particular classifying gender (two classes), age group (eight classes), and job type (eight classes). Graphical features or a combination of graphical and non-graphical features performed significantly better than only non-graphical features for all of the prediction tasks. In the third domain, activity prediction from smart phone data, the addition of GPS location category features to typical smart phone sensor-based features provided the best performance.
More recently, deep learning has been applied to the activity recognition task that is based on sensor data [31]. Like our approach, deep learning avoids the need for manual feature engineering. However, deep learning approaches can be overconfident in their classification, even for incorrect classifications, and information regarding the user’s context is not easily extracted from the learned network. To address these issues, Rueda et al. [32] proposed a hybrid activity recognition architecture, called Hybrid Computational Causal Behaviour Model (HCCBM), which combines deep learning with symbolic models, namely, CSSMs. However, deep learning has not been applied to our graph-based representation of sensor data. The Graph Convolutional Network (GCN) [33] is a deep learning approach to classification of graphs, and so it can be applied to our setting to evaluate a deep learning alternative to our GFF. The Results section presents an empirical comparison of the approaches.

3. Graphical Features-Based Framework

We propose a Graphical Feature-based Framework (GFF), where we represent one type of sensor data in a graph, extract graphical features, apply feature selection techniques, and then apply classification methods for a prediction task. Figure 1 shows the workflow for a particular prediction task from an IoT sensor network that is based on the GFF. We describe the details of each step of the GFF in Section 3.1, Section 3.2 and Section 3.3. The main contribution of this article is proposing and evaluating additional graphical features as part of the GFF. We describe these additional features in Section 3.4. In Section 3.5 and Section 3.6, we briefly discuss the deep learning approach and the GCN to compare with our graphical feature-based approach.

3.1. Sensor Categorization and Graph Representation

We consider each sensor in the IoT application, map the sensor to a sensor category if that is provided and available, and represent it as a node in the graph for representing IoT sensor network data as a graph. We construct a graph for each labeled instance of the IoT application. When two consecutive sensors are turned on during a labeled task, an undirected edge is added between the two corresponding nodes in the graph. Two sensors can be consecutively triggered multiple times, that is, an edge can be triggered multiple times. We store this count of multiple edges being triggered as edge attributes. The same sensor can be consecutively triggered in some applications, and hence we allow self-loops in our graph representation to capture this information.
We consider motion sensors as nodes for the task of activity recognition in the smart home. Each motion sensor can be mapped to a room type such as bedroom, kitchen, dining room, and living room, which are used as the sensor’s category [5]. For the tasks of demographic prediction [6] and activity recognition from smart phone sensors [7], we consider location data from the GPS of mobile phones and categorize the locations visited. We represent each location category as a node in the graph. We find the nodes in the graph for the corresponding location categories and add an undirected edge between the two nodes whenever the participant moves from one location to another location. In this way, we create a graph for each user in the demographic prediction task and a graph for each activity in the activity recognition task. We show examples of graph representations generated from motion sensor data and from GPS data in Figure 2.

3.2. Graphical Feature Extraction

We construct one graph for each labeled instance in the sensor network data. We construct one instance per user where we extract the existence of nodes and edges from the user’s graph as features for the demographic prediction problem. For the activity recognition problem, we construct one graph instance per activity. We use the list of all the nodes and edges triggered in all graphs as the feature set. Initially, three different kinds of graphical features have been used: existence of nodes, existence of edges, and existence of both nodes and edges. For the existence of nodes experiment, our feature set consists of all unique sensor categories that are represented as nodes in the graphs across all instances. If a node is present in the graph for a user, we set the value to ‘ON’ for that feature in the corresponding instance; otherwise, we set the value to ‘OFF’. For the existence of edge experiment, we compute all the unique edges triggered by all instances in their sensor-trajectory graph. We use this list of all edges as the feature set for the existence of edge experiment. To construct an instance, we consider the graph for each instance and check the existence of the edges. If the edge exists, then we mark the value as ‘ON’; otherwise, ‘OFF’. In the third experiment, we combined existence of both nodes and edges that we used as features in the previous two experiments as the combined feature set. We also computed how many times each node has been turned on during an activity and used these counts as features. We store the number of times that each edge is triggered as edge attributes and use these edge counts as graphical features. We attempted to introduce additional transitional features for the demographic prediction problem. We save the types of sensors as node attributes, and we denote the time used to move from one sensor to another sensor as the edge transition time and save it as an edge attribute. We use these node and edge attributes as additional graphical features and assess their impact on prediction task performance.

3.3. Feature Selection and Classification

The number of features may be large when compared to the number of instances since we are using nodes and edges triggered across all graphs as our feature set. Feature selection is employed for reducing the number of features to avoid overfitting in this condition. In the literature, there are two main approaches for choosing useful and relevant attributes: filtering and wrapper [34]. The filtering approach ranks each individual feature based on different measures, such as information gain, correlation, gain ratio, and symmetrical uncertainty. In the wrapper approach, subsets of the feature set are checked with a classification algorithm, and the subset that performs the best is provided as the optimal and selected feature subset. When the size of the feature set is large with thousands of features, the wrapper approach might be computationally expensive. For a large number of features, a hybrid approach is used, where a feature set of reduced size is constructed based on the filtering approach, and then the wrapper approach is applied to the reduced feature set to find an optimal feature subset. We then apply an appropriate classification technique on this optimally selected set of features. For the classifiers, we considered the Support Vector Machine and different tree-based classifiers. We describe the details of the feature selection algorithms and the classification techniques that provided the best result for each application in the Results section.

3.4. Additional Features

We experimented with additional features: normalized node counts and edge counts, edge transition times. We present the results of applying these additional features to the smart home domain. We also show the result of using sensor type and room type as categorical information for smart home sensors. We performed an experiment in the smart home domain to evaluate the impact of different window sizes on the use of graphical features while classifying activities. The results from these experiments are presented in the Results section.

3.5. Deep Learning Approach

We explore Deep Learning (DL) approaches as alternatives to the GFF, due to DL’s ability to construct relevant features without user intervention. The advantage of using a DL network as a classifier is that it automatically identifies relevant features and avoids the need for users to construct handcrafted features. Using DL as a classifier may also avoid the need for adding graphical features if the DL classifier can internally construct these features. We provide raw sensor data and graphical features to a DL classifier to experiment with this. Afterwards, we compare the results with our GFF.

3.6. Graph Convolutional Network (GCN) Approach

We also explore applying a Graph Convolutional Network (GCN) to activity prediction in the smart home. Under the umbrella of GCNs, a large number of methods have been developed to re-define convolution for graph data with the first prominent research that was presented in Bruna et al. [35]. Graph based neural networks (GNNs) have been applied in the areas of computer vision, recommender systems, forecasting in traffic networks, and molecular chemistry [33]. We follow the method that was developed by Zhang et al. [36], called Deep Graph Convolutional Neural Network (DGCNN), which directly takes a graph as input to learn a classification function and thus avoids the need of transforming the graphs to vectors. This method performed better than the existing methods on many benchmark datasets. We apply DGCNN to the graphs generated from sensor data in the smart home domain. Subsequently, we compare the results with our GFF.

4. Results

Here, we present results on several variations of the Graphical Feature-based Framework (GFF). These variations include using different window sizes over the sensor data, normalizing feature values, adding additional features, and using different classifiers. We also show the results on using deep learning for activity prediction mainly to compare the feature generation strengths of deep learning networks for graphs to our graph-based features.

4.1. Window-Based Approach with Graphical Features

In previous experiments, we used an activity as an instance for constructing graphs and extracting nodes and edges. Here, we segment the duration of an activity into smaller time windows to see the effect of window size on the accuracy of activity recognition. In a real-world scenario, we may not know when an activity instance begins and ends, so we are more likely to collect sensor data over a fixed-size time window. We divide each activity into segments based on a fixed window size and then use these smaller segments as instances to construct graphs. We compute the minimum, maximum, and average window size for each activity for each dataset to decide the range of window sizes to be varied. These statistics are presented in Table 1, Table 2 and Table 3.
We vary the window sizes for each dataset from five seconds to the average window size for all activities (Aruba 1895 seconds, Cairo 1020 seconds, Tulum 4478 seconds). We apply linear support vector machine classification (LinearSVC from Scikit Learn [37]) with 10-fold cross validation and the one-vs-rest method for this multi-class classification task. Accuracy for activity recognition is shown for increasing the window sizes for dataset ‘Aruba’ in Figure 3, for dataset ‘Cairo’ in Figure 4, and for dataset ‘Tulum’ in Figure 5. In these figures, ‘full activity’ indicates the use of the entire activity for computation of graphical features instead of segmenting the activities into windows of specific time duration. ‘Full Activity’ is depicted using a horizontal line for each case of nodes, edges, and nodes+edges.
From these plots we observe that accuracy increases with increasing window size for all types of features for Tulum and Cairo. For Aruba, the highest accuracy is observed at window size 500, and accuracy decreases with larger window sizes until full activity. For Aruba and Tulum datasets, node features performed better than edge features, but the combination of nodes and edges performed the best for all of the window sizes. Edge features performed low for small window sizes near five seconds, but the performance increased faster with the increase in window size. This indicates that larger window sizes will contain more edge transitions that help to improve prediction performance. For the Cairo dataset, accuracy in general increases with window size for all types of features, but for node and edge features, accuracy sometimes decreases with increasing window size. For some window sizes, edges performed better, and for some window sizes, nodes performed better, as depicted in Figure 4. However, the combination of nodes and edges performed better than nodes and edges separately as features. Cairo accuracies using the combination of nodes and edges are increasing for all increasing window sizes, with one exception at window size of 900 s. We can conclude from the plots in Figure 3, Figure 4 and Figure 5 that either edge counts or the combination of node and edge counts always provides better performance than node and edge separately, regardless of window size.
When we are using varying window sizes from five seconds to average activity duration, there are cases when window sizes for some instances of some activities are smaller than the current window size under evaluation. In this case, we used the whole activity as the training or test instance. In a real-world scenario, the label of the test instances and their window length will be unknown. As a result, we have to use a window size W for all test instances. There is extra irrelevant data in the test instance if the test instance we are trying to classify is an activity whose duration is D in the training set, and D < W, which may confuse the learning model and may decrease performance. In the same way, if D > W, then some relevant data is excluded from the test instance. This experiment of varying window sizes can be re-designed using a fixed window size of W for all test instances to address these possibilities. Afterwards, we can observe how the performance of basic features changes with the change of window size and how adding graphical features to basic features improves for different window sizes. This redesigned experiment will be pursued in future work.

4.2. Normalization

The number of nodes and edges triggered may vary with varying duration of different activities. Thus, we considered normalizing the graphical features (node counts, edge counts, and total number of nodes and edges) with respect to the total number of sensor events that occurred during each activity. Table 4 and Figure 6 show the impact of normalized graphical features on the accuracy of activity prediction. We used linear SVM from Scikit-Learn with default parameters, including one vs. rest as the method for multi-class classification. The combination of nodes and edges performed the best for all three testbeds. For Aruba, normalized features provide the best result; for Tulum and Cairo, unnormalized features provide the best result. Node normalization significantly improved the accuracy in Aruba and slightly in Tulum. Edge normalization improved accuracy in Aruba slightly and in Tulum significantly. Node and edge normalization both decreased performance in Cairo. Normalizing the combination of node-edge increased accuracy slightly in Aruba and decreased accuracy both in Tulum and Cairo. In all tables the best performers are in bold.
According to a paired t-test with 95% confidence, all of the higher-accuracy results for Aruba are statistically significant. For Tulum and Cairo, unnormalized nodes+edges perform significantly better than the nodes and edges. Normalized node-edges perform significantly better than the nodes and edges for both of these testbeds. However, the performance difference between unnormalized nodes+edges and normalized nodes+edges is not significant in Cairo and Tulum.
Models that are based on counts of nodes and counts of edges may be susceptible to imbalanced datasets. These counts may vary depending on the duration of activities. We experimented using normalization to handle these cases. For an imbalanced dataset, the variance in the counts will be lower for activities with higher frequency in the data when these features are discriminating. Variance can be higher for activities with lower frequency in the data. Normalization of these two features may help in reducing the difference in variances among different activities with varying frequencies, and thus provide improved performance for imbalanced datasets.

4.3. Random Forest Classifier on Smart Home Domain

We also considered using a different classifier as the base classifier in the smart home domain. Table 5 presents the performance comparisons between Support Vector Machine (SVM) and Random Forest (RF) applied to three sets of unnormalized transitional features for each testbed. The best performance is from RF with node-edge in Aruba, from SVM with node-edge in Tulum, and from RF with node in Cairo.
We performed paired t-test for entries in Table 5 with 95% confidence. For Aruba, node-edge with Random Forest performed significantly better than the node only and edge only with Random Forest. Node-edge with Random Forest performance is significantly better than node-edge with Linear SVM. For Tulum, the performance difference between Linear SVM with node-edge and Random Forest with node-edge is not statistically significant. For Cairo, RF with nodes performs significantly better than RF with edges or node-edges, and RF with nodes performs significantly better than the Linear SVM with nodes.

4.4. Edge Transition Times as Additional Features

In this experiment, we consider the effect on performance by including the time that is taken to traverse an edge. Edge transition time is the time that is spent to arrive at the second node of an edge after triggering the first node. We compute this transition time for each edge, store it as an edge attribute in the graph, and use this information as an additional feature, along with edge count for each edge. Each edge can be triggered multiple times in an activity graph, resulting in a list of varying transition times. We use the average of these transition times as the feature value for each edge transition time. Table 6 shows the accuracy after adding transition time to the feature list for the three testbeds: Aruba, Tulum, and Cairo. Node-edge features performed the best for Aruba and Tulum. For Cairo, node is the best performer, indicating that adding edge transition times reduced the performance of graphical features in the case of Cairo. Performing a paired t-test on the results of this table show that edge and node-edge performance is significantly better than node performance for Aruba. For Tulum, node-edge performance is significantly better than both node and edge. There are no significant differences in the performance among the graphical features in Cairo. These results after adding edge transition time did not improve upon our best result so far.
Table 7 shows the number of instances, nodes, edges, and combination of nodes and edges. The number of edges doubled due to adding transition time for each edge. Accordingly, we next considered the use of feature selection to see whether the use of only selected features improves performance. We used the SelectKBest feature selection method to select the k most important features based on the mutual information criteria [38,39,40]. We varied the value of k from 10 to the maximum number of features for each testbed. We provide this range of possible values to GridSearchCV in Scikit-Learn that conducts an exhaustive search over the specified parameter values of an estimator. The parameters of the estimator are optimized by cross-validated grid-search over a parameter grid [37]. The parameters selected are those that maximize the score of the left-out data while using accuracy as the score. We separate each testbed data into 67% training and 33% testing. Subsequently, we provide the training set to GridSearchCV to select the best value for k and to find a fit with the Linear SVM and Random Forest classifiers. We use the optimized k value and classifier to classify the testing data. Table 8 presents accuracy for this prediction task while using Linear SVM as the classifier.
We observe that Node-Edge provides the best performance for Aruba and Tulum with linear SVM and with transition times as additional features. For Cairo, Node is the best performing features in this case where edge transition times are used. We observe that a mix of node counts, edge counts, and average edge transition times are selected as the best features for the three test beds.
With Random Forest as the classifier, Edge performed best for Aruba, and Node performed best for Tulum and Cairo. Table 9 presents the results for accuracy.
Table 5 shows the accuracies before adding edge transition time. Statistical significance testing could not be easily done, given the difference in experimental settings, but comparing the above results with this previous result indicates that the best result is without using edge transition time for Tulum and Cairo. However, for Aruba, adding edge transition time and using Random Forest as the classifier provides slightly better accuracy.

4.5. Categorical Features

In this section, we add other types of sensors as nodes along with motion sensors. We add transitions between these different types of sensors as edges. We assign a room type to each sensor, add room types as node attributes, and use room types and transition between room types as the features. Among the three testbeds, only Aruba has door sensors. After adding door sensors, accuracy using nodes significantly improved, accuracy using edge and node-edge improved slightly. Table 10 presents this result.
All of the sensors deployed in a smart home can be grouped according to their locations in the residence. Motion and door sensors located in bedrooms, bathrooms, kitchen, dining room, hallways, and living room can be mapped to the corresponding room type. We store room type as a node attribute while constructing an activity graph. Instead of using individual sensors as nodes, we use their room types as nodes and transitions between room types as edges. We manually create the mapping between sensors and room types from the apartment layout and provide the map to the feature extraction algorithm.
Table 11 shows the accuracy using room types to create an activity graph and nodes and edges as features. We observe that the combination of nodes and edges provided the best accuracy for all test beds.
Next, we add nodes and edges that are related to room types as additional features to our previous motion and door sensor-based graphical features. Table 12 and Table 13 present accuracies while using these additional features with Linear SVM and Random Forest as classifiers, respectively. Edge or the combination of nodes and edges performed the best for all three testbeds and for both classifiers. Performing a paired t-test with 95% confidence on the data for Table 12 shows that, for Aruba, both edge and node-edges perform significantly better than only nodes. However, the performance difference between edges and node-edges is not statistically significant. For Tulum, the node-edges perform significantly better than only nodes and only edges. For Cairo, all results are statistically significant. Conducting a paired t-test on the data for Table 13 demonstrates that node-edge performs significantly better than only nodes or only edges for Aruba. For Cairo, the node-edge performs significantly better than edges, but the performance difference between nodes and node-edge is not significant. For Tulum, edges perform significantly better than node-edges, but the performance difference between nodes and edges are not significant. When comparing with the previous best result shown in Table 5, for Aruba, performance slightly improved after adding room type features; for Tulum, accuracy decreased after adding room type; and for Cairo, adding room type did not affect accuracy.
Raw sensor information, such as GPS coordinates, can be too specific for learning algorithms in sensor applications. Mapping raw sensor information to categories (e.g., mapping GPS to location categories, such as coffee shop, office, home, etc.) improved the accuracy for prediction tasks compared to raw sensor information. Mapping motion sensor data in a smart home to room type information, such as bedroom, living room, kitchen, etc., and then computing graphical features on this categorized sensor information, further improves the performance of prediction tasks for single resident apartments. IoT practitioners who will use the GFF can supply mapping information from sensor id to sensor type.
We conclude that using the combination of nodes and edges as features improves the activity recognition performance in most cases when compared to nodes or edges only. Adding sensor types that are related to movement such as door sensors along with motion sensors improved the recognition accuracy. For a single resident apartment, such as Aruba, feature extraction using an activity window size of less than the average window size showed the best performance instead of using full activity time. Graphical feature extraction using full-activity as the window size provided the best performance for apartments with multiple residents, like Cairo and Tulum. In the case of single resident apartments, like Aruba, adding edge transition time combined with a feature selection method improved the accuracy of activity recognition. Adding edge transition time did not improve performance in the case of multi-resident apartments, such as Cairo and Tulum.

4.6. Deep Learning Approach

In these experiments, we apply a deep learning approach to activity recognition in the smart home domain. We used Keras [41], which is a high-level neural network API that is built on top of TensorFlow [42], which is an open source platform for machine learning. We used Google Collaboratory cloud service to create and train the neural net with GPU support. We created a basic sequential neural network of three dense layers. ‘Dense’ indicates fully connected layers where each node in the next layer is connected to all nodes in the previous layer. We conduct two experiments with existence of graphical features and count of graphical features as inputs to the neural network. The input layer contains 256 nodes and the hidden layer contains 128 nodes. The output layer contains one node for each activity. The hidden layer uses ReLU as the activation function and the output layer uses softmax as the activation function. We used ‘adam’ as the optimizer, ‘sparse categorical crossentropy’ as the loss function, and ‘accuracy’ as the metric. We conducted training for 30 epochs while using the train-test separation as the evaluation method with 70% data in the training set. Table 14 shows average accuracy on the test set for the three test-beds when we provide existence of nodes, edges, and combination of nodes and edges as inputs to the three-layer neural network. We present the accuracies in Table 15 resulting from the same neural network when we provide counts of nodes, edges, and combination of nodes and edges as inputs.
We observe from the above results that edges or node-edge combination work better than only nodes for the DL approach for most of the cases, except for the single-resident apartment Aruba with count features. For Aruba, nodes performed the best with the DL approach. Count features performed better when compared to existence features for the DL approach, similar to the non-DL approach with graphical features. In Table 16, we present the best results from graphical features and the combination of features and algorithm that provided the best result in our GFF approach. Statistical significance testing could not be easily done, given the different in experimental settings, but comparing these performances with the DL accuracies in Table 15, we observe that graphical features perform similar to the DL network for single resident apartment Aruba. The graphical features performed better than the DL network for the multi-resident apartments Cairo and Tulum.

4.7. Graph Convolutional Network (GCN) Approach

We used the Graph Convolutional Neural Network (GCNN) that was developed by Zhang et al. in order to test the ability of deep learning to classify our graph representation of activities directly [36]. We generated the graphs for each test bed and provided those to the GCNN tool that trains the neural net for 500 epochs. We use the train and test separation method as the evaluation method with 70% data in the training set. We report the test set accuracy on the 500th epoch. The GCNN tool also has an option to take continuous node attributes as inputs along with node and edge information. Here, we conduct two different experiments: (1) provide input graphs with nodes and edges information and (2) provide input graphs with nodes, edges, and node attributes indicating the count of how many times each node is triggered. We present the results of these two experiments in Table 17 and Table 18.
We observe from Table 17 and Table 18 that GCNN with node count as attributes performs better than GCNN with existence of nodes and edges for multi-resident apartments Cairo and Tulum, but GCNN with node count performs worse for single resident apartment Aruba. When comparing with the best results from the GFF, as shown in Table 16, we observe that the GFF performs better than GCNN.

5. Conclusions

We studied and assessed the use of graph representations and graphical features in IoT sensor networks to improve the performance on prediction tasks. We proposed a graphical feature-based framework (GFF) to apply to sensor network data. The purpose of the construction and use of this kind of framework is multi-fold. First, the framework leverages inherent graph structure of sensor networks to represent sensor network data. Second, the framework offers a generic approach of applying graphical features to improve the accuracy on prediction tasks across different sensor networks. Third, the framework improves the predictions without using application-specific and prediction-task specific feature crafting.
We evaluated this hypothesis on three application areas, activity recognition from motion sensors in smart homes, demographic prediction from smart phone GPS, and activity recognition from smart phone GPS. The GFF approach improved the accuracy of activity recognition from motion sensors in most cases when compared with other widely used baseline methods for motion sensor data, such as Hidden Markov Model, Naïve Bayes, and Conditional Random Field. For demographic attribute prediction, the GFF outperformed most of the state-of-the-art methods while using no background knowledge and while using no application-specific and task-specific feature construction. The results for activity recognition from GPS data demonstrate that adding edges can increase the accuracy for most activities.
We present the results of using graphical features from different window sizes, while using normalized and categorized graphical features, applying edge transition times as graphical features and comparing Random Forest with Linear SVM for motion sensor data in a smart home. We analyzed the impact of graphical features on activity recognition accuracy from motion sensor data for varying window sizes. For each window size, either the edges or the combination of nodes and edges as features performed better than using only nodes. For the single-resident apartment, slight improvements in accuracy were observed by each of normalizing graphical features, adding graphical features that are based on room types, adding edge transition times as additional features, and using important features with feature selection methods. For multi-resident apartments, these additional features, along with feature selection, did not show improved accuracy. For activity recognition using smart home motion sensor data, SVM worked better for some testbeds and Random Forest worked better for other testbeds. We also presented a comparison between the GFF, a deep learning approach, and the Graph Convolutional Network, with the initial results indicating that the GFF outperforms both of these methods.

6. Future Directions

This study suggests several directions for future research. One research direction is to extract multi-edge paths and small subgraphs along with nodes and edges as features and evaluate the effect of these graphical features on prediction tasks for IoT sensor network applications. We extracted features representing two-edge transitions for one testbed in activity recognition from motion sensors in the smart home, but the initial results showed decreased performance. Moreover, generating longer paths and larger sub-graphs can be computationally expensive and time-consuming combined with higher numbers of features. However, a more systematic exploration of multi-edge transitions with feature selection methods, and evaluating these features on different testbeds and applications, is needed to assess whether multi-edge-based features is a promising direction. We would also like to compare our approach with symbolic representation-based approaches discussed in the related work section and to assess the possibility to combine the methods for improving the performance of sensor network prediction tasks. Symbolic approaches and hidden Markov models exploit the structure of high-level activities, and our approach exploits the sensor’s topology. We can analyze whether there are similar structures in both cases and if the combination of high-level activity structure with low-level sensor structure improves the performance of the model.
We plan to try classifiers other than SVM and Random Forest for improving prediction task performance and evaluate the GFF. We would like to investigate the reasoning behind why different classifiers are working better for different datasets and applications. Based on that, we would like to provide the best practices for classifier selection in the GFF.
For continuous features, like node and edge counts, we can transform them into nominal features. Transforming a continuous variable to a binary variable is called dichotomization [43]. In our work, the existence of nodes and edges can be considered a dichotomization of the counts of nodes and edges. Using dichotomized features may result in information loss and decreased performance [44]. The choice of cut point to dichotomize the feature is another challenging issue as the amount of information lost depends on the prior choice of cut points, with the optimal cut point depending upon the unknown parameters [45]. This aligns with our result in the smart home environment, where we obtained better performance with count features instead of existence features. However, many of the feature selection algorithms provide better results with the discretized features. Discretization is used as a pre-processing step for correlation-based feature selection approaches [46]. This aligns with our result in demographic attribute prediction from GPS data, where selecting fewer and important features from more than 7000 features is required to improve the performance. In this case, discretizing the features before providing them to the feature selection algorithm was helpful. In the future, we can experiment with discretizing counts of nodes and edges into several intervals with discretization methods that were described in [46], so that information loss is minimal and at the same time feature selection methods can effectively use the discretized features.
With time series data, forecasting practitioners usually want to predict future events that are based on historical data. As a result, cross-validation with random subsets of the data is not a good choice, because it will be easier to learn looking at the information both before and after the dates that we are trying to predict, which is different than the real-life scenario, where we need to predict the events that will happen in the future based on events that happened in the past [47]. Additionally, research shows that performance on testing data closer-in-time to the training data will likely be better than performance on testing data farther-in-time from the training data [48]. One way to address this issue is to wait and see in real time. For example, Makridakis et al. [49] made forecasts in September 1987 and evaluated them at the conclusion of 1988. Instead of waiting a long time, we can use earlier data to fit and estimate a model and reserve later data to evaluate the forecasting accuracy. This becomes structurally similar to the real-world environment, where we are in the present and predict the future. In our future work, we will evaluate based on test data that appear later in the dataset instead of a randomly-selected subset of the data for time series forecasting from IoT sensor network data.
Generally, human activities follow a pattern of nodes visited and edge transitions. Over time, activity patterns may change, and new transitions may appear in the test data that were not seen in the training data. We can evaluate the effect of these new transitions on the activity recognition accuracy by observing the performance drift of the models on new test data. Models exhibiting a significant decrease in performance over time will need to be retrained.
While deep learning approaches to activity recognition from sensor data have shown promise [31], they performed worse in our setting than our non-deep learning approach. Deep learning approaches can be improved by tuning hyper parameters, such as number of layers, learning rate, iterations, number of units in hidden layers, choice of activation function, momentum term, mini batch size, regularization parameters, and many others. However, significant network tuning would eventually equate to the effort of feature-crafting in the traditional machine learning approach that our framework was designed to avoid.
The usefulness of the GFF can be assessed in other IoT scenarios that are widely used for tracking and monitoring. We built three different applications for data processing and modeling for three different domains. In the future, we can build one single software application and define a general interface for all IoT data, so that sensor information, sensor id to sensor category mappings, and other inputs can be provided as part of this input interface. As a result, any IoT sensor network data can be fit into this input format and easily tested to evaluate the performance improvement provided by the use of graphical features.

Author Contributions

Methodology, S.A. and L.H.; Software, S.A.; Supervision, L.H.; Validation, S.A.; Visualization, S.A.; Writing—original draft, S.A.; Writing—review and editing, S.A. and L.H.

Funding

This research was funded by the National Science Foundation, grant numbers 1318913 and 1646640.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. George, B.; Shekhar, S. Time-Aggregated Graphs for Modeling Spatio-temporal Networks. In Journal on Data Semantics XI; Springer-Verlag: Berlin, Germany, 2008; pp. 191–212. [Google Scholar]
  2. Long, S.S.; Holder, L.B. Using Graphs to Improve Activity Prediction in Smart Environments Based on Motion Sensor Data. In Proceedings of the 9th International Conference on Smart Homes and Health Telematics, Montreal, QC, Canada, 20–22 June 2011; pp. 57–64. [Google Scholar]
  3. Salomón, S.; Tîrnăucă, C.; Salomón, S.; Tîrnăucă, C. Human Activity Recognition through Weighted Finite Automata. Proc. Int. Conf. Ubiquitous Comput. Ambient. Intell. 2018, 2, 1263. [Google Scholar] [CrossRef]
  4. George, B.; Kang, J.M.; Shekhar, S. Spatio-Temporal Sensor Graphs (STSG): A Data Model for the Discovery of Spatio-temporal Patterns. Intell. Data Anal. 2009, 13, 457–475. [Google Scholar] [CrossRef]
  5. Akter, S.S.; Holder, L.B. Activity Recognition Using Graphical Features. In Proceedings of the International Conference on Machine Learning and Applications, Detroit, MI, USA, 3–6 December 2014; pp. 165–170. [Google Scholar]
  6. Akter, S.; Holder, L. Using Graphical Features To Improve Demographic Prediction From Smart Phone Data. In Proceedings of the International Workshop on Network Data Analytics (NDA), Chicago, IL, USA, 14–19 May 2017; pp. 1–8. [Google Scholar]
  7. Akter, S.S.; Holder, L.B.; Cook, D.J. Activity Recognition Using Graphical Features from Smart Phone Sensor. In Proceedings of the International Conference on Internet of Things, Valencia, Spain, 15–18 October 2018; Springer: Cham, Switzerland, 2018; pp. 45–55. [Google Scholar]
  8. Chen, C.; Das, B.; Cook, D.J. A Data Mining Framework for Activity Recognition in Smart Environments. In Proceedings of the 2010 Sixth International Conference on Intelligent Environments, Kuala Lumpur, Malaysia, 19–21 July 2010; pp. 80–83. [Google Scholar]
  9. Singla, G.; Cook, D.J.; Schmitter-Edgecombe, M. Incorporating Temporal Reasoning into Activity Recognition for Smart Home Residents. In Proceedings of the AAAI Workshop on Spatial and Temporal Reasoning, Chicago, IL, USA, 13 July 2008. [Google Scholar]
  10. Singla, G.; Cook, D.J.; Schmitter-Edgecombe, M. Tracking Activities in Complex Settings Using Smart Environment Technologies. Int. J. Biosci. Psychiatr. Technol. 2009, 1, 25–35. [Google Scholar]
  11. Nazerfard, E.; Das, B.; Holder, L.B.; Cook, D.J. Conditional Random Fields for Activity Recognition in Smart Environments. In Proceedings of the 1st ACM International Health Informatics Symposium, Arlington, WV, USA, 11–12 November 2010; ACM: New York, NY, USA, 2010; pp. 282–286. [Google Scholar]
  12. Aicha, A.N.; Englebienne, G.; Kröse, B. Modeling visit behaviour in smart homes using unsupervised learning. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing Adjunct Publication—UbiComp 14 Adjunct, Seattle, WA, USA, 13–17 September 2014; ACM Press: New York, NY, USA, 2014; pp. 1193–1200. [Google Scholar]
  13. Mo, K.; Tan, B.; Zhong, E.; Yang, Q. Report of Task 3: Your Phone Understands You. In Proceedings of the Nokia Mobile Data Challenge Workshop, Newcastle, UK, 18–19 June 2012. [Google Scholar]
  14. Mohrehkesh, S.; Ji, S.; Nadeem, T.; Weigle, M. Demographic Prediction of Mobile User from Phone Usage. In Proceedings of the Nokia Mobile Data Challenge Workshop, Newcastle, UK, 18–19 June 2012. [Google Scholar]
  15. Brdar, S.; Ćulibrk, D.; Crnojević, V.; Obradovića, T.D. Demographic Attributes Prediction on the Real-World Mobile Data. In Proceedings of the Nokia Mobile Data Challenge Workshop, Newcastle, UK, 18–19 June 2012. [Google Scholar]
  16. Dong, Y.; Yang, Y.; Tang, J.; Yang, Y.; Chawla, N.V. Inferring user demographics and social strategies in mobile social networks. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining—KDD 14, New York, NY, USA, 24–27 August 2014; pp. 15–24. [Google Scholar]
  17. Ying, J.J.; Chang, Y.; Huang, C.; Tseng, V.S. Demographic prediction based on user’s mobile behaviors. Mob. Data Chall. 2012, 1–6. [Google Scholar]
  18. Schler, J.; Koppel, M.; Argamon, S.E.; Pennebaker, J.W. Effects of Age and Gender on Blogging. In Proceedings of the AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, Palo Alto, CA, USA, 22–25 March 2005. [Google Scholar]
  19. Hu, J.; Zeng, H.J.; Li, H.; Niu, C.; Chen, Z. Demographic prediction based on user’s browsing behavior. In Proceedings of the 16th international conference on World Wide Web—WWW 07, Banff, AB, Canada, 8–12 May 2007; ACM Press: New York, NY, USA, 2007; p. 151. [Google Scholar]
  20. Song, C.; Qu, Z.; Blumm, N.; Barabási, A.L. Limits of predictability in human mobility. Science 2010, 327, 1018–1021. [Google Scholar] [CrossRef] [PubMed]
  21. De Montjoye, Y.A.; Hidalgo, C.A.; Verleysen, M.; Blondel, V.D. Unique in the Crowd: The privacy bounds of human mobility. Sci. Rep. 2013, 3, 1376. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Bouchard, K.; Holder, L.; Cook, D.J. Extracting generalizable spatial features from smart phones datasets. In Proceedings of the AAAI Workshop on Artificial Intelligence Applied to Assistive Technologies and Smart Environments, Phoenix, AZ, USA, 12 February 2016. [Google Scholar]
  23. Aminikhanghahi, S.; Fallahzadeh, R.; Sawyer, M.; Cook, D.J.; Holder, L.B. Thyme: Improving Smartphone Prompt Timing Through Activity Awareness. In Proceedings of the IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico, 18–21 December 2017; pp. 315–322. [Google Scholar]
  24. Liao, L.; Fox, D.; Kautz, H. Location-based activity recognition. In Proceedings of the Advances in Neural Information Processing Systems (NIPS), Montreal, QC, Canada, 7–12 December 2005. [Google Scholar]
  25. Yordanova, K.; Lüdtke, S.; Whitehouse, S.; Krüger, F.; Paiement, A.; Mirmehdi, M.; Craddock, I.; Kirste, T.; Yordanova, K.; Lüdtke, S.; et al. Analysing Cooking Behaviour in Home Settings: Towards Health Monitoring. Sensors 2019, 19, 646. [Google Scholar] [CrossRef] [PubMed]
  26. Olaru, A.; Florea, A. Context Graphs as an Efficient and User-Friendly Method of Describing and Recognizing a Situation in AAL. Sensors 2014, 14, 11110–11134. [Google Scholar] [CrossRef] [PubMed]
  27. Hao, J.; Bouzouane, A.; Gaboury, S. Complex behavioral pattern mining in non-intrusive sensor-based smart homes using an intelligent activity inference engine. J. Reliab. Intell. Environ. 2017, 3, 99–116. [Google Scholar] [CrossRef]
  28. Chen, L.; Nugent, C.; Okeyo, G. An Ontology-Based Hybrid Approach to Activity Modeling for Smart Homes. IEEE Trans. Hum. Mach. Syst. 2014, 44, 92–105. [Google Scholar] [CrossRef]
  29. Ye, J.; Stevenson, G.; Dobson, S. USMART: An Unsupervised Semantic Mining Activity Recognition Technique. ACM Trans. Interact. Intell. Syst. 2015, 4, 1–27. [Google Scholar] [CrossRef]
  30. Laurila, J.K.; Gatica-Perez, D.; Aad, I.; Bornet, O.; Do, T.M.T.; Dousse, O.; Eberle, J.; Miettinen, M. The Mobile Data Challenge: Big Data for Mobile Computing Research. In Proceedings of the Pervasive and Mobile Computing, Trento, Italy, 26–30 March 2012. [Google Scholar]
  31. Wang, J.; Chen, Y.; Hao, S.; Peng, X.; Hu, L. Deep learning for sensor-based activity recognition: A survey. Pattern Recognit. Lett. 2019, 119, 3–11. [Google Scholar] [CrossRef] [Green Version]
  32. Rueda, F.M.; Ludtke, S.; Schroder, M.; Yordanova, K.; Kirste, T.; Fink, G.A. Combining Symbolic Reasoning and Deep Learning for Human Activity Recognition. In Proceedings of the 2019 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), Kyoto, Japan, 11–15 March 2019; pp. 22–27. [Google Scholar]
  33. Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A Comprehensive Survey on Graph Neural Networks. arXiv 2019, arXiv:1901.00596. [Google Scholar]
  34. Hall, M.A.; Holmes, G. Benchmarking attribute selection techniques for discrete class data mining. IEEE Trans. Knowl. Data Eng. 2003, 15, 1437–1447. [Google Scholar] [CrossRef]
  35. Bruna, J.; Zaremba, W.; Szlam, A.; Lecun, Y. Spectral networks and locally connected networks on graphs. In Proceedings of the International Conference on Learning Representations (ICLR), Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
  36. Zhang, M.; Cui, Z.; Neumann, M.; Chen, Y. An End-to-End Deep Learning Architecture for Graph Classification. In Proceedings of the AAAI, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
  37. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  38. Kozachenko, L.F.; Leonenko, N.N. Sample Estimate of the Entropy of a Random Vector. Probl. Inf. Transm. 1987, 23, 95–101. [Google Scholar]
  39. Kraskov, A.; Stögbauer, H.; Grassberger, P. Estimating mutual information. Phys. Rev. E 2004, 69, 66138. [Google Scholar] [CrossRef]
  40. Ross, B.C. Mutual Information between Discrete and Continuous Data Sets. PLoS ONE 2014, 9, e87357. [Google Scholar] [CrossRef]
  41. Chollet, F. Keras. Available online: https://keras.io/ (accessed on 7 June 2019).
  42. Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M.; et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv 2016, arXiv:1603.04467. [Google Scholar]
  43. Royston, P.; Altman, D.G.; Sauerbrei, W. Dichotomizing continuous predictors in multiple regression: A bad idea. Stat. Med. 2006, 25, 127–141. [Google Scholar] [CrossRef]
  44. Ravichandran, C.; Fitzmaurice, G.M. To dichotomize or not to dichotomize? Nutrition 2008, 24, 610–611. [Google Scholar] [CrossRef] [PubMed]
  45. Fedorov, V.; Mannino, F.; Zhang, R. Consequences of dichotomization. Pharm. Stat. 2009, 8, 50–61. [Google Scholar] [CrossRef] [PubMed]
  46. Hall, M.A. Correlation-Based Feature Selection for Machine Learning; The University of Waikato: Hamilton, New Zealand, 1999. [Google Scholar]
  47. Thomas, R. How (and Why) to Create a Good Validation Set Fast.ai. Available online: https://www.fast.ai/2017/11/13/validation-sets/ (accessed on 7 June 2019).
  48. Tashman, L.J. Out-of-sample tests of forecasting accuracy: An analysis and review. Int. J. Forecast. 2000, 16, 437–450. [Google Scholar] [CrossRef]
  49. Makridakis, S.; Chatfield, C.; Hibon, M.; Lawrence, M.; Mills, T.; Ord, K.; Simmons, L.F. The M2-competition: A real-time judgmentally based forecasting study. Int. J. Forecast. 1993, 9, 5–22. [Google Scholar] [CrossRef]
Figure 1. The Graphical Feature-based Framework (GFF) for IoT sensor network prediction tasks.
Figure 1. The Graphical Feature-based Framework (GFF) for IoT sensor network prediction tasks.
Sensors 19 03250 g001
Figure 2. Example graph representations: (a) graph representation for activity ‘Breakfast’ in testbed ‘Cairo’; and, (b) selected edges that are triggered for a participant of age 22–27 in the Nokia smartphone dataset.
Figure 2. Example graph representations: (a) graph representation for activity ‘Breakfast’ in testbed ‘Cairo’; and, (b) selected edges that are triggered for a participant of age 22–27 in the Nokia smartphone dataset.
Sensors 19 03250 g002
Figure 3. Accuracy of activity recognition for varying window sizes for testbed ‘Aruba’. ‘nodes’, ‘edges’ and ‘nodes+edges’ indicate the types of features extracted from the graph: node-based features only, edge-based features only, and both node and edge-based features, respectively. ‘full activity’ means the window size is set to the whole duration for an activity for building the graph.
Figure 3. Accuracy of activity recognition for varying window sizes for testbed ‘Aruba’. ‘nodes’, ‘edges’ and ‘nodes+edges’ indicate the types of features extracted from the graph: node-based features only, edge-based features only, and both node and edge-based features, respectively. ‘full activity’ means the window size is set to the whole duration for an activity for building the graph.
Sensors 19 03250 g003
Figure 4. Accuracy of activity recognition for varying window sizes for testbed ‘Cairo’. See Figure 3 caption for explanation of different plots.
Figure 4. Accuracy of activity recognition for varying window sizes for testbed ‘Cairo’. See Figure 3 caption for explanation of different plots.
Sensors 19 03250 g004
Figure 5. Accuracy of activity recognition for varying window sizes for testbed ‘Tulum’. See Figure 3 caption for explanation of different plots.
Figure 5. Accuracy of activity recognition for varying window sizes for testbed ‘Tulum’. See Figure 3 caption for explanation of different plots.
Sensors 19 03250 g005
Figure 6. Accuracy with Normalized vs. Unnormalized Features.
Figure 6. Accuracy with Normalized vs. Unnormalized Features.
Sensors 19 03250 g006
Table 1. Aruba Dataset: minimum, maximum, and average window length of each activity.
Table 1. Aruba Dataset: minimum, maximum, and average window length of each activity.
ActivityMinimumMaximumAverage
Respirate971493512
Enter home1545
Work5070161024
Eating144167608
Housekeeping9253991216
Relax5615,3032009
Wash dishes342200428
Sleeping3038,78314,417
Leave home11093
Bed to toilet22520163
Meal preparation719,573469
Table 2. Cairo Dataset: minimum, maximum, and average window length of each activity.
Table 2. Cairo Dataset: minimum, maximum, and average window length of each activity.
ActivityMinimumMaximumAverage
Night wandering791701298
Medicine2739882
Laundry225807468
Work42156132029
Bed32517659
Lunch61151521920
Dinner100062362757
Bed to toilet91703238
Leave home320120
Breakfast55128871735
Table 3. Tulum Dataset: minimum, maximum, and average window length of each activity.
Table 3. Tulum Dataset: minimum, maximum, and average window length of each activity.
ActivityMinimumMaximumAverage
Bathing153,360270
Wash dishes1828119
Personal hygiene12343138
Leave home1134821
Enter home154,099477
R1 sleeping in bed202123,77232,463
R2 sleeping in bed1000123,77634,490
Work bedroom 1114,011548
Work table15100147
Watch TV110,100362
Meal preparation164,968169
Yoga263302827
Eating211,156404
Bed toilet transition180079
Work bedroom 2127,328550
Work living room112,728584
Table 4. Accuracy with Normalized vs. Unnormalized Features.
Table 4. Accuracy with Normalized vs. Unnormalized Features.
Test BedNode AccuracyEdge AccuracyNode Edge Accuracy
Un-NormNormUn-NormNormUn-NormNorm
Aruba81.0392.9092.1393.1192.0393.41
Tulum71.5172.6267.9571.4076.3473.74
Cairo73.1772.0074.1773.5076.5074.83
Table 5. Accuracy (Unnormalized features) Support Vector Machine (SVM) vs. Random Forest (RF).
Table 5. Accuracy (Unnormalized features) Support Vector Machine (SVM) vs. Random Forest (RF).
Test BedNode AccuracyEdge AccuracyNode Edge Accuracy
SVMRFSVMRFSVMRF
Aruba81.0392.9192.1393.0692.0393.07
Tulum71.5172.4567.9770.5376.3472.46
Cairo73.1779.1774.1775.3376.577.67
Table 6. Accuracy with Edge Transition Time as Additional Feature.
Table 6. Accuracy with Edge Transition Time as Additional Feature.
Test BedNodeEdgeNode Edge
Aruba89.0491.7691.80
Tulum72.0167.9973.38
Cairo73.6766.1765.67
Table 7. Dimension of Feature Matrices with Edge Transition Time as Additional Features.
Table 7. Dimension of Feature Matrices with Edge Transition Time as Additional Features.
Test BedInstancesNodesEdgesNodes + Edges
Aruba6477349761010
Tulum12635319721003
Cairo60027722749
Table 8. Accuracy with Selected Edge Transition Time and Linear SVM.
Table 8. Accuracy with Selected Edge Transition Time and Linear SVM.
Test BedNodeEdgeNode Edge
Aruba89.7191.1691.39
Tulum72.0168.8272.69
Cairo74.7569.1961.62
Table 9. Accuracy with Selected Edge Transition Time and Random Forest.
Table 9. Accuracy with Selected Edge Transition Time and Random Forest.
Test BedNodeEdgeNode Edge
Aruba92.7093.5993.26
Tulum72.1369.2171.56
Cairo78.2874.2473.23
Table 10. Accuracy Before and After Adding Door Sensors with Motion Sensors for “Aruba”.
Table 10. Accuracy Before and After Adding Door Sensors with Motion Sensors for “Aruba”.
Test BedNodeEdgeNode Edge
Motion Sensors81.0392.1392.03
Motion and Door Sensors (Linear SVM)89.4992.6292.87
Motion and Door Sensors (Random Forest)93.0793.2793.42
Table 11. Accuracy after Mapping Sensors to Room Type.
Table 11. Accuracy after Mapping Sensors to Room Type.
Test BedNodeEdgeNode Edge
Aruba92.7392.6592.81
Tulum60.4260.8161.98
Cairo73.0075.1776.50
Table 12. Accuracy after Adding Mapped Room Type to Motion Sensors (Linear SVM).
Table 12. Accuracy after Adding Mapped Room Type to Motion Sensors (Linear SVM).
Test BedNodeEdgeNode Edge
Aruba89.1992.7692.47
Tulum71.9568.2575.56
Cairo71.5074.3374.67
Table 13. Accuracy after Adding Mapped Room Type to Motion Sensors (RF).
Table 13. Accuracy after Adding Mapped Room Type to Motion Sensors (RF).
Test BedNodeEdgeNode Edge
Aruba93.0593.2293.53
Tulum72.4970.5372.60
Cairo78.0079.1777.50
Table 14. Accuracy from Deep Learning (DL) approach with input of existence of graphical features.
Table 14. Accuracy from Deep Learning (DL) approach with input of existence of graphical features.
TestbedsNodesEdgesNodes + Edges
Aruba91.8792.7492.49
Cairo62.2267.2267.22
Tulum69.1869.8271.35
Table 15. Accuracy from DL approach with input of count of graphical features.
Table 15. Accuracy from DL approach with input of count of graphical features.
TestbedsNodesEdgesNodes + Edges
Aruba93.5293.3692.49
Cairo65.5673.3372.78
Tulum73.874.0972.66
Table 16. Best results from GFF.
Table 16. Best results from GFF.
TestbedsAccuracyFeatures + Algorithms for the Best Result
Aruba93.53Nodes + edges + doors-room types as additional features + Random Forest
Tulum 76.34Nodes + edges + Linear SVM
Cairo79.17Nodes + Random Forest
Table 17. Graph inputs with only existence of node and edges.
Table 17. Graph inputs with only existence of node and edges.
Test BedsAccuracy
Aruba70.56
Cairo33.33
Tulum21.11
Table 18. Graph inputs with nodes, edges and count of nodes triggered as node attributes.
Table 18. Graph inputs with nodes, edges and count of nodes triggered as node attributes.
Test BedsAccuracy
Aruba35.00
Cairo36.66
Tulum22.32

Share and Cite

MDPI and ACS Style

Akter, S.; Holder, L. Improving IoT Predictions through the Identification of Graphical Features. Sensors 2019, 19, 3250. https://doi.org/10.3390/s19153250

AMA Style

Akter S, Holder L. Improving IoT Predictions through the Identification of Graphical Features. Sensors. 2019; 19(15):3250. https://doi.org/10.3390/s19153250

Chicago/Turabian Style

Akter, Syeda, and Lawrence Holder. 2019. "Improving IoT Predictions through the Identification of Graphical Features" Sensors 19, no. 15: 3250. https://doi.org/10.3390/s19153250

APA Style

Akter, S., & Holder, L. (2019). Improving IoT Predictions through the Identification of Graphical Features. Sensors, 19(15), 3250. https://doi.org/10.3390/s19153250

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop