Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine

Mohmed, Gadelhag; Lotfi, Ahmad; Pourabdollah, Amir

doi:10.3390/technologies6040110

Open AccessArticle

Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine

by

Gadelhag Mohmed

,

Ahmad Lotfi

^*

and

Amir Pourabdollah

School of Science and Technology, Nottingham Trent University, Clifton Lane, Nottingham NG11 8NS, UK

^*

Author to whom correspondence should be addressed.

Technologies 2018, 6(4), 110; https://doi.org/10.3390/technologies6040110

Submission received: 7 November 2018 / Revised: 19 November 2018 / Accepted: 22 November 2018 / Published: 26 November 2018

(This article belongs to the Special Issue The PErvasive Technologies Related to Assistive Environments (PETRA))

Download

Browse Figures

Versions Notes

Abstract

:

Human activity recognition and modelling comprise an area of research interest that has been tackled by many researchers. The application of different machine learning techniques including regression analysis, deep learning neural networks, and fuzzy rule-based models has already been investigated. In this paper, a novel method based on Fuzzy Finite State Machine (FFSM) integrated with the learning capabilities of Neural Networks (NNs) is proposed to represent human activities in an intelligent environment. The proposed approach, called Neuro-Fuzzy Finite State Machine (N-FFSM), is able to learn the parameters of a rule-based fuzzy system, which processes the numerical input/output data gathered from the sensors and/or human experts’ knowledge. Generating fuzzy rules that represent the transition between states leads to assigning a degree of transition from one state to another. Experimental results are presented to demonstrate the effectiveness of the proposed method. The model is tested and evaluated using a dataset collected from a real home environment. The results show the effectiveness of using this method for modelling the activities of daily living based on ambient sensory datasets. The performance of the proposed method is compared with the standard NNs and FFSM techniques.

Keywords:

activities of daily living; activities of daily working; finite state machine; fuzzy finite state machine; learning; ADL; ADW; FSM; activity recognition

1. Introduction

Monitoring and recognising human activities in an indoor environment (home or office) are studied within the general topic of Ambient Intelligence (AmI) [1,2]. Human activities can be sensed using unobtrusive sensors such as Passive Infrared (PIR) sensors and images/videos captured using cameras [3]. Attention has predominantly been focused on data collected by unobtrusive sensors, which are more acceptable by users [4]. The analysis of the captured data can be used to optimise energy consumption, address health and safety concerns, or lead to an improved level of the residences’ comfort and living quality [5].

In order to recognise human activities based on low-level sensory information, different modelling techniques are investigated. One of the promising techniques in modelling and recognising human activities is based on the Finite State Machine (FSM) [6]. The classic FSM is employed to represent the states and the functionality of transitions between different states (here, the activities). However, considering the uncertainties incorporated in human activities, a fusion of fuzzy logic with FSM allows a more powerful tool to model the dynamic processes that may change over time [6,7,8]. Considering the uncertainty involved in the collected data that represent human activities, it is argued that the Fuzzy Finite State Machine (FFSM) is a suitable technique to deal with large uncertain data collected from real-world environments. Furthermore, the system can assign a degree of truth to the occurrence of each activity.

In an FFSM model, the transitions between states are triggered by fuzzy variables instead of crisp values. This provides an accurate model supported by a reasoning mechanism represented with a degree of truth related to each state transition. Thus, the system can be in more than one state at any time based on the membership values or degree of belonging to each state [5,6]. Readers are referred to [9] for the theoretical definition of the FFSM with some recent developments reported in [10,11].

The research reported in this paper is part of the on-going research works in our research group to support the independent living and enhance the quality of life for elderly residents. The aim of the research reported in this paper is to develop a model representing the Activities of Daily Living (ADL). The work could be easily expanded to include Activities of Daily Working (ADW) if required. Once the human activities are modelled, an individual profile for each user (house residence or office worker) can be created to adjust automatically his/her environment (residence-places/work-spaces) conditions according to the user’s preferences [3,5]. Recognising a user’s activity leads to predicting what that person is going to do next. The predicting process might be made on the basis of the person’s behavioural pattern repeatedly observed in the past. By understanding the human behavioural patterns, the future activities can be predicted. Once the predicted activities are identified, many different aspects of human lifestyle can be improved; e.g., safety, security, and energy saving. For instance, in the case of ADW, the environmental conditions of the workspace such as the heating system, ambient light, and turning on/off the computers could be adjusted based on the prediction of the worker’s arriving and leaving times. In this case, the energy consumption can be reduced and the workers’ comfortability will be increased. It should be mentioned that activity prediction is beyond the scope of this paper. In this paper, a learning method to create the FFSM rules is proposed, which allows the model to generate automatically the rules based on the numerical information gathered from the sensors and the knowledge gathered from human experts. The rules represent the probabilistic transitions between the states, whereas the states themselves are still defined based on the experts’ knowledge.

This paper is an extension of the authors’ research work in developing an FFSM used for Human Activity Recognition (HAR) based on the data gathered from low-level ambient sensors [12]. Employing FFSMs to model the HAR is justified mainly due to their capabilities in modelling and recognising the uncertainties in human behaviour. The original approach is extended in this paper by integrating the learning capabilities of Neural Networks (NNs) to generate the fuzzy rules that govern the fuzzy states’ transitions. Moreover, experts’ knowledge is used to identify the number of sates, the number of linguistic labels associated with each input, and the general structure of the rules.

The rest of this paper is organised as follows: after a review of the literature in Section 2, the methodologies are presented in Section 3, including FFSM and the proposed human activity modelling using the Neuro-Fuzzy Finite State Machine (N-FFSM). In Section 4, a human activity recognition case study is detailed, including the experimental results, followed by the discussion of the results in Section 5. The pertinent conclusions are drawn in Section 6.

2. Literature Review

Modelling human activities in an indoor environment is a challenging task, as humans behave with great uncertainty within their living and working places. Many research works have been conducted to monitor and analyse the activities of people using many different machine learning techniques including genetic-fuzzy FSM [13], dynamic Bayesian network modelling [14], echo-state neural networks [15], and regression models [16]. In [5], it was shown that the ADW in an AmI environment can be modelled using the FFSM technique by means of sequential events based on a dataset collected from a real smart office environment. Although the authors have presented the human activities by fuzzy states, they faced difficulties in generating fuzzy rules solely based on experts’ knowledge.

The researchers in [2,17,18] investigated different ways in which human behaviour can be detected and modelled using the Markov Model (MM) and the Hidden Markov Model (HMM). Their experiments were based on the data collected from some wearable sensors and cameras, with a focus on using the Hierarchical Context Hidden Markov Model (HC-HMM) from video streams. In [19], the authors presented a novel approach for monitoring people’s behaviour using an indoor localisation system based on the stigmergy technique. They suggested that a further work is required to implement the same concepts for enhancing the system’s ability to monitor human behaviour. This enhancement can be processed after training the system using a dataset collected from a real environment. A relatively new research work [4] presented a new model based on the Markov Modulated Poisson Process (MMPP) that promises to come up with a model to represent multi-visitor recognition with more accuracy. In [20], a framework was proposed to integrate temporal and spatial contextual information to determine the wellness of an elderly person living alone in a home environment.

The swarm intelligence method was used in [19] to monitor an elderly person’s activities via indoor position-based stigmergy. Other evolutionary computing and machine learning techniques based on MMPP are similarly employed to enhance human activity monitoring accuracy. Some works used a dataset collected by a smartphone’s accelerometer [4,21]. Hybrid computational techniques, such as data mining [22], pattern recognition, and human activity profiling using Convolutional Neural Network (CNN) [23], are also used in the context of ADL and ADW in order to divide the monitored human behaviours into activities and preferences [7].

Many published papers addressed the issue of modelling human behaviour using wearable sensors [24,25]. Developing activity recognition systems using the smartphones’ built-in accelerometer together with employing CNNs to model the activities was addressed in some recent publications [26,27]. In [28], the authors proposed a novel way of implementing the task of recognition by using probabilistic graphical models such as Bayesian Network (BN) and Dynamic Bayesian Network (DBN). These techniques are widely used in different domains including speech recognition and bio-sequence analysis [28]. Furthermore, they used the proposed DBN to recognise the current pair of activity-object and predict the most probable task based on the features extracted from RGB Depth (RGB-D) raw data. This information was then used to make the human-robot cooperation more efficient.

In [29], the authors proposed a sequential meta-cognitive learning algorithm for a Neuro-Fuzzy Inference System (McFIS) to develop a classifier for human actions recognition based on a video sequence. They used a four-layer NN to determine the number of rules and their corresponding parameters. The motion features were used for each action by extracting the accumulated motion information over a small time window. The results obtained from this work indicate superior performance of the McFIS classifier compared to the standard Support Vector Machine (SVM). The developed system uses a Neuro-Fuzzy Inference System for HAR. Based on the literature review conducted for this research, a gap is identified where NN learning could be integrated with FFSM.

3. Methodology

In this section, first the fuzzy finite state machine is introduced, then a proposed enhancement is introduced that is able to implement a learning method using neural networks.

3.1. Fuzzy Finite State Machine

A Fuzzy Finite State Machine (FFSM) [30,31] is an extended version of the classical Finite State Machine (FSM), in which a computation model can simulate the sequence of events in a dynamic process. The FSM computation is based on a model made of one or more states. Only one single state of this machine can be active at a time. The machine performs different actions, by transiting from one state to another, triggered by fixed values. By adding the fuzziness aspect to the state transitions, the states are not only triggered by binary values, but also by means of fuzzy variables. Moreover, the states could be represented by fuzzy variables as well [5,13,30]. However, in some FFSM applications, it is assumed that states are still defined as fixed values, whereas the fuzzy values are to be used to control the state transitions [32]. In both approaches, the system is not necessarily in one state at the same time [30], i.e., fuzzy membership values are associated with the states at each time [31].

In an FFSM, the state variables are shown as a set of linguistic variables

S = {s_{1}, s_{2}, \dots, s_{n}}

where n is the number of states. For a non-sequential system at time t, the FFSM state is represented as a state vector

S (t)

(as opposed to the scalar state of the FSM). When the system evolves in time, the next state is represented as a vector

S (t + 1)

.

FFSM is defined as a tuple

(S (t), U (t), f, Y (t), g)

, where

S (t) = [s_{1} (t), s_{2} (t), \dots, s_{n} (t)]

is the state vector,

U (t) = [u_{1} (t), u_{2} (t), \dots, u_{k} (t)]

is the input vector to the system, with k being the number of input variables,

Y (t) = [y_{1} (t), y_{2} (t), \dots y_{p} (t)]

is the vector of output variables with p being the number of output variables, f is the function that calculates the next state at time t, and g is the function that calculates the output vector Y at time t [5,6,13]. Considering the complexity of our modelling cases, it may be impossible to identify analytically the functions f and g.

In a general time-invariant model, the FFSM’s states and outputs are therefore expressed [5,13] as:

S (t + 1) = f (S (t), U (t))

(1)

Y (t) = g (S (t), U (t))

(2)

More details about each of these elements are provided below:

Fuzzy state $(S (t)$ ) is a vector representing the system’s states at time t. Each individual state at time t $s_{i} (t); i = 1 \dots n$ is a numerical value that is in fact the membership grade (between 0 and 1) given to each linguistic variable $s_{i}$ within the set of FFSM’s states (S).
Input vector $(U (t))$ represents the values associated with the linguistic variables that are generally obtained after a fuzzification process of sensors’ data, a combination of different signals, or any other calculation of numerical data. The fuzzification process, which is designed based on experts’ view, translates the numerical input values into a set of membership grades given to each linguistic label, which defines all the acceptable values. The labels that are associated with the input $u_{i}$ are represented as $A_{u_{i}} = {A_{u_{i}}^{1}, A_{u_{i}}^{2}, \dots, A_{u_{i}}^{k_{i}}}$ , where $k_{i}$ is the number of associated linguistic labels [6].
Output vector $(Y (t))$ is the output vector consisting of crisp values associated with each output, which are calculated based on the current state of the system $S (t)$ and the input vector $(U (t))$ .
Output function $(g)$ is the output function that is used to calculate the value of output vector $Y (t)$ , at each time instant t.
Transition function $(f)$ is the state transition function that is used to calculate the next state vector $S (t + 1)$ , at each time instant. The transition function f controls the allowed transitions between the defined relevant states in the system. f is defined as a set of fuzzy rules. There are different ways to define the rules; e.g., using human experts’ knowledge [5] or learning from the numerical input-output data by applying machine learning algorithms [32,33,34]. A combination of these approaches can also be implemented to have one framework that contains the rules that were generated by learning from the numerical data and those assigned by the human experts’ knowledge [34].

Figure 1 illustrates the system states and transition mechanism between two exemplary states

s_{i}

and

s_{j}

. The transition mechanism from

s_{i}

to

s_{j}

is represented by the following general fuzzy rule:

R_{i j} : IF (S (t) is s_{i}) AND H_{i j} THEN S (t + 1) is s_{j}

The antecedent part of the rule is a combination of two terms: The first term,

(S (t)

is

s_{i})

, is used to determine if the state

s_{i}

is an activated state in time instant t. The second term of the antecedent part is

H_{i j}

, which represents all constraints imposed on the input variables that are required to either remain in state

s_{i}

(when,

i = j

) or change to state

s_{j}

, e.g.,

H_{i j} = (u_{1} (t)

is

A_{u 1}^{3})

AND

(u_{2} (t)

is

A_{u 2}^{4}

OR

A_{u 2}^{2})

. The consequent part of the given rule is

S (t + 1)

is

s_{j}

, which determines the next value of the state vector

S (t + 1)

for being in state

s_{j}

. The linguistic variables of the consequent part are considered as being singletons, i.e., all elements of the

S (t)

vector are zero, except for the

j^{th}

element, which is 1 [13].

For the

k^{th}

rule, a t-norm method (e.g., minimum) is used to calculate the rule’s firing degree

w_{k}

. For a rule-base consisting of

κ

rules, the next value of the state vector

S (t + 1)

is the weighted average utilising the firing degree of each rule, defined as:

S (t + 1) = \frac{\sum_{k = 1}^{κ} w_{k} . S (t)}{\sum_{k = 1}^{κ} w_{k}} i f \sum_{k = 1}^{κ} w_{k} \neq 0

(3)

S (t + 1) = S (t) i f \sum_{k = 1}^{κ} w_{k} = 0

(4)

The expression above is considered as an inference process that is applied to a set of fuzzy rules where the linguistic variables of the consequent part are singletons. Readers are referred to [5,13,35] for more information about FFSM. More details about the transition function element based on fuzzy rules are explained in the next section.

3.2. Neuro-Fuzzy Finite State Machine

A common approach in incorporating learning capabilities into the fuzzy systems is based on the combination of fuzzy systems and Artificial Neural Networks (ANNs), also known as Neural Networks (NNs), leading to a well-known hybrid system called a neuro-fuzzy system [36]. Fuzzy rules are generally based on the numerical data rather than experts’ knowledge [37]. In this section, the fusion framework between FFSM and NNs is explained.

Figure 2 illustrates the schematic diagram of the proposed Neuro-Fuzzy Finite State Machine (N-FFSM). The proposed N-FFSM model can automatically generate the fuzzy rules representing the state transitions. In this approach, the experts are also allowed to introduce their own knowledge over the whole system by defining the system states and specifying the general structure of the fuzzy rules representing the state transitions. The fuzzy rules and the associated linguistic labels to each input are automatically derived by the neuro-fuzzy rule-based system. Therefore, it is possible to construct the Membership Functions (MFs) associated with the linguistic label used in the fuzzy rules.

The N-FFSM system is considered as an adaptive network, which is functionally equivalent to the fuzzy systems in terms of representing the fuzzy rules linguistically with the capabilities of neural learning. This network is comprised of nodes (neurons) identifying specific functions gathered in layers. The final output of these layers is able to construct a network generating the fuzzy rules.

Based on the explanation given in the proceeding sections, a new model is proposed to generate automatically the rules representing the transition based on learning from the sensors’ data.

4. Case Study

Modelling the ADL for a single user living within a smart home environment is represented as a case study in this section. The N-FFSM approach introduced earlier is applied to the data gathered from a smart home environment representing the activities of the occupant. The fuzzy rules are automatically generated based on the data gathered from the sensors.

4.1. Human Activity Recognition

Recognising the human activities based on data gathered from some low-level sensors comprises some challenges. They are:

Recognising concurrent or simultaneous activities: By nature, several activities can be undertaken by a single user at the same time [38]. For example, people can read a book while they are watching TV or eating. In this case, it is not necessary to know which activity started first; that means the existence of concurrent activities when an activity (e.g., eating) starts while the other activity is already started (e.g., reading a book). A specialised approach is required to recognise these non-sequential behaviours.
Recognising interleaved activities: In real life, a certain activity can be interrupted by another activity [8]. For example, while the current activity of an office worker is recorded as “computer activity”, a visitor comes to the office. In this case, the first activity is paused for the period of the second activity’s duration before the previous activity is resumed.
Recognising multiple residents: In many situations, more than one user is present in the environment at the same time. It is even harder than the previous scenarios to recognise parallel activities for multiple people living/working together in the same place. Different statistical measurements are provided in this research area, but it is still considered as a challenge [16].

4.2. Data Collection System

The experiment was conducted at the Smart Home facilities within Nottingham Trent University. A floor plan of the house is shown in Figure 3. The list of sensors embedded in the environment that were used for this experiment is also provided in Table 1. A set of data was collected from this environment.

Figure 4 shows a sample of the collected binary data from PIR sensors. Each data record from the sensors contained the information presented as a triple

(t, a_{o n}, a_{o f f})

, where t is the timestamp of the action or activity, and

a_{o n}

and

a_{o f f}

are the sensor status at the time instant t, represented as a binary data (1 or 0). The information involved in the raw data is used to extract the required features for the input variables.

4.3. System States’ Definition

As explained in Section 3, each state represents an activity. Multiple activities could be associated with one room. Based on the available experts’ knowledge, eight different states were defined representing eight distinguishable activities. This is easily represented by means of the proposed state diagram illustrated in Figure 5. These eight states are defined as follows:

$s_{1}$ : The sleeping state represents sleeping activity either during the night or while taking a nap during the daytime. Intuitively, the collected starting time and the duration of this activity could vary depending on the day of the week, even for the same user. Furthermore, the state can be interrupted by other activities such as going to the toilet, etc.
$s_{2}$ : The bedroom state is used to represent the other duties in the bedroom except for the sleeping activity.
$s_{3}$ : The toilet state represents the times when the user is using the toilet.
$s_{4}$ : The kitchen state is where the user spends time in the kitchen to prepare food or to clean.
$s_{5}$ : The dining room state usually comes after the kitchen state, when the user stays in the dining room to eat the prepared meal.
$s_{6}$ : The living room state corresponds to the time spent in the living room to watch TV or other social activities.
$s_{7}$ : The garden state is used when the user uses the back door to go to the garden.
$s_{8}$ : The leaving home state becomes active when the individual leaves the home from the front door. This can be for any duties away from the home such as shopping. This state might occur regularly at a certain time (in the case of the individual having a daily job) or irregularly (in the case of shopping and social visiting).

4.4. Input Variables’ Definition

Once the data have been collected from all sensors, three features are extracted to be used as the inputs to the proposed N-FFSM system. The input variable vector is

U = [u_{1}, u_{2}, u_{3}]

.

u_{1}

represents activity start time;

u_{2}

denotes the duration of each activity, which is represented in minutes;

u_{3}

is the activity count, which defines a number that represents how many times the activity was sensed per day. Each input variable is fuzzified to translate the numerical data to their relevant linguistic values. These values are represented as fuzzy Membership Functions (MFs).

The linguistic labels for each input variable are shown in Figure 6, as explained below:

For input variable $u_{1}$ , five linguistic labels are used, which represent five activity start times during a day, as: ${E M_{u_{1}}, M_{u_{1}}, A F_{u_{1}}, E V_{u_{1}}, N I_{u_{1}}}$ . $E M$ is Early Morning; M is Morning; $A F$ is Afternoon; $E V$ is Evening; and $N I$ is Night. Therefore, $A_{u_{1}} = {A_{u_{1}}^{1}, A_{u_{1}}^{2}, A_{u_{1}}^{3}, A_{u_{1}}^{4}, A_{u_{1}}^{5}}$ , where $A = {E M, M, A F, E V, N I}$ .
For input variable $u_{2}$ , this input variable has five linguistic labels, as well, which represent five different periods of time for the activity duration as: ${V S_{u_{2}}, S H_{u_{2}}, M E_{u_{2}}, L O_{u_{2}}, V L O_{u_{2}}}$ . $V S$ is Very Short; $S H$ is Short; $M E$ is Medium; $L O$ is Long; and $V L O$ is Very Long. Therefore, $A_{u_{2}} = {A_{u_{2}}^{1}, A_{u_{2}}^{2}, A_{u_{2}}^{3}, A_{u_{2}}^{4}, A_{u_{2}}^{5}}$ , where $A = {V S, S H, M E, L O, V L O}$ .
For input variable $u_{3}$ , only three linguistic labels are used, which represent three different usage levels, as: ${H U_{u_{3}}, M U_{u_{3}}, R U_{u_{3}}}$ . $H U$ is Heavy Usage; $M U$ is Medium usage; and $R U$ is Rare Usage. Therefore, $A_{u_{3}} = {A_{u_{3}}^{1}, A_{u_{3}}^{2}, A_{u_{3}}^{3}}$ , where $A = {R U, M U, H U}$ .

4.5. Transition Function Definition

In order to control the transitions between the system’s states, a set of fuzzy rules is required. From the state diagram shown in Figure 5, the required rules that can define the transitions between the system states are determined. The rules have the following structure:

IF (S (t) is s_{1}) AND H_{12} THEN S (t + 1) is s_{2}

IF (S (t) is s_{2}) AND H_{23} THEN S (t + 1) is s_{3}

IF (S (t) is s_{3}) AND H_{34} THEN S (t + 1) is s_{4}

\dots \dots \dots \dots \dots \dots \dots \dots \dots

IF (S (t) is s_{5}) AND H_{57} THEN S (t + 1) is s_{7}

It should be noted that there are limitations in the transition between states. For example, when the user is in the sleeping state

(S_{1})

, the system can go to the bedroom state

S_{2}

. However, while the user is in the bedroom state

(S_{2})

, the model only allows going to the toilet state

S_{3}

, kitchen state

S_{4}

, or leaving home state

S_{8}

. These limitations are enforced by the physical layout of the house shown in Figure 3. The state transitions are governed by the fuzzy rules, which are generated by means of NNs based on the data gathered from sensors.

4.6. Output Definition

For this specific system model, the output vector Y is the result of states’ activation or the membership degrees given to each state, i.e.,

Y (t) = S (t)

.

4.7. Results

Based on the information provided in the previous section, an N-FFSM model was implemented to model the Activities of Daily Living (ADL) for a single user in a smart home environment. A sample of ADL data for five days is illustrated in Figure 7. A multilevel activity graph is illustrated in Figure 7a, and a scattered plot of the same data is shown in Figure 7b. This section presents the obtained results of the conducted experiments.

Datasets representing human activities are often imbalanced where some activities appear much more frequently than others. It is evident that if the dominant activity is recognised with a high level of performance, the overall level of accuracy is high, even if all other activities are not well recognised [39]. Therefore, we did a cross-validation for each activity over the whole model. Table 2 shows the recall (known as sensitivity), precision, and accuracy obtained by using the N-FFSM model for each activity. Moreover, a confusion matrix plot representing the precision and accuracy scores over the whole model, as well as for each activity is illustrated in Figure 8. The information given in the confusion matrix is explained as follows:

The rows and columns represent the output activities and target activities, respectively. The activities are identified as $s_{1}, s 2, \dots, s_{8}$ .
The diagonal cells from the upper left to the lower right indicate activities that are correctly recognised.
The off-diagonal cells represent the incorrectly-recognised activities.
The right-most column shows the accuracy of each activity.
The last row at the bottom shows the precision for each activity.
The bottom right cell represents the accuracy over the whole model.

The expressions that were used to calculate accuracy, precision, and recall are given below:

A c c u r a c y = \frac{1}{N} \sum_{i = 1}^{C} t p_{i}

(5)

P r e c i s i o n = \frac{1}{C} \sum_{i = 1}^{C} \frac{t p_{i}}{t p_{i} + f p_{i}}

(6)

R e c a l l = \frac{1}{C} \sum_{i = 1}^{C} \frac{t p_{i}}{t p_{i} + f n_{i}}

(7)

where N is the total number of events

N = t p_{i} + t n_{i} + f p_{i} + f n_{i}

for

i^{th}

activity in the source data.

t p_{i}

,

t n_{i}

,

f n_{i}

, and

f p_{i}

are the number of true positives, true negative, false negatives, and false positives of the

i^{th}

activity, respectively. C is the number of activities for which their accuracy, recall, and precision are calculated. For this study,

t p_{i}, t n_{i}, f p_{i}

and

f n_{i}

were defined as follows:

-: True positive $(t p_{i})$ : the case when $i^{th}$ activity is correctly recognised as being the $i^{th}$ activity.
-: True negative $(t n_{i})$ : the case when all the other activities are correctly recognised as being not the $i^{th}$ activity.
-: False positive $(f p_{i})$ : the case when all the other activities are incorrectly recognised as being the $i^{th}$ activity.
-: False negative $(f n_{i})$ : the case when the $i^{th}$ activity is incorrectly recognised as being not the $i^{th}$ activity.

4.8. Comparison with Existing Modelling Techniques

In order to evaluate the proposed method, we compared the performance of the proposed N-FFSM with some existing methodologies. The dataset mentioned earlier was applied to a classical FFSM and standard NNs, and the results were compared with the proposed N-FFSM. The FFSM contained eight states representing the same eight activities mentioned earlier. The state transitions were controlled by fuzzy rules that were generated based on the experts’ knowledge only. Accuracy, recall, and precision of models based on FFSM and NNs are shown in Table 3 and Table 4 respectively.

5. Discussion

Considering the results obtained from our experiments, this section discusses three different aspects related to the proposed model: i.e., accuracy, interpretability, and the importance of using experts’ knowledge.

Accuracy: The results illustrated in Table 2 show that the N-FFSM model exhibited a high accuracy, recall, and precision when its performance was tested for each activity separately. The results presented in Table 5 show the overall activity recognition performance when it was compared with the existing FFSM and NNs in terms of accuracy, recall, and precision. According to the achieved results, the N-FFSM model was considerably better at ADL recognition based on data gathered from low-level ambient sensors. Furthermore, it can be seen how the N-FFSM model was able to follow the proper sequence of states with the correct state activation degree.
Interpretability discussion: From the interpretability point of view, the most commonly-used approaches in human activity recognition research works have been NNs and HMMs. These models are considered as black-box approaches because of the complexity of understanding their underlying concepts. This complexity increases when a large number of input and output variables are used. Nevertheless, the proposed N-FFSM model is described linguistically using eight linguistic states representing eight different activities, as well as fuzzy rules associated with the linguistic inputs.
The importance of using human experts’ knowledge: In order to achieve a robust model for representing human activities, the advantages of using experts’ knowledge with the learning capabilities in NNs can be integrated with the N-FFSM model. Designing an FFSM only based on the linguistic information assigned by human experts is not enough for a successful human activity recognition model. On the other hand, information derived from the gathered sensor data is not usually enough to achieve a high-performance model. Experts’ knowledge was used to define the fuzzy rules, as well as distinguishing the system’s current state(s). This allowed obtaining a linguistic description of the ADL, i.e., the final set of fuzzy rules that control the transition between states.

6. Conclusions

This paper has presented a practical application of utilising FFSM to model and recognise human activities using NNs and human experts’ knowledge. The principal elements of the FFSM were explained in detail, as well as the developed NN learning technique for generating the fuzzy rules and MFs associated with the linguistic labels in the inputs and outputs of the FFSM. Experts’ knowledge can still be used to define the system’s states and the general structure of the state transitions. Experimental results were presented to demonstrate the effectiveness of the proposed method. The advantage of the proposed system is that it integrates the experts’ knowledge with the information derived from the automatic learning process. The results obtained from the proposed N-FFSM model show that human activities could be modelled/learned with a high degree of accuracy based on the data gathered from low-level sensors.

Author Contributions

This project was conducted by G.M. as part of his PhD research at Nottingham Trent University. The project was supervised by A.L. The work reported here is part of ongoing research conducted by A.L. and A.P. G.M. conducted the experiments and data analysis. All the authors have contributed to the preparation of this paper.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare that they have no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ADL	Activities of Daily Living
ADW	Activities of Daily Working
FSM	Finite State Machine
FFSM	Fuzzy Finite State Machine
MF	Membership Function
NNs	Neural Network
N-FFSM	Neuro-Fuzzy Finite State Machine

References

Basu, D.; Moretti, G.; Gupta, G.S.; Marsland, S. Wireless sensor network based smart home: Sensor selection, deployment and monitoring. In Proceedings of the 2013 IEEE Sensors Applications Symposium Proceedings, Galveston, TX, USA, 19–21 February 2013; pp. 49–54. [Google Scholar]
Chen, L.; Hoey, J.; Nugent, C.D.; Cook, D.J.; Yu, Z. Sensor-based activity recognition. IEEE Trans. Syst. Man Cybern. Part C 2012, 42, 790–808. [Google Scholar] [CrossRef]
Cook, D.J.; Crandall, A.S.; Thomas, B.L.; Krishnan, N.C. CASAS: A smart home in a box. Computer 2013, 46, 62–69. [Google Scholar] [CrossRef] [PubMed]
Aicha, A.N.; Englebienne, G.; Kröse, B. Unsupervised visit detection in smart homes. Pervasive Mob. Comput. 2017, 34, 157–167. [Google Scholar] [CrossRef]
Langensiepen, C.; Lotfi, A.; Puteh, S. Activities recognition and worker profiling in the intelligent office environment using a fuzzy finite state machine. In Proceedings of the 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Beijing, China, 6–11 July 2014; pp. 873–880. [Google Scholar]
Alvarez-Alvarez, A.; Trivino, G.; Cordón, O. Body posture recognition by means of a genetic fuzzy finite state machine. In Proceedings of the 2011 IEEE 5th International Workshop on Genetic and Evolutionary Fuzzy Systems (GEFS), Paris, France, 11–15 April 2011; pp. 60–65. [Google Scholar]
Yin, J.; Yang, Q.; Pan, J.J. Sensor-based abnormal human-activity detection. IEEE Trans. Knowl. Data Eng. 2008, 20, 1082–1090. [Google Scholar] [CrossRef]
Mohmed, G.; Lotfi, A.; Langensiepen, C.; Pourabdollah, A. Clustering-Based Fuzzy Finite State Machine for Human Activity Recognition. In UK Workshop on Computational Intelligence; Springer: Cham, Switzerland, 2018; pp. 264–275. [Google Scholar]
Ying, M. A formal model of computing with words. IEEE Trans. Fuzzy Syst. 2002, 10, 640–652. [Google Scholar] [CrossRef]
Cao, Y.; Ying, M.; Chen, G. Retraction and generalized extension of computing with words. IEEE Trans. Fuzzy Syst. 2007, 15, 1238–1250. [Google Scholar]
Alvarez, A.; Trivino, G. Comprehensible model of a quasi-periodic signal. In Proceedings of the 2009 Ninth International Conference on Intelligent Systems Design and Applications, Pisa, Italy, 30 November–2 December 2009; pp. 450–455. [Google Scholar]
Mohmed, G.; Lotfi, A.; Langensiepen, C.; Pourabdollah, A. Unsupervised Learning Fuzzy Finite State Machine for Human Activities Recognition. In Proceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference, Corfu, Greece, 26–29 June 2018. [Google Scholar]
Alvarez-Alvarez, A.; Trivino, G.; Cordon, O. Human gait modeling using a genetic fuzzy finite state machine. IEEE Trans. Fuzzy Syst. 2012, 20, 205–223. [Google Scholar] [CrossRef]
Zhu, C.; Sheng, W.; Liu, M. Wearable sensor-based behavioral anomaly detection in smart assisted living systems. IEEE Trans. Autom. Sci. Eng. 2015, 12, 1225–1234. [Google Scholar] [CrossRef]
Lotfi, A.; Langensiepen, C.; Mahmoud, S.M.; Akhlaghinia, M.J. Smart homes for the elderly dementia sufferers: identification and prediction of abnormal behaviour. J. Ambient Intell. Hum. Comput. 2012, 3, 205–218. [Google Scholar] [CrossRef]
Alberdi, A.; Weakley, A.; Schmitter-Edgecombe, M.; Cook, D.J.; Aztiria, A.; Basarab, A.; Barrenechea, M. Smart Homes predicting the Multi-Domain Symptoms of Alzheimer’s Disease. IEEE J. Biomed. Health Inf. 2018, 22, 1720–1731. [Google Scholar] [CrossRef] [PubMed]
Chung, P.C.; Liu, C.D. A daily behavior enabled hidden Markov model for human behavior understanding. Pattern Recognit. 2008, 41, 1572–1580. [Google Scholar] [CrossRef]
Nguyen, N.T.; Phung, D.Q.; Venkatesh, S.; Bui, H. Learning and detecting activities from movement trajectories using the hierarchical hidden Markov model. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 20–25 June 2005; Volume 2, pp. 955–960. [Google Scholar]
Barsocchi, P.; Cimino, M.G.; Ferro, E.; Lazzeri, A.; Palumbo, F.; Vaglini, G. Monitoring elderly behavior via indoor position-based stigmergy. Pervasive Mob. Comput. 2015, 23, 26–42. [Google Scholar] [CrossRef]
Suryadevara, N.K.; Mukhopadhyay, S.C.; Wang, R.; Rayudu, R. Forecasting the behavior of an elderly using wireless sensors data in a smart home. Eng. Appl. Artif. Intell. 2013, 26, 2641–2652. [Google Scholar] [CrossRef]
Dawadi, P.; Cook, D.; Parsey, C.; Schmitter-Edgecombe, M.; Schneider, M. An approach to cognitive assessment in smart home. In Proceedings of the 2011 Workshop on Data Mining for Medicine and Healthcare, San Diego, CA, USA, 21 August 2011; pp. 56–59. [Google Scholar]
Lu-An, T.; Jiawei, H.; Guofei, J. Mining Sensor Data in CyberPhysical Systems. Tsinghua Sci. Technol. 2015, 19, 225–234. [Google Scholar] [CrossRef]
Panwar, M.; Dyuthi, S.R.; Prakash, K.C.; Biswas, D.; Acharyya, A.; Maharatna, K.; Gautam, A.; Naik, G.R. CNN based approach for activity recognition using a wrist-worn accelerometer. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Seogwipo, Korea, 11–15 July 2017; pp. 2438–2441. [Google Scholar]
Jordao, A.; Nazare, A.C., Jr.; Sena, J.; Schwartz, W.R. Human Activity Recognition Based on Wearable Sensor Data: A Standardization of the State-of-the-Art. arXiv 2018, arXiv:1806.05226. [Google Scholar]
Jordao, A.; Torres, L.A.B.; Schwartz, W.R. Novel approaches to human activity recognition based on accelerometer data. Signal Image Video Process. 2018, 1–8. [Google Scholar] [CrossRef]
Ignatov, A. Real-time human activity recognition from accelerometer data using Convolutional Neural Networks. Appl. Soft Comput. 2018, 62, 915–922. [Google Scholar] [CrossRef]
Inoue, M.; Inoue, S.; Nishida, T. Deep recurrent neural network for mobile human activity recognition with high throughput. Artif. Life Robot. 2018, 23, 173–185. [Google Scholar] [CrossRef]
Magnanimo, V.; Saveriano, M.; Rossi, S.; Lee, D. A bayesian approach for task recognition and future human activity prediction. In Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication, Edinburgh, UK, 25–29 August 2014; pp. 726–731. [Google Scholar]
Subramanian, K.; Suresh, S. Human action recognition using meta-cognitive neuro-fuzzy inference system. Int. J. Neural Syst. 2012, 22, 1250028. [Google Scholar] [CrossRef] [PubMed]
Unal, F.A.; Khan, E. A fuzzy finite state machine implementation based on a neural fuzzy system. In Proceedings of the 1994 IEEE 3rd International Fuzzy Systems Conference, Orlando, FL, USA, 26–29 June 1994; pp. 1749–1754. [Google Scholar]
Reyneri, L.M. An introduction to fuzzy state automata. In International Work-Conference on Artificial Neural Networks; Springer: Berlin/Heidelberg, Germany, 1997; pp. 273–283. [Google Scholar]
Bombardier, V.; Schmitt, E. Fuzzy rule classifier: Capability for generalization in wood color recognition. Eng. Appl. Artif. Intell. 2010, 23, 978–988. [Google Scholar] [CrossRef] [Green Version]
Wang, Z.; Jiang, M.; Hu, Y.; Li, H. An incremental learning method based on probabilistic neural networks and adjustable fuzzy clustering for human activity recognition by using wearable sensors. IEEE Trans. Inf. Technol. Biomed. 2012, 16, 691–699. [Google Scholar] [CrossRef] [PubMed]
Wang, L.X.; Mendel, J.M. Generating fuzzy rules by learning from examples. IEEE Trans. Syst. Man Cybern. 1992, 22, 1414–1427. [Google Scholar] [CrossRef]
Ambres, O.; Trivino, G. Gait quality monitoring using an arbitrarily oriented smartphone. In International Workshop on Ambient Assisted Living; Springer: Berlin/Heidelberg, Germany, 2012; pp. 224–231. [Google Scholar]
Jang, J.S.R.; Sun, C.T.; Mizutani, E. Neuro-fuzzy and soft computing; a computational approach to learning and machine intelligence. IEEE Trans. Autom. Control 1997, 42, 1482–1484. [Google Scholar] [CrossRef]
Nauck, D.; Klawonn, F.; Kruse, R. Foundations of Neuro-Fuzzy Systems; John Wiley & Sons, Inc.: New York, NY, USA, 1997. [Google Scholar]
Helal, S.; Lee, J.W.; Hossain, S.; Kim, E.; Hagras, H.; Cook, D. Persim-Simulator for human activities in pervasive spaces. In Proceedings of the 2011 Seventh International Conference on Intelligent Environments, Nottingham, UK, 25–28 July 2011; pp. 192–199. [Google Scholar]
Benmansour, A.; Bouchachia, A.; Feham, M. Modeling interaction in multi-resident activities. Neurocomputing 2017, 230, 133–142. [Google Scholar] [CrossRef] [Green Version]

Figure 1. State diagram of the fuzzy finite state machine.

Figure 2. A schematic diagram of the proposed neuro-fuzzy finite state machine.

Figure 3. Floor plan layout and location of installed sensors.

Figure 4. A sample of raw data gathered from PIR sensors over a one-month period.

Figure 5. State diagram of human activities in the experimental home.

Figure 6. Fuzzy membership functions representing input variables.

Figure 7. Data collected from a real environment over five days: (a) multilevel activity graph; and (b) scattered data based on start times and activity duration.

Figure 8. Confusion matrix plot for ADL recognition results.

Table 1. List of sensors used in the experiment that can measure different conditions and activities (* denotes the unused sensors in this research).

Sensors	Sensors’ Quantity	Used for
Passive Infrared Red (PIR)	6	Detecting the movement and occupancy
On/off door switches	2	Detecting when doors are open
Mat pressure sensor	1	Measuring bed occupancy
Electricity usage sensors	2	Plugs measuring electricity consumption
		usage, e.g., microwave and kettle.
* Indoor temperature sensors	5	Measuring ambient temperature
* Humidity sensors	5	Measuring ambient humidity
* Outdoor temperature sensors	1	Measuring outside temperature
* Light intensity sensors	6	Measuring ambient light intensity

Table 2. Accuracy, recall, and precision for each activity obtained based on the proposed N-FFSM method.

Activities	Accuracy	Recall	Precision
(1) Sleeping	85.7	84.1	81.8
(2) Bedroom	93.9	93.3	96.8
(3) Toilet	87.5	86.4	100
(4) Kitchen	100	99.1	90.5
(5) Dining room	100	97.8	100
(6) Living room	100	98.3	100
(7) Garden	100	90.9	100
(8) Leaving home	100	95.2	100

Table 3. Accuracy, recall, and precision for each activity obtained based on FFSM.

Activities	Accuracy	Recall	Precision
(1) Sleeping	38.4	37.8	100
(2) Bedroom	24.2	24.0	66.6
(3) Toilet	75	74.0	31.5
(4) Kitchen	40.7	40.4	27.5
(5) Dining room	100	98.9	81.8
(6) Living room	0	0	NaN
(7) Garden	100	83.3	100
(8) Leaving home	0	0	NaN

Table 4. Accuracy, recall, and precision for each activity obtained based on NNs.

Activities	Accuracy	Recall	Precision
(1) Sleeping	100	98.4	76.4
(2) Bedroom	89.1	88.7	100
(3) Toilet	93.7	92.5	93.7
(4) Kitchen	85.7	85.1	88.8
(5) Dining room	70	69.3	77.7
(6) Living room	100	95.2	66.6
(7) Garden	NaN	0	0
(8) Leaving home	100	95.2	100

Table 5. Overall accuracy, recall, and precision obtained based on the N-FFSM, FFSM, and NN methods.

Activities	Accuracy	Recall	Precision
N-FFSM (proposed)	95.2%	93.18%	96.16%
NNs model	78.72%	78.81%	75.44%
FFSM model	46.61%	44.84%	NaN

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mohmed, G.; Lotfi, A.; Pourabdollah, A. Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine. Technologies 2018, 6, 110. https://doi.org/10.3390/technologies6040110

AMA Style

Mohmed G, Lotfi A, Pourabdollah A. Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine. Technologies. 2018; 6(4):110. https://doi.org/10.3390/technologies6040110

Chicago/Turabian Style

Mohmed, Gadelhag, Ahmad Lotfi, and Amir Pourabdollah. 2018. "Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine" Technologies 6, no. 4: 110. https://doi.org/10.3390/technologies6040110

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Human Activities Recognition Based on Neuro-Fuzzy Finite State Machine

Abstract

1. Introduction

2. Literature Review

3. Methodology

3.1. Fuzzy Finite State Machine

3.2. Neuro-Fuzzy Finite State Machine

4. Case Study

4.1. Human Activity Recognition

4.2. Data Collection System

4.3. System States’ Definition

4.4. Input Variables’ Definition

4.5. Transition Function Definition

4.6. Output Definition

4.7. Results

4.8. Comparison with Existing Modelling Techniques

5. Discussion

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI