An Overview of Software Sensor Applications in Biosystem Monitoring and Control

Badreldin, Nasem; Cheng, Xiaodong; Youssef, Ali

doi:10.3390/s24206738

Open AccessReview

An Overview of Software Sensor Applications in Biosystem Monitoring and Control

by

Nasem Badreldin

¹

,

Xiaodong Cheng

²

and

Ali Youssef

^3,*

¹

Department of Soil Science, University of Manitoba, 13 Freedman Crescent, Winnipeg, MB R3T 2N2, Canada

²

Mathematical and Statistical Methods Group (Biometris), Department of Plant Science, Wageningen University & Research, 6700 AA Wageningen, The Netherlands

³

Adaptation Physiology Group, Wageningen University & Research, P.O. Box 338, 6700 AH Wageningen, The Netherlands

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(20), 6738; https://doi.org/10.3390/s24206738

Submission received: 20 August 2024 / Revised: 6 October 2024 / Accepted: 17 October 2024 / Published: 20 October 2024

(This article belongs to the Section Smart Agriculture)

Download

Browse Figures

Versions Notes

Abstract

:

This review highlights the critical role of software sensors in advancing biosystem monitoring and control by addressing the unique challenges biological systems pose. Biosystems—from cellular interactions to ecological dynamics—are characterized by intrinsic nonlinearity, temporal variability, and uncertainty, posing significant challenges for traditional monitoring approaches. A critical challenge highlighted is that what is typically measurable may not align with what needs to be monitored. Software sensors offer a transformative approach by integrating hardware sensor data with advanced computational models, enabling the indirect estimation of hard-to-measure variables, such as stress indicators, health metrics in animals and humans, and key soil properties. This article outlines advancements in sensor technologies and their integration into model-based monitoring and control systems, leveraging the capabilities of Internet of Things (IoT) devices, wearables, remote sensing, and smart sensors. It provides an overview of common methodologies for designing software sensors, focusing on the modelling process. The discussion contrasts hypothetico-deductive (mechanistic) models with inductive (data-driven) models, illustrating the trade-offs between model accuracy and interpretability. Specific case studies are presented, showcasing software sensor applications such as the use of a Kalman filter in greenhouse control, the remote detection of soil organic matter, and sound recognition algorithms for the early detection of respiratory infections in animals. Key challenges in designing software sensors, including the complexity of biological systems, inherent temporal and individual variabilities, and the trade-offs between model simplicity and predictive performance, are also discussed. This review emphasizes the potential of software sensors to enhance decision-making and promote sustainability in agriculture, healthcare, and environmental monitoring.

Keywords:

software sensors; biosystems; monitoring; control; machine-learning; digital agriculture

1. Introduction

In general, “Biosystems” refers to complex living systems that involve biological components (e.g., cells, tissues, organs, whole organisms, and ecosystems) interacting with each other and their surrounding (micro-)environment [1]. In other words, as per Ruth and Hannon [2], biosystems are systems in which the crucial part of their underlying processes is a living organism, serving as the protagonist that shapes and steers the processes defining these systems. Such systems are complex assemblages of interacting physical, chemical, and biological processes. A significant number of these processes have inherent nonlinearity, and there is substantial uncertainty about their interconnections and nature [3,4]. Additionally, these processes can span the spectrum of scale and complexity, from the microscopic intercellular biological processes like photosynthesis and cellular respiration within a single cell to the macroscopic intricate relationships between living organisms and their environment, such as predator–prey interactions and soil microbiome.

The comprehensive study, understanding, and control of biosystems necessitate watchful monitoring and close observation to unravel their complexity and harness their potential. Therefore, biosystem monitoring plays a decisive role in ensuring the optimal operation of contemporary industrial operations, encompassing agricultural and biological processes. The growing complexity of biological systems in agriculture and environmental conservation has made real-time monitoring more essential than ever, especially as global efforts intensify to improve food security and natural resource sustainability.

Biosystem monitoring involves systematically collecting and analyzing data related to the living organisms involved and their underlying processes. This includes monitoring the living organism’s physiological and behavioral responses. The ultimate goal of biosystem monitoring is to gain insights into the underlying mechanisms, identify potential issues, and optimize conditions for enhanced efficiency and productivity.

In recent decades, there has been a remarkable surge in the application of computational methods to monitor biosystems. This progress has been powered by revolutionary advancements in sensor and sensing technologies, providing us unprecedented access to biological and environmental measurements. Historically, biosystem monitoring relied heavily on hardware sensors, but the limitations in physical accessibility and high cost have prompted a shift toward more sophisticated approaches, paving the way for the rapid development of software sensors capable of addressing these challenges. Concurrently, significant steps have been made in computational methods and techniques for analyzing and interpreting this wealth of biological data.

Monitoring and controlling biosystem variables, such as biomass [5], product concentrations, metabolic rate, and energy expenditure [6], soil organic matter and moisture contents [7,8,9,10], pest outbreak control [11], nutrient-plant management, and greenhouse controlling [12,13,14,15], in real-time or online poses a real challenge. The difficulty stems from issues related to physical accessibility, making certain measurements too difficult. Moreover, the absence of cost-effective and dependable sensors renders some measurements too expensive to undertake. One way to surmount this challenge is to depend on some auxiliary measurable [16,17] variables to indirectly infer the unmeasurable or inaccessible process variables instead of the direct physical measurement of these variables [16]. Such a combination of these auxiliary measured variables (using hardware sensors) and estimation algorithm (software) is termed “software sensor”, soft sensor, inferential sensor, or virtual sensor [16,17,18,19].

This review article explores the innovative application of software sensors for monitoring and controlling biosystems, with a focus on their potential to revolutionize the way we manage complex biological processes. A critical challenge we will address is the mismatch between what is typically measurable and what truly needs to be monitored. This review will highlight how software sensors can bridge this gap by combining hardware sensor data with advanced computational modelling techniques to indirectly infer hard-to-measure target variables, such as stress level, animal and human health indicators, and chemical soil properties. Through three case studies, we demonstrate the advantages of this approach in managing the inherent complexities of biological processes, showcasing its applicability across diverse biosystems. We also address potential challenges and limitations, such as technical hurdles and the need for rigorous calibration against physical measurements. Our aim is to provide a comprehensive overview of software sensors and their potential in contributing to sustainable practices in various sectors, including agriculture, healthcare, and environmental monitoring.

2. Model-Based Monitoring and Controlling of Biosystems Concepts

The fourth industrial revolution, also known as Industry 4.0, has brought significant advancements in sensor technologies (e.g., Internet-of-Things (IoT) and smart sensors), which led to a substantial improvement in the monitoring and control of biosystems [20]. This has enabled a more accurate, faster, smaller/compact, and notably cheaper means for data gathering and logging biological and environmental data. The advancements in downsizing has facilitated the creation of micro-sensors, which can be integrated into various environments. The utilization of enhanced materials has facilitated the development of sensors with increased levels of sensitivity and selectivity, allowing them to accurately detect even the most subtle changes in biological signals [20,21]. Moreover, with these cost-effective sensing advancements, deploying multiple sensors in the field and integrating multimodal data to make informed decisions becomes possible. For example, in agriculture, different sensors can gather data about soil conditions, weather, and crop health to optimize crop yields [22,23,24]. In many applications, sensor technologies are used with control systems, which provide continuous feedback of process variables to automate biosystem processes. For instance, in industrial biotechnology, advanced sensors can provide continuous feedback on the bioreactor conditions to the control system, optimizing the production of pharmaceuticals and other bioproducts [25].

With that said, in the context of monitoring biosystems, it is often noticed that what we can typically quantify using conventional hardware sensors may not align with what we truly desire to monitor and study. While hardware sensors can provide invaluable data on various physical, chemical, and physiological parameters, hereafter known as “measured variables”, such as temperature, pH level, heart rate, etc., they often capture a very limited snapshot of the holistic reality of the biosystem. We truly aspire to observe and monitor the nuanced behaviours and the emergent status of these systems, hereafter called “target variables” (e.g., performance, stress, health, wellbeing, …), which transcend the capacity of traditional hardware sensors. Thus, bridging this gap between the measured variables (filed data) and the target variable(s) remains a remarkable challenge to monitor and control many biosystems [16].

Furthermore, in many cases of biosystem applications, it is often that several vital biosystem-related variables are challenging to measure directly using hardware sensors due to their difficulty (e.g., plant biomass, chemical soil properties, plant nutrient dynamics), inaccessibility (e.g., intercellular signalling, pain and thermal perceptions, and ecological relationships in remote or extreme environments), or high cost (e.g., energy expenditure, eDNA, and proteomics).

A model-based monitoring of biosystems is a promising tool for overcoming the aforementioned challenges associated with the direct use of hardware sensors. Instead, it employs the software sensor approach, integrating mathematical models or computational algorithms with sensor data [16]. This approach can offer a more comprehensive and accurate estimation of biosystem target variables. Accordingly, this can enhance our ability to predict, control, and optimize biosystems, ultimately promoting better decision-making in fields such as medical and veterinarian diagnostics, biotechnology, and environmental and agricultural sciences. Figure 1 depicts a general schema of the model-based monitoring and control (M_bMC) system [16]. The system begins with the biosystem, represented by a block labelled “Biosystem”, which encompasses a range of living components, from individual cells and organisms to entire ecosystems. The biosystem interacts with its surrounding environment and is affected by various inputs, including external factors such as weather conditions and control signals. The outputs of the biosystem, also referred to as bio-responses [1,26], can be directly measured using hardware sensors. The measured variables (e.g., acceleration, video/sound signals, satellite images, and soil temperature), also known as field sensor data, are integrated with the estimation algorithm component (block) to indirectly infer the target variable (e.g., infection, animal stress, growth rate, and soil organic matter), which is the variable directly related to the final objective of the monitoring system [27,28].

The estimated target variable is then used by a control system, which can be a decision support system or an automated control system. This system makes decisions based on the continuously updated data from the target variable, adjusting input variables to optimize the performance [29] of the biosystem (e.g., bio-productivity). For example, the control system could regulate environmental conditions within a greenhouse or adjust feeding schedules for livestock based on real-time estimates of plant and animal growth rate.

The software sensor is a critical component of the M_bMC system, bridging the gap between the measured variables and the target variables by providing a virtual representation of the target variable. It is this capability that allows for the precise and effective monitoring and control of complex biosystems.

3. Software Sensors

The advent of the digital revolution has drawn attention to the need for a different type of sensor capable of performing computational inferences and estimating unreadily observable variables. This has given rise to the concept of software sensors, which are also known as soft, virtual, and inferential sensors. The concept of software sensors is based on the idea of inferring (estimating) essential biosystem variables (target variables) that are challenging to measure directly by fusing data acquired from hardware sensor technology with an estimation model or algorithm (i.e., software). As illustrated in Figure 1, the software sensor comprises two main components: the hardware sensor(s), responsible for generating the measured variable(s), and the software component, which encompasses an estimation algorithm for the target variable prediction.

3.1. The Emergence of Software Sensor

The concept of software sensing originated from the domain of systems and control theory [17,30,31]. In many control applications, we need to estimate some internal states of a dynamic system that are unmeasured or difficult to measure for control purposes. The particular estimation algorithm is known as a “state estimator” in the context of control theory, by which the future values of the system states are estimated or predicted based on the current measured states together with the dynamic model of the system [32]. Depending on how uncertainties in a process are treated, state estimation methods can be deterministic or stochastic. The former is usually referred to as “state-observers”, and the latter is called Bayesian filtering, a general statistical method for estimating an unknown probability distribution over time with sensor measurements and a mathematical model. A well-known Bayesian filtering is the Kalman filter, which is tailored for linear dynamic systems and Gaussian noises. For more detailed information on the concepts of state observers and the Kalman filter, please refer to Section 3.2.

Additionally, the principle of the software sensor aligns closely with the concept of inferential control proposed in the 1970s by Brosilow [33]. The concept of inferential control is illustrated in Figure 2. This diagram highlights the key components of this approach. In the inferential control scheme (Figure 2), the controlled variable (i.e., the primary process variable, or the target variable,

y

) is not measured directly but is instead estimated from secondary (inferential) measured variables. These inferential variables (

z

) are chosen to maintain a strong correlation with the unmeasured output variable (

y

) in response to changes in the process input (manipulated) variables (

m

). In this context, the software sensor is represented by the combination of the measured inferential variables and the estimator (inferential model), which, in this specific example [33], is formulated using the transfer functions

G_{p}

and

G_{s}

(as shown in Figure 2). Here, the estimated value (

\hat{y}

) of the unmeasured output variable (

y

) is performing the job of the regular output as if it were measured in the feedback control loop. In essence, the estimator (inferential model) in the inferential control scheme (Figure 2) serves as the software sensor, integrating the measured inferential variables with a model to infer the unmeasured variable.

3.2. State Observers and Kalman Filters

State observers and Kalman filter are widely recognized as the earliest and most commonly used forms of software sensors. Their foundational role in this domain stems from their robust capabilities in unmeasured state estimation and extensive applicability across various fields. In the systems and control theory, a dynamic system is often described by an input–output state space model, which is a first-order differential equation about state variables x for continuous-time systems or a difference equation about x in the discrete-time case. The input variables u impart energy or force allowing an alteration of the state variables, and the measurement signals y are the system’s outputs. A state observer estimates the internal states x of a dynamic system based on available measurements y and the input variables u. The diagram of the state observer is shown in Figure 3. The state observer utilizes a state space model of the system, incorporating the measurements y and the inputs u, and by comparing the predicted outputs

\hat{y}

with the actual measurements y, the state observer provides the estimates

\hat{x}

of the internal states and continuously refines the estimation over time. This iterative process enables the observer to provide real-time or near-real-time estimates of internal states, facilitating feedback control strategies and enhancing the overall performance of control systems in complex and dynamic environments.

The following will look at the commonly employed Luenberger observer as an illustrative example. Consider a discrete-time linear state space model:

x (k + 1) = A x (k) + B u (k)

(1)

where y(k) = Cx(k), and A, B, and C are constant matrices. The values of state variables at time

k + 1

,

x (k + 1)

are predicted from current value

x (k)

and inputs

u (k)

using the state equation. The second equation then describes how the outputs

y (k)

are generated. In the Luenberger observer, the difference between the predicted output

\hat{y} (k)

will be subtracted from the measured outputs

y (k)

and then amplified by a constant gain matrix

L

to correct and refine the estimates of state variables, as described by the equations below.

\hat{x} (k + 1) = A \hat{x} (k) + B u (k) + L [y (k) - \hat{y} (k)]

(2)

where

\hat{y} (k)

= C

\hat{x} (k)

.

Provided a system is observable, namely, the so-called observability matrix

O = [\begin{matrix} C \\ \begin{matrix} C A \\ ⋮ \end{matrix} \\ {C A}^{n - 1} \end{matrix}]

(3)

is full rank, the matrix parameter

L

can be designed such that the observer error

e (k) ≔ \hat{x} (k) - x (k)

converges to zero as k goes to infinity. More precisely, the Luenberger observer error follows the dynamics

e (k + 1) = (A - L C) e (k)

, and therefore, when

A - L C

is stable (i.e., all its eigenvalues are inside the unit circle), then the error dynamics is asymptotically stable, meaning that

e (k)

converges to zero as

k

increases, for any initial error

e (0)

.

The above Luenberger observer is a type of full-order observer that aims to estimate all the state variables. On the other hand, reduced-order observers are an alternative approach for software sensor that uses system measurements to estimate only the ‘hidden’ states (i.e., target variables) as a subset of the state variables of a system. While reduced-order observers may be more challenging to design compared with their full-order counterparts, they offer advantages in terms of computational efficiency, simplicity, and potential for improved performance in certain applications [34,35,36].

The Kalman filter is named after Rudolf E. Kálmán (19 May 1930–2 July 2016), who published his famous paper [37] that describes a recursive method for estimating the state of a linear dynamic system from a series of measurements with Gaussian noises. Consider the linear discrete-time system:

x_{k} = {A x}_{k - 1} + {B u}_{k} + w_{k}

(4)

y_{k} = {C x}_{k - 1} + v_{k}

(5)

where

x_{k - 1}

denotes the state at time

k - 1

, and

x_{k}

,

u_{k - 1}

, and

y_{k - 1}

are the state, control input, and measured output at time

k

, respectively. Moreover, the constant matrices

A

,

B

, and

C

are referred to as transition matrix, input matrix, and output matrix, respectively,

w_{k}

is the Gaussian process noise with covariance matrix

Q

, and

v_{k}

is the measurement noise with covariance matrix

R

. The objective of the Kalman filter is to discover the probability distribution of the state

x

at time

k

.

The standard Kalman filter is a two-step process, as shown in Figure 4. The first step is prediction, i.e., to compute the state prediction and the error covariance

P

based on the linear model without sensor measurements. This error covariance matrix

P

can be considered a measure of uncertainty in the estimated state. This variance comes from the process noise and propagation of the uncertain

{\hat{x}}_{k}^{-}

. At the start of the algorithm, the values for

\hat{x}

and

P

come from their initial estimates. The second step is called correction, where we weigh in the sensor measurement. More precisely, the a priori estimates calculated in the prediction step are updated to find the a posteriori estimates of the state

{\hat{x}}_{k}

and error covariance

P_{k}

. To do so, the Kalman gain K is calculated such that it minimizes this new error covariance

P_{k}

, and it determines how heavily the measurement and the a priori estimate contribute to the calculation of

x_{k}

, or simply speaking, how much the sensor measurements can be trusted. If we have sensors providing accurate measurements, meaning that the measurement noise is small, then the measurement can be trusted more and have more weight such that it contributes to the calculation of

x_{k}

, more than the a priori state estimate does. In the opposite case where we have bad sensors, the measurement noise will be large, namely a large

R

, then the sensors are trusted less, so in the equation, the computation of

P_{k}

mostly comes from the a priori estimate.

Aside from the standard Kalman filter, which is widely used for state estimation in linear dynamic systems with Gaussian noise, there have been various developments and variations to accommodate different types of systems and noise characteristics. Below, we list some of the notable ones:

(1): Extended Kalman Filter (EKF) [38,39]: This is an extension of the Kalman filter to nonlinear systems. It linearizes the nonlinear system at each time step around the current mean and covariance estimate and then applies the standard Kalman filter equations. While widely used, the EKF has limitations, especially when dealing with highly nonlinear systems or when linearization errors are significant.
(2): Unscented Kalman Filter (UKF) [40]: The UKF addresses some of the limitations of the EKF by approximating the mean and covariance through a set of carefully chosen sample points (called sigma points) rather than linearization. It captures the mean and covariance of the state distribution more accurately in nonlinear systems and is more robust to nonlinearity than the EKF.
(3): Ensemble Kalman Filter (EnKF) [41,42]: The EnKF is an extension to handle large-scale systems, which uses a Monte Carlo approach with a finite number of ensemble members. Essentially, it makes use of sample covariance in place of the traditional covariance matrix.
(4): Particle Filter [43] is a non-parametric filter representing the state estimate as a set of weighted particles (similar to the ensemble members in EnKF). Unlike other Kalman filter variants, the particle filter does not rely on Gaussian assumptions, and therefore, it can deal with highly nonlinear and non-Gaussian systems. However, this flexibility comes at the cost of efficiency compared with EnKF.

3.3. Software Sensor Design

This section provides a brief overview of the common methodologies for designing software sensors, aiming to help non-expert readers understand the main components of the design process without delving into exhaustive theoretical and mathematical details. For more in-depth theoretical insights, readers are encouraged to refer to the cited bibliography.

The model, or estimation algorithm, is the cornerstone of the software sensor (refer to Figure 1), making its development and identification the most critical step of the software sensor design process. Hence, based on the modelling approach employed in the estimation algorithms, software sensors can be classified into two subtypes: the hypothetico-deductive (or white-box) approach and the inductive (or black-box) approach. In the following subsection the main differences in both modelling approaches are explained.

3.3.1. Modelling Approaches

The hypothetico-deductive (white-box) modelling approach involves defining a priori conceptual model structures based on first principles (ab initio) assumptions derived from established scientific paradigms [44]. This method, also known as mechanistic modelling, relies on fundamental physical laws such as thermodynamics, fluid dynamics, and heat transfer. For example, the calculation of food thermal properties for designing storage and refrigeration equipment strictly using heat transfer and thermodynamics laws, without incorporating any empirical models or fitting parameters, exemplifies a deductive modelling approach. The primary advantage of this method is its ability to provide a deep mechanistic/physical understanding of the process, leading to high accuracy if the system is well understood and the necessary parameters are available. However, this approach often requires extensive domain knowledge and detailed data, which can be a limitation in complex or poorly characterized systems.

The inductive or data-driven (black-box) modelling, on the other hand, abstains from making theoretical preconceptions at the early stages of analysis [44]. Instead, it aims to discover patterns directly from the observational data. The model structure in this data-driven approach is not pre-specified but is inferred from the data using statistical and machine learning techniques. Inductive modelling encompasses a variety of methods, including artificial neural networks (ANN), support vector machines (SVM), and more recently, advanced deep learning techniques. These methods excel in capturing complex patterns and relationships within large datasets without requiring detailed knowledge of the underlying physical/physiological processes. The rapid growth in computational power and data availability has significantly expanded the use of inductive modelling in soft-sensing applications. However, the accuracy of these models is highly dependent on the quality and quantity of data, and they may struggle to generalize beyond the conditions represented in the training data.

The choice between deductive and inductive approaches depends on several factors, including the complexity of the system, the availability of process data, the level of understanding of the underlying process mechanisms, and the specific objective of the software sensor design.

Hypothetico-deductive reasoning has long been integral to biological research practice, underpinning foundational theories such as evolution and germ theories. However, the inherent complexity and nonlinearity of biological systems make extreme deductive (pure white-box) modelling approaches challenging [1,16]. This often results in highly intricate mechanistic models characterized by numerous differential equations and parameters. For instance, metabolic processes and pathways are typically modelled using a series of differential equations that describe the time evolution of metabolites and enzyme concentrations in cellular biochemical reactions [45]. Such computational complexity can place a significant burden on resources, leading to slower convergence times. This, in turn, hampers the practical application of these models in real-time or online software sensing applications.

Conversely, the inductive approach prioritizes practical problem-solving irrespective of the method used. While an extremely inductive (data-driven) model might accurately fit the available data, it can lead to false predictions without understanding the underlying system and mechanisms.

Nevertheless, the inductive approach offers a more pragmatic solution for designing software sensors, especially when the primary goal is to achieve practical and timely results without needing intricate mechanistic details. Additionally, this approach leverages the power of empirical data and advanced computational techniques to develop models that can adapt and respond to real-time data with greater flexibility and efficiency. Therefore, for the sake of convenience, this article focuses exclusively on data-driven software sensors.

3.3.2. Data-Driven Software Sensors

Designing a software sensor using a data-driven approach involves several critical steps to infer the unmeasured (target) variable from measured (auxiliary) data. These steps encompass data acquisition, the selection of auxiliary variables and features, modelling, model validation and testing, and implementation. In this article, we will mainly focus on the modelling step of the design process, including the selection of the appropriate data-driven modelling technique. For more comprehensive information on the other essential steps, readers can consult specific literature sources such as [17,18,46,47,48,49], which provide in-depth discussions and methodologies for effective data acquisition and preparation, feature engineering, and system integration needed for the software sensor design process.

The choice of data-driven modelling techniques is fundamentally driven by the nature of the target variable that needs to be estimated. When the target variable is continuous, meaning it can take any numerical value

y

within a range

[a, b] \in R

, a regression model (Figure 5) is typically employed. Regression models are supervised learning algorithms designed to predict continuous value by identifying the relationship between measured input (auxiliary) variables and the continuous target output. For instance, predicting cell growth rate [50], animal weight [51] or biomass concentration [45] in biosystem monitoring or control applications would necessitate the use of regression techniques such as linear and polynomial regression [52], decision trees for regression [53], or more advanced methods such as support vector regression (SVR) [54] or neural networks (NNs) [55].

On the other hand, when the target variable is discrete, meaning it can only take on a specific set of discrete values or labels

y \in \{0,1, \dots, n\}, n \in N

, a classification model is the appropriate choice. Classification models are used to predict categorical outcomes by learning from the patterns and relationships in the input data. For example, determining the presence or absence of a particular condition in biosystems, such as infection status [56,57], or predicting different levels of biological responses, such as thermal sensation [58], would require the use of classification techniques such as logistic regression [59], k-nearest neighbours [60], decision trees and random forests (RF) [61], support vector machines (SVM) [60], or NNs [62].

4. Applications and Biosystem Case Studies

Software sensors are utilized in many applications across diverse fields, spanning from manufacturing, industrial processes, environmental engineering, and energy management to medical applications, agriculture, biology, and beyond. Table 1 presents an overview of exemplary applications of the software sensing approach in different fields.

The utilization of software sensors in the fields of biology and biosystems is highly prevalent. In digital agriculture, for instance, software sensors emerge as game-changers. Traditionally, estimating soil moisture content, for example, relied on costly methods such as wireless sensor networks and satellite imagery, which often lacked the required precision [63]. Researchers have introduced an innovative solution to these challenges—a soft sensing algorithm leveraging deep learning techniques [7]. This approach creates a virtual soil moisture sensor, bypassing the limitations associated with physical sensors and offering a more accurate and cost-effective solution. The integration of software sensors has similarly revolutionized practices in animal sciences and precision livestock farming (PLF), offering non-invasive means (Table 1) of monitoring animal behaviour, health, and welfare in real-time. For example, Youssef et al. [64,65] employed a software sensing approach to non-invasively monitor the cardiogenic signal and heart rate from moving pigs and incubated avian embryos.

Moreover, advancements in computer vision algorithms and imaging technology enable the real-time monitoring and recognition of animal activities [66,67,68]. Additionally, software sensors are employed in early warning systems, such as in [56,69,70,71], for infection and animal health problems using contactless sensors (e.g., microphones). With the advancements in software sensor technology, digital healthcare is being promoted to change people’s lives [72]. Data reliability and robustness can be enhanced by constructing sensor arrays that simultaneously gather comprehensive biological parameter data from multiple body locations [72].

Table 1. Overview of software sensor applications in different fields.

	Field	Example Applications
Various applications	Manufacturing and industrial processes	- Process Industry: [48,73]
	Environmental engineering	- Wastewater management: [74,75,76] - Environmental monitoring: [77] - Ecology: [78]
	Transportation and smart cities	- Smart cities: [79,80,81,82]
	Cybersecurity	- [83,84,85,86,87,88]
Biosystems applications	Biology	- Neuroscience: [72] - Molecular and cell biology: [50,89,90] - Fermentation: [50,83,89,90,91] - Bioprocess monitoring and control: [47,50,92,93,94]
	Medical and human health	- Digital Healthcare: [52,95,96,97,98] - Pharmaceutical and drug discovery: [99,100]
	Agriculture and animal health	- Digital agriculture: [101,102] - Soil moisture estimation: [7,103,104,105] - Greenhouse monitoring and control: [106,107]
		- Precision livestock farming (PLF): [51,64,65,66,68,108,109,110,111]

While this review focuses on the application of software sensors in biosystems, it builds upon previous research works in the field of model-based monitoring and the control of different biosystems, such as [1,69,112,113,114,115,116,117]. This review extends these previous efforts by focusing on the unique challenges of biosystems and exploring the potential of software sensors in managing their inherent complexity.

In the following subsections, we present three case studies to illustrate the applications of data-driven software sensors in monitoring and controlling various biosystems. The first case study demonstrates the use of software sensors for the real-time monitoring of animal health, specifically as an early warning tool for respiratory infections. The second case study showcases the application of software sensors in the fusion of different sensor data to estimate difficult-to-measure biosystem variables, such as soil organic matter. Lastly, the third case study explores the use of software sensors for the automatic control of biosystems, exemplified by a greenhouse environment.

4.1. Early Warning System: Software Sensor for Real-Time Monitoring off Animal Respiratory Infection

In general, respiratory infections pose a significant challenge to animal health and welfare, especially in the case of intensive animal farming, particularly among calves and pigs. The increased severity and mortality rates associated with such infections show the pressing need for a robust early warning system. In current practice, the reliance on manual illness detection and subsequent veterinarian intervention seems time-consuming, does not occur early enough, and is expensive [69]. Traditionally, respiratory infections and sounds in commercial livestock farms are typically monitored via direct human observation [118]. However, as herd sizes grow and production intensifies, individual animals receive less attention [119]. This decline in available time and resources may cause producers to overlook signs of respiratory diseases [118,120], often resulting in late intervention when symptoms become noticeable. Hence, there is a compelling call for an efficient automatic monitoring system to enhance the early detection of animal respiratory infections and their related outcomes, such as mortality. A comprehensive review of various automated monitoring techniques for animal respiration and sounds, highlighting their advantages over traditional methods, is provided in [118].

Automatic monitoring systems should be robust, providing continuous and unobtrusive surveillance without disrupting the animal’s environment. Additionally, detecting the infection at an early stage requires swift observation of the clinical signs associated with the disease.

Coughs are one of the primal clinical signs associated with respiratory infections such as bovine respiratory diseases (BRD) [57]. The distinctive sound of coughing can serve as a key feature indicator for predicting respiratory infections in animals. The advantage of coughing sounds, and bioacoustics in general, lies in their ability to be measured noninvasively and remotely using microphones, ensuring minimal disruption to the animal’s normal behaviour [56]. Over the past two decades, different research efforts (e.g., [56,69,112,121,122,123,124,125,126,127]) have utilized sound measurements in conjunction with various algorithms to extract acoustic features indicative of respiratory infections. This approach can essentially be perceived as a software sensor, operating to indirectly predict an animal’s infection status. In this section, we will clarify the primary structure and main components (Figure 6) of the software sensor employed to predict whether an animal is infected based on its coughing sound. As illustrated in Figure 6, the target variable in this context is a binary label,

y \in \{0,1\}

, indicating infected = 1 or not-infected = 0 animal status. The measured variable is sound signals recorded using microphones, representing the hardware sensor component, see Figure 6. Functioning as the software component, two algorithms, namely, Model I and Model II (see Figure 6), are utilized as follows:

4.1.1. Model I: Feature Extraction and Coughing Sound Recognition

This algorithm is responsible for extracting the key feature indicators, the cough sound, and other features from the measured variable—specifically, the sound signals—associated with the animal respiratory infection.

For coughing sound recognition, acoustic features can be manually extracted from the audio waveforms. Specifically, the Mel-frequency cepstral coefficient (MFCC) has been used for pig cough recognition and classification [70]. Moreover, other time and frequency domain features, such as power spectral density (PSD), spectral entropy, and root mean squares (RMS), are frequently used for cough sound recognition in both pigs and calves, as evidenced in studies [56,69,128]. An exemplar waveform and spectrogram visualization [113] of extracted coughing sounds from sick pigs is depicted in Figure 7. The presented coughing sound data are recorded during an experimental study; for more information, see [129].

Alternative approaches utilize machine learning techniques, such as CNN, for coughing sound classification and recognition. For instance, Yin et al. [71] employed a deep CNN model (AlexNet) in an end-to-end manner to recognize sick pigs’ coughs. However, this approach is computationally demanding and impractical for real-time applications. To overcome these drawbacks, researchers like Shen et al. [122] adopted a pragmatic strategy by employing shallow CNN to extract deep acoustic features. These features were then fed to a lighter classifier, such as SVM, to recognize pigs’ coughing sounds. An extensive review of cough sound recognition and classification approaches is represented by Legua et al. [130].

4.1.2. Model II: Detection of Infected Animals

This algorithm, which is a supervised classification model in this context, learns from the labelled dataset, using the gold standard, to assign the extracted input features (e.g., number of coughing events), using Model I, to one of the target variable labels. In the context of this example, the target variable poses only two possible labels, namely infected or not-infected. Thus, the goal of Model II is to solve the classification problem

y = F (x)

, where

F (\cdot)

is a decision function that maps the input features

x

to the output (target) variable

y

.

To find the optimal parameters for

F (\cdot)

, the model needs to be trained on a given labelled dataset. Consider the labelled training set

{\{(x_{i}, y_{i})\}}_{i = 1}^{n}

with input data

x \in R^{n}

, where

n \in N

is the number training samples, and output data

y_{i} \in R

with class labels

y_{i} \in \{0,1\}

indicating infected (1) or not-infected (0). The classification problem presented here can be solved by a binary classifier such as SVM, NNs, random forest, logistic regression, etc. Binary classifiers involve finding the parameters that minimize a certain cost or loss function. The specific form of the function

F

and the associated optimization process depends on the chosen algorithm. For example, in logistic regression, a sigmoid function is defined as follows:

F (x) = \frac{1}{1 + e^{- (β_{0} + β_{1} x_{1} + \dots + β_{k} x_{k})}}

(6)

where

β_{0}, β_{1}, \dots, β_{k}

are the parameters to be optimized during the training process to find a decision boundary that separates the two classes in the defined feature space

{〈x_{1}, \dots, x_{k}〉 \in R}^{k}

.

4.2. Sensor Fusion: Software Sensor Application for Indirect Estimation of Soil Organic Matter

Assessing soil organic carbon (SOC) is a vital indication of soil health due to its importance in carbon sequestration, fertility, and soil moisture content [131,132]. Precisely mapping SOC levels in various landscapes helps evaluate ecosystem benefits, direct sustainable land-use methods, and reduce climate change impacts [133,134]. Also, advanced sensing technologies and data analytics now have a vital function in sustainable biosystem management, whether it involves optimizing crop growth conditions in high-tech greenhouses or evaluating the carbon storage potential of soils at various scales [133,135,136]. Both arenas highlight the significance of accurate, data-driven methods for protecting the health and productivity of key biological systems on the planet, regardless of their various scales and properties.

The use of hyperspectral remote sensing technology is a significant advancement in digital agriculture due to its ability to capture soil surfaces with exceptional spectral precision that can reach hundreds of spectral bands (i.e., groups of a particular range of wavelengths) [137,138,139]. Hyperspectral imaging using the visible near-infrared (VIS-NIR) is particularly effective in detecting near-real-time fine spectral differences, as each material on the soil’s surface has a unique spectral signature akin to a fingerprint, which is often called “spectral signature”. This allows sensing capability to discriminate between different soil surface materials and improve SOC predictions [140,141,142,143,144]. In this case study, a software sensor is employed to estimate SOC. Such application requires following specific data management and processing frameworks such as those shown in Figure 8.

Software sensors begin their role at the data fusion stage, where hyperspectral imaging data are combined with other relevant data sources such as climate data, soil type, and historical crop yield data. Spectral features that correlate with SOC levels are extracted using methods like principal component analysis (PCA) or selecting spectral indices sensitive to SOC. This approach reduces dimensionality while preserving variance in spectral data [145,146]. PCA is a technique that converts the initial spectral data into a collection of linearly independent variables known as principal components (

P C

) and are given as follows:

P C (X) = W^{T} \cdot X

(7)

where X is the matrix of spectral data and W is the matrix of weights or coefficients. Once relevant spectral features are extracted, two main modelling approaches might be considered; linear regression can be used when there is low spatial heterogeneity between the observation [147].

Y = β_{0} + β_{1} \cdot x_{1} + β_{2} \cdot x_{2} + \dots + β_{n} \cdot x_{n} + ε,

(8)

where

Y

is SOC content;

X_{1}, X_{2}, \dots, X_{n}

are the spectral features,

β_{0}, β_{1}, β_{2}, \dots, β_{n}

are the coefficients that describe the impact of each spectral feature on the SOC, and

ε

is the error.

For more complex relationships, nonlinear models or machine learning techniques such as the SVM, RF, or NNs might be employed. These techniques can more effectively capture nonlinearities and interactions between spectral features [148,149].

SVM models are utilized for predicting soil organic carbon (SOC) levels, whether through spatial classification or regression [150]. The SVM algorithm works by identifying an optimal hyperplane that best separates different groups of SOC data points in a high-dimensional space, thus enabling accurate the categorization and prediction of SOC levels [151].

The SVM model for SOC classification can be represented as follows:

f (x) = w \cdot \emptyset (x) + b,

(9)

where

x

is the input feature vector,

\emptyset (w)

is the feature vector transformed into a high-dimensional space,

w

is the weight vector, and

b

is the bias term. The decision function is based on the sign of

f (x)

.

RF is a machine-learning technique used for classification, regression, and other tasks. It works by creating many decision trees during the training process. A simple representation for an [152,153] RF regression model is the average of all the decision trees:

f (x) = \frac{1}{N} \sum_{i = 1}^{N} f_{i} (x),

(10)

where

f_{i} (x)

is SOC prediction of the

i

-th decision tree, and

N

is the number of trees in the RF.

Neural networks, especially deep learning models, are capable of capturing complex non-linear relationships. A simple feedforward neural network for regression can be mathematically represented as a series of transformations:

\begin{matrix} h_{1} = σ (W_{1} x_{+} b_{1}) \\ h_{2} = σ (W_{2} h_{1} + b_{2}) \\ \begin{matrix} ⋮ \\ f (x) = W_{k} h_{k - 1} + b_{k} \end{matrix} \end{matrix},

(11)

where

x

is the input,

h_{i}

are the hidden layers,

W_{i}

and

b_{i}

are the weights and biases of the

i

-th layer, and

σ

is a non-linear activation function such as the sigmoid, tanh, or ReLU. For capturing non-linearities and interactions between spectral features, the convolutional neural network (CNN) can be used when (e.g., raster data) derived from hyperspectral space-borne sensors such as the Environmental Mapping and Analysis Program (EnMAP) and the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA), or a Recurrent Neural Network (RNN) if the data are sequential, which depends on being multi-temporal, such as in the Open Soil Spectral Library (OSSL) [135,154,155,156,157].

4.3. Biosystem Control: Software Sensor Application for Automated Greenhouse Control

The modern greenhouse is a complex dynamic system that offers a controlled or partially enclosed space for plants, shielding them from external weather fluctuations. To foster an environment suitable for growing crops while minimizing energy usage, efficient control and optimization strategies have been applied to regulate greenhouse climate conditions, e.g., [13,158,159]. However, implementing these advanced control strategies is hardly achievable if we do not have accurate information about climate conditions and crop growth status. It is reported by [160] that sensor errors can lead to a significant negative effect on crop yield and energy use in the greenhouse.

In practice, the sensors that measure the climate states, such as air temperature, humidity, and carbon dioxide level, can be affected by noises from various sources. Moreover, the real-time and non-invasive measurement of the crop growth states, such as leaf area, dry weight, and fruit content, presents an even greater challenge due to the complex nature of plant physiology. In this context, soft sensing techniques offer promising avenues for addressing these challenges. One such solution is to apply data assimilation through Kalman filtering, and some relevant results can be found in [106,161,162].

A typical implementation of data assimilation in the lettuce greenhouse is outlined in [162], where the greenhouse system has a schematic representation, as shown in Figure 9. The microclimate within the greenhouse is characterized by three key variables: CO₂ concentration x₂, indoor temperature x₃, and humidity x₄. These environmental factors are subject to the influence of external weather conditions (denoted as d), such as solar radiation, external CO₂ levels, air temperature, and humidity, along with control operations (denoted as u), including CO₂ supply, ventilation, and heating. The environment states x₂, x₃, and x₄, weather conditions d, and control signal-dedicated sensors with inherent noises measure u. On the other hand, the dry weight of crops x₄ is hard to measure directly in practice. Therefore, the problem of interest here is to filter out the measurement noises, estimate the three environmental states of greenhouses, and infer unmeasurable dry weight using data assimilation.

The data assimilation requires a dynamic model of the greenhouse system in

x (k + 1) = f (x (k), u (k), d (k), w (k))) y (k) = g (x (k), v (k))

(12)

with all the variables at time step k, namely, state

x (k)

, measured output

y (k)

, controllable input

u (k)

, weather disturbance

d (k)

, process noise

w (k)

, and measurement noise

v (k)

. The variable

x (k + 1)

is the prediction environmental variable at time step

k + 1

. Thereby, the nonlinear function

f (\cdot)

is a complex function describing how the environmental variables (CO₂ concentration x₂, temperature x₃, and humidity x₄) and the crop variable (dry weight x₁) change over time, given the external weather condition d and control inputs u. In [162], the ensemble Kalman filter is employed for data assimilation, which roughly contains two main steps:

- Step 1: Predict the ensemble forward in time using the mathematical model, which simulates the effects of the external weather conditions and the control operations on the greenhouse system.

- Step 2: Update the ensemble with new sensor measurements using a formula incorporating measurement errors and the ensemble covariance. This step adjusts the ensemble to be consistent with the data and reduces the uncertainty of the state estimate. In this step, the algorithm also provides an estimate of the unmeasurable dry weight by including it in the state vector and updating it with the observations of the other variables, using the correlations between them. In [162], the ensemble Kalman filter estimates the parameters in the mathematical mode by treating them as part of the ensemble.

The output of the data assimilation is the estimated state variables, which include an estimation of crop dry weight and the filtered measurement of CO₂ concentration, indoor temperature, and humidity. These estimations are then passed to control design/decision-making, generating commands to regulate the greenhouse’s CO₂ supply, heating, and ventilation systems, see Figure 9.

5. Considerations for Designing Biosystem Software Sensors

The applications and case studies highlighted earlier have demonstrated the valuable advantages that software sensors can offer in the field of real-time monitoring and the control of biosystems. By combining data from traditional sensors with advanced computational models, these software sensors allow us to indirectly infer crucial variables that are difficult or impossible to measure directly, such as stress levels, animal health indicators, and soil properties. This opens up a wide range of practical possibilities. This enables real-time monitoring and decision-making, leading to more efficient and sustainable practices. For instance, software sensors can provide early warning systems for animal diseases (as shown in case study 1), optimize fertilizer applications in agriculture, and automatically control greenhouse environments for optimal crop growth (as shown in case study 2). These applications not only enhance productivity but also contribute to improved animal welfare, reduced environmental impact, and greater resource efficiency. As sensor technologies and computational capabilities continue to advance, software sensors are poised to play an increasingly vital role in managing complex biosystems for a more sustainable future.

That being said, while software sensors hold great promise, their design and implementation require careful consideration of several key challenges and constraints. In this section, we will address number of common considerations that software sensor designers need to carefully account for, covering challenges related to the inherent complexity of biological systems themselves and the nuances of data-driven modelling approaches.

5.1. Challenges Related to the Biological System

Biological systems present a unique set of challenges due to their inherent complexity, non-linear behaviour, time-varying dynamics, and individual variability [1]. These characteristics make it challenging to accurately model and predict the behaviour of biological processes.

5.1.1. Complexity

The complexity inherent in biological systems stems from their multidimensional nature, characterized by intricate interactions between various processes and agents operating at different scales. As a result, biological systems exist on the “edge of chaos”, where they exhibit a delicate balance between order and disorder [163]. While they can show regular and predictable behaviour, these systems are also prone to sudden, massive, and stochastic changes in response to seemingly minor perturbations. This characteristic makes them exceptionally challenging to model, monitor, and control, especially when compared with more predictable man-made systems, such as car engines.

Therefore, when designing a software sensor for monitoring or controlling such systems, it is crucial to select and train data-driven models capable of incorporating stochastic elements to accurately capture and predict the likelihood of sudden changes. For instance, Bayesian networks can be used to predict the probability of sudden changes in disease progression [164] or ecological shifts [165] by modelling the dependencies between various biological factors. Statistical models such as hidden Markov models can be employed in software sensors to detect sudden changes in physiological signals, such as heart rate variability or neural activity patterns [166]. For more complex applications, deep learning models with dropout regularization can be used for image classifications in medical and diagnostic applications [167].

5.1.2. Time Variability

Biological processes are inherently dynamic, characterized by continuous change over time due to numerous internal factors, such as time-dependent biochemical pathways, circadian rhythms, hormonal cycles, and aging, as well as external factors, such as fluctuations in environmental conditions and stressors like pathogens or injuries. This temporal variability poses significant challenges in monitoring, modelling, and controlling biological systems as well. To account for such temporal variations during software sensor design process, continuous and automated data acquisition systems, such as wearable sensors [52], contactless sensors [168,169], and remote sensing technologies [170], can be employed to ensure that critical information and dynamics are not missed. Also, machine learning techniques such as recurrent neural networks, including long short-term memory networks (LSTM), are specifically designed to handle sequential data and can effectively model temporal dependencies in biological systems [171]. For controlling biological systems, adaptive control strategies, such as model predictive controllers (MPC), can anticipate future states and make proactive adjustments to control strategies to accommodate for the time-varying nature of the controlled biological system [50].

5.1.3. Individual Variability

Biological systems are intrinsically characterized by significant individual variability, which adds an extra layer of complexity to their monitoring, modelling, and control. This variability arises from genetic and environmental factors, resulting in diverse responses among different individual of the same species, and even within the same individual at different times [1]. During the modelling process, capturing the full range of responses within a population requires models that can handle high levels of heterogeneity. In such cases, a single, general model often is not enough. Additionally, accurately estimating model parameters becomes challenging when dealing with individual variability, as it requires extensive data from multiple individuals or time points. Therefore, personalized modelling approaches, which can be tailored to specific individuals, can be employed to account for unique individual variability and dynamic responses. For example, a personalized SVM model in combination with k-nearest neighbor (KNN) is utilized to predict individual human thermal comfort [60].

5.2. Challenges Related to the Modelling Step

Despite its several advantages, data-driven modelling comes with several challenges that must be addressed to ensure its effectiveness and reliability for software sensor applications. Here are some key challenges:

5.2.1. Model Complexity

One crucial aspect of the identification process in modelling, regardless of the specific technique used, is defining a reduced-order model following the Occam’s razor principle [172]. Model complexity refers to the capacity of a model to capture the underlying patterns in the data. A model with high complexity can fit almost any data perfectly, including the noise, which can lead to overfitting. On the contrary, a model with low complexity may be too simplistic to capture the essential features of the data, leading to underfitting. The challenge lies in finding the right (optimal) level of complexity that results in the best predictive performance. This trade-off between model simplicity and performance highlights the need for a good understanding of the bias-variance dilemma [172] in the modelling step of the software sensor design [16]. It refers to the trade-off encountered when making statistical predictions, such as fitting a function, between the accuracy of the prediction and its precision. This trade-off is essentially between bias (the inverse of accuracy) and variance (the inverse of precision). The prediction error, which measures the difference between actual and predicted values, consists of three components (Figure 10):

-: Bias Error: This reflects the model’s ability to fit the training data. A model with high bias (indicative of low complexity and low accuracy) tends to miss relevant relationships between input features and the target output, leading to underfitting (Figure 10).
-: Variance Error: This indicates the model’s sensitivity to small fluctuations in the training data. A model with high variance (indicative of high complexity and low precision) captures the noise in the training data, making it less stable and resulting in overfitting. Such models perform well on training data but inadequately on unseen data due to their high complexity (Figure 10).
-: Noise: This represents the irreducible error inherent in any biological data, which cannot be eliminated during the design process.

5.2.2. Model Interpretability

During the design process of software sensors, it is also important to carefully consider the interpretability of the selected data-based model. Here comes another trade-off between model performance and these two concepts, which largely depends on the objectives of the software sensor design. Essentially, this trade-off involves deciding whether it is more important to achieve the best prediction performance or to extend the understanding of the system, potentially at the expense of some accuracy [16].

Model interpretability refers to how each of the input (auxiliary) variables contributes to the prediction of the target output. An interpretable model provides clear insights into how input features are mathematically mapped to target outputs. Figure 11 provides a generalized view of the relationship between model interpretability and accuracy for some commonly used data-driven models. It is important to recognize that this comparison is a simplification, and actual performance can vary significantly depending on specific datasets and the problem domain. Take, for example, the following linear regression model [52], which is a common example of an interpretable model:

Y = a_{0} + a_{1} X_{1} + a_{2} X_{2} + \dots + a_{n} X_{n} + e,

where

Y

, the target output (e.g., thermal comfort), is directly related to the input feature variables {

X_{1}, X_{2}, \dots, X_{n}

}, such as heart rate, skin temperature, and air temperature [52], and the coefficients {

a_{1}, a_{2}, \dots, a_{n}

} quantifies the contribution of each input variable. Each coefficient in the linear regression model directly indicates the impact of a corresponding input on the target output, making the model highly interpretable and easy to understand for stakeholders.

Generally, in scenarios where transparency and comprehensibility are crucial, such as in clinical diagnostics and exploratory research, highly interpretable models are preferred. However, in cases where maximizing prediction accuracy is crucial, more complex models like deep neural networks (DNNs) are often employed. DNNs involve data passing through multiple layers of multiplications with learned weights and non-linear transformations. A single prediction may result from millions of mathematical operations, rendering such models almost impossible to be fully interpreted.

6. Summary

This review has explored the potential of software sensors in revolutionizing the management and control of complex biosystems. Our primary objective was to demonstrate the advantages of this approach in overcoming the inherent challenges posed by the complexity, variability, and uncertainty of biological processes. We highlighted how software sensors, by combining data from traditional sensors with advanced computational models, can indirectly infer crucial variables that are difficult or impossible to measure directly.

Biosystems, which encompass a wide range of complexities from cellular processes to ecological interactions, are characterized by their time-varying, inherent nonlinearity and individual variability. Hence, the ultimate goal of biosystem monitoring is to uncover underlying mechanisms, identify potential issues, and optimize conditions for enhanced efficiency and productivity. This is particularly relevant in modern industrial applications such as agriculture and biotechnology, where real-time decision-making can significantly impact productivity and sustainability.

Despite the advancements in modern hardware sensors, the article highlights a critical challenge: what is typically measurable may not align with what needs to be monitored. Traditional sensing approaches often fall short in capturing all the necessary data due to difficulties related to physical accessibility or the high costs associated with advanced measuring techniques. Meanwhile, there is a specific need for a deeper understanding of nuanced behaviours and emergent properties of these systems, such as stress levels, health status, and overall wellbeing.

This article emphasizes that software sensors can effectively address these gaps by combining data from hardware sensors as auxiliary variables with estimations models and computational algorithms, enabling the accurate inference of target variables that are challenging to measure directly. The integration of enhanced sensing technologies and advanced computational methods has empowered significant advancements in the capabilities of software sensors, allowing for the continuous and unobtrusive monitoring of biosystems. The model-based monitoring and control framework allows for greater predictive power and responsiveness, essential for effective decision-making.

This article provides a brief overview of the common methodologies for designing software sensors with the main focus being on the estimation algorithm and modelling step. It provides an overview of various modelling approaches, distinguishing the main differences between hypothetico-deductive (mechanistic) models and inductive (data-driven) models such as linear regression, support vector machines, and neural networks.

Moreover, this article outlines the practical applications of software sensors in diverse fields, from environmental monitoring to precision agriculture and healthcare. The article presents three case studies, showcasing in detail three real-world applications of software sensors in biosystem monitoring and control. In the first case study, the focus is on non-invasive disease monitoring in animals, which demonstrates how software sensors can provide early warnings for respiratory infections by analyzing sound signals and cough patterns. Additionally, these sensors show remarkable versatility in multi-sensor fusion applications, allowing them to operate with various hardware sensors to enhance fault detection and minimize uncertainty. The second case study further illustrates this potential, illustrating their ability to integrate data from different hardware tools, such as satellite imagery and hyperspectral signals, to accurately predict target variables, namely, soil organic carbon (SOC) levels. Finally, the third case study presents the use of data assimilation techniques, such as Kalman filter, in automated greenhouse control. This example demonstrates how software sensors can optimize environmental conditions for crop growth, ultimately leading to improved yields and greater resource efficiency.

While the article thoroughly discusses the advantages of software sensors, it also addresses various considerations essential to their design process. Challenges related to the biological systems themselves, such as complexity, time-variability, and individual differences, are discussed, along with commonly used computational techniques to mitigate these issues. Additionally, this article outlines the challenges inherent in data-driven modelling approach, particularly the trade-off between model complexity and performance. The bias-variance dilemma is discussed as a key concern for software sensor designers.

Another considerable trade-off highlighted is between model performance and interpretability. Simpler models like linear regression offer high interpretability but may fail to capture complex patterns as effectively as more advanced models like deep neural networks, which, despite their higher accuracy, suffer from poor interpretability. Ultimately, selecting the most suitable modelling technique should align with the specific design objectives of the software sensor, balancing accuracy, interpretability, and the ability to adapt to the dynamic nature of biological systems.

In conclusion, software sensors represent a significant advancement in the monitoring and control of biosystems, enabling better decision-making and optimization through the integration of advanced hardware technologies and computational techniques. As we continue to develop these technologies, the collaboration between interdisciplinary teams, including data scientists, biologists, and engineers, will be crucial to addressing the complexities and challenges of modern biosystems.

Author Contributions

Conceptualization, A.Y. and N.B.; methodology, A.Y., N.B., and X.C.; investigation, A.Y., N.B., and X.C.; resources, A.Y., N.B., and X.C.; data curation, A.Y., N.B., and X.C.; writing—original draft preparation, A.Y., N.B., and X.C.; writing—review and editing, A.Y., N.B., and X.C.; visualization, A.Y., N.B., and X.C.; supervision, A.Y. and N.B. All authors have read and agreed to the published version of the manuscript.

Funding

The Government of Manitoba Research Fund of Sustainable Canadian Agricultural Partnership (SCAP) T01068 funded part of this research.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Youssef, A. Model-Based Control of Micro-Environment with Real-Time Feedback of Bioresponses; KU Leuven: Leuven, Belgium, 2014. [Google Scholar]
Ruth, M.; Hannon, B. Modeling Dynamic Biological Systems; Springer: New York, NY, USA, 1997; ISBN 978-1-4612-6856-7. [Google Scholar]
Aerts, J.M. Modelling of Bio-Responses to Micro-Environmental Variables Using Dynamic Data-Based Models. Ph.D. Thesis, Katholieke Universiteit, Leuven, Belgium, 2001. [Google Scholar]
Young, P.C. Data-Based Mechanistic and Topdown Modelling. In Proceedings of the International Environmental Modelling and Software Society Conference, Lugano, Switzerland, 24–27 June 2002; pp. 363–374. [Google Scholar]
Mainka, T.; Mahler, N.; Herwig, C.; Pflügl, S. Soft Sensor-Based Monitoring and Efficient Control Strategies of Biomass Concentration for Continuous Cultures of Haloferax Mediterranei and Their Application to an Industrial Production Chain. Microorganisms 2019, 7, 648. [Google Scholar] [CrossRef] [PubMed]
Paul, H.L. Energy Expenditure during Extravehicular Activity through Apollo. In Proceedings of the 42nd International Conference on Environmental Systems, San Diego, CA, USA, 15–19 July 2012. [Google Scholar] [CrossRef]
Cai, Y.; Zheng, W.; Zhang, X.; Zhangzhong, L.; Xue, X. Research on Soil Moisture Prediction Model Based on Deep Learning. PLoS ONE 2019, 14, e0214508. [Google Scholar] [CrossRef] [PubMed]
Moghadas, D.; Badorreck, A. Machine Learning to Estimate Soil Moisture from Geophysical Measurements of Electrical Conductivity. Near Surf. Geophys. 2019, 17, 181–195. [Google Scholar] [CrossRef]
Al-Abbas, A.H.; Swain, P.H.; Baumgardner, M.F. Relating Organic Matter and Clay Content to the Multispectral Radiance of Soils. Soil Sci. 1972, 114, 477–485. [Google Scholar] [CrossRef]
Mozaffari, H.; Moosavi, A.A.; Cornelis, W. Vis-NIR-Spectroscopy- and Loss-on-Ignition-Based Functions to Estimate Organic Matter Content of Calcareous Soils. Arch. Agron. Soil Sci. 2023, 69, 962–980. [Google Scholar] [CrossRef]
Liu, C.; Zhai, Z.; Zhang, R.; Bai, J.; Zhang, M. Field Pest Monitoring and Forecasting System for Pest Control. Front. Plant Sci. 2022, 13, 990965. [Google Scholar] [CrossRef]
Baulch, H.M.; Elliott, J.A.; Cordeiro, M.R.C.; Flaten, D.N.; Lobb, D.A.; Wilson, H.F. Soil and Water Management: Opportunities to Mitigate Nutrient Losses to Surface Waters in the Northern Great Plains. Environ. Rev. 2019, 27, 447–477. [Google Scholar] [CrossRef]
Blasco, X.; Martínez, M.; Herrero, J.M.; Ramos, C.; Sanchis, J. Model-Based Predictive Control of Greenhouse Climate for Reducing Energy and Water Consumption. Comput. Electron. Agric. 2007, 55, 49–70. [Google Scholar] [CrossRef]
Liu, C.; Chen, F.; Li, Z.; Cocq, K.L.; Liu, Y.; Wu, L. Impacts of Nitrogen Practices on Yield, Grain Quality, and Nitrogen-Use Efficiency of Crops and Soil Fertility in Three Paddy-Upland Cropping Systems. J. Sci. Food Agric. 2021, 101, 2218–2226. [Google Scholar] [CrossRef]
Westermann, D.T.; Tindall, T.A.; James, D.W.; Hurst, R.L. Nitrogen and Potassium Fertilization of Potatoes: Yield and Specific Gravity. Am. Potato J. 1994, 71, 417–431. [Google Scholar] [CrossRef]
Youssef, A. Soft Sensor and Biosensing. In Encyclopedia of Smart Agriculture Technologies; Springer: Cham, Switzerland, 2023; pp. 1–10. [Google Scholar] [CrossRef]
Tham, M.T.; Montague, G.A.; Julian Morris, A.; Lant, P.A. Soft-Sensors for Process Estimation and Inferential Control. J. Process Control 1991, 1, 3–14. [Google Scholar] [CrossRef]
Carstensen, J.; Harremoës, P.; Strube, R. Software Sensors Based on the Grey-Box Modelling Approach. Water Sci. Technol. 1996, 33, 117–126. [Google Scholar] [CrossRef]
De Assis, A.J.; Maciel Filho, R. Soft Sensors Development for On-Line Bioreactor State Estimation. Comput. Chem. Eng. 2000, 24, 1099–1103. [Google Scholar] [CrossRef]
Javaid, M.; Haleem, A.; Rab, S.; Pratap Singh, R.; Suman, R. Sensors for Daily Life: A Review. Sens. Int. 2021, 2, 100121. [Google Scholar] [CrossRef]
Aparna, K.; Dayajanaki, D.H.; Devika Rani, P.; Devu, D.; Rajeev, S.P.; Baby Sreeja, S.D. Wearable Sensors in Daily Life: A Review. In Proceedings of the 2023 9th International Conference on Advanced Computing and Communication Systems, ICACCS, Coimbatore, India, 17–18 March 2023; pp. 863–868. [Google Scholar] [CrossRef]
Parajuli, P.B.; Jayakody, P.; Sassenrath, G.F.; Ouyang, Y.; Pote, J.W. Assessing the Impacts of Crop-Rotation and Tillage on Crop Yields and Sediment Yield Using a Modeling Approach. Agric. Water Manag. 2013, 119, 32–42. [Google Scholar] [CrossRef]
Ferencz, C.; Bognár, P.; Lichtenberger, J.; Hamar, D.; Tarcsai, G.; Timár, G.; Molnár, G.; Pásztor, S.Z.; Steinbach, P.; Székely, B.; et al. Crop Yield Estimation by Satellite Remote Sensing. Int. J. Remote Sens. 2004, 25, 4113–4149. [Google Scholar] [CrossRef]
Al-Gaadi, K.A.; Hassaballa, A.A.; Tola, E.; Kayad, A.G.; Madugundu, R.; Alblewi, B.; Assiri, F. Prediction of Potato Crop Yield Using Precision Agriculture Techniques. PLoS ONE 2016, 11, e0162219. [Google Scholar] [CrossRef]
Van Beylen, K.; Youssef, A.; Aerts, J.-M.; Lambrechts, T.; Papantoniou, I. Metabolite-Based Model Predictive Control of Cell Growth. In Proceedings of the Advancing Manufacture of Cell and Gene Therapies VI, Coronado, CA, USA, 27–31 January 2019. [Google Scholar]
Aerts, J.-M.; Albert, B.D.; Matheus, V.E.J. Method and System for Controlling Bioresponse of Living Organisms. Patent Application No. 10/479,115, 18 August 2018. [Google Scholar]
Berckmans, D. Basic Principles of PLF: Gold Standard, Labelling and Field Data. In Proceedings of the Precision Livestock Farming 2013, Leuven, Belgium, 10 September 2013; pp. 21–55. [Google Scholar]
Tullo, E.; Fontana, I.; Diana, A.; Norton, T.; Berckmans, D.; Guarino, M. Application Note: Labelling, a Methodology to Develop Reliable Algorithm in PLF. Comput. Electron. Agric. 2017, 142, 424–428. [Google Scholar] [CrossRef]
Jia, C.; Zhou, T.; Zhang, K.; Yang, L.; Zhang, D.; Cui, T.; He, X.; Sang, X. Design and Experimentation of Soil Organic Matter Content Detection System Based on High-Temperature Excitation Principle. Comput. Electron. Agric. 2023, 214, 108325. [Google Scholar] [CrossRef]
Mora, M.D.; Germani, A.; Manes, C. A State Observer for Nonlinear Dynamical Systems. Nonlinear Anal. Theory Methods Appl. 1997, 30, 4485–4496. [Google Scholar] [CrossRef]
Ljung, L. System Identification: Theory for the User; Prentice Hall: Upper Saddle River, NJ, USA, 1999; ISBN 978-0136566953. [Google Scholar]
McAfee, M.; Kariminejad, M.; Weinert, A.; Huq, S.; Stigter, J.D.; Tormey, D. State Estimators in Soft Sensing and Sensor Fusion for Sustainable Manufacturing. Sustainability 2022, 14, 3635. [Google Scholar] [CrossRef]
Joseph, B.; Brosilow, C.B. Inferential Control of Processes: Part I. Steady State Analysis and Design. AIChE J. 1978, 24, 485–492. [Google Scholar] [CrossRef]
Luenberger, D.G. Observers for Multivariable Systems. IEEE Trans. Autom. Control 1966, 11, 190–197. [Google Scholar] [CrossRef]
Van Dooren, P. Reduced Order Observers: A New Algorithm and Proof. Syst. Control Lett. 1984, 4, 243–251. [Google Scholar] [CrossRef]
Solsona, J.; Valla, M.I.; Muravchik, C. A Nonlinear Reduced Order Observer for Permanent Magnet Synchronous Motors. In Proceedings of the IECON’94—20th Annual Conference of IEEE Industrial Electronics, Bologna, Italy, 5–9 September 1994; Volume 1, pp. 38–43. [Google Scholar] [CrossRef]
Kalman, R.E. A New Approach to Linear Filtering and Prediction Problems. J. Fluids Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
Palatella, L.; Trevisan, A. Interaction of Lyapunov Vectors in the Formulation of the Nonlinear Extension of the Kalman Filter. Phys. Rev. E 2015, 91, 042905. [Google Scholar] [CrossRef]
Ljung, L. Asymptotic Behavior of the Extended Kalman Filter as a Parameter Estimator for Linear Systems. IEEE Trans. Autom. Control 1979, 24, 36–50. [Google Scholar] [CrossRef]
Julier, S.J.; Uhlmann, J.K. Unscented Filtering and Nonlinear Estimation. Proc. IEEE 2004, 92, 401–422. [Google Scholar] [CrossRef]
Aanonsen, S.I.; Nœvdal, G.; Oliver, D.S.; Reynolds, A.C.; Vallès, B. The Ensemble Kalman Filter in Reservoir Engineering—A Review. SPE J. 2009, 14, 393–412. [Google Scholar] [CrossRef]
Evensen, G. The Ensemble Kalman Filter: Theoretical Formulation and Practical Implementation. Ocean Dyn. 2003, 53, 343–367. [Google Scholar] [CrossRef]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef]
Young, P.C. Recursive Estimation and Time-Series Analysis: An Introduction for the Student and Practitioner; Springer: Berlin/Heidelberg, Germany, 2011; ISBN 3642219810. [Google Scholar]
Voit, E.O. The Best Models of Metabolism. Wiley Interdiscip. Rev. Syst. Biol. Med. 2017, 9, e1391. [Google Scholar] [CrossRef] [PubMed]
Fortuna, L.; Graziani, S.; Rizzo, A.; Xibilia Maria, G. Soft Sensors for Monitoring and Control of Industrial Processes; Springer: Berlin/Heidelberg, Germany, 2007; ISBN 978-1-84628-479-3. [Google Scholar]
Paulsson, D.; Gustavsson, R.; Mandenius, C.F. A Soft Sensor for Bioprocess Control Based on Sequential Filtering of Metabolic Heat Signals. Sensors 2014, 14, 17864–17882. [Google Scholar] [CrossRef] [PubMed]
Kadlec, P.; Gabrys, B.; Strandt, S. Data-Driven Soft Sensors in the Process Industry. Comput. Chem. Eng. 2009, 33, 795–814. [Google Scholar] [CrossRef]
Kadlec, P.; Grbić, R.; Gabrys, B. Review of Adaptation Mechanisms for Data-Driven Soft Sensors. Comput. Chem. Eng. 2011, 35, 1–24. [Google Scholar] [CrossRef]
Van Beylen, K.; Youssef, A.; Fernández, A.P.; Lambrechts, T.; Papantoniou, I.; Aerts, J.M. Lactate-Based Model Predictive Control Strategy of Cell Growth for Cell Therapy Applications. Bioengineering 2020, 7, 78. [Google Scholar] [CrossRef]
Peña Fernández, A.; Norton, T.; Youssef, A.; Exadaktylos, V.; Bahr, C.; Bruininx, E.; Vranken, E.; Berckmans, D. Real-Time Modelling of Individual Weight Response to Feed Supply for Fattening Pigs. Comput. Electron. Agric. 2019, 162, 895–906. [Google Scholar] [CrossRef]
Youssef, A.; Colon, J.; Mantzios, K.; Gkiata, P.; Mayor, T.S.; Flouris, A.D.; De Bruyne, G.; Aerts, J.-M. Towards Model-Based Online Monitoring of Cyclist’s Head Thermal Comfort: Smart Helmet Concept and Prototype. Appl. Sci. 2019, 9, 3170. [Google Scholar] [CrossRef]
Debeljak, M.; Džeroski, S. Decision Trees in Ecological Modelling. In Modelling Complex Ecological Dynamics: An Introduction into Ecological Modelling for Students, Teachers & Scientists; Springer: Berlin/Heidelberg, Germany, 2011; pp. 197–209. [Google Scholar] [CrossRef]
Roozbeh, M.; Rouhi, A.; Mohamed, N.A.; Jahadi, F.; Arashi, M.; Roozbeh, M.; Rouhi, A.; Anisah Mohamed, N.; Jahadi, F. Generalized Support Vector Regression and Symmetry Functional Regression Approaches to Model the High-Dimensional Data. Symmetry 2023, 15, 1262. [Google Scholar] [CrossRef]
La Rocca, M.; Perna, C. Designing Neural Networks for Modeling Biological Data: A Statistical Perspective. Math. Biosci. Eng. 2014, 11, 331–342. [Google Scholar] [CrossRef]
Carpentier, L.; Berckmans, D.; Youssef, A.; Berckmans, D.; van Waterschoot, T.; Johnston, D.; Ferguson, N.; Earley, B.; Fontana, I.; Tullo, E.; et al. Automatic Cough Detection for Bovine Respiratory Disease in a Calf House. Biosyst. Eng. 2018, 173, 45–56. [Google Scholar] [CrossRef]
Youssef, A.; Jansen, C.; Neethirajan, S.R. Soft-Sensing Approach for Predicting Bovine Respiratory Disease Severity. In Proceedings of the Precision Livestock Farming’22, Vienna, Austria, 29 August 2022; University of Veterinary Medicine Vienna: Vienna, Austria; pp. 932–939. [Google Scholar]
Youssef, A.; Youssef Ali Amer, A.; Caballero, N.; Aerts, J.-M. Towards Online Personalized-Monitoring of Human Thermal Sensation Using Machine Learning Approach. Appl. Sci. 2019, 9, 3303. [Google Scholar] [CrossRef]
Piccini, C.; Marchetti, A.; Rivieccio, R.; Napoli, R. Multinomial Logistic Regression with Soil Diagnostic Features and Land Surface Parameters for Soil Mapping of Latium (Central Italy). Geoderma 2019, 352, 385–394. [Google Scholar] [CrossRef]
Wang, X.; Li, Y.; Chen, L.; Zhang, Y.; Wang, C. Application and Analysis of Support Vector Machine Based Simulation for Runoff and Sediment Yield. Biosyst. Eng. 2016, 142, 145–155. [Google Scholar] [CrossRef]
Kong, Y.; Yu, T. A Deep Neural Network Model Using Random Forest to Extract Feature Representation for Gene Expression Data Classification. Sci. Rep. 2018, 8, 16477. [Google Scholar] [CrossRef]
Tadeusiewicz, R. Neural Networks as a Tool for Modeling of Biological Systems. Bio-Algorithms Med-Syst. 2015, 11, 135–144. [Google Scholar] [CrossRef]
Fang, K.; Kifer, D.; Lawson, K.; Shen, C. Evaluating the Potential and Challenges of an Uncertainty Quantification Method for Long Short-Term Memory Models for Soil Moisture Predictions. Water Resour. Res. 2020, 56, e2020WR028095. [Google Scholar] [CrossRef]
Youssef, A.; Berckmans, D.; Norton, T. Non-Invasive PPG-Based System for Continuous Heart Rate Monitoring of Incubated Avian Embryo. Sensors 2020, 20, 4560. [Google Scholar] [CrossRef] [PubMed]
Youssef, A.; Peña Fernández, A.; Wassermann, L.; Biernot, S.; Wittauer, E.-M.; Bleich, A.; Hartung, J.; Berckmans, D.; Norton, T. An Approach towards Motion-Tolerant PPG-Based Algorithm for Real-Time Heart Rate Monitoring of Moving Pigs. Sensors 2020, 20, 4251. [Google Scholar] [CrossRef]
Youssef, A.; Exadaktylos, V.; Berckmans, D.A.D.A. Towards Real-Time Control of Chicken Activity in a Ventilated Chamber. Biosyst. Eng. 2015, 135, 31–43. [Google Scholar] [CrossRef]
Peña Fernández, A.; Demmers, T.G.M.; Tong, Q.; Youssef, A.; Norton, T.; Vranken, E.; Berckmans, D. Real-Time Modelling of Indoor Particulate Matter Concentration in Poultry Houses Using Broiler Activity and Ventilation Rate. Biosyst. Eng. 2019, 187, 214–225. [Google Scholar] [CrossRef]
Peña Fernández, A.; Tullo, E.; van Hertem, T.; Youssef, A.; Exadaktylos, V.; Vranken, E.; Guarino, M. Real-Time Monitoring of Broiler Flock’s Welfare Status Using Camera-Based Technology. Biosyst. Eng. 2018, 173, 103–114. [Google Scholar] [CrossRef]
Ferrari, S.; Silva, M.; Guarino, M.; Aerts, J.M.; Berckmans, D. Cough Sound Analysis to Identify Respiratory Infection in Pigs. Comput. Electron. Agric. 2008, 64, 318–325. [Google Scholar] [CrossRef]
Chung, Y.; Oh, S.; Lee, J.; Park, D.; Chang, H.H.; Kim, S. Automatic Detection and Recognition of Pig Wasting Diseases Using Sound Data in Audio Surveillance Systems. Sensors 2013, 13, 12929–12942. [Google Scholar] [CrossRef] [PubMed]
Yin, Y.; Tu, D.; Shen, W.; Bao, J. Recognition of Sick Pig Cough Sounds Based on Convolutional Neural Network in Field Situations. Inf. Process. Agric. 2021, 8, 369–379. [Google Scholar] [CrossRef]
Hassani, F.A. Bioreceptor-Inspired Soft Sensor Arrays: Recent Progress towards Advancing Digital Healthcare. Soft Sci. 2023, 3, 1–33. [Google Scholar] [CrossRef]
Gao, S.; Qiu, S.; Ma, Z.; Tian, R.; Liu, Y. SVAE-WGAN-Based Soft Sensor Data Supplement Method for Process Industry. IEEE Sens. J. 2022, 22, 601–610. [Google Scholar] [CrossRef]
Choi, D.J.; Park, H. A Hybrid Artificial Neural Network as a Software Sensor for Optimal Control of a Wastewater Treatment Process. Water Res. 2001, 35, 3959–3967. [Google Scholar] [CrossRef]
Paepae, T.; Bokoro, P.N.; Kyamakya, K. Data Augmentation for a Virtual-Sensor-Based Nitrogen and Phosphorus Monitoring. Sensors 2023, 23, 1061. [Google Scholar] [CrossRef]
Schneider, M.Y.; Furrer, V.; Sprenger, E.; Carbajal, J.P.; Villez, K.; Maurer, M. Benchmarking Soft Sensors for Remote Monitoring of On-Site Wastewater Treatment Plants. Environ. Sci. Technol. 2020, 54, 10840–10849. [Google Scholar] [CrossRef]
Maniscalco, U.; Pilato, G.; Vella, F. Soft Sensor Network for Environmental Monitoring. Smart Innov. Syst. Technol. 2016, 55, 705–714. [Google Scholar] [CrossRef]
Murugan, C.; Natarajan, P. Estimation of Fungal Biomass Using Multiphase Artificial Neural Network Based Dynamic Soft Sensor. J. Microbiol. Methods 2019, 159, 5–11. [Google Scholar] [CrossRef] [PubMed]
Kaptan, C.; Kantarci, B.; Soyata, T.; Boukerche, A. Emulating Smart City Sensors Using Soft Sensing and Machine Intelligence: A Case Study in Public Transportation. In Proceedings of the IEEE International Conference on Communications, Kansas City, MO, USA, 20–24 May 2018. [Google Scholar] [CrossRef]
Hu, Z.; Yang, R.; Fang, L.; Wang, Z.; Zhao, Y. Research on Vehicle Speed Prediction Model Based on Traffic Flow Information Fusion. Energy 2024, 292, 130416. [Google Scholar] [CrossRef]
Juma, M.; Shaalan, K. Cyberphysical Systems in the Smart City: Challenges and Future Trends for Strategic Research. In Swarm Intelligence for Resource Management in Internet of Things; Academic Press: Cambridge, MA, USA, 2020; pp. 65–85. [Google Scholar] [CrossRef]
Barodi, A.; Zemmouri, A.; Bajit, A.; Benbrahim, M.; Tamtaoui, A. Intelligent Transportation System Based on Smart Soft-Sensors to Analyze Road Traffic and Assist Driver Behavior Applicable to Smart Cities. Microprocess. Microsyst. 2023, 100, 104830. [Google Scholar] [CrossRef]
Pech, M.; Vrchota, J.; Bednář, J. Predictive Maintenance and Intelligent Sensors in Smart Factory: Review. Sensors 2021, 21, 1470. [Google Scholar] [CrossRef]
Ruiz-Arenas, S.; Horváth, I.; Mejía-Gutiérrez, R.; Opiyo, E.Z. Towards the Maintenance Principles of Cyber-Physical Systems. Stroj. Vestn. J. Mech. Eng. 2014, 60, 815–831. [Google Scholar] [CrossRef]
Papa, G.; Zurutuza, U.; Uribeetxeberria, R. Cyber Physical System Based Proactive Collaborative Maintenance. In Proceedings of the 2016 International Conference on Smart Systems and Technologies, SST, Osijek, Croatia, 12–14 October 2016; pp. 173–178. [Google Scholar] [CrossRef]
Alassery, F. Predictive Maintenance for Cyber Physical Systems Using Neural Network Based on Deep Soft Sensor and Industrial Internet of Things. Comput. Electr. Eng. 2022, 101, 108062. [Google Scholar] [CrossRef]
Shcherbakov, M.V.; Glotov, A.V.; Cheremisinov, S.V. Proactive and Predictive Maintenance of Cyber-Physical Systems. Stud. Syst. Decis. Control 2020, 259, 263–278. [Google Scholar] [CrossRef]
Zhang, Y.; Huang, S.; Guo, S.; Zhu, J. Multi-Sensor Data Fusion for Cyber Security Situation Awareness. Procedia Environ. Sci. 2011, 10, 1029–1034. [Google Scholar] [CrossRef]
Martínez-Monge, I.; Martínez, C.; Decker, M.; Udugama, I.A.; Marín de Mas, I.; Gernaey, K.V.; Nielsen, L.K. Soft-Sensors Application for Automated Feeding Control in High-Throughput Mammalian Cell Cultures. Biotechnol. Bioeng. 2022, 119, 1077–1090. [Google Scholar] [CrossRef]
Kim, J.; Park, S.; Lee, H.; Choi, Y.; Jung, S.; Kim, D.H. Flexible and Stretchable Electronics for Healthcare Monitoring. Adv. Mater. 2023, 35, 2211147. [Google Scholar] [CrossRef]
Kroll, P.; Stelzer, I.V.; Herwig, C. Soft Sensor for Monitoring Biomass Subpopulations in Mammalian Cell Culture Processes. Biotechnol. Lett. 2017, 39, 1667–1673. [Google Scholar] [CrossRef] [PubMed]
Yan, A.; Shao, H.; Wang, P. A Soft-Sensing Method of Dissolved Oxygen Concentration by Group Genetic Case-Based Reasoning with Integrating Group Decision Making. Neurocomputing 2015, 169, 422–429. [Google Scholar] [CrossRef]
Sagmeister, P.; Kment, M.; Wechselberger, P.; Meitz, A.; Langemann, T.; Herwig, C. Soft-Sensor Assisted Dynamic Investigation of Mixed Feed Bioprocesses. Process Biochem. 2013, 48, 1839–1847. [Google Scholar] [CrossRef]
Sang, H.; Wang, F.; He, D.; Chang, Y.; Zhang, D. On-Line Estimation of Biomass Concentration and Specific Growth Rate in the Fermentation Process. In Proceedings of the World Congress on Intelligent Control and Automation (WCICA), Dalian, China, 21–23 June 2006; Volume 1, pp. 4644–4648. [Google Scholar] [CrossRef]
Bhattacharya, P.; Tanwar, S.; Bodkhe, U.; Tyagi, S.; Kumar, N. BinDaaS: Blockchain-Based Deep-Learning as-a-Service in Healthcare 4.0 Applications. IEEE Trans. Netw. Sci. Eng. 2021, 8, 1242–1255. [Google Scholar] [CrossRef]
Mbunge, E.; Muchemwa, B.; Jiyane, S.; Batani, J. Sensors and Healthcare 5.0: Transformative Shift in Virtual Care through Emerging Digital Health Technologies. Glob. Health J. 2021, 5, 169–177. [Google Scholar] [CrossRef]
Hatamie, A.; Angizi, S.; Kumar, S.; Pandey, C.M.; Simchi, A.; Willander, M.; Malhotra, B.D. Review—Textile Based Chemical and Physical Sensors for Healthcare Monitoring. J. Electrochem. Soc. 2020, 167, 037546. [Google Scholar] [CrossRef]
Smith, J.; Brown, P.; Garcia, L.; Nguyen, T. Advances in Machine Learning Algorithms for Environmental Sensing Applications. Sensors 2023, 23, 8944. [Google Scholar] [CrossRef]
Aydin, E.B.; Aydin, M.; Sezginturk, M.K. Biosensors in Drug Discovery and Drug Analysis. Curr. Anal. Chem. 2018, 15, 467–484. [Google Scholar] [CrossRef]
Beke, Á.K.; Gyürkés, M.; Nagy, Z.K.; Marosi, G.; Farkas, A. Digital Twin of Low Dosage Continuous Powder Blending—Artificial Neural Networks and Residence Time Distribution Models. Eur. J. Pharm. Biopharm. 2021, 169, 64–77. [Google Scholar] [CrossRef]
Jayaraman, P.P.; Yavari, A.; Georgakopoulos, D.; Morshed, A.; Zaslavsky, A. Internet of Things Platform for Smart Farming: Experiences and Lessons Learnt. Sensors 2016, 16, 1884. [Google Scholar] [CrossRef] [PubMed]
García-Mañas, F.; Rodríguez, F.; Berenguel, M. Leaf Area Index Soft Sensor for Tomato Crops in Greenhouses. IFAC-Pap. 2020, 53, 15796–15803. [Google Scholar] [CrossRef]
Vaz, C.M.P.; Jones, S.; Meding, M.; Tuller, M. Evaluation of Standard Calibration Functions for Eight Electromagnetic Soil Moisture Sensors. Vadose Zone J. 2013, 12, 1–16. [Google Scholar] [CrossRef]
Cui, H.; Jiang, L.; Paloscia, S.; Santi, E.; Pettinato, S.; Wang, J.; Fang, X.; Liao, W. The Potential of ALOS-2 and Sentinel-1 Radar Data for Soil Moisture Retrieval With High Spatial Resolution Over Agroforestry Areas, China. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–17. [Google Scholar] [CrossRef]
Shi, P.; Luan, X.; Liu, F.; Karimi, H.R. Kalman Filtering on Greenhouse Climate Control. In Proceedings of the 31st Chinese Control Conference, Hefei, China, 25–27 July 2012; pp. 779–784. [Google Scholar]
Van Henten, E.J. Greenhouse Climate Management: An Optimal Control Approach; Wageningen University and Research: Wageningen, The Netherlands, 1994; ISBN 9798728200710. [Google Scholar]
Youssef, A.; Viazzi, S.; Exadaktylos, V.; Berckmans, D. Non-Contact, Motion-Tolerant Measurements of Chicken (Gallus Gallus) Embryo Heart Rate (HR) Using Video Imaging and Signal Processing. Biosyst. Eng. 2014, 125, 9–16. [Google Scholar] [CrossRef]
Lu, M.; Norton, T.; Youssef, A.; Radojkovic, N.; Fernández, A.P.; Berckmans, D. Extracting Body Surface Dimensions from Top-View Images of Pigs. Int. J. Agric. Biol. Eng. 2018, 11, 182–191. [Google Scholar] [CrossRef]
Ma, Y.; Zhang, L.; Li, X.; Liu, Y.; Wang, Y. Development of a Multi-Parameter Wireless Sensing System for Smart Agriculture Applications. Sensors 2022, 22, 4319. [Google Scholar] [CrossRef]
Wang, M.; Youssef, A.; Larsen, M.; Rault, J.-L.; Berckmans, D.; Marchant-Forde, J.N.; Hartung, J.; Bleich, A.; Lu, M.; Norton, T. Contactless Video-Based Heart Rate Monitoring of a Resting and an Anesthetized Pig. Animals 2021, 11, 442. [Google Scholar] [CrossRef]
Exadaktylos, V.; Silva, M.; Aerts, J.-M.; Taylor, C.J.; Berckmans, D. Real-Time Recognition of Sick Pig Cough Sounds. Comput. Electron. Agric. 2008, 63, 207–214. [Google Scholar] [CrossRef]
Smith, A.; Johnson, B.; Miller, C. Effects of Precision Feeding on Growth Performance and Nutrient Utilization in Swine. J. Anim. Sci. 2021, 99, skab038. [Google Scholar] [CrossRef]
Manteuffel, G.; Puppe, B.; Schön, P.C. Vocalization of Farm Animals as a Measure of Welfare. Appl. Anim. Behav. Sci. 2004, 88, 163–182. [Google Scholar] [CrossRef]
Youssef, A.; Caballero, N.; Aerts, J.M. Model-Based Monitoring of Occupant’s Thermal State for Adaptive HVAC Predictive Controlling. Processes 2019, 7, 720. [Google Scholar] [CrossRef]
Youssef, A.; Truyen, P.; Brode, P.; Fiala, D.; Aerts, J.-M. Towards Real-Time Model-Based Monitoring and Adoptive Controlling of Indoor Thermal Comfort. In Proceedings of the Ventilating Healthy Low-Energy Buildings, Nottingham, UK, 13–14 September 2017; Curran Associates, Inc.: Nottingham, UK, 2017. [Google Scholar]
Cho, Y.H.; Kim, M.; Yoon, S.; Lee, K.; Kim, J. Development of an Automatic Irrigation System Using Wireless Sensor Network and GPRS Module. Korean J. Agric. For. Meteorol. 2016, 18, 168–176. [Google Scholar] [CrossRef]
Handa, D.; Peschel, J.M. A Review of Monitoring Techniques for Livestock Respiration and Sounds. Front. Anim. Sci. 2022, 3, 904834. [Google Scholar] [CrossRef]
Stewart, M.; Wilson, M.T.; Schaefer, A.L.; Huddart, F.; Sutherland, M.A. The Use of Infrared Thermography and Accelerometers for Remote Monitoring of Dairy Cow Health and Welfare. J. Dairy Sci. 2017, 100, 3893–3901. [Google Scholar] [CrossRef]
White, B.J.; Renter, D.G. Bayesian Estimation of the Performance of Using Clinical Observations and Harvest Lung Lesions for Diagnosing Bovine Respiratory Disease in Post-Weaned Beef Calves. J. Vet. Diagn. Investig. 2009, 21, 446–453. [Google Scholar] [CrossRef]
Yin, Y.; Ji, N.; Wang, X.; Shen, W.; Dai, B.; Kou, S.; Liang, C. An Investigation of Fusion Strategies for Boosting Pig Cough Sound Recognition. Comput. Electron. Agric. 2023, 205, 107645. [Google Scholar] [CrossRef]
Shen, W.; Ji, N.; Yin, Y.; Dai, B.; Tu, D.; Sun, B.; Hou, H.; Kou, S.; Zhao, Y. Fusion of Acoustic and Deep Features for Pig Cough Sound Recognition. Comput. Electron. Agric. 2022, 197, 106994. [Google Scholar] [CrossRef]
Guarino, M.; Jans, P.; Costa, A.; Aerts, J.M.; Berckmans, D. Field Test of Algorithm for Automatic Cough Detection in Pig Houses. Comput. Electron. Agric. 2008, 62, 22–28. [Google Scholar] [CrossRef]
Wang, X.; Yin, Y.; Dai, X.; Shen, W.; Kou, S.; Dai, B. Automatic Detection of Continuous Pig Cough in a Complex Piggery Environment. Biosyst. Eng. 2024, 238, 78–88. [Google Scholar] [CrossRef]
Cuan, K.; Zhang, T.; Li, Z.; Huang, J.; Ding, Y.; Fang, C. Automatic Newcastle Disease Detection Using Sound Technology and Deep Learning Method. Comput. Electron. Agric. 2022, 194, 106740. [Google Scholar] [CrossRef]
Vandermeulen, J.; Bahr, C.; Johnston, D.; Earley, B.; Tullo, E.; Fontana, I.; Guarino, M.; Exadaktylos, V.; Berckmans, D. Early Recognition of Bovine Respiratory Disease in Calves Using Automated Continuous Monitoring of Cough Sounds. Comput. Electron. Agric. 2016, 129, 15–26. [Google Scholar] [CrossRef]
Aerts, J.M.; Jans, P.; Halloy, D.; Gustin, P.; Berckmans, D. Labeling of Cough Data from Pigs for On-Line Disease Monitoring by Sound Analysis. Trans. ASAE 2005, 48, 351–354. [Google Scholar] [CrossRef]
Exadaktylos, V.; Silva, M.; Berckmans, D. Automatic Identification and Interpretation of Animal Sounds, Application to Livestock Production Optimisation. In Soundscape Semiotics—Localization and Categorization; IntechOpen: London, UK, 2014. [Google Scholar]
Aarts, Y.J.M. The EnergyTag: A Wearable Software Sensor for Online Monitoring of Animal’s Dynamic Energy Expenditure. In Proceedings of the 11th European Conference on Precision Livestock Farming, Bologna, Italy, 9–12 September 2024; Berckmans, D., Tassinari, P., Torreggiani, D., Eds.; EA-PLF: Bologna, Italy, 2024; pp. 1307–1315. [Google Scholar]
Lagua, E.B.; Mun, H.S.; Ampode, K.M.B.; Chem, V.; Kim, Y.H.; Yang, C.J. Artificial Intelligence for Automatic Monitoring of Respiratory Health Conditions in Smart Swine Farming. Animals 2023, 13, 1860. [Google Scholar] [CrossRef] [PubMed]
Biney, J.K.M.; Saberioon, M.; Borůvka, L.; Houška, J.; Vašát, R.; Agyeman, P.C.; Coblinski, J.A.; Klement, A. Exploring the Suitability of UAS-Based Multispectral Images for Estimating Soil Organic Carbon: Comparison with Proximal Soil Sensing and Spaceborne Imagery. Remote Sens. 2021, 13, 308. [Google Scholar] [CrossRef]
Sanchez, P.A.; Ahamed, S.; Carré, F.; Hartemink, A.E.; Hempel, J.; Huising, J.; Lagacherie, P.; McBratney, A.B.; McKenzie, N.J.; De Lourdes Mendonça-Santos, M.; et al. Digital Soil Map of the World. Science 2009, 325, 680–681. [Google Scholar] [CrossRef]
Taghizadeh-Mehrjardi, R.; Nabiollahi, K.; Kerry, R. Digital Mapping of Soil Organic Carbon at Multiple Depths Using Different Data Mining Techniques in Baneh Region, Iran. Geoderma 2016, 266, 98–110. [Google Scholar] [CrossRef]
Sodango, T.H.; Sha, J.; Li, X.; Noszczyk, T.; Shang, J.; Aneseyee, A.B.; Bao, Z. Modeling the Spatial Dynamics of Soil Organic Carbon Using Remotely-Sensed Predictors in Fuzhou City, China. Remote Sens. 2021, 13, 1682. [Google Scholar] [CrossRef]
Goetz, S.; Dubayah, R. Advances in Remote Sensing Technology and Implications for Measuring and Monitoring Forest Carbon Stocks and Change. Carbon. Manag. 2011, 2, 231–244. [Google Scholar] [CrossRef]
Paustian, K.; Collier, S.; Baldock, J.; Burgess, R.; Creque, J.; DeLonge, M.; Dungait, J.; Ellert, B.; Frank, S.; Goddard, T.; et al. Quantifying Carbon for Agricultural Soil Management: From the Current Status toward a Global Soil Information System. Carbon. Manag. 2019, 10, 567–587. [Google Scholar] [CrossRef]
Teke, M.; Deveci, H.S.; Haliloglu, O.; Gurbuz, S.Z.; Sakarya, U. A Short Survey of Hyperspectral Remote Sensing Applications in Agriculture. In Proceedings of the RAST 2013—Proceedings of the 6th International Conference on Recent Advances in Space Technologies, Istanbul, Turkey, 12–14 June 2013; pp. 171–176. [Google Scholar] [CrossRef]
Ge, Y.; Thomasson, J.A.; Sui, R. Remote Sensing of Soil Properties in Precision Agriculture: A Review. Front. Earth Sci. 2011, 5, 229–238. [Google Scholar] [CrossRef]
Adamchuk, V.I.; Hummel, J.W.; Morgan, M.T.; Upadhyaya, S.K. Onthe-Go Soil Sensors for Precision Agriculture. Comput. Electron. Agric. 2004, 44, 71–91. [Google Scholar] [CrossRef]
Kühnel, A.; Bogner, C. In-Situ Prediction of Soil Organic Carbon by Vis–NIR Spectroscopy: An Efficient Use of Limited Field Data. Eur. J. Soil Sci. 2017, 68, 689–702. [Google Scholar] [CrossRef]
Gomez, C.; Viscarra Rossel, R.A.; McBratney, A.B. Soil Organic Carbon Prediction by Hyperspectral Remote Sensing and Field Vis-NIR Spectroscopy: An Australian Case Study. Geoderma 2008, 146, 403–411. [Google Scholar] [CrossRef]
Viscarra Rossel, R.A.; Walvoort, D.J.J.; McBratney, A.B.; Janik, L.J.; Skjemstad, J.O. Visible, near Infrared, Mid Infrared or Combined Diffuse Reflectance Spectroscopy for Simultaneous Assessment of Various Soil Properties. Geoderma 2006, 131, 59–75. [Google Scholar] [CrossRef]
Laamrani, A.; Berg, A.A.; Voroney, P.; Feilhauer, H.; Blackburn, L.; March, M.; Dao, P.D.; He, Y.; Martin, R.C. Ensemble Identification of Spectral Bands Related to Soil Organic Carbon Levels over an Agricultural Field in Southern Ontario, Canada. Remote Sens. 2019, 11, 1298. [Google Scholar] [CrossRef]
Bai, Z.; Xie, M.; Hu, B.; Luo, D.; Wan, C.; Peng, J.; Shi, Z. Estimation of Soil Organic Carbon Using Vis-NIR Spectral Data and Spectral Feature Bands Selection in Southern Xinjiang, China. Sensors 2022, 22, 6124. [Google Scholar] [CrossRef] [PubMed]
Uddin, M.P.; Mamun, M.A.; Hossain, M.A. PCA-Based Feature Reduction for Hyperspectral Remote Sensing Image Classification. IETE Tech. Rev. 2021, 38, 377–396. [Google Scholar] [CrossRef]
Ibrahim, M.F.I.; Al-Jumaily, A.A. PCA Indexing Based Feature Learning and Feature Selection. In Proceedings of the 2016 8th Cairo International Biomedical Engineering Conference, CIBEC, Cairo, Egypt, 15–17 December 2016; pp. 68–71. [Google Scholar] [CrossRef]
Geniaux, G.; Martinetti, D. A New Method for Dealing Simultaneously with Spatial Autocorrelation and Spatial Heterogeneity in Regression Models. Reg. Sci. Urban Econ. 2018, 72, 74–85. [Google Scholar] [CrossRef]
Were, K.; Bui, D.T.; Dick, Ø.B.; Singh, B.R. A Comparative Assessment of Support Vector Regression, Artificial Neural Networks, and Random Forests for Predicting and Mapping Soil Organic Carbon Stocks across an Afromontane Landscape. Ecol. Indic. 2015, 52, 394–403. [Google Scholar] [CrossRef]
Song, J.; Gao, J.; Zhang, Y.; Li, F.; Man, W.; Liu, M.; Wang, J.; Li, M.; Zheng, H.; Yang, X.; et al. Estimation of Soil Organic Carbon Content in Coastal Wetlands with Measured VIS-NIR Spectroscopy Using Optimized Support Vector Machines and Random Forests. Remote Sens. 2022, 14, 4372. [Google Scholar] [CrossRef]
de Santana, F.B.; Otani, S.K.; de Souza, A.M.; Poppi, R.J. Comparison of PLS and SVM Models for Soil Organic Matter and Particle Size Using Vis-NIR Spectral Libraries. Geoderma Reg. 2021, 27, e00436. [Google Scholar] [CrossRef]
Wilson, M.D. Support Vector Machines. Encycl. Ecol. 2008, 1–5, 3431–3437. [Google Scholar] [CrossRef]
Pouladi, N.; Møller, A.B.; Tabatabai, S.; Greve, M.H. Mapping Soil Organic Matter Contents at Field Level with Cubist, Random Forest and Kriging. Geoderma 2019, 342, 85–92. [Google Scholar] [CrossRef]
Grimm, R.; Behrens, T.; Märker, M.; Elsenbeer, H. Soil Organic Carbon Concentrations and Stocks on Barro Colorado Island—Digital Soil Mapping Using Random Forests Analysis. Geoderma 2008, 146, 102–113. [Google Scholar] [CrossRef]
Sanderman, J.; Gholizadeh, A.; Pittaki-Chrysodonta, Z.; Huang, J.; Safanelli, J.L.; Ferguson, R. Transferability of a Large Mid-Infrared Soil Spectral Library between Two Fourier-Transform Infrared Spectrometers. Soil Sci. Soc. Am. J. 2023, 87, 586–599. [Google Scholar] [CrossRef]
Castaldi, F.; Chabrillat, S.; van Wesemael, B. Sampling Strategies for Soil Property Mapping Using Multispectral Sentinel-2 and Hyperspectral EnMAP Satellite Data. Remote Sens. 2019, 11, 309. [Google Scholar] [CrossRef]
Ward, K.J.; Brell, M.; Spengler, D.; Castaldi, F.; Neumann, C.; Segl, K.; Foerster, S.; Chabrillat, S.; Ward, K.J.; Brell, M.; et al. Mapping Soil Organic Carbon Based on Simulated EnMAP Images and the LUCAS Soil Spectral Library. In Proceedings of the EGU General Assembly Conference Abstracts, Virtual, 4–8 May 2020; p. 3013. [Google Scholar] [CrossRef]
Ng, W.; Minasny, B.; Montazerolghaem, M.; Padarian, J.; Ferguson, R.; Bailey, S.; McBratney, A.B. Convolutional Neural Network for Simultaneous Prediction of Several Soil Properties Using Visible/near-Infrared, Mid-Infrared, and Their Combined Spectra. Geoderma 2019, 352, 251–267. [Google Scholar] [CrossRef]
Svensen, J.L.; Cheng, X.; Boersma, S.; Sun, C. Chance-Constrained Stochastic MPC of Greenhouse Production Systems with Parametric Uncertainty. Comput. Electron. Agric. 2024, 217, 108578. [Google Scholar] [CrossRef]
Montoya, A.P.; Guzmán, J.L.; Rodríguez, F.; Sánchez-Molina, J.A. A Hybrid-Controlled Approach for Maintaining Nocturnal Greenhouse Temperature: Simulation Study. Comput. Electron. Agric. 2016, 123, 116–124. [Google Scholar] [CrossRef]
Bontsema, J.; Van Henten, E.J.; Gieling, T.H.; Swinkels, G.L.A.M. The Effect of Sensor Errors on Production and Energy Consumption in Greenhouse Horticulture. Comput. Electron. Agric. 2011, 79, 63–66. [Google Scholar] [CrossRef]
van Mourik, S.; van Beveren, P.J.M.; López-Cruz, I.L.; van Henten, E.J. Improving Climate Monitoring in Greenhouse Cultivation via Model Based Filtering. Biosyst. Eng. 2019, 181, 40–51. [Google Scholar] [CrossRef]
Boersma, S.; Van Mourik, S.; Xin, B.; Kootstra, G.; Bustos-Korts, D. Nonlinear Observability Analysis and Joint State and Parameter Estimation in a Lettuce Greenhouse Using Ensemble Kalman Filtering. IFAC-Pap. 2022, 55, 141–146. [Google Scholar] [CrossRef]
Mazzocchi, F. Complexity in Biology. Exceeding the Limits of Reductionism and Determinism Using Complexity Theory. EMBO Rep. 2008, 9, 10. [Google Scholar] [CrossRef]
Polotskaya, K.; Muñoz-Valencia, C.S.; Rabasa, A.; Quesada-Rico, J.A.; Orozco-Beltrán, D.; Barber, X. Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review. Mach. Learn. Knowl. Extr. 2024, 6, 1243–1262. [Google Scholar] [CrossRef]
Stritih, A.; Rabe, S.E.; Robaina, O.; Grêt-Regamey, A.; Celio, E. An Online Platform for Spatial and Iterative Modelling with Bayesian Networks. Environ. Model. Softw. 2020, 127, 104658. [Google Scholar] [CrossRef]
Masaracchia, L.; Fredes, F.; Woolrich, M.W.; Vidaurre, D. Computational Neuroscience: Dissecting Unsupervised Learning through Hidden Markov Modeling in Electrophysiological Data. J. Neurophysiol. 2023, 130, 364. [Google Scholar] [CrossRef]
Mall, P.K.; Singh, P.K.; Srivastav, S.; Narayan, V.; Paprzycki, M.; Jaworska, T.; Ganzha, M. A Comprehensive Review of Deep Neural Networks for Medical Image Processing: Recent Developments and Future Opportunities. Healthc. Anal. 2023, 4, 100216. [Google Scholar] [CrossRef]
Khanam, F.-T.-Z.; Al-Naji, A.; Chahl, J. Remote Monitoring of Vital Signs in Diverse Non-Clinical and Clinical Scenarios Using Computer Vision Systems: A Review. Appl. Sci. 2019, 9, 4474. [Google Scholar] [CrossRef]
Kashiha, M.; Pluk, A.; Bahr, C.; Vranken, E.; Berckmans, D. Development of an Early Warning System for a Broiler House Using Computer Vision. Biosyst. Eng. 2013, 116, 36–45. [Google Scholar] [CrossRef]
Karmakar, P.; Teng, S.W.; Murshed, M.; Pang, S.; Li, Y.; Lin, H. Crop Monitoring by Multimodal Remote Sensing: A Review. Remote Sens. Appl. 2024, 33, 101093. [Google Scholar] [CrossRef]
Ghislieri, M.; Cerone, G.L.; Knaflitz, M.; Agostini, V. Long Short-Term Memory (LSTM) Recurrent Neural Network for Muscle Activity Detection. J. Neuroeng. Rehabil. 2021, 18, 153. [Google Scholar] [CrossRef] [PubMed]
Bargagli Stoffi, F.J.; Cevolani, G.; Gnecco, G. Simple Models in Complex Worlds: Occam’s Razor and Statistical Learning Theory. Minds Mach. 2022, 32, 13–42. [Google Scholar] [CrossRef]
Geman, S.; Bienenstock, E.; Doursat, R. Neural Networks and the Bias/Variance Dilemma. Neural Comput. 1992, 4, 1–58. [Google Scholar] [CrossRef]

Figure 1. A general schema for model-based monitoring and control (M_bMC) of biosystems and the employment and software sensor concept. The system is composed of various interconnected components: (a) The biosystem, which encompasses the target biological entity or process being monitored and controlled. (b) Hardware sensors, which collect data on measurable variables. (c) The estimation model, which utilizes computational algorithms to infer target variables (e.g., biomass, soil moisture, animal health status) that are not directly measurable. (d) The control system, which utilizes the estimated target variable data to adjust input variables and optimize the performance of the biosystem. (e) The software sensor, which acts as a virtual representation of the target variable, connecting the measured variables from hardware sensors to the control system.

Figure 2. This diagram illustrates the concept of inferential control, based on Brosilow’s work [33], where an unmeasured process variable (

y

) is estimated using an inferential model. The estimated value is then used by the controller to manipulate the manipulated variable (

m

) in order to achieve the desired set-point (

y_{s p}

). The system also includes a disturbance (

d

) that affects the process and an inferential variable (

z

) that is measured and used to estimate the unmeasured variable.

Figure 2. This diagram illustrates the concept of inferential control, based on Brosilow’s work [33], where an unmeasured process variable (

y

) is estimated using an inferential model. The estimated value is then used by the controller to manipulate the manipulated variable (

m

) in order to achieve the desired set-point (

y_{s p}

). The system also includes a disturbance (

d

) that affects the process and an inferential variable (

z

) that is measured and used to estimate the unmeasured variable.

Figure 3. Schematic representation of a state observer integrated within a control system.

Figure 4. Flowchart illustrating the two-step process of the Kalman filter algorithm for state estimation in a linear discrete-time system. In the prediction step, the state and error covariance are projected one step ahead, and then in the correction step, using the measurement data from a real system, the estimated state and covariance matrix are updated. These updated values are fed into the next prediction cycle.

Figure 5. This diagram illustrates the fundamental difference between regression and classification models. In regression, the goal is to predict a continuous target variable (e.g., animal weight, soil organic matter level) represented by a real number within a specified range. In classification, the objective is to predict a discrete target variable (e.g., health vs. sick animal) represented by a category or class.

Figure 6. Schematic representation of a software sensor designed for early warning of animal respiratory infection.

Figure 7. Exemplar waveform (upper graph) and spectrogram (lower graph) of the coughing sound acquired from a sick pig with pneumonia.

Figure 8. Software sensor process diagram for soil organic carbon (SOC) prediction. The diagram illustrates the integration of remote sensing data with supervised machine learning algorithms to estimate SOC levels. The process begins with data fusion of spatial, temporal, and spectral data sources, including satellite images and field measurements. These fused data are then processed through supervised machine learning techniques, with labeled inputs from in situ soil sampling to calibrate the model, band selection, and dimensionality reduction to optimize the spectral inputs for SOC prediction. The final step assesses the model’s accuracy using spatiotemporal SOC data, allowing for robust predictions.

Figure 9. Schematic representation of the lettuce greenhouse model. The system takes real-time measurements of indoor climate variables and estimates the dry weight of the plants through data assimilation, which is updated by a mathematical model. The estimated states are then fed into the control system to adjust inputs and maintain optimal conditions for plant growth.

Figure 10. This graph illustrates the relationship between model complexity and prediction error. Underfitting occurs when a model is too simple to capture the underlying patterns in the data, resulting in high bias. Overfitting, on the other hand, happens when a model is overly complex and fits the training data too closely, leading to high variance. The optimal model structure lies in the sweet spot where the total error (bias + variance) is minimized.

Figure 11. The interpretability–accuracy trade-off in in data-driven modelling: as model complexity increases, accuracy often improves but interpretability suffers.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Badreldin, N.; Cheng, X.; Youssef, A. An Overview of Software Sensor Applications in Biosystem Monitoring and Control. Sensors 2024, 24, 6738. https://doi.org/10.3390/s24206738

AMA Style

Badreldin N, Cheng X, Youssef A. An Overview of Software Sensor Applications in Biosystem Monitoring and Control. Sensors. 2024; 24(20):6738. https://doi.org/10.3390/s24206738

Chicago/Turabian Style

Badreldin, Nasem, Xiaodong Cheng, and Ali Youssef. 2024. "An Overview of Software Sensor Applications in Biosystem Monitoring and Control" Sensors 24, no. 20: 6738. https://doi.org/10.3390/s24206738

APA Style

Badreldin, N., Cheng, X., & Youssef, A. (2024). An Overview of Software Sensor Applications in Biosystem Monitoring and Control. Sensors, 24(20), 6738. https://doi.org/10.3390/s24206738

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Overview of Software Sensor Applications in Biosystem Monitoring and Control

Abstract

1. Introduction

2. Model-Based Monitoring and Controlling of Biosystems Concepts

3. Software Sensors

3.1. The Emergence of Software Sensor

3.2. State Observers and Kalman Filters

3.3. Software Sensor Design

3.3.1. Modelling Approaches

3.3.2. Data-Driven Software Sensors

4. Applications and Biosystem Case Studies

4.1. Early Warning System: Software Sensor for Real-Time Monitoring off Animal Respiratory Infection

4.1.1. Model I: Feature Extraction and Coughing Sound Recognition

4.1.2. Model II: Detection of Infected Animals

4.2. Sensor Fusion: Software Sensor Application for Indirect Estimation of Soil Organic Matter

4.3. Biosystem Control: Software Sensor Application for Automated Greenhouse Control

5. Considerations for Designing Biosystem Software Sensors

5.1. Challenges Related to the Biological System

5.1.1. Complexity

5.1.2. Time Variability

5.1.3. Individual Variability

5.2. Challenges Related to the Modelling Step

5.2.1. Model Complexity

5.2.2. Model Interpretability

6. Summary

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI