Next Article in Journal
Forest Land Cover Mapping at a Regional Scale Using Multi-Temporal Sentinel-2 Imagery and RF Models
Next Article in Special Issue
Sentinel-1 and 2 Time-Series for Vegetation Mapping Using Random Forest Classification: A Case Study of Northern Croatia
Previous Article in Journal
A Concept of 2U Spaceborne Multichannel Heterodyne Spectroradiometer for Greenhouse Gases Remote Sensing
Previous Article in Special Issue
Modeling and Simulation of Very High Spatial Resolution UXOs and Landmines in a Hyperspectral Scene for UAV Survey
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Interval Temporal Logic for Time Series Specification and Data Integration

by
Piotr Kosiuczenko
Institute of Information Systems, Military University of Technology, 00-908 Warsaw, Poland
Remote Sens. 2021, 13(12), 2236; https://doi.org/10.3390/rs13122236
Submission received: 15 April 2021 / Revised: 28 May 2021 / Accepted: 2 June 2021 / Published: 8 June 2021
(This article belongs to the Special Issue Multi-Sensor Systems and Data Fusion in Remote Sensing)

Abstract

:
The analysis of temporal series—in particular, analysis of multisensor data—is a complex problem. It depends on the application domain, the way the data have to be used, and sensors available, among other factors. Various models, algorithms, and technologies have been designed for this goal. Temporal logics are used to describe temporal properties of systems. The properties may specify the occurrence and the order of events in time, recurring patterns, complex behaviors, and processes. In this paper, a new interval logic, called duration calculus for functions (DC4F), is proposed for the specification of temporal series corresponding to multisensor data. DC4F is a natural extension of the well-known duration calculus, an interval temporal logic for the specification of process duration. The adequacy of the proposed logic is analyzed in the case of multisensor data concerning volcanic eruption monitoring. It turns out that the relevant behavior concerns time intervals, not only accumulated history as it is described in other kinds of temporal logics. The examples analyzed demonstrate that a description language is required to specify time series of various kind relative to time intervals. The duration calculus cannot be successfully applied for this task. The proposed calculus allows one to specify temporal series and complex interval-dependent behaviors, and to evaluate the corresponding data within a unifying logical framework. It allows to formulate hypotheses concerning volcano eruption phenomena. However, the expressivity of DC4F comes at the cost of its decidability.

1. Introduction

Remote sensing allows one to acquire information from a distance from cameras, sensors, microphones, and other external devices. The data may originate from satellites, aeroplanes, and sonar systems, among other sources. Satellite-based instruments are commonly used to monitor various parameters of the Earth’s surface, such as temperatures in various infrared frequencies, and to take images. Global coverage is offered with frequency being as low as once per day. Stationary satellites are even capable of providing continuous monitoring of specific locations. The main problems associated with satellite data include their heterogeneity and large volumes. The data need to be analyzed on a daily basis, sometimes in real time, if timely response is needed.
In general, multisensor data are voluminous, heterogeneous, and sometimes incomplete. Prior to being analyzed, they need to be processed and represented in a specific form. They may be represented as discrete data by temporal series, i.e., by sequences of data usually taken in equal time intervals, or as continuous time-dependent functions and stochastic processes. Numerous algorithms have been developed, for example, concerning volcano monitoring (cf., e.g., [1]).
In the literature, heterogeneous means are used, such as means diagrams, functions, tables, and textual descriptions, to specify those heterogeneous types of phenomena. Dependencies between factors of different kind, such as similarities, regularities, and periodicity, are quite often described with the use of text only, meaning that various problems stemming from such an approach, such as imprecision and the lack of a unifying framework for specification, validation, and reasoning need to be dealt with. Textual specifications are inherently imprecise, as this is the feature of natural languages. They do not rely on formal semantics in the form of comprehensive mathematical models. Consequently, they do not allow for precise evaluation or a formal reasoning, and thus the reasoning lacks due formality.
Logics provide uniform formal languages, models, and reasoning methods, and thus provide a solution enabling to address above mentioned problems. Logics are associated with well-defined classes of models constituting their semantics. They provide precise formal languages for the description of the models and capabilities for correct reasoning abut properties of the models. In general, a logic is a system consisting of a formal language for specification, a class of models corresponding to the language, and a set of sound rules for a correct reasoning about the about the models.
Temporal logics are used to express change over time, properties of behaviors, and sequences of actions. Their languages provide temporal modalities for specifying future or past events. They are used also to define and synthesize system controllers. On the other hand, they provide rules for correct reasoning about temporal properties. Various kinds of temporal logics exist (see [2] for an overview).
Interval temporal logics (cf., e.g., [2,3]) are used for specifying time-dependent processes relative to time intervals. Duration calculus (DC) is an interval logic that is widely used for the specifying, modeling, and reasoning about discrete and continuous processes. It allows to specify propositional functions with Boolean values changing over time. There is an operator corresponding to the integral (cf., e.g., [4]) that measures how long such a propositional function remains true. It may be used to study periodicity of system states.
In this paper, we present duration calculus for functions (DC4F), a natural extension of the duration calculus for dealing with general integrable functions, not only Boolean-valued ones, as is the case of DC. The idea is simple; we take the integral operator on Riemann integrable functions and use it within a frame of an interval logic such as DC. Thus, the integral operator is used within a well-suited logical framework. DC4F, in addition to the expressive capabilities of DC, allows one to characterize the behavior of functions over time intervals in terms of their integrals. Consequently, we can characterize not only the duration of a certain property, as in the case of DC, but characterize the behavior of functions over time intervals in terms of integrals. The proposed extension is conservative in the logical sense: the DC part is unchanged meaning that its valid formulas remain valid and all its invalid formulas remain invalid. Even though the extension of DC is natural, we are, to our best knowledge, the first ones who propose it.
To evaluate the proposed logic, we investigated various phenomena, multisensor data, and facts concerning volcano monitoring, as this is a popular research topic and the degree of complexity of data is significant. Periodic degassing and temperature increases are common characteristic of active volcanoes. Distinct periodicity patterns concerning measurable parameters of volcanoes’ activity have been widely identified. The timescales are ranging from seconds to weeks and months. The development of temperature and gas measurement techniques is aimed at enabling a robust quantification of high-frequency processes. Paper [5] presents an overview of the current state of knowledge regarding periodic volcanic degassing and evaluates the methods aiming at detecting periodicity. It summarizes and statistically analyzes published studies. Periodicity analysis of volcano activity (cf., e.g., [5]) is one of the challenges.
It turns out that such phenomena and their dependencies may be conveniently specified using DC4F, as it provides a expressive and uniform language for expressing various phenomena in a precise way, models for various data, and evaluation and reasoning capabilities. It provides a convenient specification language to express hypothesis concerning expected temporal properties. If data (in particular, multisensor data) are provided, then DC4F formulas may be validated for them and, thus, their truthfulness may be checked. On the other hand, the reasoning rules provide for convenient reasoning possibilities. Thus, DC4F may be perceived as a unifying logical framework for data integration and for formulating hypotheses, their evaluation, and reasoning.
However, high expressivity of languages always comes at the cost of complexity of reasoning rules. The higher the expressivity, the more difficult the reasoning. More precisely, the question if a formula follows from a set of other formulas has high computational complexity. DC4F, like DC, is an expressive language and is, therefore, undecidable, i.e., there is no general algorithm for deciding the question mentioned above. Nonetheless, this does not hinder its use for specification, nor does it hinder the validation of formulas for concrete data.
The paper is organized as follows. In Section 2, we discuss related works. Section 3 contains a brief presentation of duration calculus and the way it is used. In Section 4, we present the main idea of DC4F and some examples of its application. In Section 5, we define its formal syntax and outline its semantics in an informal and exemplary way; we also list some of its properties and show how it applies to the multidimensional case. Section 6 is devoted to applications of DC4F in the area of volcano monitoring, in particular to the validation of the proposed concepts. In particular, we present an exemplary reasoning in DC4F. We conclude the paper with Section 7. The paper also contains an Appendix in which we define the formal semantics of DC4F.

2. Related Works

Data fusion means the integration of data and knowledge from different sources. It is a widely studied topic (cf. the overview papers [6,7] and the references there). Data fusion approaches may be classified in various ways—for example, data association, state estimation, and decision fusion. Data fusion has been studied in the context of temporal series as well. In paper [8], the authors study the fusion of long-term data in the form of dense time series from the Moderate Resolution Imaging Spectroradiometer (MODIS) and Landsat imagery. They investigate a spatiotemporal adaptive fusion algorithm in a regionalization study in which MODIS was used. They show the correlation of the time series achieved by different observation methods. In paper [8], the authors propose a data fusion method for producing high spatiotemporal resolution values for the normalized difference vegetation index in the case of time series.
We approach this topic from the perspective of temporal logic, rather than from the practical angle, as in the papers mentioned above and various others. Temporal logic (TL) describes behavior over time (cf., e.g., [2]). Their languages offer temporal modalities for specifying past or future events, as well as their temporal relations, such as: event A will happen or event A may happen in the future, property A has to be always present, property A holds until property B will be satisfied, and so on. They are also used to define and synthesize system controllers; the controllers are then guaranteed to monitor and control the underlying systems according to the specific requirements. On the other hand, they are used to ensure that conclusions drawn based on the assumptions made are correct, that the implementation satisfies the specifications, and so on.
There are different types of temporal logics (see [2] for the overview). The linear temporal logic describes behaviors by referring to linear sequences of events. The computation tree logic (CTL) describes the possibilities and, more precisely, the branching time structures. The interval temporal logic (ITL) (cf., e.g., [2,3]) describes behaviors relative to time intervals, stating, for example, that a property is valid during the entire time period or that it occurs within a certain subperiod. It is also possible to express the fact that a property holds within a given time period, which is then followed by another time period over which another property holds.
Duration calculus (DC) is a popular and widely studied kind of interval logic (cf., e.g., [4]). Duration calculus may be considered to be a form of ITL. In addition to the above-mentioned potential of ITL, DC contains an integral operator that allows one to express durations of properties in a quantitative manner.
In [9], automata-based semantics of DC have been defined covering data, real-time, and communication-related aspects. A model-checking algorithm has been presented for a subset of DC that may be model-checked. The algorithm has been implemented and its use demonstrated.
DC and ITL are expressive but not decidable, i.e., no procedure exists for figuring out whether a given formula is always valid or not. Thus, as usual, expensivity is at the cost of complexity—in this case, decidability. A restricted form of DC, known as RDC, has decidable inference relation [10]. In fact, formulas of this type may be reduced to the regular expressions. The problem of DC model checking is a topic of ongoing research (cf. e.g., [11] and the references there). In [12], the authors propose a method for solving minimal and maximal reachability problems for the multipriced timed automata. The automata are an extension of timed automata with multiple cost variables that may evolve according to specified rates.
The range of applications varies from the above-mentioned real-time systems (cf. e.g., [13] and the references there) to air traffic control (cf. e.g., [14]) and hybrid systems [15].
Duration calculus is often used for the synthesis of real-time system controllers (see, for example, [16]) and the references there). It is capable of specifying time constraints of dynamic systems. Various methods are used—for example, integer linear problem-solving and optimization problem-solving methods (cf. [16]). DC is used also for traffic system specification (cf., e.g., [13]). There is also a wide range of tools supporting DC (see, for example, [10] and the references there).
In [17], a variant of duration calculus was presented for system discounting, i.e., the idea that something happening earlier is more important than similar events happening later. The idea was introduced earlier into temporal logics such as LTL and CTL. The authors demonstrated decidability of the model-checking for timed automata and a dedicated fragment of discounted duration calculus.
We illustrate our ideas with data concerning volcano monitoring. In fact, volcano monitoring is one of the prime examples for the application of multisensor systems placed in satellites, aeroplanes, balloons, and on the ground. They provide huge amounts of data that have to be preprocessed, analyzed, evaluated, and reasoned about. Thus, it is a proper area to test ideas such as ours. Of particular interest are the Strombolian effects, i.e., periodic volcanic activity phenomena such as temperature picks, gas bursts, and lava eruptions (cf., e.g., [18,19,20]).
In [1], Koeppen et al. propose a fully automated interactive algorithm called MODVOLC for the analysis of thermal satellite time-series data. The algorithm is aimed at detecting and quantifying the excess energy radiated from the thermal anomalies such as active volcanoes. The algorithm enhances the previously developed approaches (see the references in [1]). It is characterized by the law rate of false positives (see [1,21]). It flags thermal anomalies—in particular, volcanic eruptions. The algorithm was tested for different localizations such as the Anatahan volcano, the Kīlauea volcano, and the Cantarell oil field in the Gulf of Mexico.
In [22], Laiolo et al. compared thermal satellite images of the Etna eruption that took place in 2018 with ground-based geophysical data of summit craters. The Moderate Resolution Imaging Spectroradiometer (MODIS) provided infrared images and helped to identify pixels including the possible hot spots.
Temporal series corresponding to Strombolian effects may be compared using similarity measures. In fact, various similarity measures are used to compare temporal series and functions in general (cf., e.g., [23] and the references there). One can use the dynamic time warping algorithm (cf., e.g., [24] and the references there). This algorithm is used in temporal series analysis for measuring similarity between temporal series that may vary in terms of speed. Algorithms of this type were applied to compare graphical data representing temporal series, but also in comparisons of audio and video materials (cf., e.g., [25,26]).
Dyea et al. [19] proposed a method of detecting Strombolian eruptions based on training a convolutional neural network. The method automatically categorizes eruptions based on infrared images taken at the rim of a crater atop Mount Erebus. The authors show that machine learning may be effectively used to classify the characteristics of Strombolian eruptions, to facilitate the process of studying their origins, and to assess the hazards posed by volcanic eruptions.

3. Duration Calculus

In this section, we present the basic features of duration calculus (DC). We present the basic idea behind temporal specification relative to time intervals. We also list and explain the temporal modalities it provides and explain how they can be used.

3.1. Using Duration Calculus

In this subsection, we show the basic properties of duration calculus (DC) in its original form. DC allows to specify Boolean-valued functions, which play the role of time-dependent formulas, and to use the DC integral operator to measure the time, or equivalently the duration, when they hold. We also consider an operator for splitting intervals.
DC allows to treat durations of processes. More precisely, it expresses integrals of Boolean-valued functions with values 1 and 0 representing true and false, respectively. Such functions are called state functions since they indicate whether the system is in a given state or not. It should be noted that the properties are defined not in respect to specific points in time but in respect to intervals and in terms of durations.
We start with a simple example of system with states modeled by the sin ( t ) function and lasting for the time interval [ 0 , 2 π ] (see Figure 1). This function is first positive on [ 0 , π ] subinterval and then it is negative on subinterval [ π , 2 π ] . The fact that the function is first positive may be characterized in DC using the Boolean-valued function sp ( t ) = 1 { x | 0 sin ( x ) } ( t ) , which returns 1 if t { x | 0 sin ( x ) } and 0 otherwise. In the first case, the value is 1, and in the second case, it equals 0; we identify true with 1 and false with 0. sp ( t ) is called the characteristic function of the set { x | 0 sin ( x ) } . To describe that the value of s i n is negative, we can use the characteristic function sn ( t ) = 1 { x | sin ( x ) < 0 } . This function may be obtained from the first one using logical negation sn = ¬ sp , which swaps 1 and 0. It should be noted that DC does not allow direct access to s i n and its integral, but only via Boolean-valued functions characterizing its values.
Duration calculus enables one to express such properties as the duration of states—for example, the period of time over which s i n remains positive, with the help of state function sp . DC formulas are evaluated always in respect to an underlying time interval. Length function returns the length of the underlying interval. It is defined using the integration operator ∫ of DC: = 1 . The semantics of this operator is defined relatively to an underlying interval [ a , b ] —in this case, 1 = a b 1 d t .
For a Boolean-valued function, the integral operator measures the time when the function is true. For intervals of length 2 π , the length of subinterval where sin ( t ) is positive is equal to the length of the subinterval in which it is negative. This fact may be expressed using the length function :
= 2 π s p = s n
In this formula, we use the integral s p , which, for a given interval [ a , b ] , returns the value a b s p d t , i.e., the value of the integral for the interval, and similarly for sn . The formula states that if the length of an interval equals to 2 π , then the time when the sp is true equals to the time, the duration, when sn is true. We do not have to consider the fact that the function is equal 0 at the end of intervals, since the functions are characterized in terms of integrals and for subintervals of length 0, the integral value has no influence on the entire integral.
For interval [ 0 , 2 π ] , it holds that s p = π , since the duration of sp being true is π . In fact, this expression is true for every interval of the form [ 0 , x ] , where π x 2 π , but it is false for intervals of the form [ x , 2 π ] . Similarly, we can express the fact that the duration of s i n being negative is π : s n = π . This expression is true on every interval of the form [ y , 2 π ] where 0 y π . In both cases, we have to use the auxiliary functions s p , s n to characterize s i n in terms of being positive or negative, respectively.

3.2. Temporal Modalities

In this subsection, we present three temporal modalities definable in DC: the chop, the after-modality, the sometime-modality, and the always-modality. The chop is denoted by (cf., e.g., [10]); it applies to two neighboring intervals. It allows one to express the fact that a property holds in the first interval and another property holds in the second interval. The last two modalities occur in all kinds of modal and temporal logics (cf., e.g., [2]), but they have a specific meaning in DC. The sometime-modality is denoted by and requires a property to hold for some time interval. The always-modality is denoted by □ and means that a property holds for all time periods.
Formally, for two formulas F , G , the formula F G means that for an interval [ a , b ] in question, there exists a number x such that a x b , F holds for interval [ a , x ] and G holds for interval [ x , b ] . For example, the fact that sin is initially positive and then negative (see the previous subsection) can be described in respect to two subintervals dividing the interval in question. We may express it by formula ( s p = π ) ( s n = π ) . It is true on the interval [ 0 , 2 π ] as the interval can be split into two subintervals [ 0 , π ] and [ π , 2 π ] ; thus, we chose x = π .
Note that the operator is associative; thus, the formula of the form F ( G H ) is equivalent to formula ( F G ) H , since the formulas depend on splitting an underlying interval [ a , d ] into three parts [ a , b ] , [ b , c ] , [ c , d ] where the subformulas F , G , H have to hold, respectively. Consequently, we will drop the brackets when using chop.
Given a formula G and an interval [ a , b ] , the fact that G holds for interval [ a , d ] means that it has a subinterval [ b , c ] , i.e., a b c d , such that G holds in [ b , c ] . The may-modality may be expressed using the chop operator t r u e G t r u e , which means that we can divide the underlying interval into three subintervals and that G holds for the middle subinterval, whereas we do not require anything from its adjacent subintervals on the left- and right-hand sides. For example, we have the following property:
π < 0 < s p =
This formula says that if the length of the underlying interval is larger than π , then there is a subinterval of length larger than 0 where the property s p holds all the time, i.e., sin is positive.
The modal formula G means that for all subintervals of an underlying interval, the formula G holds. For example, for all subintervals of interval [ 0 , π ] , the function s p is true all the time: s p = . Modality □ can be expressed using the sometime-modality : ¬ ¬ G ; the formula means that no subinterval exists where G does not hold. Vice versa, G can be expressed as ¬ ¬ G . Thus, both modalities are dual. Both are definable in terms of the chop operator, but we will use them as syntactic sugar to facilitate readability of the formulas.

4. The Idea of Duration Calculus for Functions

In this section, we present the idea of duration calculus for functions (DC4F). The objective is to treat Riemann integrable functions with values of type real. Such functions can represent continuous as well as discrete time series. We informally present features of the proposed calculus and illustrate them with simple examples demonstrating its basic characteristics, the way it can be used, and the capabilities of the logic.

4.1. DC Versus DC4F

DC allows one to treat Boolean-valued functions only and to reason about their durations. It does not allow to integrate general functions. In this subsection, we propose an extension of DC supporting all Riemann integrable functions. We present the basic features of the proposed logic. We also present examples that are specific to DC4F and, thus, cannot be expressed in DC.
In DC, a state function f, we can consider the time intervals, or periods, when this states holds. Such a case corresponds to considering the corresponding characteristic functions: 1 { y | 0 < f ( y ) } ( t ) . The integrals of the characteristic functions correspond to the duration of the characterized state or property. For example, as explained in the previous section, the fact that s i n is first negative and then positive can be expressed in DC only via the auxiliary characteristic functions s p and s n , but not directly. In DC, we can integrate function s p ( x ) and obtain the duration over which it remains true, but we cannot integrate s i n or any other function that is not Boolean. Thus, for example, we can express the property that for intervals of length 2 π , s i n is positive as equally long as it is negative = 2 π sp = sn .
However, not all properties of integrals can be expressed in this way. For example, we cannot express properties such as = 2 π sin = 0 , i.e., that if the length of the underlying interval is 2 π , then the integral is equal to 0. Thus, DC is not expressive enough to treat time series in general.
DC4F is a natural extension of DC as allows all integrable functions. For example, we can consider s i n . In logical terms, it is a conservative extension of DC. Conservativity means that a formula of DC is a tautology in the sense of DC if, and only if, it is a tautology of DC4F; equivalently, it is satisfied in a DC model if, and only if, there is a DC4F extension of the model in which the formula is satisfied. In general, DC can be expressed in DC4F by restricting the set of integrable functions so that it contains Boolean-valued functions only.

4.2. Monotonicity

We formulate the definition of monotonicity in terms of intervals and integrals. The property that a function is monotone over an interval can be expressed in DC4F by the requirement that for two neighboring subintervals of the same length, the integral over the left one has smaller or equal value to the integral over the right one. It can be expressed by formula M of the form f = x = y f = z = w y = w x z . This property must hold for all subintervals of this kind; thus, we have the formula: M . Figure 2 shows a monotonically increasing function f.
Alternatively, we can define the monotonicity property in the following way:
x , y { ( > 0 f / = x ) ( > 0 f / = y ) x y }
This formula, though it may seem complicated, is rather simple. It states that for a given interval and for all its adjacent subintervals, the normalized value of the integral for the left subinterval is lower than or equal to the normalized value of the integral for the right subinterval. It is illustrated in Figure 2. Intervals [ b , c ] and [ c , d ] are adjacent. The values of the integrals for these intervals have to be normalized by their length, as their lengths are different. We will abbreviate this property of function f as M I n c r e a s i n g ( f ) . Analogously, we can define the property of monotonic decrease, M D e c r e a s i n g ( f ) . We use a capital letter to indicate that both functions have a functional argument instead of a time value.
Although length operator occurs in the characteristics of the first and of the second interval, it returns independent values. Due to transitivity of smaller or equal relation ⩽, the monotonicity property holds for all subintervals, not only for the adjacent ones. Thus, a b f / d t b c f / d t c d f / d t (see Figure 2).

4.3. Limits and Amplitude

In this subsection, we show how to define lower limits of functions and their minimal amplitudes. The definitions are based on temporal modalities. Upper limits and maximal amplitudes can be defined analogously. We use those definitions below when dealing with volcanic Strombolian effects.
We start with the case when a function g ( t ) is an upper approximation of a function f ( t ) or, equivalently, function f ( t ) is a lower approximation of function g ( t ) (see Figure 3). This property can be expressed by demanding that for each subinterval of a given interval, the integral of f is smaller than or equal to the integral of g:
f g
The property that g increases c times faster than c can be expressed in DC4F in a simple way: c f g . It should be noted that we cannot directly express it in DC.
The diagram presented in Figure 4 shows function h, which is above value a. The property that, for a given interval, values of function h are above a certain limit a all the time can be expressed as follows:
a h
The integral of a is equal to the value a . Consequently, we demand that it remains smaller than the integral of h for all subintervals of the underlying interval. The fact that function h exceeds threshold y for some time can be expressed as follows:
t r u e ( 0 < y h ) t r u e
This formula states that for some subinterval of a nonzero length, the value of the integral is not lower than the value of a multiplied by the length of the subinterval. The function shown in Figure 4 extends beyond line y for a time period. Equivalently, the property can be expressed as follows:
( 0 < y h )
The fact that an amplitude of a function is at least d can be expressed as follows:
x , y ( ( 0 < h x ) ) ( 0 < y h ) ) d y x
Formula (8) says that there exist two subintervals of nonzero length. The integral of h for the first one is smaller than or equal to x , as it was expressed by formula (7). Analogously, the integral of h for the second one is larger than or equal to y . Formula (8) requires also that the difference y x is equal to at least d.
In Figure 4, the intervals correspond to the parts of diagrams of h located below the x and above the y line, respectively, are indicated by bold lines. Below, we abbreviate formula (8) by A m p l i t u d e ( h , d ) . Formula P e a k A b o v e ( h , y ) is a shorthand of Formula (6) and expresses the fact that function h is above threshold y for some time period.

5. Syntax and Semantics of DC4F

In this section, we define the formal syntax of DC4F and describe its informal semantics. We list some basic properties of the proposed logic and show how to apply it in the multidimensional case, such as those involving two-dimensional images.

5.1. Syntax of DC4F

Every logic has its proper language, which has to be formally defined. In this subsection, we define the formal syntax of DC4F. This syntax is close to that of DC. The crucial difference consists in the fact that we allow all unary integrable functions, not only the Boolean-valued ones, and their integrals. Basically, there are constants, variables, and function symbols corresponding to the integrable functions, which can be added and multiplied by constants. There is also an integral operator. Formulas are either of the atomic form, when real numbers are compared, or are composed from other formulas.
To define the syntax formally, designated sets of variables, functions, and relation symbols are required. There are global variables, which remain unchanged. We assume that VSym is an unbound set of real-valued variables denoted by letters x , y , z , … We also assume that there is a set CSym = { 0 , 1 , , true , false , } of constant symbols. The set FSym contains unary functions symbols f , g , h , corresponding to integrable functions. The set FlSym contains functional symbols X , Y , Z , corresponding to functionals with arguments in the form of time intervals and values of type real R (cf. Appendix A).
The syntax of the logic comprises three categories of expressions: functions F n containing integrable functions, terms R e a l containing expressions of type R , and formulas F o containing logical formulas in the proper sense of the word. We assume in the definition below that f FSym , C CSym , x VSym , X FlSym .
  • F n : : = f | C | C · F n | F n F n , for { + , , }
  • R e a l : : = C | x | X | F n | R e a l R e a l , for { + , , · , / }
  • F o : : = t r u e | f a l s e | R e a l R e a l | ¬ F o | F o F o | x F o | F o F o | F o * , for { < , = }
The first category includes constants and functions of type real; the constants are treated here as functions with arity equal to 0. The functions can be multiplied by constants and added. There is also the maximum operator ∨; it denotes their maximum and is a reminiscent of DC (cf., e.g., [4]). In DC, there is an indefinable negation operator ¬ that applies to state functions (cf., e.g., [4]). It is characterized by the following property: f ( x ) = 1 if, and only if, ¬ f ( x ) = 0 . We can define the negation ¬ on integrable functions such that it coincides with negation on state functions: ¬ f = d e f 1 f ( x ) f ( x ) 1 . Consequently, if f ( x ) = 1 , then ¬ f ( x ) = d e f 1 1 1 1 = 0 . If f ( x ) = 0 , then ¬ f ( x ) = 1 0 0 1 = 1 , since ∨ is the maximum.
The second category includes constants, variables, and applications of the integral operator. There are also auxiliary operators X on intervals with values of type real. Complex terms can be constructed from simpler ones using arithmetic operations.
The third category comprises formulas as such. The atomic formulas include constants true and false ; the atomic formulas are also formed from relational symbol applied to terms, such as u < v . Complex formulas are composed from other formulas by the application of logical operators: logical negation, alternative, existential quantification, concatenation, the chop operator, and iteration. Thus, if F and G are formulas, then the alternative F G , the chop F G , the iteration F * , and the quantification x F are formulas as well.
As an example of the above syntax, we can present formula s i n = 2 ( s i n = 2 ) , provided that s i n belongs to FSym . The formula is obtained by the application of chop to two atomic subformulas. The first subformula is obtained by the application of equation = to terms s i n and 2; the first term is obtained from function s i n by application of the integral operator. The other subformula is obtained in a similar way.
It should be noted that relations u v and u > v can be defined in terms of < and = using logical operators ¬ and ∨, e.g., in the first case, we have u < v u = v . Similarly, the conjunction F G is defined as ¬ ( ¬ F ¬ G ) . The implication F G is defined as ¬ F G . The general quantifier x F o is defined as ¬ x ¬ F o . As mentioned in Section 3.2, modalities □, are defined using chop and negation. F is defined as ¬ ( t r u e ( ¬ F o t r u e ) ) and F as t r u e F t r u e . The length operator is defined as 1 (see Section 3.1). We assume that binds stronger than ∨ and ∧. ∨ and ∧ bind stronger than quantifiers ∃ and ∀. Quantifiers bind stronger than implication ⇒. Implication binds stronger than modalities and □.

5.2. Informal Semantics of DC4F

In this subsection, we present the intuitive semantics of the previously defined syntax. The presentation is informal, but it can be easily formalized, as shown in the Appendix A. We discussed it for the three syntactic categories defined above: F n , R e a l and F o .
We assume here that the time domain T ime is the set of non-negative real numbers R + . In the case of discrete time modeled by natural numbers N , for every discrete function f, we define the corresponding step function g on R + by extending the values of f to the corresponding unit interval: g ( t ) = f ( t ) , where t is the largest integer n N such that n t . Thus, we can treat discrete functions, or discrete time series, as if they were defined for continuous time.
Elements belonging to category F n are interpreted as time series, and more precisely as Riemann integrable functions with the domain R + . The functions may be multiplied by constants, added, and subtracted; thus, the set corresponds to a linear space. The function f ( t ) g ( t ) returns the maximum of f ( t ) and g ( t ) .
The category R e a l contains terms of type R . The constants, e.g., 0 or 1, are interpreted as the corresponding real numbers. Variable symbols are evaluated by functions mapping them into the set R . For f F n and an interval [ a , b ] , where a , b R + and a b , term of the form f is interpreted as the integral a b f ( t ) d t . This operator is linear in respect to functions: c f = c f and f + g = f + g .
A term t depends on variable x FSym if t contains it. In general, the value of terms containing variables depend on the values of its variables. Valuations are mappings of variables into R . We can, for example, define valuation v a l 1 as mapping of the form x 1 , y 2 , z 5 . Let term p be of the form x + y + z + 2 . The value of p for v a l 1 is 10; it does not depend on an underlying interval. The value of a term depends on an underling interval if contains operator ∫. In the case of term q of the form ( 1 ) + z , the value depends on the underlying interval and variable z; for interval I 1 = [ 0 , 2 ] and v a l 1 , the value is 7.
The category of formulas F o plays a crucial role, as formulas are the proper objects of logical reasoning. In fact, formulas can be understood as Boolean-valued terms. The satisfaction of formulas, i.e., the fact whether formulas are true or false, is defined in respect to an underlying interval and a valuation of the variables they include. The values of these terms may be compared using <, =, given the underlying interval and the valuation. The result of this comparison is either true or false, depending on the value of the corresponding terms. For example, for interval I 1 and valuation v a l 1 , formula q < p is true but formula q = p is false.
Let I = [ a , b ] be an interval and v a l be a valuation. Formula ¬ F is satisfied in interval I for valuation v a l , if formula F is false in interval I for v a l . Similarly, the alternative F G is satisfied if F or G is satisfied in I for v a l .
In case of chop, F G is satisfied in interval I for valuation v a l if there exists c [ a , b ] such that F is satisfied in [ a , c ] for v a l and G is satisfied in [ c , b ] for v a l . Iteration F * of formula F is satisfied in I for v a l if there is an n such that we can split the interval into subintervals [ a , a 1 ] , [ a 1 , a 2 ] , …, [ a n 1 , b ] such that F is satisfied in each subinterval for v a l (cf. [4] Section 4). An example of the iteration formula is presented below and its formal semantics are presented in the Appendix A.
Quantified formula x F is satisfied in interval I for valuation v a l if there exists a valuation v a l which differs from v a l only for variable x such that F is satisfied in I for v a l . The fact that v a l and v a l differ only at x means that for every variable y different from variable x, it holds that v a l ( y ) = v a l ( y ) . For example, x x + y > 25 is satisfied for valuation v a l 1 defined above; in fact, we can define v a l as v a l 1 but v a l ( x ) = 30 .

5.3. Basic Properties

In this subsection, we list some basic properties of the logical operators, such as the linearity and monotonicity of the integral. We use them in an exemplary reasoning.
The DC4F calculus has the following logical properties:
  • 0 = 0 , c = c 1 = c
  • c f = c f , f + g = f + g
  • If f ( x ) g ( x ) , for every x R + , then f g
  • f = x f = y f = x + y
  • F F ( = 0 ) , F ( = 0 ) F
  • F t r u e , ( F G ) ( F H G H ) , F G G F ,
  • ( F G ) ( F H G H ) , ( F G ) ( H F H G )
  • If the formulas F , F G hold in an interval I for a valuation v a l , then G holds in an interval I for a valuation v a l .
Point (1) concerns integration of constants. Point (2) expresses the fact that integrals are linear operators, i.e., they commute with multiplication by constants and are distributive in respect to addition. Point (3) expresses the monotonicity of integral operators. If the antecedent inequality holds for all non-negative reals, then the consequent inequality holds for all time intervals. Property (4) can be expressed as a b f d t + b c f d t = a c f d t . The above-mentioned points are specific to DC4F because it allows integrable functions, not only propositional ones as in the case of DC. The following points are common with DC (cf. [4], Section 2). Property (5) states that every interval can be split into an interval of length 0 and the rest. Point (6) specifies three exemplary propositional tautologies (cf., e.g., [27]) that we use later in an exemplary derivation. The first tautology says that truth is implied by any formula. The second tautology expresses the monotonicity of the conjunction operator. The third one expresses the commutativity of conjunction. It should be noted that all propositional tautologies hold in case of DC as well as DC4F. Point (7) specifies the monotonicity of the chop operator. Property (8) is the modus ponens reasoning rule. It says that if the antecedent of an implication is true and the implication is true, then the consequent of the implication is true as well. The rule is used in many kinds of logic (cf., e.g., [2,27]).

5.4. Multidimensional Case

The functions considered in DC and DC4F are unary, i.e., they are of the form f ( t ) , where t is the time parameter. In this subsection, we consider integrals over multidimensional sets and spaces. Such cases may occur when, for instance, images of an area are considered and measurements are performed. The development of values over time may concern specific areas or multidimensional spaces. In this case, the integrated function has a number of variables, apart from the time variable. We show that in fact such a multidimensional case can be reduced to the case of unary functions. In fact, we utilize such integrals in the examples presented in the following section.
An image may be represented by a number of pixels with two coordinates x, y. Thus, the area of an image may be considered as a set A R 2 . In the multidimensional case, it has the form A R n . If the images are used to measure certain values, then the measurement can be modeled by a function. The measurements may be time-dependent. Thus, we consider time-dependent integrable functions of the form f ( t , x 1 , , x n ) and multidimensional sets of the form R + × A . For an interval I, the corresponding integral has the form: I × A f d ( t , x 1 , x n ) , where d ( t , x 1 , , x n ) is a product measure on the space R × R n . Equivalently, we can present the integral using the characteristic function 1 A corresponding to set A, the function has value 1 for the elements of A and 0 for all other arguments: 1 A f ( t , x 1 , , x n ) d ( t , x 1 , , x n ) .
This integral can be presented also in the form I A f d ( x 1 , , x n ) d t . Thus, the integral I × A f d ( t , x 1 , x n ) has the form I g ( t ) d t , where g ( t ) = A f ( t , x 1 , , x n ) d ( x 1 , , x n ) . Consequently, it can be presented as an integral of unary function g ( t ) depending on the time parameter t only.

6. Applications and Validation

In the case of multisensor systems, the data generated may be of various types, such as temperature, pressure, or density measurements, videos, or sound, among others. It is hard to bring them into a uniform and consistent model. Thus, quite often, textual specifications and informal validations are used, which is inherently imprecise. Specification, interpretation, comparison, and reasoning about such heterogeneous data pose a nontrivial problem. In this section, we apply the DC4F calculus to express measurements and complex behaviors. The goal is to illustrate the way DC4F can be applied and to demonstrate its capabilities for specifying temporal series. We use examples from the area of volcanic activity monitoring by multisensor systems because of their complexity and, thus, the challenge that they pose. We aim to show that the extension of DC that we propose in this paper provides a uniform language, models, and a reasoning system to integrate, describe, and reason about such data. The examples are used to show the practical applicability of DC4F and to demonstrate how it can be used. However, it is not our goal to adequately describe those phenomena per se. It should be also pointed out that the units of measurement do not play an essential role for the illustration. Our concepts concern mathematical properties and, thus, are independent of the units used.

6.1. Volcano Monitoring

To demonstrate the applicability of DC4F, we looked at several examples of signal processing and the corresponding temporal series. It turned out that volcano monitoring is an interesting and challenging task due to the complexity of the process, various monitoring methods used, and the heterogeneity of the acquired data. In this subsection, we present a brief presentation of volcano monitoring processes, as they are instrumental in illustrating our ideas.
As pointed out by Corradini et al. in [18,28], volcano monitoring processes require the correlation of satellite data, provided that satellites are present over the specific target. Satellite data are heterogeneous in nature and relate to temperatures measured in different wavelengths, among other measurements. Several phenomena, such as temperature picks, gas explosions, and magma eruptions, need to be considered (cf., e.g., [19,20]).
Strombolian volcano eruptions are a widely studied topic. They are characterized by specific cyclical patterns. Therefore, their specification is a challenging task. A Strombolian volcano eruption is characterized by regular, relatively mild blasts. It consists of the ejection of fragments of solidified lava, material deposited by previous volcanic eruptions, and masses of molten rock to altitudes ranging from tens to hundreds of meters (cf., e.g., [18,19,20]). Eruptions of this type are named after the Stromboli volcano. They are commonly observed in volcanoes fed by low to moderate viscosity magma. The explosions are caused by the bursting of gas slugs, which rise to the surface faster than the surrounding magma. Quakes associated with such explosions occur at low depths (cf. e.g., [20]). Spectroscopic measurements performed on the Stromboli volcano were used to demonstrate that gas slugs originate from the volcano–crust interface at the depth of approximately 3 kilometers, which may promote slug coalescence. In the case of the Stromboli volcano, quantitative constraints for the depth of conglomerates of high-pressure gas bubbles inducing Strombolian activity can be identified on the basis of spectroscopic measurements of the magmatic gas phase driving the explosions.

6.2. Dealing with Estimates

Various parameters are considered when a volcano eruption is monitored. These parameters need to be measured and compared. In this subsection, we show that functional limits may be dealt with in CD4F.
Thermal satellite images of the Etna eruption that took place in 2018 were compared with ground-based geophysical data of summit craters using the Moderate Resolution Imaging Spectroradiometer (MODIS) (see [22]). The technology provides infrared images and makes the identification of hot spots including pixels possible. The thermal anomalies may be quantified in terms of volcanic radiative power (VRP), expressed in watts (see [22] and the references there). The VRP value is calculated as follows:
V R P p I X = A p I X × 18.7 × ( L 4 a l e r t L 4 b k )
where A p I X is the pixel area of approximately 10 6 m 2 , L 4 a l e r t is the recorded the middle infrared radiance (W m 2 sr 1 m 1 ), and L 4 b k are the background pixels. These values allow to estimate the average lava erupted volume, i.e., the time average discharge rate (TADR), through a unique parameter called radiant density c r a d (J m 3 ):
T A D R = V R P c r a d
This parameter represents capacity of the lava body to radiate heat (see [22] and the references there). It allows to estimate the erupted lava volume by assuming the appropriate parameter c r a d for the lava flow in question.
The fact that the actual lava flow volume T A D R is between limits L A p p and U p A p p (see Figure 5) can be specified by the following formula, which is analogous to Formula (5) from Section 4.3. This fact cannot be expressed in DC, since DC only allows to consider the duration properties on intervals but not the values of integrals:
L A p p T A D R U p A p p
The inequalities express the desired property in terms of intervals. The formula means that the inequalities hold for every subinterval of the given interval (see Section 3.2).

6.3. Dealing with Two-Dimensional Spaces

There exist algorithms for detecting thermal anomalies in two-dimensional pictures related to volcano eruptions. In this subsection, we discuss the application of DC4F to such cases. We specify the units of measurement, but it should be noted that for the presentation, the units of measurement and exact numeric values obtained do not play a significant role in the presentation data, since in our framework we can deal with values of type R independently of their interpretation.
MODVOLC is a fully automated algorithm for the analysis of thermal satellite time-series data [1]. The algorithm is based on spectral wavelengths centred at 4 and 11–12 µm emitted by high-temperature volcanic sources and the surrounding Earth surface. In the case of a volcanic eruption, the radiance is measured from the corresponding pixel. One pixel corresponds to approximately 4 km. Images are collected every 15 min. In pixels that show a thermal anomaly, the radiance measured in the mid-infrared (4 µm) is significantly higher than in surrounding pixels where no thermal anomalies are depicted [1,21].
Figure 6 shows an excerpt of Figure 6 from [1]. It contains results obtained by the MODVOLC algorithm detecting thermal anomalies for different thresholds 2.0, 2.6, and 3.0, respectively, in case of the Anatahan volcano monitoring data. The following formula presents the Local Index of Change of the Environment v ( x , y , t ) representing the degree to which each pixel deviates from its normal behavior normalized by its variability (cf. [1,29]).
v ( x , y , t ) = V ( x , y , t ) V R E F ( x , y ) σ v ( x , y )
An individual observation is modeled by the function V ( x , y , t ) specifying the state of a pixel ( x , y ) at time t. Its normal behavior is expressed by function V R E F ( x , y ) and its variability by σ v ( x , y ) . One can then identify pixels ( x , y ) for which the function exceeds a given threshold T r v ( x , y , t ) . The number of such pixels can be then computed for the entire area of interest A using its characteristic function 1 A , i.e., ( x , y ) A 1 A ( x , y ) = 1 in the discrete case when single pixels are considered and each pixel has measure 1. A thermal anomaly is a pixel with a temperature measurement value above a defined threshold. The total number of thermal anomalies occurring at time t with values above threshold T r is modeled by function a n o m a l i e s ( T r , t ) defined by the following integral:
a n o m a l i e s ( T r , t ) = A 1 { ( x , y ) | T r v ( x , y , t ) } d x d y = R × R 1 A 1 { ( x , y ) | T r v ( x , y , t ) } d x d y
Thus, the number of anomalies is the integral of the product of the characteristic function of set A, corresponding to the area of interest, and of the characteristic function identifying points with temperature above threshold T r . In the continuous case, the value of the integral is equal to the measure of the set of thermal anomalies in the area of interest.

6.4. Specification of Strombolian Effects

Strombolian effects are characterized by cyclic periods of repetitive behaviors. Repetitive gas explosions and temperature peaks occurring in a periodic manner are one of such aspects. These explosions are caused by deep slag-driven explosive activity (cf., e.g., [20]). Those effects include also consecutive magma eruptions. In this subsection, we demonstrate that these kinds of effects can be specified in DC4F. We utilize the temporal properties formulated in the preceding section. The data that we use were obtained on 9 April 2002 during hours of passive and explosive degassing on the Stromboli volcano that was most active at the time [20]. The spectrometer was placed at a distance of 240 m.
Repetitive temperature increases are shown in Figure 7. It shows the diagram presented in [20] (see Figure 1 and Figure S1 there) representing peaks in radiation corresponding to source temperature in Celsius. Figure 7 shows a function g ( t ) with cyclic periods of consecutive value peaks and drops representing the level of radiation. The peaks can reach different maximal values, the drops can reach different minimal values, and their numbers are not fixed in a cycle. The time scale of Strombolian effects is usually hours and days (cf., e.g., [18,20,28]). In Figure 7 and below, we use only numeric time values, as the exact timescale does not play any role in the demonstration and may be changed in an arbitrary manner.
Each peak formation is characterized by a monotone increase of g followed by its monotone decrease. In consequence, a peak can be specified in DC4F by formula M I n c r e a s i n g ( g ) ; M D e c r e a s i n g ( g ) (see Section 4.2). For example, Figure 7 shows a peak corresponding to interval [ a 1 , a 2 ] .
We can also require that the peak values be above a certain threshold t r and the amplitude be above the value of d. These requirements can be formulated as P e a k A b o v e ( g , t r ) and A m p l i t u d e ( g , d ) , respectively. The first formula states that function g reaches a value above threshold t r , and the second one states that its amplitude is at least equal to d (see Section 4.3). In Figure 7, the first peak is above 400 and the amplitude is above 100; of course, these parameters can be adjusted arbitrarily. The fact of reaching such a peak with the constraints concerning values of threshold and amplitude can be expressed by the formula M I n c r e a s i n g ( g ) M D e c r e a s i n g ( g ) P e a k A b o v e ( g , t r ) A m p l i t u d e ( g , d ) . This formula is the logical conjunction of the previous formulas; we abbreviate it as P e a k ( g , t r , d ) . We use capital letters to indicate that functions have functional arguments, not only arguments of type R .
Now, the peaks and the corresponding drops form a repetitive behavior. The left-hand side of Figure 7 shows three peaks, modeling COS bursts, which correspond to the intervals [ a i , a i + 1 ] , for i = 1 , , 3 . We can write the corresponding formula as P e a k ( g , t r , d ) P e a k ( g , t r , d ) P e a k ( g , t r , d ) . It means that the interval may be split into three subintervals such that each satisfies the formula P e a k ( g , t r , d ) . We abbreviate the formula as P e a k ( g , t r , d ) 3 .
In general, for a formula F, n repetitions of the form F F are abbreviated as F n . This formula means that the interval may be split into n subintervals in which F is satisfied. If the number of repetitions is arbitrary, then we write F * . If it equals at least n, then we write F n + , which is equivalent to F n F * .
The peaks occur in cyclic series separated by periods of low volcanic activity. Let formula B e l o w ( g , l t r , l b , u b ) be the abbreviation of ( g l t r ) l b u b . This means that g is always below threshold l t r in a given time interval and that the length of the interval is between time bounds l b and u b (cf. Section 4.3). We can compose the formulas again to specify the behavior shown in Figure 7; for m = 3 and n = 3 , the formula has the form:
{ P e a k ( g , t r , d ) m + B e l o w ( g , l t r , l b , u b ) } n
This Strombolian activity consists of three cycles; each cycle is composed of at least three bursts followed by a period of a low volcanic activity. The three cycles of repetitive bursts are indicated in Figure 7 by brackets below the time axis. The periods of limited volcanic activity are indicated in a similar way.
Strombolian activities may be also detected using satellites equipped with thermal imaging cameras, which sometimes are the source of more reliable indicators than sensors on the ground (cf., e.g., [18,28]). However, if the satellites are not stationary or if visibility is reduced by clouds, then data from thermal imaging cameras cannot be used. We can specify the conditional visibility of thermal peaks by formula v i s i b i l i t y = P e a k ( f , t r , d ) . In this formula, v i s i b i l i t y is a Boolean-valued time-dependent function modeling visibility; it is true if there is visibility and false otherwise. Function f models temperature measurements at a certain frequency. Parameter t r is the corresponding temperature threshold and parameter d is the corresponding amplitude. The formula is conditional and reads as follows: if there is visibility, then a thermal peak is observed. We abbreviate this formula by C o n d P e a k ( f , t r , d ) . Analogously, we can define C o n d B e l o w ( b , l t r , l b , u b ) for the case in which the thermal radiance measurement is below a certain threshold. If the peaks in thermal anomalies coincide with the peaks in COS bursts, then we can transform Formula (14), taking the thermal anomalies into account as well:
{ C o n d P e a k ( g , t r , d ) m + C o n d B e l o w ( g , l t r , l b , u b ) ) } n { C o n d P e a k ( f , t r , d ) m + C o n d B e l o w ( b , l t r , l b , u b ) ) } n
The sometime-modality is used here to specify that we do not require anything for the time periods before and after the characteristic behavior. We may also specify here the minimum and maximum lengths of the periods of low activity, if needed.
Of course, the formulas become more and more complicated as we specify more kinds of behaviors and data, e.g., multisensor data, and constraints. However, the point is that the formulas may be specified in DC4F. Thus, we may precisely formulate fine hypotheses concerning complex behaviors. Furthermore, if implemented, the hypotheses may be verified in respect to existing data. They may also be used to automatically mine the available data, historic or incoming.

6.5. Reasoning in DC4F

In the previous subsections, we presented the capabilities of DC4F for the specification of periodic behaviors and the hypothesis framing. In this subsection, we present a relatively simple example of reasoning in DC4F utilizing the properties of DC4F defined in Section 5.3. The example is somewhat artificial, as we did not find proper examples in the literature, and we do not claim that the formulas truly express actual volcanic behavior. We simplify the proof, as its full version would be rather long. As in the previous section, we skip units of measurement as they do not play any role for the illustration.
Suppose that we know that a Strombolian effect is characterized by two properties occurring within 24 h: a series of COS bursts and an observation of thermal anomalies. The data is on COS burst is collected by sensors on the ground and the data on thermal anomalies are collected by a sensor placed on a satellite. The behavior is specified by functions f C O S ( t ) and f a n o m representing the emission level of COS and the number of observed anomalies, respectively. We use abbreviation C O S B for the formula obtained from formula (14) by substituting f C O S for g, 1 for n, and by setting other parameters somehow. We assume that f a n o m ( t ) = a n o m a l i e s ( t , T r ) for a certain threshold t r (cf. Section 6.3).
Let the Strombolian effect be characterized by at least 10 COS bursts and thermal anomalies of value 5 occurring within 24 h. Moreover, let it be an established fact that 12 COS bursts are always accompanied by thermal anomalies of value 7. The following formulas specify these two properties:
24 C O S B 10 5 f a n o m
24 C O S B 12 7 f a n o m
Now, let the observation made by sensor on the ground be that within 24 h, there were 14 COS bursts, as specified by the following formula:
24 C O S B 14
Suppose that the data on thermal anomalies are not available due to bad weather and, thus, the integral f a n o m cannot be computed. We will show that, despite that obstacle, the Strombolian behavior can be proved.
The formula C O S B 14 can be presented in the form t r u e C O S B 12 C O S B 2 t r u e (we apply here the definition of and unfold the exponent). The formulas C O S B 2 t r u e and t r u e t r u e t r u e are tautologies of the predicate calculus (see point (6) in Section 5.3). Due to these two tautologies and the monotonicity of the chop operator (see point (7) in Section 5.3), we deduce the implication
t r u e C O S B 12 C O S B 2 t r u e t r u e C O S B 12 t r u e
Further, applying the monotonicity property of conjunction (see point (6) two times in Section 5.3) to the above implications, we derive the following implications:
24 C O S B 14 24 C O S B 12 , 24 C O S B 12 7 f a n o m
Moreover, 7 f a n o m 5 f a n o m and C O S B 12 C O S B 10 . From the above implications, we derived the following implication:
24 C O S B 14 L e 24 C O S B 10 5 f a n o m
Antecedent of this implication is the observation (18). The consequent of this implication is the formula to be proved. The consequent follows from the antecedent and the implication by the application of modus ponens rule (see point (8) in Section 5.3). Thus, observation (18) and fact (17) imply that the Strombolian property holds.

6.6. Detecting Similar Behaviors

In this subsection, we consider the methods used in different areas, volcano monitoring included, for detecting similarities between functions and, in particular, between temporal series. Various similarity measures can be used in this case (cf., e.g., [23,24,25,26]). Different types of volcanic activity may be characterized not only in absolute terms, i.e., by general characteristics satisfied by all behaviors, but also relative to one another, or in relation to known examples of behaviors. We consider neither the application of specific similarity measures to specific cases, nor their adequacy, as we are not aiming to identify similarities in volcanic behavior. We are not in the position to judge which similarity measures would be most suitable for the Strombolian eruption patterns. We merely intend to stress that such measures and the corresponding algorithms can be used within the framework of DC4F.
Similarity measures may be formalized by functionals FlSym , which are interpreted as real-valued functions defined on intervals (see Section 5.1 and also the Appendix A). Similarity measures m can be seen as functionals that, for a given interval I and two functions f and g to be compared, return a real value being the measure of similarity. Thus, we can consider the application of a measure functional m to the two measured functions f and g and an interval of interest I: m ( f , g , I ) . Consequently, the result can be presented as a real-valued function on intervals X f , g FlSym , which returns a real value r for the interval I (see Section 5.1 and also Appendix A). The measured level of similarity can be then used in framing descriptions and hypotheses concerning the time series—in particular, concerning volcanic eruptions.
Similarly, the standard Pearson correlation coefficient used in statistics may be integrated into CD4F as well. The correlation coefficient ρ f , g of two functions f, g is specified by integral ( f f ) ( g g ) / ( ( f f ) 2 ( g g ) 2 ) . The integral returns for an interval I a value of type real, provided that the denominator is different from zero. Thus, coefficient ρ f , g depends on the interval in question, and as was the case in the example presented above, it may be represented as a real-valued function on intervals.
In general, some of the similarity functions can be defined directly in DC4F. Others, such as algorithms computing the similarity measures, can be defined simply by the interval mapping operators corresponding to functionals FlSym . Thus, if there is an algorithm computing a similarity measure for intervals, then we can associate interval operator X FlSym with the algorithm: X ( [ a , b ] ) = x if x is the value computed by the algorithm for interval [ a , b ] . The algorithm proposed in [19] can be treated along these lines.

7. Conclusions

In this paper, we proposed a duration calculus for integrable functions (DC4F). It is a natural, conservative extension of the well-known and widely used duration calculus. DC was used in numerous applications to specify various kinds of real-time and hybrid systems, to synthesize controllers, and so on. However, it allows to deal only with state functions, i.e., functions with Boolean values only. It did not aim to specify signals and measurements. The theoretical value of the proposed extension seems to be rather modest; however, its applications appear to be interesting. We are wondering why DC was not developed in this way from the beginning, as this would naturally broaden its application range and would be a consequent thing to do.
DC4F, like DC, is a type of interval logic providing a formal language, uniform mathematical models, and a reasoning system to deal with time series. Both calculi provide axioms and inference rules allowing to reason about properties of systems and to draw conclusions. However, they differ in expressivity, the scope of their applications, and, of course, maturity. Expressivity is achieved, in most cases, at the cost of complexity of the proof system and methods. This is the case of DC and DC4F as well. As the general form of DC is not decidable, i.e., there is no algorithm that decides whether a given formula is provable or not; the extended expressivity of DC4F relative to DC does not change much in that respect. DC, in contrary to DC4F, is aimed exclusively at the specification of durations of properties. The properties are specified by Boolean-valued functions. A great amount of research has been conducted about DC. Model-checking algorithms were created for its restricted versions and various tools were developed. It has been used to synthesize controllers. DC4F allows to integrate arbitrary Riemann integrable functions. Consequently, it provides a consistent general model for general form time series. The models can be handled using integral calculus, computer algebra, and so on. It allows one to specify several factors, parameters, and signals in one coherent language, to model them in a uniform manner, and to reason about them. Consequently, DC4F extends the ideas of DC to a qualitatively new area of applications.
We demonstrated that DC4F can be used to specify various temporal properties of time series, such as monotonicity, boundedness, and periodicity. We used it to specify different aspects of volcano monitoring activities, with a particular emphasis placed on Strombolian effects. DC4F proved to be a framework that is useful for integrating various types of data expressed in terms of time series and diagrams. Usually, in other papers, such types of data are informally related and reasoned about using informal textual descriptions in natural languages, such as English, and some mathematical formulas. This way of handling data is, of course, imprecise and error-prone. Thus, DC4f provides a remedy for this problem in the form of a unifying logic.
DC4F can be used to formulate specifications and hypotheses, reason about them, and validate them. Informally speaking, the difference between validation and deduction is that validation is performed for a concrete data set, on a specific model, semantic deduction concerns all possible data sets and all concrete models, and syntactic deduction relies solely on the application of sound deduction rules. The proposed logic can be combined with other methods, such as algorithms, and in particular with trained neural networks and other methods used in artificial intelligence (cf., e.g., [19]).

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Formal Semantics

To make the paper self-contained, we present in this section the formal semantics of the proposed DC extension. The semantics were already outlined in Section 5.2, but the formal definitions, thought a bit tedious, offer full mathematical precision.
These semantics are closely related to the original semantics of DC. The main difference is that we use general Riemann integrable functions instead of functions with Boolean values. DC4F generalizes the scope of integral operator, but it does not add new logical operators to DC. From the logic point of view, DC4F is a conservative extension of DC. Thus, a formula of DC is a tautology of DC4F if, and only if, it is a tautology of DC. We define the semantics of the formulas defined in Section 5.1. The definition has three parts corresponding to the three syntactic categories defined there: functions F n , terms R e a l , and formulas F o .
The time domain T ime is the set R + consisting of all non-negative real numbers. The discrete case, when the time domain is the set of natural numbers N , can be dealt with as a special case of R + , as was shown in Section 5.2. In particular, if T ime = N , then we assume that the set of IF un consists of functions that change their values at most at natural numbers and are constant elsewhere. Set IF un contains some unary Riemann integrable functions on positive reals. The symbols from the set FSym are interpreted as functions from IF un . For a function symbol f FSym , the corresponding interpretation is an integrable function I I I I [ f ] IF un of one variable of type R .
The semantics of F n is defined in a standard way. Interpretation function I I I I is the key here. Let f, f 1 , f 2 FSym and c CSym . For the terms of category F n , it is defined as follows:
  • I I I I [ 0 ] = 0 , I I I I [ 1 ] = 1 , I I I I [ t r u e ] = 1 , I I I I [ c ] R ,
  • I I I I [ f ] IF un
  • I I I I [ c f ] = I I I I [ c ] I I I I [ f ] ,
  • I I I I [ g ( f 1 , f 2 ) ] = I I I I [ g ] ( I I I I [ f 1 ] , I I I I [ f 2 ] ) , for g { + , , } ; here, the operations + , , are interpreted as addition and subtraction, respectively.
The set I nte consists of all intervals: I nte = { [ a , b ] | a , b T ime a b } . The set V al = { v a l | v a l : VSym R } contains all valuations of variables. Let F ls be a set containing functionals, i.e., functions that map time intervals to real numbers. The functional symbols FlSym (cf. Section 5.1) are interpreted as elements of F ls . We assume that d is the usual measure on R .
  • I I I I [ f ] ( v a l , [ a , b ] ) = a b f d x
  • I I I I [ X ] ( v a l , [ a , b ] ) F ls , for X FlSym
  • I I I I [ t 1 t 2 ] ( v a l , [ a , b ] ) = I I I I [ ] ( I I I I [ t 1 ] ( v a l , [ a , b ] ) , I I I I [ t 2 ] ( v a l , [ a , b ] ) ) , where t 1 , t 2 are terms, { + , , , / } , and I I I I [ ] are interpreted as the addition, subtraction, maximum, or division, respectively.
We say that two valuations v a l , v a l differ at most on variable x if v a l ( y ) = v a l ( x ) for every variable y is different from x. To define the semantics terms, the interpretation function Int needs pairs consisting of a valuation function v a l V al and an interval [ a , b ] I nte .
We define now the satisfaction relation ⊧ for the elements of category F o . The relation has three arguments: a valuation, an interval, and a formula. It is defined as follows:
  • I I I I ( v a l , [ a , b ] ) t 1 t 2 iff the values I I I I [ t 1 ] ( v a l , [ a , b ] ) , I I I I [ t 2 ] ( v a l , [ a , b ] ) ) are in relation I I I I [ ] , where t 1 , t 2 are terms, { < , = } and I I I I [ ] is the inequality or equality, respectively
  • I I I I ( v a l , [ a , b ] ) ¬ F o iff I I I I ( v a l , [ a , b ] ) F o does not hold
  • I I I I ( v a l , [ a , b ] ) F o 1 F o 2 iff I I I I ( v a l , [ a , b ] ) F o 1 or I I I I ( v a l , [ a , b ] ) F o 2
  • I I I I ( v a l , [ a , b ] ) F o 1 F o 2 iff I I I I ( v a l , [ a , m ] ) F o 1 and I I I I ( v a l , [ m , b ] ) F o 2 , for some m [ a , b ]
  • I I I I ( v a l , [ a , b ] ) x F o iff I I I I ( v a l , [ a , b ] ) F o for some valuation v a l which differs from valuation v a l at most on variable x
  • I I I I [ F o n ] ( v a l , [ a , b ] ) if, and only if, either n = 0 and a = b or 0 < n and there exist numbers a 0 , , a n T ime such that a = a 0 < a 1 < < a n = b and I I I I [ F o ] ( v a l , [ a i , a i + 1 ] ) , for i = 0 , , n 1

References

  1. Koeppen, W.C.; Pilger, E.; Wright, R. Time series analysis of infrared satellite data for detecting thermal anomalies: A hybrid approach. Bull. Volcanol. 2011, 73, 577–593. [Google Scholar] [CrossRef]
  2. Gabbay, D.M.; Guenthner, F. (Eds.) Handbook of Philosophical Logic; Springer: Berlin, Germany, 2001–2005. [Google Scholar]
  3. Della Monica, D.; Goranko, V.; Montanari, A.; Sciavicco, G. Interval Temporal Logics: A Journey. Bull. Eatcs 2013, 105, 73–99. [Google Scholar]
  4. Hansen, M.R.; Van Hung, D. A Theory of Duration Calculus with Application. in Chris George, Zhiming Liu, Jim Woodcock, Domain Modeling and the Duration Calculus. Lect. Notes Comput. Sci. 2007, 4710, 119–176. [Google Scholar]
  5. Pering, T.D.; Ilanko, T.; Liu, E.J. Periodicity in Volcanic Gas Plumes: A Review and Analysis. Geosciences 2019, 9, 394. [Google Scholar] [CrossRef] [Green Version]
  6. Castanedo, F. A Review of Data Fusion Techniques. Sci. World J. 2013, 2013, 704504. [Google Scholar] [CrossRef] [PubMed]
  7. Bleiholder, J.; Naumann, F. Data fusion. ACM Comput. Surv. 2009, 41, 1–41. [Google Scholar] [CrossRef]
  8. Liu, M.; Yang, W.; Zhu, X.; Chen, J.; Chen, X.; Yang, L.; Helmer, E.H. An Improved Flexible Spatiotemporal Data Fusion (IFSDAF) method for producing high spatiotemporal resolution normalized difference vegetation index time series. Remote. Sens. Environ. 2019, 227, 74–89. [Google Scholar] [CrossRef]
  9. Meyer, R.; Faber, J.; Hoenicke, J.; Rybalchenko, A. Model checking Duration Calculus: A practical approach. Formal Asp. Comput. 2008, 20, 481–505. [Google Scholar] [CrossRef] [Green Version]
  10. Hansen, M.R.; Brekling, A.W. On Tool Support for Duration Calculus on the Basis of Presburger Arithmetic. In Proceedings of the 2011 Eighteenth International Symposium on Temporal Representation and Reasoning, Lubeck, Germany, 12–14 September 2011; pp. 115–122. [Google Scholar] [CrossRef]
  11. An, J.; Zhan, N.; Li, X.; Zhang, M.; Yi, W. Model Checking Bounded Continuous-time Extended Linear Duration Invariants. In Proceedings of the 21st International Conference on Hybrid Systems: Computation and Control (part of CPS Week) (HSCC ’18), HSCC 2018, Porto, Portugal, 11–13 April 2018; pp. 81–90. [Google Scholar] [CrossRef]
  12. Larsen, K.G.; Rasmussen, J.I. Optimal conditional reachability for multipriced timed automata. Theor. Comput. Sci. 2008, 390, 197–213. [Google Scholar] [CrossRef] [Green Version]
  13. Xiangjun, C.; Peng, D. Formalization Model and Safety Analyses of High Speed Train in CTCS-3 Control Mode. In Proceedings of the 2013 International Conference on Mechanical and Automation Engineering, Jiujang, China, 21–23 July 2013; pp. 161–167. [Google Scholar] [CrossRef]
  14. Ramos, D.B.; e Silva, R.A.B.; Costa, I.C.; Colonese, E.M.; de Oliveira, J.M.P. Modeling conflicts resolution of Unmanned Aircraft System using a lightweight Duration Calculus. In Proceedings of the IEEE/AIAA 30th Digital Avionics Systems Conference, Seattle, WA, USA, 16–20 October 2011; pp. 5A6-1–5A6-9. [Google Scholar] [CrossRef] [Green Version]
  15. Petnga, L.; Austin, M. Ontologies of Time and Time-based Reasoning for MBSE of Cyber-Physical Systems. Procedia Comput. Sci. 2013, 16, 403–412. [Google Scholar] [CrossRef] [Green Version]
  16. Dole, K.; Gupta, A.; Krishna, S.N. Robust Controller Synthesis for Duration Calculus. In International Symposium on Automated Technology for Verification and Analysis; Springer: Berlin/Heidelberg, Germany, 2020. [Google Scholar] [CrossRef]
  17. Ody, H.; Fränzle, M.; Hansen, M.R. Discounted Duration Calculus. In International Symposium on Formal Methods; Springer: Berlin/Heidelberg, Germany, 2016; pp. 577–592. [Google Scholar] [CrossRef]
  18. Corradini, S.; Montopoli, M.; Guerrieri, L.; Ricci, M.; Scollo, S.; Merucci, L.; Marzano, F.; Pugnaghi, S.; Prestifilippo, M.; Ventress, L.; et al. A Multi-Sensor Approach for Volcanic Ash Cloud Retrieval and Eruption Characterization: The 23 November 2013 Etna Lava Fountain. Remote. Sens. 2016, 8, 58. [Google Scholar] [CrossRef] [Green Version]
  19. Dyea, B.C.; Morrab, G. Machine learning as a detection method of Strombolian eruptions in infrared images from Mount Erebus, Antarctica. Phys. Earth Planet. Inter. 2020, 305, 106508. [Google Scholar] [CrossRef]
  20. Burton, M.; Allard, P.; Muré, F.; La Spina Magmatic, A. Gas Composition Reveals the Source Depth of Slug-Driven Strombolian Explosive Activity. Science 2007, 317, 227–230. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Wright, R.; Flynn, L.; Garbeil, H.; Harris, A.; Pilger, E. MODVOLC: Near-real-time thermal monitoring of global volcanism. J. Volcanol. Geotherm Res. 2004, 153, 29–49. [Google Scholar] [CrossRef]
  22. Laiolo, M.; Ripepe, M.; Cigolini, C.; Coppola, D.; Della Schiava, M.; Genco, R.; Innocenti, L.; Lacanna, G.; Marchetti, E.; Massimetti, F.; et al. Space- and Ground-Based Geophysical Data Tracking of Magma Migration in Shallow Feeding System of Mount Etna Volcano. Remote. Sens. 2019, 11, 1182. [Google Scholar] [CrossRef] [Green Version]
  23. Vlachos, M.; Kollios, G.; Gunopulos, D. Discovering similar multidimensional trajectories. In Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, 26 February–1 March 2002; pp. 673–684. [Google Scholar] [CrossRef]
  24. Raket, L.L.; Sommer, S.; Markussen, B. A nonlinear mixed-effects model for simultaneous smoothing and registration of functional data. Pattern Recognit. Lett. 2013, 38, 1–7. [Google Scholar] [CrossRef]
  25. Olsen, N.L.; Markussen, B.; Raket, L. Simultaneous inference for misaligned multivariate functional data. J. R. Stat. Soc. C 2018, 67, 1147–1176. [Google Scholar] [CrossRef] [Green Version]
  26. Koenig, L.; Lucero, J.; Perlman, E. Speech production variability in fricatives of children and adults: Results of functional data analysis. J. Acoust. Soc. Am. 2008, 124, 3158–3170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Barwise, J. (Ed.) Handbook of Mathematical Logic; Studies in Logic and Foundations of Mathematics; North Holland: Amsterdam, The Netherlands; New York, NY, USA; Oxford, UK, 1982. [Google Scholar]
  28. Corradini, S.; Guerrieri, L.; Stelitano, D.; Salerno, G.; Scollo, S.; Merucci, L.; Prestifilippo, M.; Musacchio, M.; Silvestri, M.; Lombardo, V.; et al. Near Real-Time Monitoring of the Christmas 2018 Etna Eruption Using SEVIRI and Products Validation. Remote. Sens. 2020, 12, 1336. [Google Scholar] [CrossRef] [Green Version]
  29. Pergola, N.; Marchese, F.; Tramutoli, V. Automated detection of thermal features of active volcanoes by means of infrared AVHRR records. Remote. Sens. Env. 2004, 93, 311–327. [Google Scholar] [CrossRef]
Figure 1. s i n -function and the corresponding state function 1 { x | 0 s i n ( x ) } .
Figure 1. s i n -function and the corresponding state function 1 { x | 0 s i n ( x ) } .
Remotesensing 13 02236 g001
Figure 2. Monotone function f ( t ) .
Figure 2. Monotone function f ( t ) .
Remotesensing 13 02236 g002
Figure 3. Functional limit.
Figure 3. Functional limit.
Remotesensing 13 02236 g003
Figure 4. Constant limits and amplitude.
Figure 4. Constant limits and amplitude.
Remotesensing 13 02236 g004
Figure 5. TADR with its lower and upper approximation.
Figure 5. TADR with its lower and upper approximation.
Remotesensing 13 02236 g005
Figure 6. Number of thermal anomalies detected by the hybrid algorithm over MODVOLC using thresholds of 2.0, 2.6, and 3.0, respectively.
Figure 6. Number of thermal anomalies detected by the hybrid algorithm over MODVOLC using thresholds of 2.0, 2.6, and 3.0, respectively.
Remotesensing 13 02236 g006
Figure 7. Periodic peaks in radiation corresponding to source temperature.
Figure 7. Periodic peaks in radiation corresponding to source temperature.
Remotesensing 13 02236 g007
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kosiuczenko, P. An Interval Temporal Logic for Time Series Specification and Data Integration. Remote Sens. 2021, 13, 2236. https://doi.org/10.3390/rs13122236

AMA Style

Kosiuczenko P. An Interval Temporal Logic for Time Series Specification and Data Integration. Remote Sensing. 2021; 13(12):2236. https://doi.org/10.3390/rs13122236

Chicago/Turabian Style

Kosiuczenko, Piotr. 2021. "An Interval Temporal Logic for Time Series Specification and Data Integration" Remote Sensing 13, no. 12: 2236. https://doi.org/10.3390/rs13122236

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop