1. Introduction
The sense of touch plays an essential role in our lives and allows us to safely interact and control our contacts with our surroundings. The sense of touch not only allows us to characterize and evaluate contacts, but it also allows us to locate contacts. It provides us with a tactile image of our interactions with the world. The human brain represents this tactile image in the somatosensory cortex combining proprioceptive information with cutaneous information and thus assembling an internal model, that is, the homunculus that associates the postural information of the body with the spatial location and tactile information of the skin receptors [
1,
2].
A deeper study of the sense of touch reveals its fundamental differences in comparison to our other senses. These differences eventually break down to two facts. The sense of touch is a highly
distributed sense. The sense of touch spreads out its 5 million cutaneous receptors [
2] (mechanoreceptors, thermoreceptors, nociceptors) through the whole body in large areas up to around 2 m
[
3] and conveys tactile information through around 1.1 million ascending nerve fibers [
4] to the somatosensory cortex. In contrast to the sense of touch, vision is a very concentrated sense. The human eye accommodates approximately 137 million receptors (130 million rods and 6.5 million cones per retina) [
5] and approximately 1 million nerve fibers [
6] in an area of around 1100 mm
. In comparison, the sensing area of the human skin is around 1000 times larger than the sensing area of both eyes. Consequently, the term
large-scale can be attributed to large resolutions in vision and to
large-areas in tactile sensing. Secondly, the sense of touch acquires its information through
physical contacts. In contrast to vision or audition which protect their receptors against physical contacts, the sense of touch depends on these contacts with the environment to acquire information. Tactile receptors cannot acquire tactile information through distant observations, they need to access the information in the area of the contact. These two facts profoundly impact how the sense of touch organizes sensing in nature [
7] and thus provide the guidelines to effective electronic skin (e-skin) designs [
8,
9].
The developments of e-skin currently focus on two different kinds of skins similar to the two found in humans [
10,
11]. One skin is mainly located in the inner sides of the hands and the foot-soles while the other skin covers the remaining parts of the body. The skin of our hands and foot-soles covers rather small regions and targets very high spatial resolution, supersensitive sensing, shear-force, and vibration sensing, and slip detection [
10]. Research towards realizing this kind of skin in technical systems deemphasizes the challenges of distributed sensing systems and focuses on high sensing density and the challenges connected with supporting physical contacts. Such as the development of fingertip e-skins that have been investigated in the works in References [
12,
13,
14]. Developing e-skin to cover large areas emphasizes the distributed nature of the sense of touch. While large area skin may slightly deemphasize high spatial resolution, it has to specifically focus on efficient and feasible methods to deploy, connect, and determine the poses (location and orientation in 3D space) of a
large number of spatially distributed
tactile sensors over
large areas. We define these e-skin system as large-area skin systems (LASSs).
The sense of touch employs specialized receptors for sensing mechanical, thermal and noxious (potentially dangerous/destructive) stimuli [
4,
10,
11,
15,
16,
17]. These receptors are tuned to sense specific stimulus features which focus on deciphering distinct pieces of contact/object properties. The dominant stimulus features are normal pressure (Merkel cell receptors [
10,
11,
15]), horizontal motions and slip (Meissner corpuscle receptors [
10,
11,
15]), vibrations (Pacinian corpuscle receptors [
10,
11]), stretch (Ruffini endings [
10,
11]) and proximity/approach (tylotrich-hair receptors [
10,
11]). The receptors’ stimulus feature selectivity is influenced by the location of skin receptors in different dermal layers, by the deployment pattern, and by mechanical filter mechanisms. The receptors’ selectivity samples complex multi-modal stimuli to simple distinct uni-modal stimulus features allowing for the encoding of complex tactile information and selective attention. Peripheral axons connect these tactile receptors to the nerve cell bodies in the dorsal root ganglion next to the spinal cord [
4,
10,
18,
19,
20], forming together the tactile part of the peripheral somatosensory system. Throughout all its parts, the somatosensory system maintains the somatotopic order of the conveyed and relayed information, that is, the relative spatial structure of its receptors is reflected in the order of its nerve fibers and nerve cells [
4,
10]. Ascending along the spinal cord towards the somatosensory cortex, the somatotopically ordered information of different body parts is assembled to a comprehensive sensory representation of the whole body, the homunculus [
1,
4,
10,
20]. These two dominant principles of biology for realizing the complex sense of touch, namely decomposing complex contact features of physical interactions to simple uni-modal stimulus features, and maintaining and assembling relative spatial information, naturally impacted the development of large-area e-skin systems [
8,
21,
22].
Research in the last decade focused on different scalable multi-modal e-skin systems suited for large-area applications [
23,
24,
25,
26,
27,
28,
29]. Some of these works [
25,
28,
29] led to viable solutions targeting the challenges of distributed e-skin systems. However, these works have not yet sufficiently addressed the major challenge to handle a large amount of tactile information that an upscaling of e-skin would produce to cover large areas. The lack of a systematic approach for solving this remaining challenge explains why LASSs are not yet as available and widely utilized as other sensing systems such as auditory or visual.
Neuromorphic systems that employ event-driven information handling to increase the processing efficiency and to reduce the latency of systems have already been introduced more than two decades ago in the works of References [
30,
31]. Over the years, different implementations and event representations emerged optimizing the event-driven approach for different applications. The most notable approaches towards Event-Driven Systems (EDSs) are the neuromorphic Address-Event-Representation (AER) [
30,
31], the Send-on-Delta Principle (SoDP) [
32,
33,
34], and more recently, the Asynchronous Encoded Skin (ACES) [
35]. Some of these EDSs have been used in applications with e-skin [
35,
36,
37], supporting their effectiveness and efficiency. However, none of these event-driven e-skin approaches fully consider the implications and challenges of effective deployment over large areas and its eventual system integration. This work does not target to introduce yet another principle for representing and handling events, it rather consolidates the findings of these previous works, homogenizes their underlying theory, and assesses their applicability in LASSs. Besides the introduction of the realization principles that we propose for flexible, feasible, and efficient event-driven LASSs, this work also aims to foster the exchange of perspectives and requirements of the different research fields, especially within the context of event-driven information handling and large-area tactile sensing.
Our initial works [
38,
39,
40] demonstrated the effectiveness of the event-driven approach for handling the large amount of tactile information of LASSs in various experimental setups. Now, this work intends to provide a solid foundation for realizing and understanding event-driven LASSs in general with an emphasis on three points. First,
flexibility/deployability, that is, neuromorphic hardware may be utilized but is not strictly required and the system can be adjusted with a reasonable amount of effort. Second,
feasibility/effectiveness, that is, the presented principles are implementable, scale, and enable real-world applications. Third,
efficiency, that is, the event-driven system outperforms its clock-driven counterpart with respect to network traffic and CPU load. A summary of our implementations and the experimental results of our work delivers the impacts and validation of the presented design and realization principles.
Outline
Section 2 presents the challenges of LASSs, surveys existing solutions and design concepts, and introduces the remaining challenges.
Section 3 presents the concept of efficient event-driven information handling, and analyzes existing approaches for EDSs and their applicability in LASSs.
Section 4 presents the designs for realizing event-driven sensing in e-skin systems, including the design of event generators and their correct parameterization.
Section 5 presents the designs for realizing event-driven information handling for LASSs in standard computing systems.
Section 6 summarizes and connects the designs to the challenges they tackle, and how their realization impact the efficiency and effectiveness in our e-skin implementation. Finally, we conclude in
Section 7.
3. Efficient Event-Driven Information Handling for Large-Area Skin Systems
This section introduces the concepts of efficient event-driven information handling and examines EDSs regarding their applicability in LASSs. Complex systems, artificial or biological, combine sensation, communication, processing, and actuation to achieve desired system behaviors; they need to
handle information. Handling information not only refers to processing information, it rather addresses the complete information flow in a perception-action loop [
46] or a system control loop, that is, acquiring, transmitting, processing, and acting on information [
47]. In this sense, achieving the desired system behavior fundamentally depends on the fast, efficient, and loss-less representation, processing, and exchange of information.
The representation and conveyance of information in biology follows schemes quite different to the principles utilized in technical systems. The representation in biology could neither be described as analog nor digital. Biology uses binary action potentials, often also termed spikes or events, to represent and convey information between neurons [
47]. These action potentials alone convey only a very limited amount of information. Action potentials in nerve fibers are either present or not, they do not convey any additional information, for example, in their shape and so forth. Information in biology is encoded in the spatio-temporal activity patterns in massively parallel nerve bundles or populations of neurons [
11,
17]. These neural codes employ a set of different information representation principles, which are: (1) type code, (2) spatial code, (3) rate code, (4) temporal code, and (5) latency code [
11,
17,
48]. All these principles show that biology uses structure and time (spatio-temporal features) to encode and represent information. Although there has been a long debate if biology employs rate coding or temporal coding [
48,
49], the nervous system employs both. All the previously discussed information representation principles can be observed in the somatosensory system, and thus also in the sense-of-touch.
The limitations of traditional approaches, especially in applications which need to handle a large amount of information within short periods, triggered the development of spike-based bio-inspired and neuromorphic systems to mirror the incredibly high information handling efficiency of biological systems. These neuromorphic systems employ spike-based information representation principles in sensing, communication, and processing [
30,
31,
36,
37,
50,
51,
52,
53,
54,
55,
56,
57,
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68] and report major improvements in efficiency and speed, that is, the systems require less power, and handle information with higher temporal resolution and less latency. Spike-based neuromorphic systems exploit all neural codes found in biology and are realized on customized hardware with special asynchronous circuits, optimized for spike-based signals and processing, and mimicking neural computation principles. In this work, we focus less on mimicking biology in all its aspects of information representation and computation principles, we rather focus only on two basic principles which we belief contribute significantly to improve the efficiency and speed of systems, and connect well to traditional information theory. First, rather than concentrating on different neural codes to convey information with spikes, we summarize and simplify the concept to the principle of
event-driven information handling, that is, only novel information, the events, drive the whole system. This simplified principle can still be considered as biologically inspired. Even employing different coding principles, the activity in populations of neurons is usually triggered (or inhibited) by the arrival of stimuli. This is particularly true for the afferents of sensory neurons. Their peripheral axons only generate action potentials when their receptors register stimuli and are otherwise silent, regardless to the neural code they utilize for conveying information [
11,
47]. Following this line of thought, we furthermore neglect rate coding and equalize event-driven systems with novelty-driven systems. We solely concentrate on the sparsity aspect of spiking neural networks since we target to exploit its
temporal redundancy reduction and
saliency enhancement capabilities. Temporal redundancy reduction and saliency positively contribute to system efficiency since less information has to be handled.
The biological information representation and handling principles have inspired the development of bio-inspired technical systems which might prove feasible for tackling the challenge of efficiently handling information in LASSs. In the following sections, we first formally describe the characteristics of Clock-Driven Systems (CDSs) (
Section 3.1) and Event-Driven Systems (EDSs) (
Section 3.2) to clarify terminology and definitions, and to eventually provide a homogenized presentation which simplifies the comparison of both systems. Then, we discuss the limits of clock-driven systems (CDSs) in
Section 3.3. Afterward, we formalize and homogenize the common novelty-based event generation principle found in neuromorphic spike-driven systems and in systems that implement the time-discrete Send-on-Delta Principle (SoDP), in
Section 3.4. Event generation principles do not differ between different implementations of event-driven systems, but event representations do. Therefore,
Section 3.5 surveys the most prominent approaches towards representing events in EDSs. Based on this survey, we proceed with a comparative study and select the most applicable event representation for LASSs in
Section 3.6.
3.1. Clock-Driven Systems (CDS)
Time-discrete systems follow the Nyquist-Shannon sampling theorem that defines constraints for the lossless conversion of time-continuous signals to time-discrete signals. The Nyquist-Shannon sampling theorem [
69] states that any bandwidth limited time-continuous signal
with
can be represented by a time-discrete signal
with
and
as long as the sampling frequency
surpasses the bandwidth
B of
by at least a factor of two:
Consequently, time-discrete systems ensure that a clock with at least a frequency of
drives the information handling such that the constraint of the Nyquist-Shannon sampling theorem is fulfilled at all times and information loss is zero through all stages of the system. These systems are termed Clock-Driven Systems (CDS). Standard computing systems are CDSs and usually either implement the von Neumann [
70] or the Harvard architecture [
71].
3.2. Event-Driven Systems (EDS)
At the beginning of
Section 3, we outlined that many works introduced spike-based neuromorphic systems mimicking the information representation and neural computation principles found in nature [
30,
31,
36,
37,
50,
51,
52,
53,
54,
55,
56,
57,
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68]. Since these systems are spike-driven, it is valid to describe them as event-driven systems (EDSs). In this work, however, we focus only on one particular kind of EDSs, that is, novelty-driven systems [
51,
52,
59,
72]. In contrast to the general group of spike-based neuromorphic systems, which use all neural coding principles, novelty-driven systems focus on sparse information representation, that is, neural time coding, and follow the idea that only novel information should drive a system. For simplicity, in this work whenever we refer to EDSs, we refer to the subgroup of novelty-driven systems (NDSs) and solely focus on their dominant characteristics.
Novelty-driven systems relate very well to one core statement in information theory [
69]. In many applications, following the guideline of the Nyquist-Shannon sampling theorem, that is, realizing CDSs, results in a stream of samples containing a huge amount of uncontrolled redundant information. Temporally redundant information is especially apparent when the system continuously samples the same value. Systems can avoid temporal redundancy when they only handle novel information, that is, when they are only active when sensors register activity. Shannon’s information entropy and his source coding theorem formally describes the information rate of information sources, and thus how much information, or respectively redundancy, a signal contains [
69]. The information entropy
evaluates the probabilities
of symbols
, that encode the information produced by the information source
and is measured in bits. Thus, if an information source
continuously emits the same signal level, then all probabilities
besides one are zero and the information entropy
is zero. Consequently, the signal does not contain information and repeatedly sampling it just produces uncontrolled redundancy and wastes resources. On the other hand, if
is constantly changing, then the probabilities
are more distributed and the information entropy
is well beyond zero. Thus, sensors that register substantial changes in
produce a considerable amount of information.
In summary, novelty corresponds to activity and is expressed by changes. Thus, systems driven by events, where events solely express novel information, avoid uncontrolled temporal redundancy throughout all stages, and gain efficiency simply due to the fact that information is represented more sparsely and less information has to be gathered, transmitted, and processed.
3.3. Handling Large Amount of Information
Up to date, most technical systems are clock-driven and handle information strictly following the Nyquist-Shannon sampling theorem. CDSs have been successfully applied in many different applications such as the high-speed precision motor control in hard drives [
73], control of robot arms [
74], and many more. These systems not only prove that CDS provide viable solutions, they usually achieve excellent performance.
Nevertheless, CDSs that require fast reaction times in real-time applications depend on high-bandwidth information resulting in high sample rates. High sample rates only marginally impact systems that handle a limit amount of information of a few sensors, for example, the position control of electrical motors. However, when systems have to handle a large amount of information with high sample rates, their realization may become challenging or even infeasible [
8,
75,
76,
77,
78,
79]. The challenge of handling a large amount of information with high sample rates emerges in systems that need to react fast to visual, auditory, or tactile (cutaneous) information. All these senses employ a large number of sensors. To handle a large amount of information with high sample rates, CDSs have to employ very powerful transmission and computing systems with severe demands on power and space [
75,
76,
79]. While power and space mainly cause monetary and environmental disadvantages in stationary systems, both factors tremendously impact systems in mobile applications [
32,
33,
76,
77,
79].
3.4. Event Generation/Event-Driven Sensing
EDSs, or more specifically NDSs, gain their efficiency by coupling their activity to the information rate of information sources, see
Section 3.2. By this means, EDSs succeed in canceling the temporal redundancy observed in the sampled information of CDSs. Thus EDSs are more efficient than CDSs. On average, EDSs need to process less information, thus induce less latency, and consume less power.
This section focuses on novelty detection, that is, on the procedure and formalisms to decide if signals, for example, of sensors, provide valuable information. Since the amount of information a signal provides correlates with the amount of its changes, see
Section 3.2, a novelty detector is basically a change detector that triggers activity, or respectively the generation of events. NDSs and change detectors have been introduced in two different research fields, in the field of energy efficient sensing, signal processing and control from the information and control theory point of view [
32,
33,
34], and in the field of neuromorphic systems from the mimicking neural codes and neural computing principles point of view [
30,
31,
36,
50,
51,
52,
55,
61].
Neuromorphic systems realize the change-detectors for their event generators in analog circuits [
51,
52,
55,
59,
80]. These circuits directly detect changes in analog and thus in time- and range-continuous signals
and convert them to events
. The change detectors of the other field [
32,
33,
34] are realized in
compound architectures [
33], that is, the information source, for example, a sensor, is first sampled with a sample rate of
at time instances
,
. Then, a digital, time-discrete change detector transforms the samples
to events.
Analog change detectors monitor a signal
and track the change of this signal until the accumulated change exceeds a predefined threshold
. Therefore, the change detector integrates the derivative
of the input signal
until the integration reaches or passes the threshold
at time instance
. At this time instance
, the information of the monitored signal is classified as novel and the change detector triggers the creation of an event
that contains this novel information. The more precise the occurrence time
of the event matches with the time instance of the actual signal change, the higher the
temporal precision of the event generator is. Thus, any non-deterministic or non-constant delay between the actual signal change and the occurrence of the event reduces the temporal precision. Considering the properties of Riemann integrals, we can derive a relationship between an integral of a signal
and its average
in an interval
:
Combining Equations (
3) and (
4) to
indeed shows that the change detector evaluates the accumulated average change of the input signal since the occurrence of last event
at time instance
, simplifying Equation (
3) to:
Thus, a change detector in fact triggers the creation of events whenever the difference between the signal that caused the last event and the currently monitored input exceeds a specified limit . Because the analog change detector can trigger events at any time, its temporal resolution is only limited by the bandwidth of the analog circuits and can theoretically achieve an almost infinite equivalent sampling rate.
Digital change detectors monitor the samples
with
,
of a signal
. The underlying principle for detecting changes is similar to the analog change detector. A digital change detector also integrates the derivative
of the sampled input signal
but the integration process is digital and clock-driven. The integration continues until it passes the threshold
at time instance
or respectively at the sample
. At time instance
, the sample is classified as novel and the change detector triggers the creation of an event
. Similarly to Equation (
4), we can derive a time-discrete relationship between the average and the integral of a signal
which combined with Equation (
7) leads to
and simplifies to:
Thus, the digital time-discrete change detector has to remember the signal sample of the previous event and compare it to the current sample . Then, when the absolute difference between these two samples exceeds the threshold, the change detector triggers the creation of the event and updates the memory with the current sample. In comparison, digital change detectors naturally exhibit a lower temporal resolution than their analog counter parts and consume more power since the monitoring is clock-driven. The temporal resolution of digital change detectors is limited by the sampling rate of the digital system.
3.5. Event Representation / Information Encoding
This section focuses on the different event representations and transmission techniques found in the most notable approaches towards realizing EDSs. The surveyed representations include the most common representation applied in neuromorphic systems [
30,
31], the representation applied in the event-driven sensing and control community [
32,
33,
34], and one recently proposed representation technique developed for e-skin systems [
35]. Event representation is tightly coupled with encoding information in events, since these events eventually carry information in EDSs. This subsequent study will discuss the applicability of the different approaches towards realizing an effective novelty-driven event handling system for LASSs.
3.5.1. Address Event Representation (AER)
The Address Event Representation (AER) [
30,
31] is one of the first bio-inspired systems that has been developed for representing and conveying events in technical systems. Originally, AER has been developed for the communication between spiking artificial neurons in VLSI ICs (Very Large Scale Integrated Circuits) [
30,
31] and rigorously takes advantage of high-speed digital asynchronous parallel bus systems that are readily available on such devices. AER realizes event-driven point-to-point connections between event generators and consumers. But instead of encoding the information source by individually wiring each event generator to an event consumer, as nature does, the AER employs addresses to identify sources and time-multiplexes these addresses onto a common asynchronous parallel bus. A valid address on this bus represents an event and this address identifies the event generator of that event. The AER exploits the superior communication speed of technical systems per wire in integrated systems (>100 MBit/s) in comparison to nerve fibers (≈1 kBit/s) to reduce the number of wires and still achieve a comparably high temporal resolution. Besides the address bus, the AER employs a request, and an acknowledge line to realize a self-timed bus arbitration mechanism that avoids any clock resynchronization. AER represents events through addresses. To convey information, AER event generators can employ encoding principles that are similar to neural codes. The AER can encode the type of events in additional address lines such that an AER event generator can create change events, events that indicate an increase or decrease of the observed signal (up and down events), respectively [
51,
52]. To encode absolute values instead of increments, AER event generators can employ low and high events where the time between these two events represents the encoded value [
55]. More recent research introduced serial AER [
81] to reduce the wiring complexity in more distributed EDSs [
37]. Serial AER packs the event address into a datagram on a serial bus, which reduces the number of wires at the cost of reducing the temporal resolution. The AER is an established bio-inspired protocol for representing and transferring events and could successfully demonstrate its use in auditory [
50], visual [
51,
52], and force [
36] sensing applications and in event-driven processing hardware such as SpiNNacker [
57], TrueNorth (IBM) [
58], BrainScaleS [
62], ROLLS [
63], DYNAP [
66], Loihi (Intel) [
67], and BrainDrop [
68].
3.5.2. Send-On-Delta Principle (SoDP)
The Send-on-Delta-Principle (SoDP) [
32,
33,
34] is a hybrid system which exploits standard digital hardware to realize EDSs. The SoDP has been first proposed for efficiently reducing the number of transmissions in wireless, battery powered, and widely distributed sensors networks [
32,
33]. In these application scenarios, the reduction of the number of transmissions is essential to increase the life time of the distributed sensors. SoDP systems employ time-discrete digital change detectors, that can even be implemented in software, to trigger the creation of events. The SoDP represents events by packets (event packets) that are transported in asynchronous arbitrated networks. An event packet usually contains the ID of its information source and the absolute value of the signal at the creation time of the packet. Similar to the AER, the presence of an event packet signifies the availability of novel information and drives the information handling of the system. Since the SoDP not only conveys the information source but also the absolute magnitude of the signal, the bit rate of SoDP events is higher than the bit rate of AER events. Thus, the temporal resolution of SoDP systems is lower than that of AER systems when both system employ the same transmission rate. However, decoding the information of SoDP events is by far less complex than for AER events whenever an application enforces clock-driven information, for example, in low-level control of closed hardware/software systems, such as in robots. Furthermore, SoDP systems do not require specialized hardware and can be realized with off-the-shelf sensors and well established information transport layers. Actually, CDSs which possess the flexibility to modify their information handling procedures and can employ asynchronous transmission and processing capabilities can be turned into EDSs without the need of any hardware modifications. The SoDP provides great flexibility and availability for realizing low cost and scalable event-driven applications. However, the hardware of SoDP systems is clock-driven such that SoDP cannot reach the temporal precision and energy efficiency of systems that employ event-driven neuromorphic hardware.
3.5.3. Asynchronous Encoded Skin (ACES)
The Asynchronous Encoded Skin (ACES) [
35] is an event-driven hardware system that has been recently proposed to realize a neuro-inspired artificial nervous system. The ACES implements a many-to-one protocol for transmitting and representing events. Rather than time-multiplexing events to a common transportation medium such as in the AER or SoDP, the ACES fuses events as pulse signatures onto one single common wire. A pulse signature is a sequence of pulses within a constant time window, where the relative timing of the pulses encode the signature. Similar to the addresses in AER and the IDs in SoDP, the pulse signature identifies the information source and represents the event. Interestingly, the ACES manages to fuse these pulse signatures on one single wire without applying time-multiplexing or requiring an arbitration method. The ACES superimposes all pulse signatures by applying a logical OR operation on the pulses. In order to minimize the probability that pulse signatures cannot be separated, the set of pulse signatures has to have minimal auto-correlation and cross-correlation. Theoretically, ACES could support up to 138,000 information sources per wire, when a pulse signature has a time window of 1 ms, consists of 10 pulses, and each pulse lasts for 100 ns. In such a setup, ACES events have a latency of at least 1 ms when employing up and down events, or respectively a latency of at least 2 ms when employing low and high events for time-coding absolute values. However, the temporal precision of ACES is extremely high (in the range of the pulse length) since no arbitration mechanisms impair the temporal precision with non-deterministic uncertainties in delay which correlate with the utilization of a shared communication medium. Additionally, since the ACES event transmission is arbitration-less, connection redundancy could be introduced by adding wires as long as the propagation speed and the reflection of high speed connections do not degrade the transmission quality. While the hardware for encoding and representing events in ACES has a low complexity, acquiring a set of pulse signatures is more demanding and the demerging of ACES events is very complex. An ACES event demerger has to repeatedly correlate the currently observed pulse pattern of superimposed events with all pulse signatures of the set. Therefore, the ACES event demerger has to keep a history of received pulses which matches the length of a pulse signature. To preserve the temporal information of the events, the demerger has to perform this correlation continuously for each potential event in parallel within the time length of a pulse. For the example numbers mentioned earlier, the demerger would at least have to perform continuously 138,000 correlations with a bit length of 10,000 bits (assuming a pulse can be represented by one bit) within 100 ns. The ACES event decoder is clearly not event-driven since the decoding has to be driven by the pulse time, and the information in the superimposed pulse stream is not salient. Nevertheless, the demerged events can drive the information handling in subsequent stages.
3.6. A Comparative Study of Effective Event Representation for Large-Area Skin Systems
Section 3.5 introduced and described existing event representations and their realization in EDSs. These realizations have been successfully validated and proved their efficiency in various applications. To assess which EDS approach suits best for LASSs, we examine and discuss their performance within the relevant properties, see
Table 1. All properties are assessed considering the challenges of LASSs summarized in
Section 2.1.
The predominant factor for proposing EDSs for LASSs is their information handling efficiency. Next to efficiency and latency, an effective EDS for LASSs has to consider also robustness, deployability, wiring complexity, and sensor poses. To tackle these challenges an EDS should support the principles of modularity and self-organization.
Table 1 summarizes the most important properties of EDSs. It also includes the properties of nerve bundles and CDSs to enable comparisons with the biological reference and with the state-of-the-art approach in technical systems. In the following assessment, we focus on the properties’ most important implications for LASSs before selecting the most suitable EDS.
3.6.1. Connection, Bandwidth, and Arbitration
Standard AER employs parallel asynchronous buses with a handshaking mechanism. This bus can provide very high bandwidths but is unidirectional and utilizes many wires. Therefore serial-AER has been introduced to reduce the wire count at the cost of a slight reduction in bandwidth. The AER time-multiplexes events on a common bus and thus has to employ very complex arbitration mechanisms to ensure fair sharing and to optimize temporal precision. The arbitration latency depends on the bus utilization which is non-deterministic and correlates with the global information rate. The complex handshaking mechanisms require special hardware and are rather inflexible and hard to change.
The communication protocol of ACES has been specifically designed to reduce the complexity of merging the events of multiple information sources to a common transport medium. ACES has clear advantages over AER with respect to wire count and flexibility. ACES exploits the uniqueness of its events to completely avoid any arbitration. Avoiding arbitration, ACES achieves a lower circuit complexity, and a higher temporal precision than AER. Furthermore, information sources and wires can be added/removed in ACES without the need to consider and adjust a complex arbitration system. This ability greatly increases the robustness and flexibility of ACES in comparison to AER. However, the bandwidth of ACES is several orders of magnitude lower than in AER.
In contrast to AER and ACES, SoDP does not rely on a specifically designed transport medium for conveying SoDP events. Any protocol and hardware that asynchronously conveys packets is suitable for SoDP. The hardware independency allows for the extreme flexibility, robustness, and the rapid implementation of SoDP-based EDSs. Nevertheless, the SoDP has to time-multiplex and arbitrate events to share a common communication medium. But in contrast to AER, the arbitration is much more flexible, less complex and can be achieved by standard network protocols. Naturally, the temporal precision of SoDP is lower than in AER and ACES, since SoDP events require more bits and thus occupy a shared bus for a longer time. The higher bit count per event in SoDP reduces the overall communication bandwidth below the one of AER but still well beyond ACES.
3.6.2. Representation and Encoding of Events
In AER and ACES the events solely encode the source and the type of information while the information itself is encoded in the timing/occurrence pattern of the events. As a result, an event can be represented by few bits and only demands a tiny part of the transport capacity on a bus rendering these systems highly efficient. Nevertheless, the event conveyance system has to exert a high temporal precision since the information is encoded in the timing of events. To achieve such high temporal precision, both systems rely on specialized hardware.
On the other hand, SoDP does not rely on neural codes and does not only encode the source and type of information into and event but also the information itself, that is, an absolute value. Consequently, the temporal precision is less critical than in AER and ACES but still important. The occurrence time of a SoDP event still encodes the occurrence time of the information. The downside of SoDP events is that they require more bits and thus more communication bandwidth. While SoDP still constitute a major improvement towards tackling efficiency and low-latency, however, it cannot achieve the efficiency, latency and temporal precision of AER or ACES.
3.6.3. Decoding of Events
Ideally, for handling information, EDSs should never experience the need to decode events to other representations such as samples of absolute values. Research in EDSs actually advances into that direction and progress in event-driven hardware and event-driven information handling develop to an emerging new research field [
30,
31,
36,
37,
50,
51,
52,
53,
54,
55,
56,
57,
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68] that will provide highly efficient information handling systems. However, many applications still, and for the foreseeable future will, rely on clock-driven information handling algorithms. Thus, to really profit from EDSs in applications, EDSs have to provide efficient event decoding mechanisms.
Decoding events in AER and ACES is complex and requires special hardware to convert events and their time encoded information to a format that can be processed by standard computer systems, that is, tagging high precision time stamps to AER events [
51,
52,
55], or gray scale values [
52,
55]. While decoding AER events is complicated, demerging ACES events is really challenging. In AER and SoDP, the events on a common bus are salient and may directly drive the decoding or the handling of information in event-driven algorithms. However, ACES events are not salient and an event demerger has to constantly monitor and detect events in a massively parallel clock-driven process, even if the subsequent information handling stages are event-driven. The necessity of always decoding ACES events constitutes a negative impact on the information handling efficiency of ACES. In general, decoding AER or ACES events negatively impacts the efficiency.
Since SoDP events already encode absolute values, their decoding is simple. SoDP events are salient and their decoding can be event-driven. Thus, the decoding of SoDP events is more efficient than in AER or ACES.
3.6.4. Effective Event Representation for Large-Area Skin Systems
Overall, the SoDP emerges as the most suitable EDS for tackling all the challenges towards realizing LASSs. While AER and ACES have advantages in achieving more efficient solutions than SoDP, they have also deficiencies. They require special hardware, complex decoding, and a complex setup, thus hinge on overall deployment. The clear advantage of SoDP lies in its great flexibility since it does not depend on specific hardware and can thus exploit standard hardware for the rapid realization of complex but yet efficient EDSs.
6. Results
This section connects the background, theory, and realization principles towards event-driven LASSs introduced in this work.
Our first realization of an e-skin system focused on providing a scalable, flexible, and robust platform with the multi-modal sensing capabilities to enable applications requiring tactile sensation similar to the human sense of touch [
29,
82]. Although, the early realization follows the principles of modularity and self-organization, it reaches its limits in large-area applications. Extending these initial works with the event-driven principles for LASSs presented in this work (
Section 3,
Section 4 and
Section 5). Hence, we succeeded in the realization of a new LASS that is low-latency while being computationally efficient [
38,
39,
40].
Table 3 relates the challenges of LASSs with the implemented design and realization principles, and then summarizes and highlights the impacts of the principles on the implementation. For example, modularity reduces the number of connection at least by a factor of 50, or the event-driven system effectively reduces the communication traffic by around 90%.
The implementation of our event-driven LASS has been first verified on our robot platform TOMM [
83,
92]. TOMM has both of its arms and grippers covered with e-skin. The validation of the e-skin has been performed with one of TOMM’s UR5 arms. This experimental approach proved as good trade-off since the UR5 arm is covered with a reasonable amount of skin cells (253 skin cells, 2024 sensors) while a CDS is at the same time still capable to handle the intermediate amount of tactile information. The ability to perform experiments in clock-driven and event-driven mode with identical systems enables fair and comprehensive evaluations and comparisons of EDSs with their clock-driven counterparts [
39]. Actually, the design of a fully clock-driven control system operating with clock-driven tactile information of 253 skin cells is still feasible. This allows us to fully assess the performance of a complete EDS with sophisticated event consumers and controllers in meaningful applications [
93].
The effectiveness of the EDS in comparison to its clock-driven counterpart in various experimental evaluations and applications [
38,
39,
40,
83,
87,
88,
89,
93], including control [
39,
87,
88,
89,
93], can be assessed by analyzing the indicators: 1) the
network traffic between the deployed skin system and the information handling system; and 2) the
CPU load of the perception module in the information handling system. While deeper analysis and evaluations with many additional indicators (e.g., performance of control) have been performed previously, this work focuses solely on the indicators network traffic and CPU load to provide a comprehensive overview of all results in the context of validating the presented design and realization principles.
The evaluation results of the e-skin system on the UR5 arm are presented in
Figure 7. Both indicators show a substantial reduction for the event-driven approach in comparison to the clock-driven reference. While both indicators are constant in CDSs, even when the system is idle and the information rate is zero, in EDSs, both indicators depend on the information rate or respectively on the event rate.
Figure 7 additionally summarizes our evaluation results for the scaled-up event-driven e-skin system we later deployed on our humanoid robot H1 [
40,
83]. This large-area e-skin system incorporates 1260 skin cells and 10,080 sensors [
83] on 0.87 m
, fitting well into the field of large-scale tactile sensing where other works [
28,
94] at most deploy 2,208 sensors of one modality on an area of around 0.07 m
. The evaluation of the LASS on H1 demonstrates that the superior efficiency of the event-driven is now required to avoid the loss of information. In clock-driven mode, the LASS on H1 overloads the information handling system and around one quarter of the tactile information is lost. The clock-driven LASS on H1 already fails in perceptive information handling and a further behavioral information handling to realize applications is definitely not feasible. Thus, effective LASSs are only feasible in event-driven setups.
7. Conclusions
This work presented the foundations for realizing, designing, and understanding large-area event-driven e-skin systems for effective applications. Homogenizing the perspectives on event-driven systems of the different research fields and consolidating the challenges of large-area skin systems provided the basis for assessing existing event-driven approaches. This assessment identified the send-on-delta principle (SoDP) as the most applicable method for large-area skin systems (LASSs). The send-on-delta principle offers a high system flexibility combining well with the measures for improving the deployability of large-area skin systems. The subsequent presentation of designs, supported with the previously consolidated theory, include a novel set of guidelines for tuning the novelty-threshold of event generators, modular event generators, event decoders, and a novel systematic design approach towards realizing event-driven information handling systems on standard computing systems. The presented design principles have been validated by outlining their impacts on our large-area skin implementations and by consolidating their experimental results. The experimental evaluations compared the event-driven large-area skin system with the networking and computational performance of its clock-driven counterpart. The event-driven large-area skin system outperforms the clock-driven one on average by a reduced network load of 94% and a reduced CPU load of 81%. In its large-area setup with 10,080 sensors, the clock-driven large-area skin system computationally saturated the computer system and could not operate without information loss (25% of all information was lost). Whereas the same system driven by events did not saturate and experienced very little losses, even under the same experimental condition with major tactile stimulation, that is, covering a humanoid robot with a cloth and moving and pressing the cloth, in total only 80 events (⋘0.1% information loss). Although, both systems observed information losses, these losses are largely originating from overflowing queues. Computer systems store arriving information in queues until the operating system schedules a thread to retrieve the information. Thus, when a computer system saturates, that is, more information arrives than can be processed per time instance, then these queues overflow and information is lost. Consequently, the continuous saturation of a clock-driven system causes continuous information loss. Rather than saturation, event rate peaks can cause sporadic information losses in event-driven systems. Event-driven computer systems lose information when more events arrive at the same time than events are fitting into the queues. The information loss in a saturated clock-driven computer system cannot be mitigated, but the information loss in event-driven computer systems can be reduced by increasing the queue sizes at the cost of an increased latency at event rate peaks. Overall, the presented foundations lead to scalable, efficient, and flexible e-skin systems, capable of handling large amounts of information, and improving the feasibility of complex large-area tactile applications, for instance in robotics.