Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission

Mustich, Federico; Battaglia, Alessandro; Manconi, Francesco; Kollias, Pavlos; Parodi, Antonio

doi:10.3390/rs17152590

Open AccessArticle

Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission

by

Federico Mustich

¹,

Alessandro Battaglia

^1,2,*

,

Francesco Manconi

¹

,

Pavlos Kollias

^3,4

and

Antonio Parodi

⁵

¹

Department of Environment, Land and Infrastructure Engineering (DIATI), Politecnico di Torino, 10129 Turin, Italy

²

Earth Observation Science Group, Department of Physics and Astronomy, University of Leicester, Leicester LE1 7RH, UK

³

School of Marine and Atmospheric Sciences, Stony Brook University, Stony Brook, NY 11794, USA

⁴

Department of Atmospheric and Oceanic Sciences, McGill University, Montreal, QC H3A 0G4, Canada

⁵

CIMA Foundation, 17100 Savona, Italy

^*

Author to whom correspondence should be addressed.

Remote Sens. 2025, 17(15), 2590; https://doi.org/10.3390/rs17152590

Submission received: 9 June 2025 / Revised: 12 July 2025 / Accepted: 16 July 2025 / Published: 25 July 2025

(This article belongs to the Special Issue AI Applications to Remote Sensing of Cloud and Precipitation: Monitoring, Modeling, and Prediction)

Download

Browse Figures

Review Reports Versions Notes

Abstract

The WIVERN mission promises to deliver the first global observations of the three-dimensional wind field and the associated cloud and precipitation structure in a wide range of atmospheric phenomena, including isolated thunderstorms, tropical cyclones, mid-latitude frontal systems, and polar lows. A critical element in the development of the mission’s wind products is the differentiation between stratiform and convective regions. Convective regions are defined as those where vertical wind velocities exceed 1 m/s. This work introduces CONSTRAINN, a family of U-Net-based neural network models that utilise all of WIVERN observables—including vertical profiles of reflectivity and Doppler velocity, as well as brightness temperatures—to reconstruct convective wind activity within the Earth’s atmosphere. Results show that the retrieved convective/stratiform masks are well reconstructed, with an equitable threat score exceeding 0.6. Ablation experiments further reveal that Doppler velocity signals are the most informative for the reconstruction task.

Keywords:

convective/stratiform separation; Doppler radar; U-Net

1. Introduction

This work aims to exploit recent advances in deep learning classification technologies to develop a convective/stratiform (C/S) classification algorithm specific to the WIVERN (Wind Velocity Radar Nephoscope) mission. The mission’s conical scanning Doppler radar provides a 2D curtain of reflectivity and Doppler velocity data, as well as 1D-collocated measurements of brightness temperatures. The goal is to identify pixels that are convective, i.e., those with vertical velocities exceeding 1 m/s, with downdrafts and updrafts combined into a single probabilistic score. Our main contribution is the first deep learning framework trained on end-to-end WIVERN simulations that fuses Doppler velocity, reflectivity, and brightness temperature measurements within a U-Net adapted to the conically scanned W-band geometry and regresses a continuous, physically interpretable convective–stratiform index suitable for direct ingestion by the mission’s Level-2 retrieval chain. The WIVERN mission concept, one of the two candidate missions competing for selection for the Earth Explorer 11 mission within the European Space Agency’s FutureEO programme, promises to revolutionize the study of clouds, with its 800 km swath fast conically scanning 94 GHz Doppler radar at an incidence angle of about 42 degrees [1,2,3]. This configuration allows WIVERN to measure in-cloud winds at the native horizontal resolution of 1 km along track, with approximately 600 m vertical resolution. WIVERN Doppler velocity measurements provide information on the motion of the cloud and precipitation particles along the line of sight (LoS). Because WIVERN observes the atmosphere at a slant incidence angle, the Doppler signal represents a combination of both horizontal and vertical air motions, as well as the hydrometeors’ sedimentation velocity. In regions where vertical motions are negligible and the hydrometeor fall speed can be accurately estimated, it is possible to retrieve the horizontal wind component projected along the horizontal line of sight (

V_{H L o S}

). This information can be used, for example, in data assimilation systems to improve numerical weather prediction [4,5]. To derive this wind product, it is essential to distinguish between atmospheric regimes where the vertical velocity (w) can be considered negligible (defined as

| w | \leq 1

m/s), referred to as stratiform, and those where w is significant, known as convective. In stratiform regions,

V_{H L o S}

can be directly derived under the assumption that vertical motion is minimal. Conversely, in convective regions, if

V_{H L o S}

can be reconstructed from nearby stratiform regions, it may then be possible to estimate the vertical wind component. These unique in-cloud wind products will then further provide the following:

Full vector wind estimates within clouds over the 800 km swath, by combining the forward and backward radar looks, offering an unparalleled perspective on cloud dynamics [6,7].
Insights into convective organisation and anvils morphology, by combining the LoS winds with radar reflectivity to derive convective mass fluxes and assess radiative impacts [3].
Advanced understanding of the processes governing the formation, organisation, and intensification of mesoscale convective systems, tropical cyclones, and mid-latitude windstorms [8].

The convective/stratiform (C/S) classification is also of general interest for scientific purposes as stratiform and convective regimes differ in two fundamental ways, as follows:

1.: Formation mechanisms: convective precipitation is associated with strong vertical motions and the growth of hydrometeors via coalescence and/or riming, whereas stratiform precipitation occurs in regions with much weaker vertical motion, dominated by vapour deposition and aggregation.
2.: The distinct microphysical processes associated with each regime result in differing diabatic heating structures, which, in turn, influence large-scale atmospheric circulation in different ways [9].

Over the past 30 years, many methodologies have been developed to separate convective and stratiform regimes. Drawing on data from missions such as TRMM, GPM, CloudSat, and EarthCARE (for details on atmospheric radars, see Battaglia et al. [10]), the scientific community has gained considerable experience in classifying deep hydrometeor layers as either stratiform or convective using only the reflectivity measured by spaceborne radars, typically through echo–object classification schemes. For low-frequency radars (such as the Ku-band radar on board the TRMM and GPM observatories), the fundamental concept is that, in stratiform conditions, there is a smooth transition between the solid to the liquid phase occurring at the freezing level, which is marked by a bright band in the radar reflectivity [11]. In contrast, convective profiles are characterized by high reflectivities often exceeding 50 dBZ, and extend to altitudes above 10 km without any evidence of a transition region at the freezing level [12].

Radars with WIVERN-adopted frequency (94 GHz) are subject to return signal saturation due to non-Rayleigh effects (a 94-GHz rarely detects echoes above +20 dBZ [13], significant signal attenuation [14], and multiple scattering effects that can distort the signal [15]. Despite these challenges, several studies use the CloudSat CPR reflectivity profile features near the cloud top to identify convective cores [16,17]. The underlying rationale is that the overshooting of high radar reflectivities is an indicator of the larger-size particles pushed high up only possible with the presence of strong rising updrafts. A key limitation of the spaceborne C/S classifications proposed so far is that they are mainly based on vertical profiles of reflectivities. These approaches generally lack direct dynamical information such as Doppler velocities, and make limited use of the spatial texture of the reflectivity field.

In addition to spaceborne approaches, substantial efforts have been made to classify convective and stratiform (C/S) regions using ground-based radar observations. Early attempts relied on rule-based heuristics, using radar reflectivity thresholds and pattern recognition techniques. The Steiner–Houze–Yuter (SHY95) algorithm identifies convective cores based on peak reflectivity and local neighbourhood contrast [18], while later fuzzy-logic methods extend the idea to three-dimensional volumes [19]. These methods are simple and fast, but struggle with bright-band artifacts and varying radar geometries.

In recent years, supervised models have gradually replaced fine-tuned threshold methods. Ref. [20] trained a k-nearest-neighbour classifier on WSR-98D Doppler fields and achieved a 10–15% skill boost over SHY95. Neural networks are now frequently used in connection to C/S classification from geostationary observations. For example, ref. [21] built a convolutional neural network (CNN) that detects overshooting tops linked to severe convection and is able to discriminate between intense and ordinary convection. Ref. [22] showed that gradient-boosted trees fed with spectral visible and infrared data outperform traditional texture metrics for convective region detection.

For passive microwave observations, ref. [23] showed that a suite of machine-learning models trained on GPM Microwave Imager (GMI) brightness temperatures can already separate convective, stratiform, and mixed precipitation with 90–94% global accuracy, while [24] applied a Bayesian ResNet to GPM–GMI microwave brightness temperatures, achieving >90% accuracy in distinguishing convective and stratiform precipitation while also providing per-pixel uncertainty estimates, underscoring the value of data-driven approaches for precipitation-type retrievals.

The U-Net encoder–decoder backbone has become the de facto standard for pixel-level classification tasks. For example, Hoeller et al. [25] applied a vanilla U-Net to identify convective cold pools, while Han et al. [26] reported similar improvements when using a U-Net-based nowcasting model to forecast 30-min radar precipitation. More recently, Zhang and He [27] proposed an ensemble of lightweight U-Nets for processing FY-4B geostationary satellite imagery, achieving inference latencies below 100 ms per frame while maintaining a probability of detection (POD) greater than 0.70.

Beyond convective/stratiform (C/S) discrimination, the U-Net architecture has been successfully adapted for a variety of geophysical classification tasks, including cloud typing, land-cover mapping, and severe weather prediction. For instance, the 1D-CloudNet, a one-dimensional nested U-Net, combines Himawari-8 radiance data with CloudSat-derived labels to classify nine cloud categories at the nadir [28].

Hybrid encoder designs have also enhanced the accuracy of land-cover segmentation in multispectral imagery [29], while multi-feature fusion U-Nets have improved overall classification performance across diverse surface imaging scenes [30].

Remarkably, U-Net variants have even been applied to generate spatiotemporal tornado risk maps, by leveraging multivariate fields from numerical weather prediction (NWP) models [31]. A U-Net backbone has also been adopted to retrieve tropical cyclone inner-core wind fields from combined microwave and infrared imagery, achieving aircraft-like skill in reconstructing inner-core winds [32]. Additionally, Cao et al. [33] proposed Nowcastformer, a Transformer-augmented U-Net that exploits multi-resolution radar and satellite inputs to enhance precipitation nowcasting, while Zhang et al. [34] integrated residual and attention mechanisms into a lightweight U-Net to deliver real-time nowcasts.

2. Simulations of WIVERN Observables

The WIVERN instrument, with its conically scanning wide-swath radar (Figure 1a), represents a major technological innovation, unifying the following three advanced satellite sensing capabilities into a single system: range-resolved Doppler velocity, reflectivity measurements, and passive microwave observations. These are integrated through a unique radar–radiometer concept, enabling co-located active and passive measurements to maximize scientific synergy.

During ESA Phase-0 and Phase-A studies an instrument simulator has been developed that simulates all three WIVERN observables from both atmospheric and surface targets based on successive refinements [2,35,36] of the backbone simulator proposed in [37]. In brief, the simulator takes output from cloud resolving models that provides 3D fields of winds, hydrometeors, temperature, and water vapor and translate them in 94 GHz stimuli (i.e., scattering properties such as 94 GHz extinction, scattering and backscattering coefficients, single scattering albedo and asymmetry parameters). Then, each scene is illuminated by the WIVERN antenna and scanning pattern for any given orbit. The radar observables are simulated accounting for the sampling rate, the sensitivity, and the specific pulse scheme of the instrument (details in [2]).

WIVERN’s most innovative measurement will be the LoS Doppler velocity (

V_{L o S}

). Because of the slant angle of observation this quantity will be affected by:

1.: The horizontal line of sight (HLoS) wind velocity ( $V_{H L o S}$ ), i.e., the horizontal wind along the horizontally-projected LoS direction;
2.: The vertical wind velocity, w;
3.: The radar reflectivity weighted terminal velocity of the hydrometeors ( $V_{T}^{D}$ ) [38].

While the latter contribution can be generally estimated based on the temperature and strength of the radar backscattered signal, the first two contributions are generally entangled. The WIVERN fundamental equation linking the line of sight (LoS) Doppler velocity (

V_{L o S}

) with the other three variables is given by

V_{L o S} = V_{H L o S} \sin (θ_{I}) + \underset{V_{z}^{D}}{\underset{⏟}{(w + V_{T}^{D})}} \cos (θ_{I})

(1)

and is illustrated in Figure 1b.

A Dataset of Tropical Cyclone Simulations

The training data used in this study were generated using the WIVERN end-to-end simulator. Simulations were driven by atmospheric conditions derived from a mesoscale numerical weather prediction system. Specifically, data from a WRF (Weather Research and Forecasting) model simulation of Hurricane Milton were used to create the dataset employed in this study. These data include the complete tridimensional thermodynamic state of the atmosphere (profiles of temperature, pressure, water vapor, and winds) and the tridimensional structure of the different hydrometeor (snow, rain, graupel, hail, cloud, and ice) mass contents. These variables are converted into radar stimuli (backscattering coefficient, extinction coefficient, and asymmetry parameter) via 94 GHz scattering pre-computed look-up tables (details in [37]). The WRF dataset spans the period from 6 October 2024 at 10:00 UTC to 8 October 2024 at 00:00 UTC, with output intervals of one hour (39 h in total).

During this time, the cyclone evolved from a tropical storm to a Category 5 hurricane. Each hourly snapshot captures a domain of approximately 1250 × 1250 × 20 km³ centred around the cyclone eye, with a horizontal resolution of roughly 1.5 km and vertical resolution of approximately 500 m (but finer at heights lower than 3 km). For each snapshot, 200 s simulations (equivalent to 40 full antenna rotations) were performed. The time domain was centred around the moment when the satellite’s ground track passed closest to the hurricane eye. For each of the snapshots, overpasses were placed by translating an ascending ground track passing exactly over the eye from −6 to +6 deg in longitude in steps of 1 degree (13 tracks in total). Figure 2 show a representation of the WIVERN sampling strategy in Hurricane Milton. The dataset was obtained by randomly running several combinations of the 13 possible tracks for each of the 39 h of Hurricane Milton data. The final dataset was produced by randomly selecting various combinations of these 13 tracks across the 39 time steps, yielding approximately 80 simulation runs in total. This approach ensured a rich diversity of atmospheric scenarios across a wide range of hurricane intensities.

From the simulation output, the key variables which compose the model inputs are extracted: vertical brightness temperature

T_{B}^{V}

(monodimensional), horizontal brightness temperature

T_{B}^{H}

(monodimensional), Doppler velocity

V_{LoS}

(two-dimensional), and reflectivity

Z_{m}

(two-dimensional), with a horizontal spacing of 1 km and a vertical spacing of roughly 70 m. In addition, the simulator outputs the “true” vertical component of the wind field,

w_{LoS}

, projected along the instrument LoS. To obtain the physical vertical velocity,

w_{LoS}

is divided by

cos (42 °)

, reflecting the nominal incidence angle of the conically scanning beam (Figure 1a).

An example of a simulated overpass is illustrated in Figure 3 and Figure 4. Figure 3 is a zoomed version that highlights the WIVERN dense sampling strategy. The total water path (TWP) is indicative of areas with deep hydrometeor layers and the presence of vertically extended liquid water columns due to strong vertical air motions. Superimposed, the WIVERN scan, color-coded with the H channel brightness temperature

T_{B}^{H}

. In magenta, contours of actual convective regions are highlighted, where the maximum wind speed along the column exceeds 3 m/s.

Figure 2. Representation of an overpass of WIVERN over Hurricane Milton in the Gulf of Mexico for the 10/07 at 23:52 UTC. The track of Hurricane Milton is shown, color-coded to the intensity of the storm, from 5 October 2024 at 18:00 UTC to 10 October 2024 at 00:00 UTC. In white, cloud data was plotted using geostationary infrared data for the day of 8 October 2024 at 00:00 UTC. The square marker indicates the closest position of the satellite on the ground with respect to the cyclone eye (diamond marker). The cyan dashed line is the satellite ground track, while the gray dotted line represent the conical scan (the whole swath is highlighted by the shadowed region in cyan). The curtains for relevant quantities outputted by the simulator along the sector highlighted in yellow are plotted in Figure 4.

Figure 4 shows the three main radar products (

Z_{m}

,

V_{L o S}

and

T_{B}

) and corresponding relevant quantities (TWC,

V_{H L o S}

, and

V_{z}^{D}

) along a segment of the WIVERN slanted vertical cross-section. In Figure 4a,b, the eye and eyewall of the hurricane are clearly identifiable at the centre of the plot. The eye is characterised by a column of clear air with near-zero wind velocity, while the surrounding eyewall is marked by high TWC, which leads to signal extinction in the

Z_{m}

field. Generally,

Z_{m}

correlates well with TWC: above the freezing level (approximately 5 km altitude) and large anvil clouds containing high ice water content are visible, while rain bands with abundant low-level hydrometeors are evident at lower altitudes near the eyewall regions. Even within the eye, low clouds occasionally contribute to weak radar returns at lower levels.

Figure 4c,d show that the Doppler velocity signal is dominated by horizontal winds, displaying the characteristic dipole pattern of cyclonic circulation—winds of opposite sign appear on either side of the eye. In Figure 4f, convective vertical motions are apparent in the eyewall region, while below approximately 5 km, the vertical velocity is enhanced by precipitation fall speeds. Finally, Figure 4e reveals that highly convective regions are typically associated with lower brightness temperatures, consistent with deep, cold cloud tops.

In total, the dataset spans a distance of nearly

5.5 \times 10^{6}

km along the satellite ground track. For storage and mini-batching purposes, the data curtain is divided into 10,912 non-overlapping segments, each covering 500 km along-track. All variables are stored in NetCDF4 format using 32-bit floating-point precision. Figure 4 illustrates an example of a single chunk extracted from the full dataset.

Although the dataset, based on simulations of Hurricane Milton, provides a meteorologically rich test bed with prominent convective activity, convective regions remain a minority class within the full 5.5-million-kilometre dataset. Across the entire record, only 0.62% of the data correspond to areas with vertical wind speeds exceeding 3 m/s, with an additional 3.67% falling within the range of 1–3 m/s. This reflects the well-documented sparsity of strong updraughts and downdraughts compared with widespread stratiform regions. To counter the resulting class imbalance, we adopt a weighted binary cross-entropy loss, assigning each convective pixel ten times the weight of a stratiform pixel. A complementary probability-threshold calibration at inference further balances detection and false-alarm rates (see Section 3.5).

3. Methodology

3.1. Link Between WIVERN Observables and Convective Identification

All three WIVERN observables (94 GHz reflectivities, line-of-sight Doppler velocities and 94 GHz brightness temperatures) contain valuable information related to the presence of convection. This statement is supported by insights gained from previous missions such as CloudSat and ongoing missions like EarthCARE, both of which employ cloud radars operating at the same frequency.

94 GHz Reflectivity Profile. Numerous studies have demonstrated that high radar reflectivity values near cloud tops observed by the CloudSat CPR (e.g., values exceeding 10 dBZ above 10 km altitude) are effective indicators of convective cores [16,17]. Within CloudSat data, three commonly applied criteria are used to identify deep convection:

1.: The CPR cloud mask (2B-GEOPROF product) must exceed a value of 20.
2.: There must be a continuous radar echo extending from below 2 km to above 10 km in altitude.
3.: The echo-top height of the 10 dBZ reflectivity contour must exceed 10 km. A reflectivity of 10 dBZ is typically considered a proxy for the presence of precipitation-sized particles in convective clouds [39]. The extent to which such large particles are lifted towards the cloud top serves as an indirect measure of updraught intensity [40].

An example extracted from CloudSat data is shown in Figure 5. The black dots indicate locations along the CPR data that meet the deep convection. The CPR profiles corresponding to two such profiles are shown in Figure 5c (dashed lines). There are distinct differences among these profiles with the blue line one having a clear signature of multiple scattering [15] with no evident transition between the solid and liquid phase at the melting layer and the cyan one; on the other hand, they have a sharp transition at about 4 km with a large reflectivity gradient between 1 and 4 km, a signature of rain attenuation [41]. Profiles in more stratiform regions (red and magenta continuous lines), on the other hand, show a strong positive vertical gradient of reflectivity below the freezing level.

94 GHz Doppler Velocity. Observations from nadir-looking airborne radars (e.g., Heymsfield et al. [42]) and the recently launched EarthCARE mission [43] have revealed increased variability in vertical Doppler velocities in the presence of convective motions. In such environments, strong updraughts and downdraughts often occur in close proximity, resulting in significant spatial variability in the Doppler velocity measurements. The EarthCARE Doppler radar, which is nadir-pointing, provides direct measurements of the vertical Doppler velocity (

V_{z}^{D}

in Equation (1)). Recent findings by Galfione et al. [43] confirm that this variability is a reliable indicator of convective activity.

In contrast, WIVERN performs conical scanning at an incidence angle of 42°. Although the vertical component of the wind is attenuated by a factor of 0.74 due to the projection onto the line of sight (as described in Equation (1)), the LoS Doppler velocities from WIVERN will still be sensitive to the rapid fluctuations in vertical velocity (w) commonly found in convective regions.

94 GHz Brightness Temperature. Previous studies employing microwave radiometers have demonstrated that the presence of precipitation-sized ice particles leads to a depression in brightness temperatures (

T_{b}

) at higher frequencies (≥37 GHz), relative to the warmer background [44,45,46]. Among the various types of ice particles, graupel play a key role in causing this depression at 94 GHz. Their presence in the atmospheric column is linked to the riming process, which is typically intensified by strong updraughts [47,48].

This characteristic is evident in CloudSat

T_{b}

observations: Figure 5a clearly shows a substantial drop in brightness temperature below 200 K in the vicinity of the convective core, while 94 GHz

T_{b}

measurements from CloudSat have been used to advance understanding of ice microphysics [49]; it is surprising that they have not been widely exploited in studies of deep convection. Our simulations further support these findings, with

T_{b}

depressions reaching values below 100 K in intense convective cores.

Taken together, these results highlight the significant potential of all three WIVERN observables—94 GHz radar reflectivity, Doppler velocity, and brightness temperature—for convection identification. A distinct advantage in WIVERN’s case is that all three measurements are beam-matched, ensuring spatial and angular consistency in their retrievals.

3.2. Convective/Stratiform Mask

In our simulation framework, the first step involves estimating the WIVERN sampling volume-averaged vertical air motion. This is achieved by applying the antenna pattern weighting function to the modelled vertical velocities within the radar’s sampling volume. The resulting averaged vertical air motion serves as the reference truth for subsequent analysis.

Importantly, our models are not trained to reproduce the exact magnitude of the vertical velocity. Instead, the reference vertical velocity is first transformed into a smoothed convection–stratiform mask, which serves as the training target. The models are then trained to learn this classification structure rather than predict precise velocity values.

For each pixel-value w of the reference vertical velocity matrix, a corresponding target mask value m is obtained by mapping it to

[0, 1]

with a linear rule as follows:

m = \{\begin{matrix} 0, & if | w | \leq 1 m / s \\ 1, & if | w | \geq 3 m / s \\ \frac{| w | - 1}{2}, & otherwise . \end{matrix}

(2)

The choice of these threshold values is due to 1 m/s being the threshold commonly used in the literature to separate stratiform and convective motions [50], while 3 m/s can be considered a value that corresponds to moderate convection. Thus, the region in between can be considered a region of transition between the two regimes. Note that absolute values of w are considered, hence ignoring the vertical motion direction (i.e., no distinction between updrafts and downdrafts). m is set to

N a N

in regions that produce reflectivities below WIVERN sensitivity (−25 dBZ). The resulting mask is a floating-point values image whose values are between 0 and 1, whereas values close to 1 mark vigorous convection.

3.3. Pipeline Overview and Network Architectures

Each 500 km segment of data is stored as an individual NetCDF file, which contains the four input channels—vertical and horizontal brightness temperatures, reflectivity, and Doppler velocity—along with the computed target mask.

To ensure consistency in data representation, the brightness temperature variables are tiled into 2D tensors, providing a common spatial shape across all input channels.

The dataset is divided into a training set and a cross-validation set using a 9:1 ratio. Reflectivity values below −25 dBZ are capped at −25 dBZ, and min–max normalization is applied across all variables to standardize the input range for model training (Figure 6).

Three encoder–decoder U-NET variants have been implemented. Details of their architectures are provided in Table 1.

All models employ bilinear up-sampling, skip concatenations, a 1 × 1 output convolution, and optional sigmoid activation for inference. Dropout is injected at the bottleneck to mitigate overfitting.

3.4. Training Setup

Overall, the training dataset consisted of 10912 samples, for a total of approximately 5.5 millions km and 45 GB of simulated track data. The learning rate was set at 0.0001. With a batch size of 20, utilizing a single NVIDIA A40 GPU, the duration for training spans from 3 to 4 h for Mini configurations and extends up to 24 h for large setups.

3.5. Inference and Evaluation Metrics

During evaluation the same cleaning and normalization steps are applied. The network is then executed in sigmoid mode; the raw logits from its final

1 \times 1

convolution are passed through the sigmoid function as follows:

σ (z) = \frac{1}{1 + e^{- z}}

(3)

turning every pixel into a calibrated probability in the interval

[0, 1]

. Running the model in sigmoid mode produces true probabilities that can be directly compared with the continuous target mask or thresholded for ETS, POD, FAR and

F_{1}

. For measuring performance, non-thresholded metrics were employed (Mean Absolute Error (MAE), Mean Squared Error (MSE), and Binary Cross Entropy Loss (BCE)), defined as:

\begin{matrix} MAE & = & \frac{1}{N} \sum_{i = 1}^{N} |{\hat{y}}_{i} - y_{i}| \end{matrix}

(4)

\begin{matrix} MSE & = & \frac{1}{N} \sum_{i = 1}^{N} {({\hat{y}}_{i} - y_{i})}^{2} \end{matrix}

(5)

\begin{matrix} BCE & = & - \frac{1}{N} \sum_{i = 1}^{N} y_{i} \ln {\hat{y}}_{i} + (1 - y_{i}) \ln (1 - {\hat{y}}_{i}) \end{matrix}

(6)

where N is the number of valid pixels in each image,

{\hat{y}}_{i} \in [0, 1]

is the network output, and

y_{i} \in [0, 1]

the target mask.

For further performance assessment, our task is reformulated as a binary classification problem, and the following four thresholded metrics are computed: Probability of Detection (POD), False Alarm Rate (FAR), Equitable Threats Score (ETS), and F1-score.

\begin{matrix} POD & = & \frac{TP}{TP + FN} \end{matrix}

(7)

\begin{matrix} FAR & = & \frac{FP}{TP + FP} \end{matrix}

(8)

\begin{matrix} ETS & = & \frac{TP - \frac{(TP + FP) (TP + FN)}{N}}{TP + FP + FN - \frac{(TP + FP) (TP + FN)}{N}} \end{matrix}

(9)

\begin{matrix} F_{1} & = & \frac{2 TP}{2 TP + FP + FN} \end{matrix}

(10)

A lightweight postprocessing routine is applied: for every pixel, the average of the mask values inside a

3 \times 11

window (

3 km

wide,

0.7 km

tall) is computed. If that local mean exceeds the cross-validated threshold

n = 0.05

, the pixel is flagged as convective; otherwise, it is labelled as stratiform, to return a discrete representation in which each pixel has been assigned a value of either 0 or 1 (or

N a N

). After thresholding, a

2 \times 2

confusion matrix with true positives(TP), false positives (FP), false negatives (FN), and true negatives (TN) is obtained. All four metrics above are built from these counts.

4. Case Studies

The performance of the model is illustrated through three case studies.

Figure 7 presents a 500 km slice through the simulated Hurricane Milton, capturing a broad stratiform shield with embedded convection between approximately 150 km and 320 km along-track. The brightness temperature field (upper-left panel) exhibits sharp drops only in narrow bands, suggesting the presence of isolated deep convective towers embedded within an otherwise extensive anvil. This structure is corroborated by the reflectivity panel (centre-left), which reveals a broad layer of 0–15 dBZ reflectivities spanning altitudes of 6–15 km. The Doppler velocity panel (bottom-left) displays a characteristically noisy pattern, yet clear upward motion signatures (yellow-red) can still be identified near the convective tower cores. The C/S mask (middle-right) translates these dynamical indicators into a continuous convective index, which peaks at unity in regions where

| w | > 3

m/s.

The U-Net reconstruction (bottom-right) captures both the location and vertical extent of these convective cores with high fidelity. Notably, the major updraughts between 180 and 200 km and 240–270 km are recovered with near pixel-perfect accuracy. Some minor discrepancies remain, however, as the network exhibits a slight over-dilation of the convective areas.

Figure 8 illustrates the result of transforming the soft convection index into a binary classification field. The impact of this operation is evident in the upper-left panel: fine filaments visible in the raw mask (see Figure 7) are eliminated, while the principal convective core is consolidated into a solid, contiguous structure. Applying the same thresholding process to the U-Net output (upper-right panel) yields a similarly coherent reconstruction.

The lower panel combines the two post-processed masks into a four-colour confusion map. True positives (red) dominate the convective core, indicating that the network not only identifies the convective region correctly, but also captures its full vertical extent. True negatives (blue) are prevalent across the stratiform canopy, confirming that the model exhibits a low false alarm rate. Most classification errors manifest as a narrow yellow halo of false positives surrounding the edges of the convective towers. False negatives (green) are absent in this particular example and were observed only rarely across the entire test set.

Figure 9 presents case study #2, which features a more fragmented convective structure compared to case study #1. The scene includes a chain of convective bursts embedded within a broad stratiform shield. The brightness temperature trace (upper-left panel) shows repeated dips between 130 km and 320 km, indicating the presence of multiple overshooting tops rather than a single, well-defined eyewall. The reflectivity field confirms this pattern, revealing narrow columns exceeding 15 dBZ embedded within an expansive 5–15 dBZ stratiform layer, which deepens from around 5 km on the left to over 15 km at 400 km along-track.

The Doppler velocity panel displays corresponding streaks of intense upward motion (yellow–red), flanked by weaker downdraughts—typical signatures of pulse-type convection. The C/S mask successfully isolates the convective cores, assigning the surrounding ice clouds to the stratiform category. The U-Net reconstruction accurately retrieves all major convective cores and even captures the wispy overshooting feature near 150 km. However, it also introduces several small “satellite” blobs that remain below the threshold in the ground truth.

After applying the sliding-window post-processing filter (Figure 10, top row), the predicted convective canopy appears smoother, and many of the spurious speckles disappear. The confusion map (bottom row) shows large true-positive regions (red) along the main convective towers, reflecting excellent recall. False positives (yellow) tend to appear around tower flanks and some mid-level anvil regions, while false negatives (green) are concentrated in a few narrow vertical spires, suggesting that the model occasionally underestimates the extent of very slender cores. Nevertheless, the prevalence of true positives and true negatives across the scene confirms that overall precision remains high, despite the scene’s structural complexity.

Finally, Figure 11 and Figure 12 presents an additional case (case study #3), sampling a broad stratiform shield interrupted by a single intense convective tower, in contrast to the chain of smaller cells seen in previous cases. The brightness temperature panel reveals a sharp, V-shaped plunge of nearly 200 K centred around 70 km along-track. The reflectivity field confirms the presence of a narrow convective column extending above 17 km, with significant attenuation beneath it. The Doppler velocity panel supports this scenario, displaying a distinct needle-like vertical structure at the same location.

The U-Net reconstruction accurately predicts both the along-track position and the vertical extent of the core, while correctly identifying the downwind anvil as stratiform. Minor artefacts appear as faint streaks above 14 km, likely reflecting overconfident predictions of weaker convective activity. After post-processing, the filtered output maps are visually almost indistinguishable; however, the confusion image reveals subtle differences. The convective column is classified almost entirely as true positive (red). A thin halo of false positives (yellow) surrounds the top of the tower, indicating that the network is slightly more inclusive than the ground truth in classifying the anvil fringe. False negatives (green) are absent in this case.

Overall, the three case studies demonstrate the strong performance of the U-Net architecture in C/S classification, effectively handling both isolated and embedded convection scenarios.

To ensure comprehensiveness in the analysis, Figure 13 and Figure 14 illustrate an additional case study, evaluated across all the published model sizes: CONSTRAINN-Mini, CONSTRAINN-Medium, and CONSTRAINN-Large. It is noted that while all models exhibit comparable performance, an increase in model size results in a gradual reduction in reconstruction error.

5. Results and Discussion

Table 2 returns a complete picture accounting for both the capacity-vs-skill curve and the contribution of each input channel. Moving from small to medium and then to large trims the pixel-wise losses almost monotonically (BCE from 0.031 → 0.028 → 0.023; MAE from 0.0153 → 0.0133 → 0.0098) and lifts ETS from 0.481 to 0.553, while FAR drops by roughly four percentage points overall. The returns, however, diminish: the medium model spends 4× the parameters of the mini version for a 10–12% gain in the continuous metrics, whereas the large model costs an order of magnitude more parameters but only for a further 15–20% improvement. In practice, therefore, the medium configuration may offer the best cost—benefit ratio for an operational setting, with the large model acting as a high-skill but GPU-hungry benchmark, especially training-wise.

The channel ablations rows (from fourth to eighth) refine this picture. Removing either brightness temperature or Doppler velocity perturbs all scores by less than half a percent, evidence that the network can largely substitute one signal with the other two. Dropping reflectivity is more nuanced: the continuous losses improve slightly and FAR plunges to 0.188, but POD falls to 0.944 as the network misses more weak convective examples, indicating that reflectivity adds recall at the expense of extra noise along core edges. The single-channel experiments underline Doppler’s importance: a model driven only by Doppler velocity scores the highest ETS (0.604) and lowest FAR (0.185), yet its POD slips to 0.961, confirming that velocity alone captures vertical-motion structure but sometimes underrates marginal convection. Relying solely on reflectivity performs almost on par with the full model in POD but lags behind in every other metric, reinforcing the view that the three sensors are synergistic, with Doppler providing the bulk of structural skill and reflectivity acting as a recall booster.

6. Conclusions

This study introduces CONSTRAINN, a family of U-Net models trained on simulated data replicating the expected measurements from the WIVERN mission, to deliver a continuous, physically interpretable index of convective activity. By converting simulated vertical winds into a continuous convective/stratiform mask and by fusing Doppler velocity, reflectivity and brightness temperature information, the approach offers a reliable methodology to estimate vertical wind speed, as required by the mission’s Level-2 retrieval chain. On the Hurricane Milton benchmark, a mean squared error of 0.38% is achieved, with an ETS of 60%, a POD of 98% and a FAR of 18%. It is worth noticing that, given that the convective pixels exceeding 1 m/s make up about 3.6% of our data, this FAR mostly reflects the network dilating real convection cores by a few pixels rather than fabricating artificial convective regions, an error that is considered acceptable within our application domain.

The current models are hurricane-focused and do not distinguish between downdrafts and updrafts. Future works might include generalizing the presented models to directly retrieve vertical wind velocity, including its sign. Moving toward a broader range of applicable scenarios, a natural improvement consists of testing the current architecture to outputs of different storm-resolving models (e.g., ICON, RAMS), for tropical cyclones and extending the architecture to observational scenarios such as mid-latitude systems, meso-scale convective systems, frontal systems, or polar lows. Furthermore, the performance of the U-net based CONSTRAINN model might be evaluated against other deep learning techniques.

A next step might involve applying CONSTRAINN to real satellite observations, including EarthCARE, and in the near future, INCUS. Anticipated challenges in this transition include managing instrument noise, calibration uncertainties, footprint mismatches, and ensuring robust domain adaptation from simulated to actual measurements. Successfully addressing these challenges would significantly advance the creation of a unified convection classifier for next-generation spaceborne atmospheric radars.

Source code, data, and trained checkpoints are available, open-source and open-weights, at https://github.com/Anatr1/CONSTRAINN, accessed on 20 November 2025.

Author Contributions

A.B. wrote part of the paper, defined, and supervised the project. F.M. (Federico Mustich) drafted the paper, built the U-NET, and performed the data analysis. F.M. (Francesco Manconi) built the training dataset and contributed to Section 2. A.P. ran the WRF simulations. P.K. reviewed the paper and contributed to Section 3. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been supported by the Italian Space Agency (ASI) project “Scientific studies for the Wind Velocity Radar Nephoscope (WIVERN) mission” (Project number: 2023-44-HH.0). This study was carried out within the Space It Up project funded by the Italian Space Agency (ASI) and the Ministry of University and Research (MUR) under contract no. 2024-5-E.0—CUP no. I53D24000060005.

Data Availability Statement

Data and code used for this research is available at the aforementioned Github repository.

Acknowledgments

This research used the Felipe High-Performance Computing Facility at the Politecnico di Torino. Computational resources were also provided by HPC@POLITO, a project of Academic Computing within the Department of Control and Computer Engineering at the Politecnico di Torino.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Illingworth, A.J.; Battaglia, A.; Bradford, J.; Forsythe, M.; Joe, P.; Kollias, P.; Lean, K.; Lori, M.; Mahfouf, J.F.; Melo, S.; et al. WIVERN: A New Satellite Concept to Provide Global In-Cloud Winds, Precipitation, and Cloud Properties. Bull. Amer. Met. Soc. 2018, 99, 1669–1687. [Google Scholar] [CrossRef]
Battaglia, A.; Rizik, A.; Sikaneta, I.; Tridon, F. I and Qs Simulation and Processing Envisaged for Spaceborne Polarization Diversity Doppler Radars. IEEE Trans. Geosci. Remote Sens. 2025, 63, 3529672. [Google Scholar] [CrossRef]
ESA WIVERN Team. Report for Mission Selection: Earth Explorer 11 Candidate Mission WIVERN; Technical Report, ESA-EOPSM-WIVE-RP-4798; European Space Agency: Noordwijk, The Netherlands, 2025. [Google Scholar] [CrossRef]
Sasso, N.; Borderies, M.; Chambon, P.; Berre, L.; Girardot, N.; Moll, P.; Payan, C.; Pourret, V.; Battaglia, A.; Illingworth, A.; et al. Impact of WIVERN wind observations on Arpege Numerical Weather Prediction model forecasts using an Ensemble of Data Assimilation method. Q. J. R. Meteorolog. Soc. 2025, e4991. [Google Scholar] [CrossRef]
Federico, S.; Torcasio, R.C.; Transerici, C.; Montopoli, M.; Cambiotti, C.; Manconi, F.; Battaglia, A.; Pourshams, M. Assimilating WIVERN Winds in WRF model: An application to the Outstanding Case of the Medicane Ianos. Atm. Meas. Tech. 2025. submitted. [Google Scholar]
Battaglia, A.; Cambiotti, C.; Carbone, A.F.; Da Silva, S. Reconstruction of the Horizontal Wind Field Inside Weather Systems from the Sparse Sampling Envisaged for the Wind Velocity Radar Nephoscope (WIVERN) Mission. In Proceedings of the IGARSS 2024—2024 IEEE International Geoscience and Remote Sensing Symposium, Athens, Greece, 7–12 July 2024; pp. 8925–8927. [Google Scholar] [CrossRef]
Da Silva, S.; Battaglia, A.; Cambiotti, C.; Carbone, A.F. Sparse Sampling Reconstruction of Wind Fields for Space-Borne Doppler Radars. IEEE Trans. Geosci. Remote Sens. 2025. submitted. [Google Scholar]
Tridon, F.; Battaglia, A.; Rizik, A.; Scarsi, F.E.; Illingworth, A. Filling the Gap of Wind Observations Inside Tropical Cyclones. Earth Space Sci. 2023, 10, e2023EA003099. [Google Scholar] [CrossRef]
Schumacher, C.; Funk, A. Assessing Convective-Stratiform Precipitation Regimes in the Tropics and Extratropics with the GPM Satellite Radar. Geophys. Res. Lett. 2023, 50, e2023GL102786. [Google Scholar] [CrossRef]
Battaglia, A.; Kollias, P.; Dhillon, R.; Roy, R.; Tanelli, S.; Lamer, K.; Grecu, M.; Lebsock, M.; Watters, D.; Mroz, K.; et al. Spaceborne Cloud and Precipitation Radars: Status, Challenges, and Ways Forward. Rev. Geophys. 2020, 58, e2019RG000686. [Google Scholar] [CrossRef] [PubMed]
Houze, R.A., Jr.; Rasmussen, K.L.; Zuluaga, M.D.; Brodzik, S.R. The variable nature of convection in the tropics and subtropics: A legacy of 16 years of the Tropical Rainfall Measuring Mission satellite. Rev. Geophys. 2015, 53, 994–1021. [Google Scholar] [CrossRef] [PubMed]
Mroz, K.; Battaglia, A.; Lang, T.J.; Cecil, D.J.; Tanelli, S.; Tridon, F. Hail-Detection Algorithm for the GPM Core Observatory Satellite Sensors. J. Appl. Meteorol. Climatol. 2017, 56, 1939–1957. [Google Scholar] [CrossRef]
Kollias, P.; Szyrmer, W.; Zawadzki, I.; Joe, P. Considerations for spaceborne 94 GHz radar observations of precipitation. Geophys. Res. Lett. 2007, 34, L21803. [Google Scholar] [CrossRef]
Haynes, J.M.; L’Ecuyer, T.S.; Stephens, G.L.; Miller, S.D.; Mitrescu, C.; Wood, N.B.; Tanelli, S. Rainfall retrieval over the ocean with spaceborne W-band radar. J. Geophys. Res. Atmos. 2009, 114. [Google Scholar] [CrossRef]
Battaglia, A.; Tanelli, S.; Kobayashi, S.; Zrnic, D.; Hogan, R.J.; Simmer, C. Multiple-scattering in radar systems: A review. J. Quant. Spectrosc. Radiat. Transf. 2010, 111, 917–947. [Google Scholar] [CrossRef]
Stephens, G.; Polcher, J.; Zeng, X.; van Oevelen, P.; Poveda, G.; Bosilovich, M.; Ahn, M.H.; Balsamo, G.; Duan, Q.; Hegerl, G.; et al. The First 30 Years of GEWEX. Bull. Am. Meteorol. Soc. 2023, 104, E126–E157. [Google Scholar] [CrossRef]
Takahashi, H.; Luo, Z.J.; Stephens, G.L. Level of neutral buoyancy, deep convective outflow, and convective core: New perspectives based on 5 years of CloudSat data. J. Geophys. Res. 2017, 122, 2958–2969. [Google Scholar] [CrossRef]
Steiner, M.; Houze, R.; Yuter, S. Climatological characterization of three-dimensional storm structure from operational radar and rain gauge data. J. Appl. Meteorol. 1995, 34, 1978–2007. [Google Scholar] [CrossRef]
Yang, Y.; Chen, X.; Qi, Y. Classification of convective/stratiform echoes in radar reflectivity observations using a fuzzy logic algorithm. J. Geophys. Res. Atmos. 2013, 118, 1896–1905. [Google Scholar] [CrossRef]
Yang, Z.; Liu, P.; Yang, Y. Convective/Stratiform Precipitation Classification Using Ground-Based Doppler Radar Data Based on the K-Nearest Neighbor Algorithm. Remote Sens. 2019, 11, 2277. [Google Scholar] [CrossRef]
Cintineo, J.L.; Pavolonis, M.J.; Sieglaff, J.M.; Wimmers, A.; Brunner, J.; Bellon, W. A deep-learning model for automated detection of intense midlatitude convection using geostationary satellite images. Weather Forecast 2020, 35, 2567–2588. [Google Scholar] [CrossRef]
Lee, Y.; Kummerow, C.D.; Ebert-Uphoff, I. Applying machine learning methods to detect convection using Geostationary Operational Environmental Satellite-16 (GOES-16) advanced baseline imager (ABI) data. Atmos. Meas. Tech. 2021, 14, 2699–2716. [Google Scholar] [CrossRef]
Das, S.; Wang, Y.; Gong, J.; Ding, L.; Munchak, S.J.; Wang, C.; Wu, D.L.; Liao, L.; Olson, W.S.; Barahona, D.O. A Comprehensive Machine Learning Study to Classify Precipitation Type over Land from Global Precipitation Measurement Microwave Imager (GPM-GMI) Measurements. Remote Sens. 2022, 14, 3631. [Google Scholar] [CrossRef]
Orescanin, M.; Petković, V.; Powell, S.W.; Marsh, B.R.; Heslin, S.C. Bayesian deep learning for passive microwave precipitation type detection. IEEE Geosci. Remote Sens. Lett. 2021, 19, 4500705. [Google Scholar] [CrossRef]
Hoeller, J.; Fiévet, R.; Engelbrecht, E.; Haerter, J.O. U-Net Segmentation for the Detection of Convective Cold Pools From Cloud and Rainfall Fields. J. Geophys. Res. Atmos. 2024, 129, e2023JD040126. [Google Scholar] [CrossRef]
Han, L.; Liang, H.; Chen, H.; Zhang, W.; Ge, Y. Convective Precipitation Nowcasting Using U-Net Model. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–8. [Google Scholar] [CrossRef]
Zhang, J.; He, M. Methodology for Severe Convective Cloud Identification Using Lightweight Neural Network Model Ensembling. Remote Sens. 2024, 16, 2070. [Google Scholar] [CrossRef]
Deng, M.; Han, Y.; Liu, Y.; Dong, L.; Zhou, Q.; Zhang, Y.; Deng, X.; Lu, T. Development of a Novel One-Dimensional Nested U-Net Cloud-Classification Model (1D-CloudNet). Remote Sens. 2025, 17, 519. [Google Scholar] [CrossRef]
Ramos, L.T.; Sappa, A.D. Leveraging U-Net and selective feature extraction for land cover classification using remote sensing imagery. Sci. Rep. 2025, 15, 784. [Google Scholar] [CrossRef] [PubMed]
Yan, C.; Fan, X.; Fan, J.; Wang, N. Improved U-Net Remote Sensing Classification Algorithm Based on Multi-Feature Fusion Perception. Remote Sens. 2022, 14, 1118. [Google Scholar] [CrossRef]
Meza, J.; Anderson, N.; Tsultrim, T. Weather-Forecasting UNET: Leveraging a U-Net Architecture for Probabilistic Tornado Prediction from Multivariate Reanalysis Data; Stanford University: Stanford, CA, USA, 2024; Volume 60, pp. 1–8. [Google Scholar]
Wimmers, A.J.; Griffin, S.; Velden, C. A U-Net Retrieval of Tropical Cyclone Inner-Core Wind Fields from Microwave and Infrared Satellite Imagery. Artif. Intell. Earth Syst. 2024, 3, 1–18. [Google Scholar] [CrossRef]
Cao, Y.; Chen, L.; Wu, J.; Feng, J. Enhancing Nowcasting with Multi-Resolution Inputs Using Deep Learning: Exploring Model Decision Mechanisms. Geophys. Res. Lett. 2024, 52, e2024GL113699. [Google Scholar] [CrossRef]
Zhang, Z.; Song, Q.; Duan, M.; Liu, H.; Huo, J.; Han, C. Deep Learning Model for Precipitation Nowcasting Based on Residual and Attention Mechanisms. Remote Sens. 2025, 17, 1123. [Google Scholar] [CrossRef]
Rizik, A.; Battaglia, A.; Tridon, F.; Scarsi, F.; Kötsche, A.; Kalesse-Los, H.; Maahn, M.; Illingworth, A. Impact of cross-talk on reflectivity and Doppler measurements for the WIVERN Polarization Diversity Doppler Radar. IEEE Trans. Geosci. Remote Sens. 2023, 61, 2004814. [Google Scholar] [CrossRef]
Manconi, F.; Martire, P.; Stesina, F.; Battaglia, A. High accuracy attitude determination of a spacecraft with a fast-rotating Doppler radar reflector. Acta Astronaut. 2025, 233, 66–81. [Google Scholar] [CrossRef]
Battaglia, A.; Martire, P.; Caubet, E.; Phalippou, L.; Stesina, F.; Kollias, P.; Illingworth, A. Observation error analysis for the WInd VElocity Radar Nephoscope W-band Doppler conically scanning spaceborne radar via end-to-end simulations. Atmos. Meas. Tech. 2022, 15, 3011–3030. [Google Scholar] [CrossRef]
Fabry, F. Radar Meteorology: Principles and Practice; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar] [CrossRef]
Stephens, G.L.; Kummerow, C.D. The Remote Sensing of Clouds and Precipitation from Space: A Review. J. Atmos. Sci. 2007, 64, 3742–3765. [Google Scholar] [CrossRef]
Luo, Z.; Stephens, G.L.; Emanuel, K.A.; Vane, D.G.; Tourville, N.; Haynes, J. On the Use of CloudSat and data for estimating Hurricane Intensity. IEEE Geosci. Remote Sens. Lett. 2008, 5, 13–16. [Google Scholar]
Matrosov, S.Y. Potential for attenuation-based estimations of rainfall rate from CloudSat. Geophys. Res. Lett. 2007, 34, L08517. [Google Scholar] [CrossRef]
Heymsfield, G.M.; Tian, L.; Heymsfield, A.J.; Li, L.; Guimond, S. Characteristics of Deep Tropical and Subtropical Convection from Nadir-Viewing High-Altitude Airborne Doppler Radar. J. Atmos. Sci. 2010, 67, 285–308. [Google Scholar] [CrossRef]
Galfione, A.; Battaglia, A.; Puigdomènech Treserras, B.; Kollias, P. First insights into deep convection by the Doppler velocity measurements of the EarthCARE’s Cloud Profiling Radar. EGUsphere 2025, 2025, 1–25. [Google Scholar] [CrossRef]
Spencer, R.W.; Howland, M.R.; Santek, D.A. Severe Storm Identification with Satellite Microwave Radiometry: An Initial Investigation with Nimbus-7 SMMR Data. J. Appl. Meteorol. Climatol. 1987, 26, 749–754. [Google Scholar] [CrossRef]
Hong, Y.; Kummerow, C.D.; Olson, W.S. Separation of Convective and Stratiform Precipitation Using Microwave Brightness Temperature. J. Appl. Meteorol. 1999, 38, 1195–1213. [Google Scholar] [CrossRef]
Battaglia, A.; Mroz, K.; Lang, T.; Tridon, F.; Tanelli, S.; Tian, L.; Heymsfield, G.M. Using a multiwavelength suite of microwave instruments to investigate the microphysical structure of deep convective cores. J. Geophys. Res. Atm. 2016, 121, 9356–9381. [Google Scholar] [CrossRef] [PubMed]
Leppert, K.D.; Cecil, D.J. Signatures of Hydrometeor Species from Airborne Passive Microwave Data for Frequencies 10-183 GHz. J. Appl. Meteorol. Climatol. 2015, 54, 1313–1334. [Google Scholar] [CrossRef]
Battaglia, A.; Mroz, K.; Cecil, D. Chapter 9—Satellite hail detection. In Precipitation Science; Michaelides, S., Ed.; Elsevier: Amsterdam, The Netherlands, 2022; pp. 257–286. [Google Scholar] [CrossRef]
Battaglia, A.; Panegrossi, G. What Can We Learn from the CloudSat Radiometric Mode Observations of Snowfall over the Ice-Free Ocean? Remote Sens. 2020, 12, 3285. [Google Scholar] [CrossRef]
Atlas, D.; Ulbrich, C.W.; Marks, F.D., Jr.; Black, R.A.; Amitai, E.; Willis, P.T.; Samsury, C.E. Partitioning tropical oceanic convective and stratiform rains by draft strength. J. Geophys. Res. Atmos. 2000, 105, 2259–2267. [Google Scholar] [CrossRef]

Figure 1. Panel (a): Conically scanning geometry envisaged for the WIVERN mission. The width of the footprint is exaggerated for illustration purposes. Panel (b): vector diagram explaining the WIVERN LoS equation showing how to relate

V_{H L o S}

, the vertical wind (w), and the Doppler terminal velocity (

V_{T}^{D}

) to the

V_{L o S}

measurement.

Figure 1. Panel (a): Conically scanning geometry envisaged for the WIVERN mission. The width of the footprint is exaggerated for illustration purposes. Panel (b): vector diagram explaining the WIVERN LoS equation showing how to relate

V_{H L o S}

, the vertical wind (w), and the Doppler terminal velocity (

V_{T}^{D}

) to the

V_{L o S}

measurement.

Figure 3. Detail of the simulated scan example, highlighting convective regions. In greyscale, the total water path (TWP), i.e., the TWC integrated over the height. Superimposed, the WIVERN scan, color-coded with the H channel brightness temperature

T_{B}^{H}

. In magenta, contours of regions where the maximum of the windspeed in the column is over 3 m/s, which is the criterion used to identify convective cells.

Figure 3. Detail of the simulated scan example, highlighting convective regions. In greyscale, the total water path (TWP), i.e., the TWC integrated over the height. Superimposed, the WIVERN scan, color-coded with the H channel brightness temperature

T_{B}^{H}

. In magenta, contours of regions where the maximum of the windspeed in the column is over 3 m/s, which is the criterion used to identify convective cells.

Figure 4. Curtains of the sector of the scan highlighted in yellow in Figure 2. In the left column, from top to bottom, the following observables: (a) measured reflectivity, (c) measured Doppler velocity, and (e) vertical and horizontal brightness temperature. In the right column, from top to bottom, the antenna weighted “true” quantities obtained directly from the WRF model: (b) total water content (TWC), (d) horizontal component of the LoS wind, and (f) vertical component of the LoS wind plus the hydrometeor terminal velocity.

Figure 5. Precipitation events on 11th January 2008 over Bolivia and Argentina observed in an ascending CloudSat overpass. (a) CloudSat 94-GHz

T_{b}

in K. (b) CPR reflectivity. (c) Example of CPR reflectivity profiles: two in deep convective cores meeting the criteria from Takahashi et al. [17] (dashed lines) and two in a stratiform region (continuous lines).

Figure 5. Precipitation events on 11th January 2008 over Bolivia and Argentina observed in an ascending CloudSat overpass. (a) CloudSat 94-GHz

T_{b}

in K. (b) CPR reflectivity. (c) Example of CPR reflectivity profiles: two in deep convective cores meeting the criteria from Takahashi et al. [17] (dashed lines) and two in a stratiform region (continuous lines).

Figure 6. Network architecture for the CONSTRAINN medium network. Numbers indicate each layer size. Orange: convolutional layers. Red: pooling layers. Blue: unpooling layers. Purple: softmax layer.

Figure 7. Case study #1 with input data and output for the large CONSTRAINN network. Top-left: brightness temperatures, center-left: reflectivity, bottom-left: Doppler velocity, top-right: raw wind vertical velocity, center-right: target mask, bottom-right: reconstructed image.

Figure 8. Binary thresholded output from the same sample of Figure 7. Top-left: postprocessed C/S mask, top-right: postprocessed reconstructed image, bottom: overlapped images, highlighting true-positive, true-negative, false-positive, and false-negative regions.

Figure 9. Case study #2 with input data and output for the CONSTRAINN large network. Top-left: brightness temperatures, center-left: reflectivity, bottom-left: Doppler velocity, top-right: raw wind vertical velocity, center-right: target mask, bottom-right: reconstructed image.

Figure 10. Binary thresholded output from the same sample of Figure 9. Top-left: Postprocessed C/S Mask. Top-right: postprocessed reconstructed image, Bottom: overlapped images, highlighting true-positive, true-negative, false-positive, and false-negative regions.

Figure 11. Case study #3 with input data and output for the CONSTRAINN large network. Top-left: brightness temperatures, center-left: reflectivity, bottom-left: Doppler velocity, top-right: raw wind vertical velocity, center-right: target mask, bottom-right: reconstructed image.

Figure 12. Binary thresholded output from the same sample of Figure 11. Top-left: postprocessed C/S mask. top-right: postprocessed reconstructed image, bottom: overlapped images, highlighting true-positive, true-negative, false-positive, and false-negative regions.

Figure 13. Case study #4 analyzed with all the three CONSTRAINN models. Left column: input data and vertical velocity ground truth. Right column: at the top, the target mask obtained from the vertical velocity ground truth, below the output from each model.

Figure 14. Binary thresholded output from the same sample of Figure 13 for each CONSTRAINN model size, highlighting true-positive, true-negative, false-positive, and false-negative regions.

Table 1. Architectural summary of the three CONSTRAINN variants. “Parameters” is the total number of trainable parameters (expressed in millions), “Initial Filters” is the number of feature maps in the first convolution, and “Depth” is the number of down-sampling (encoder) blocks.

Model Size	Parameters	Initial Filters	Depth
	(M)		(Down Blocks)
Mini	8	32	4
Medium	30	64	5
Large	500	128	6

Table 2. Validation performance of the three CONSTRAINN variants. Arrows indicate whether lower (↓) or higher (↑) values are better. Bold face characters are used to highlight the best achieved score.

Model	BCE↓	MAE↓	MSE↓	ETS↑	FAR↓	POD↑	$F_{1}$ ↑
CONSTRAINN-Mini	0.0311	0.0153	0.0066	0.481	0.253	0.978	0.815
CONSTRAINN-Medium	0.0281	0.0133	0.0057	0.512	0.238	0.982	0.829
CONSTRAINN-Large	0.0230	0.0098	0.0042	0.553	0.217	0.983	0.845
CONSTRAINN-Large (No Br. Temperature)	0.0238	0.0104	0.0044	0.547	0.220	0.980	0.842
CONSTRAINN-Large (No Doppler Velocity)	0.0243	0.0106	0.0046	0.548	0.220	0.984	0.844
CONSTRAINN-Large (No Reflectivity)	0.0238	0.0089	0.0038	0.580	0.188	0.944	0.855
CONSTRAINN-Large (Only Doppler Velocity)	0.0240	0.0098	0.0042	0.604	0.185	0.961	0.865
CONSTRAINN-Large (Only Reflectivity)	0.0248	0.0110	0.0048	0.543	0.222	0.981	0.841

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mustich, F.; Battaglia, A.; Manconi, F.; Kollias, P.; Parodi, A. Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission. Remote Sens. 2025, 17, 2590. https://doi.org/10.3390/rs17152590

AMA Style

Mustich F, Battaglia A, Manconi F, Kollias P, Parodi A. Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission. Remote Sensing. 2025; 17(15):2590. https://doi.org/10.3390/rs17152590

Chicago/Turabian Style

Mustich, Federico, Alessandro Battaglia, Francesco Manconi, Pavlos Kollias, and Antonio Parodi. 2025. "Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission" Remote Sensing 17, no. 15: 2590. https://doi.org/10.3390/rs17152590

APA Style

Mustich, F., Battaglia, A., Manconi, F., Kollias, P., & Parodi, A. (2025). Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission. Remote Sensing, 17(15), 2590. https://doi.org/10.3390/rs17152590

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Convective–Stratiform Identification Neural Network (CONSTRAINN) for the WIVERN Mission

Abstract

1. Introduction

2. Simulations of WIVERN Observables

A Dataset of Tropical Cyclone Simulations

3. Methodology

3.1. Link Between WIVERN Observables and Convective Identification

3.2. Convective/Stratiform Mask

3.3. Pipeline Overview and Network Architectures

3.4. Training Setup

3.5. Inference and Evaluation Metrics

4. Case Studies

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI