1. Introduction
Preliminary breakdown (PB), also referred to as initial breakdown (particularly in cloud flashes), is one of the most mysterious lightning processes. Clarence and Malan [
1] suggested that PB in negative cloud-to-ground flashes is a vertical discharge between the main negative charge region and the lower positive charge region in the cloud. However, Krehbiel et al. [
2], from eight-station electric field measurements, found that breakdown events before the stepped leader involved considerable horizontal extent. Rakov and Uman [
3] stated that the PB process can be viewed as a sequence of channels extending in seemingly random directions from the cloud charge source with one of them evolving into the stepped leader propagating to ground. More recent studies of preliminary breakdown by Stolzenburg et al. [
4,
5], who used high-speed video and electric field records, indicated that the PB luminosity bursts were correlated with the PB pulses in the corresponding electric field records and suggested that each PB pulse was caused by a substantial current surge traversing a channel segment of the order of hundreds of meters in length. Nag and Rakov [
6], via modeling, estimated the PB pulse peak currents to be of the order of tens of kiloamperes, comparable to or even exceeding the corresponding return-stroke peak current.
Preliminary breakdown in negative cloud-to-ground lightning flashes at relatively large distances produces a sequence of bipolar electric and magnetic field pulses, typically occurring a few tens of milliseconds before the first return stroke pulse. The typical PB pulse-train duration is 2–3 ms [
7,
8], and typical PB pulse width is 20–40 µs [
9]. The peaks of the largest PB pulses can be comparable to or even exceed that of the following return stroke (RS) pulse [
7,
8,
10]. Using the electric field records obtained in 2006 at the Lightning Observatory in Gainesville (LOG), Florida, Nag and Rakov [
11] found that only 18% of negative cloud-to-ground flashes had detectable PB pulse trains. However, Baharudin et al. [
8] and Marshall et al. [
12] found that each of the flashes recorded by them in Florida had a PB pulse train. The peaks of PB pulses were found [
12,
13,
14] to be significantly higher in higher-latitude (temperate) regions than their counterparts in lower-latitude (tropical or subtropical) regions. Gomes et al. [
13] and Nag and Rakov [
14] attributed this difference to the existence of a more significant lower positive charge region (LPCR) in higher-latitude locations.
Brook [
10] found that PB pulses produced in winter storms were more intense than in summer storms. Also, he found the time interval between the peaks of the first PB pulse and the first-return-stroke pulse (PB-RS interval) in winter storms to be a factor of four shorter than in summer storms. The shortest PB-RS interval in his study was 2.5 ms, observed in winter. He attributed the disparity to the difference in precipitation mixes in summer and winter thunderclouds. However, many researchers observed [
15,
16,
17,
18,
19] PB-RS intervals as short as a few milliseconds in summer storms. For the short PB-RS interval events, Heavner et al. [
15] and Zhu et al. [
17] inferred leader speeds of the order of 10
6 m/s, which is an order of magnitude greater than typical speeds of negative stepped leaders. Nag and Rakov [
14] and Kolmasova et al. [
16] suggested that such unusually fast stepped leaders are produced by unusually strong negative charge sources. According to Nag and Rakov [
14], only 5% of stepped leaders in Florida have durations shorter than 5 ms. The events with short PB-RS intervals were recently found by Kotovsky et al. [
18] to be associated with long-lasting disturbances in the upper mesosphere and lower ionosphere.
In this paper, distant (50 to 500 km) electric field waveforms of PB pulses in negative cloud-to-ground lightning flashes characterized by relatively high (≥50 kA) first-RS peak currents are analyzed by using an automated data processing algorithm. The following parameters are examined: PB-RS interval, the ratio of peaks of the largest PB pulse and the corresponding first-return-stroke pulse (PB/RS pulse peak ratio), range-normalized peak of the largest PB pulse, and PB pulse train duration. The sample size in this study is an order of magnitude greater than those in previous studies.
3. Data and Methodology
In order to process LOG electric field data more efficiently, an automated procedure was developed to match NLDN data and LOG electric field records (all GPS-time-stamped), process the electric field data, detect the PB pulse trains and corresponding first RS pulses, and measure parameters of those trains, as well as the PB-RS time interval. The automated procedure is described below and illustrated in
Figure 1.
Step 1. Find LOG records containing NLDN-reported negative first strokes
For each LOG electric field record, we define a 3-ms time window centered at the GPS time stamp of the record. Then we search the NLDN negative flash dataset containing information only on the first stroke in each flash (subsequent strokes in multiple-stroke flashes are not included in this dataset) in the 50 to 500 km range of distances from the LOG to see if the NLDN detected a first stroke in that 3-ms window. If no NLDN-reported first stroke is found, the electric field record is discarded. If the NLDN reported a first stroke within the 3-ms time window (within ±1.5 ms of the LOG trigger) and within 50 to 500 km of LOG, we go to step 2. During step 1, LOG records triggered by subsequent strokes, cloud discharges, and close (<50 km) lightning events (which may contain significant non-radiation field components that make the automated detection of pulses difficult), as determined by the NLDN, are excluded from this study.
Step 2. Exclude LOG records that are saturated or contain other strokes prior to the first stroke of interest
If an NLDN-reported first stroke is found in step 1, we further check, in a 60-ms window (within −50 ms to +10 ms relative to the record GPS time stamp) if (1) the field record is saturated or (2) one or more strokes are found up to 50 ms prior to the first return stroke. This is accomplished by using another NLDN dataset which includes information on both first and subsequent strokes in each flash. The 50-ms pre-return-stroke interval was selected because 95% of PB-RS intervals in Florida negative CGs are less than 50 ms [
8,
17]. Occurrence of other return strokes (e.g., from a different flash) less than 50 ms prior to the first return stroke would not allow us to adequately examine the PB-RS time interval. If the record is saturated or there are other return strokes within 50 ms prior to the first return stroke of interest, the record is discarded. Otherwise, we go to step 3.
Step 3. Reduce noise
At this point, ideally, we have a 60-ms (−50 ms to +10 ms) electric field records with the first return stroke at t = 0 and possibly cloud pulses before and after the first return stroke. Additionally, we allowed the occurrence of subsequent strokes (using the NLDN temporal and spatial criteria to group strokes into flashes) between 0 and +10 ms, which would not interfere with our analysis. The 60-ms records were passed through the moving-average and power-line-noise-removal filters to remove 60-Hz power-line noise and to improve the signal/noise ratio. The power-line-noise-removal filter generates a 60-Hz sinusoidal waveform, based on the phase and amplitude of the power-line signal found from the 60-ms record, and subtracts that sinusoidal waveform from the original record. The moving-average window was 0.6 µs (31 points). After reducing noise we go to step 4.
Step 4. Identify and characterize individual pulses
From the beginning of the 60-ms record to 1.4 ms prior to the first return stroke (from −60 ms to −1.4 ms), we search each record to identify all unipolar pulses wider than 2.5 µs and all bipolar pulses wider than 5 µs with amplitudes greater than three times the residual noise level. Bipolar pulses were defined as composites of two consecutive monopolar pulses of opposite polarity separated by a time interval shorter than one-tenth of the width of the first pulse. The criteria used at this step were optimized by the trial-and-error method to achieve the best discrimination of lightning-generated pulses from residual noise. The −1.4 ms upper limit for the pulse search window was needed to exclude pulses generated by the stepped leader near ground; also, no PB-RS intervals shorter than 1.4 ms were reported for Florida lightning in [
17,
21]. At the end of this step, we have amplitudes and widths of all pulses in the search window. Then we go to step 5.
Step 5. Identify PB pulse trains (if any) and measure parameters of the train
We consider as a pulse train a sequence of at least three detected bipolar pulses. Pulses that are separated from the last pulse of the train by more than 2 ms are considered not belonging to that train, which is the same criterion as that used in [
11,
17]. If no PB pulse train is identified, the flash is labeled as having no detectable PB pulses and not included in further analysis. If one or more PB pulse trains are identified, the flash is considered as a flash with detectable PB pulses. In the following, we consider only pulses that belong to PB pulse trains. If a record contains one or more detected PB pulse trains, the automated data processing code will output a plot showing the 60-ms filtered electric field record, including the first-return-stroke pulse and all detected PB pulses, each marked by a box, along with all the pertinent information (given in a table above the plot) about this event, including the filename, date, location, NLDN-reported peak current, and all the PB pulse train parameters (including the PB-RS interval). An example of the output plot is shown in
Figure 2.
From August to October 2014, during 31 storms, thousands of electric field records were obtained and processed using the automated algorithm described above. For a total of 5498 flashes with NLDN-reported first return strokes, 3496 (64%) had detectable PB pulse trains. The NLDN-reported locations of all the 5498 first strokes are shown in
Figure 3. Since (1) the algorithm excluded all flashes that occurred within 50 km of LOG and (2) the average distance to LOG for the 5498 accepted flashes was 235 km, our dataset is biased toward flashes with higher peak currents. Specifically, the arithmetic mean (AM) and geometric mean (GM) of NLDN-reported peak currents are 119 kA and 104 kA, respectively, for all the 5498 events, and 124 kA and 109 kA, respectively, for 3496 events with detectable PB pulse trains, which are considerably higher than the GM value of 30 kA for first strokes in negative lightning [
3]. A histogram of NLDN-reported peak currents for the 3496 events with detectable PB pulse trains is shown in
Figure 4. In this study, we excluded 133 events with multiple PB pulse trains and considered only high-intensity events with the NLDN-reported first-stroke peak currents ≥50 kA, for which the sample size was 3077 and the GM peak current was 122 kA.
4. Characteristics of PB Pulse Trains
We found that 133 (3.8%) of 3496 flashes with detectable PB pulses had multiple PB pulse trains. In order to avoid ambiguity, these 133 events were excluded from the analysis presented in this section. Further, to limit our analysis to high-intensity events, we also excluded 286 (8.2%) events with NLDN-reported peak current <50 kA, so that the sample size here is reduced to 3077 with the total number of bipolar pulses being 43,010. Statistics for the PB-RS interval, PB/RS pulse peak ratio, PB pulse train duration, and bipolar pulse width, as well as for the NLDN-reported current and distance are given in
Table 1.
The AM PB/RS pulse peak ratio of 0.15 in
Table 1 is comparable to the AM values of 0.20 to 0.29 reported in [
8,
12,
17]. The AM PB-RS interval of 8.8 ms in
Table 1 is more than a factor of two smaller than previously reported, this disparity being likely due to the very high AM peak current of 134 kA (GM = 122 kA) in our dataset. The AM and GM values of PB pulse train duration in
Table 1 are somewhat longer than their counterparts, 2.2 and 1.7 ms, respectively, given in [
17]. The AM and GM bipolar pulse widths in
Table 1 are 25 µs and 21µs, respectively, which are consistent with typical widths of 20–40 µs reported by Rakov et al. [
9]. Histograms of the parameters of PB pulses summarized in
Table 1 are shown in
Figure 5,
Figure 6,
Figure 7 and
Figure 8.
Figure 9a shows a scatter plot of PB-RS interval vs. NLDN-reported RS peak current. Clearly, the PB-RS interval tends to decrease with increasing peak current. The best fit curve,
y = 1560 ×
x−1.12 − 0.694, obtained by using the power law function in Matlab is also shown. The Spearman correlation coefficient between the PB-RS interval and the NLDN-reported peak current was determined to be −0.80 with the
p-value less than 0.001; this indicates a strong negative correlation which is statistically significant at the 99.9% confidence level. Note that Spearman correlation coefficient is a non-parametric measure of rank correlation [
22], which is used to assess monotonic relationships (not necessarily linear).
Zhu et al. [
23] reported nine negative flashes with short PB-RS intervals, all of which exhibited high (AM = 131 kA) NLDN-reported peak currents. For five of the nine events, the corresponding Lightning Mapping Array (LMA) data were available and showed that the first LMA source heights ranged from 4.8 to 6 km. Based on these observed flash initiation heights and corresponding PB-RS intervals (stepped leader durations), Zhu et al. [
23] estimated the average 1D stepped-leader speed to be 1.2 × 10
6 m/s, which (although an underestimate) is almost an order of magnitude higher than 2 × 10
5 m/s thought (e.g., [
3], Chapter 4) to be typical for negative stepped leaders. Thus, the strong correlation seen in
Figure 9 suggests that negative flashes with faster stepped leaders tend to have higher first RS peak currents. Jordan et al. [
24] found that the subsequent (dart or dart-stepped) leader speed and return-stroke peak current are positively correlated for both natural and rocket-triggered lightning. It is likely that this trend also holds for first strokes initiated by stepped leaders. Note that the sample size in Zhu et al. [
23] is small and further observations are needed to estimate more reliably the correlation between the first-stroke peak current and the speed of the corresponding stepped leader.
Scatterplot of the peak of the largest PB pulse normalized to 100 km versus NLDN-reported peak current is shown in
Figure 10a. Normalization was performed assuming that the electric field peak in the 50 to 500 km distance range varies as the inverse inclined distance from the elevated PB source to the LOG. The inclined distance was roughly estimated using the NLDN reported horizontal distance and the assumed source height of 6 km. One can see that the maximum normalized PB pulse peak tends to increase with increasing RS peak current. The Spearman correlation coefficient between the maximum normalized PB pulse peak and NLDN-reported peak current is 0.48 and it is statistically significant at the 99.9% confidence level. It is difficult to discern any trend in
Figure 10a where the individual data points are shown. However, there is a clear trend (see
Figure 10b) for the AM normalized PB pulse peak corresponding to individual 50-kA bins to increase with increasing RS peak current.
Scatterplot of the normalized PB pulse peak vs. PB-RS interval is shown in
Figure 11a. The normalized PB pulse peak tends to decrease with increasing PB-RS interval and the corresponding Spearman coefficient is −0.55. It is statistically significant at the 99.9% confidence level. Similar trends were previously observed in Florida [
12], Japan [
25], and China [
26].