1. Introduction
Atlantic salmon (
Salmo salar L.) is found across a wide range, including the northern Atlantic Ocean and the Baltic Sea. In the latter, the most important rivers for reproduction are located in the Gulf of Bothnia [
1]. Salmon displays an anadromous life cycle where mature individuals migrate from the sea up the natal river during spring and summer to spawn in the autumn. The embryos hatch in April–May, and the free embryos remain within the redd until the yolk sac is absorbed approximately one month later [
2,
3]. The salmon, now referred to as fry, seek out nearby areas with slower currents, often close to the shoreline, where they initiate external feeding [
2,
3]. In late summer, the salmon develop into the parr stage and move into nearby faster-flowing habitats where they establish territories [
2,
3]. The parr remain stationary in the river for around three years before undergoing smoltification, which includes several morphological, physiological, and behavioural changes enabling survival at sea [
2,
3,
4]. Thereafter, the smolt rapidly migrate out of the river into the sea in mid-May–early July and continue migrating southward to reach the primary feeding grounds [
2,
5]. After having spent 1–4 years at the feeding grounds, the mature salmon undertake the migration back to the natal river for spawning [
2]. As Atlantic salmon is an iteroparous species, individuals may repeatedly spawn for several years [
6].
The Baltic Sea stock of salmon is considered vulnerable by the Baltic Marine Environment Protection Commissions threat assessment of Baltic Sea species, mainly as a consequence of the loss of spawning grounds due to the damming of rivers for hydropower production and dredging of rapids to benefit log driving [
1]. Fishing in the sea has also been a major factor limiting the stock, but following implementations of stricter regulations in the late 1990s, the negative impact has been limited [
7,
8]. Since the turn of the millennium, there has been a steady increase in the natural smolt production in rivers in the Gulf of Bothnia, going from ~1.3 million smolts produced in the year 2000 to ~2.3 million in 2009 [
9]. By 2016, most of the rivers in the area had actualised the management target set at 75% of the potential smolt production capacity [
10].
River Simojoki is a 175 km long river discharging into the northernmost part of the Gulf of Bothnia, Baltic Sea. The river has remained undammed and unobstructed, but its rapids were extensively dredged for log driving in the 1950s and 1960s [
11]. Restoration projects of the river were initiated in the 1970s and were finalised in the early 2000s [
11]. The mean discharge rate of the river is 38 m
3 s
−1, but as a consequence of ditching by the forestry industry within the 3160 km
2 catchment area, it is highly intra- and interannually variable, changing from a minimum rate of <10 m
3 s
−1 in late winter to a maximum rate of ˃400 m
3 s
−1 during the spring flood [
8,
11,
12,
13]. During summer, salmon ascend River Simojoki up to 110 km from the sea for spawning, with occasional spawning recorded up to Lake Simojärvi, located 175 km upstream [
8,
11,
14,
15]. The smolt production of the local salmon stock has varied considerably throughout the years with an estimated output during 1977–2020 of 23,300 ± 17,700 smolts/year (
± SD), range 1300–67,000 [
7]. In 1984, during a decade of particularly low smolt production at 10,100 ± 6500 smolts/year (
± SD), range 1500–19,000, a supplementary stocking programme of parr and smolts was implemented, continuing over the turn of the century [
7,
8,
16,
17]. Since 2011, only minor stocking for research purposes has occasionally been conducted in River Simojoki [
7]. The highly variable smolt production of the local salmon stock has been hypothesised to be a consequence of the high intra- and interannual variability in the discharge rate [
12]. During spawning seasons with especially high discharge, salmon gain access to plenty of suitable spawning areas along the shoreline. However, as the discharge decreases to its minimum rate in late winter, many of these spawning areas end up above the water level, exposing and killing the embryos [
12]. The low discharge rates in late winter–early spring are also thought to negatively affect the survival of smolt by limiting the size of the available habitats [
12].
The salmon stock in River Simojoki has been the subject of annual surveys since 1972 [
11]. The number of ascending mature fish has been monitored using sonar, the parr densities in the nursery areas have been surveyed by electrofishing, and the smolt production has been estimated by trapping smolts migrating to the sea (e.g., [
11,
12,
15,
16,
17,
18,
19,
20]). Despite previous efforts, it remains unclear to what degree the different rapids contribute to total salmon production. The distance from the rapids to the sea might affect the survival of migrating smolt, especially in the upper reaches of the river where there are several long pool sections and lakes that they have to pass through which, in general, is considered to increase the mortality due to exposure to avian and fish predators [
14,
21]. However, a study releasing externally tagged, hatchery-reared smolts in different parts of the river in 1996–1999 found no significant effect of distance to the sea at the release site on smolt recapture rate in the river mouth, as it varied greatly interannually [
14].
The application of otolith microchemistry as a tool for identifying the spawning areas of fish and to distinguish fish originating from different natal rivers is well established (e.g., [
22,
23,
24,
25,
26,
27,
28]). To a lesser extent, otolith microchemistry has also been used to distinguish fish originating from different spawning areas within the same river system, although here, greater challenges have generally been encountered due to smaller variations in otolith elemental composition between samples (e.g., [
29,
30,
31,
32,
33,
34,
35]). Otoliths are paired calcified structures located in the inner ear of Osteichthyes, constituting a part of the auditory and vestibular system [
36]. The sagitta, the largest of the otoliths, is built up starting from a primordium through biomineralisation of calcium carbonate (~96%) within a protein matrix (~3%) [
37]. The biomineralisation process is considered to be affected by both extrinsic factors, such as ambient ion concentration, oxygen level, and temperature, as well as intrinsic ones, such as ontogeny, sexual maturation, spawning, somatic growth rate, and feeding rate [
38,
39]. During the biomineralisation process, other elements (~1%) are incorporated into the otolith as well, either through the substitution of calcium in calcium carbonate, by trapping in crystal defects, or by binding to the protein matrix [
37]. The incorporation rate of elements not under strict physiological regulation often reflects the availability in the ambient environment [
37,
38]. Thus, the concentration of these elements in the otolith may be analysed to deduce the fish’s environment at specific points in its life [
37,
38]. Especially the concentration of strontium (Sr), which generally is positively correlated with salinity in marine and brackish environments, has often been used to track migration over salinity gradients and to detect anadromy [
37]. An alternative to studying the concentrations of elements in otoliths is to analyse the isotopic ratio of elements, such as the
87Sr/
86Sr ratio [
40]. The strontium isotope
87Sr is formed by the radioactive decay of the rubidium isotope
87Rb with a half-life of approximately 4.9 × 10
10 years. In contrast,
86Sr is non-radiogenic and thus stable over time. As a consequence, the
87Sr/
86Sr ratio varies with the age of minerals, with higher ratios found in older minerals or minerals with initially higher Rb/Sr ratios [
41]. Depending on the geology in the drainage area, rivers entering the Baltic Sea differ in the concentration of Sr and the
87Sr/
86Sr ratio. Generally, the concentration of Sr is lower in the northern rivers compared to the southern rivers (15–50 µg L
−1 vs. 100–500 µg L
−1), and the
87Sr/
86Sr ratio is higher (0.73 vs. 0.71) [
41]. The
87Sr/
86Sr ratio also differs at a finer spatial scale, and many rivers draining into the Gulf of Bothnia can be differentiated based on their isotopic ratio [
42]. The
87Sr/
86Sr ratio incorporated into otoliths is expected to reflect the ratio found in the ambient environment, as no or minimal selection against specific isotopes is assumed to occur during the biomineralisation process [
40]. However, analysing the
87Sr/
86Sr ratio in otoliths to distinguish fish originating from different spawning grounds in the Baltic Sea remains relatively rare, and a Google Scholar search on 12 August 2022 using the keywords otolith,
87Sr,
86Sr, isotope, and Baltic Sea returned only three such studies published [
22,
43,
44].
To statistically analyse the elemental composition of otoliths to determine the spawning ground of origin for fish, methods such as quadratic discriminant analysis and discriminant function analysis (DFA) are predominantly used [
45]. However, machine learning classification methods such as random forest (RF) are showing great promise as they make less restrictive assumptions and often have a higher correct classification rate [
45,
46,
47]. In RF, many classification trees are built based on the random bootstrapping of data. In each classification tree, the bootstrapped data are recursively split into binary groups by testing a set number of dependent variables and selecting the variable most successful in maximising the within-group homogeneity at each split. The process is ended when further splitting does not increase within-group homogeneity. The classification ability of each tree is then tested by classifying individuals not used to build the tree in question, and the combined classification ability of the RF is presented as out-of-bag (OOB) error rates. The RF can be applied to classify unknown individuals by running them through each classification tree, which then votes for a class. In the end, the class that receives the most votes from the forest is selected as the classification. In contrast to DFA, RF places no assumption on the multivariate distribution of the data and is robust against outliers; furthermore, RF is better suited when small sample sizes with many dependent variables are analysed [
46,
47].
The present study aimed to preliminary investigate the feasibility of distinguishing salmon originating from different nursery areas in River Simojoki by analysing the elemental composition of otoliths. The assumption was that there exists a gradient in the elemental composition in the main stem of River Simojoki, caused by inflow from tributary rivers with different elemental composition. This difference in the ambient environment between the nursery areas in the river was hypothesised to be reflected in the otolith elemental composition for parr originating from these areas. Parr were considered to remain in the vicinity of the nursery areas before undergoing smoltification, which would allow the elemental composition of their otoliths to be analysed to establish elemental profiles characteristic of the different areas [
2]. This, in turn, would enable determining the nursery area of origin for smolt caught during the migration to the sea by analysing the elemental composition in the region of their otoliths formed during the first summer and comparing these to the previously established profiles. If the method proved successful, it could provide a valuable tool for tracking the origin of smolt and adult salmon, which would be useful for management and conservation purposes.
2. Materials and Methods
Parr were caught by electrofishing in August 2019 using a transformer (type ELT 60 NGI, Hans Grassl, Schönau am Königssee, Germany) connected to a 1 kW aggregate (Honda, Tokyo, Japan). The sampling was undertaken at three nursery areas in the main stem of River Simojoki, henceforth referred to as the lower, middle, and upper areas. For the upper and the lower areas, the uppermost and the lowermost commonly used spawning areas were selected [
8,
11]. The middle area was selected with the criteria that major tributary rivers should join the main stem of the river between the three sampled areas, increasing the probability that the water chemistry between the areas, and hence the otolith elemental composition for the sampled parr, would differ. From each nursery area, parr were sampled at two rapids, totalling ten parr/area (
Figure 1,
Table A1). The distance to the river mouth for the two lowermost rapids was 7 and 9 km, the two middle rapids 36 and 38 km, and the two uppermost rapids 98 and 105 km. Smolts, for which the nursery areas of origin were unknown, were sampled on three occasions, 24 and 31 May and 10 June 2019, covering the peak migration to the sea occurring in late May–early June in River Simojoki [
5]. The sampling was undertaken using a trap net near the river mouth, and at each sampling date, ten smolts were randomly selected for the study (
Figure 1,
Table A1). The fish were killed by anaesthetic overdose in accordance with Directive 2010/63/EU, Annex V.
The total body length of the fish was measured to the nearest millimetre, and the fish were weighed to the nearest gram (
Table A1). Scales were removed from slightly above the lateral line between the dorsal and adipose fins, and the age of the fish was visually determined by counting annuli in a microfiche reader (
Table A1). The sagittal otoliths were removed from the fish, rinsed in Milli-Q
® water, and left to dry for 24 h in a laminar flow cabinet. Thereafter, the otoliths were placed in silicone moulds, sulcus side facing downward, and embedded in epoxy (EpoFix, Struers, Ballerup, Denmark). Using a grinder–polisher (RotoPol 35, Struers, Ballerup, Denmark), the embedded otoliths were wet-ground on the anti-sulcus side with 800 p sandpaper (Carbimet, Buehler, Lake Bluff, IL, USA) until the edges were exposed. Subsequently, the grinding was conducted in stages with regular visual inspections under a microscope (transmitted light) until the first annuli became visible. At this stage, the otoliths were wet-ground by hand using 2500 p sandpaper (Carbimet, Buehler, Lake Bluff, IL, USA) until the primordium was exposed as determined under a microscope (transmitted light). The final polishing was conducted using a soft polishing disk (MD Nap, Struers, Ballerup, Denmark) and a 1-micron polish suspension (liquid diamond type WX, Kemet, Maidstone, UK) until the surface of the otolith was completely smooth, as inspected under a light microscope using reflective light, and the primordium was clearly visible as inspected using transmitted light. The polished otoliths were rinsed in ethanol, followed by Milli-Q
® water, and wiped with a lens-cleaning tissue. Finally, the prepared otoliths were mounted on glass slides using double-sided tape. Primarily, the right-side otolith was prepared in this manner, but in cases where it was damaged or the preparation was unsuccessful, the left-side otolith was prepared instead.
In situ, Sr isotope analyses of the otoliths were performed at the Geological Survey of Finland (GTK) by means of laser ablation multi-collector inductively coupled plasma mass spectrometry (LA-MC-ICP-MS) using a Nu Plasma High-Resolution MC-ICP-MS (Nu Instruments Ltd., Wrexham, UK) and an Analyte G2 laser microprobe (Photon Machines, San Diego, CA, USA). The MC-ICP-MS was equipped with nine Faraday detectors and amplifiers with 10
11 Ω resistors. The ablation was performed in helium (He) gas (gas flow 0.4 and 0.1 L/min) within a HelEx ablation cell [
50]. All analyses were performed in the static ablation mode using the following parameters: beam diameter 110 μm, pulse frequency 10 Hz, and beam energy density 3.0 J/cm
2. The otoliths were analysed in a spot placed on the dorsal–ventral axis with the innermost edge of the spot touching a 100 µm radius from the primordium. The location was selected to analyse the environmental history of the fish during the first summer, avoiding interference from the pre-hatch history and the maternally derived region found in the primordium [
51,
52]. During the laser ablation, the data were collected for
84Sr-Kr,
85Rb,
86Sr-Kr,
87Rb-Sr, and
88Sr. The measured isotope ratios were corrected for instrumental fractionation using an exponential law and an
86Sr/
88Sr ratio of 0.1194. The isobaric interference of
87Rb on
87Sr was monitored and corrected using the
85Rb ion signal and a ratio of 0.38571 for
87Rb/
85Rb. Preceding each ablation, the isobaric interference of
86Kr on
86Sr was corrected by measuring the background for 30 s. The accuracy of the laser ablation protocol was verified throughout the day of measurement by repeated analysis of an in-house plagioclase standard from a megacryst of the Cameroon volcanic chain ([
53], sample Mir a) and a pressed standard made from the reference material FEBS-1 (certified otolith standard for trace metals, NRCC, Ottawa, ON, Canada) [
54]. As the mean total Sr signal obtained for the plagioclase standard was 3.36 V, an ablation length of 120 s was needed to achieve an internal precision of ≤±0.000020 (2σ). The depth of the ablated spot was ~120 µm. The same laser ablation parameters were used for the standards as for the samples. During the course of the analysis, the mean
87Sr/
86Sr ratio obtained for the plagioclase and otolith standards were 0.703067 ± 0.000093 (2σ,
n = 5) and 0.709154 ± 0.000052 (2σ,
n = 4), respectively, similar to their respective reference solution ratios of 0.703057 ± 0.000014 (2σ) and 0.709176 ± 0.000032 (2σ) [
54]. Throughout the measurements, the mean
84Sr/
86Sr was 0.05653 ± 0.00068 (2σ,
n = 9), which is similar to the accepted ratio of ~0.0565 ± 0.0000 (2σ) [
55].
Laser ablation single-collector (SC) ICP-MS analyses of the otoliths were performed at GTK using a Nu AttoM SC-ICP-MS (Nu Instruments Ltd., Wrexham, UK) and an Analyte 193 ArF laser-ablation system (Photon Machines, San Diego, CA, USA). The ablation was performed in He gas (gas flow 0.4 and 0.1 L/min) within a HelEx ablation cell [
50]. The concentrations of the elements Sr, barium (Ba), lithium (Li), silicon (Si), nickel (Ni), lead (Pb), arsenic (As), and beryllium (Be) were measured at low resolution (∆M/M = 300) using the fast scanning mode with the following parameters: beam diameter 50 μm, pulse frequency 10 Hz, and a pulse energy of 5 mJ at 30% attenuation to produce an energy flux of 2.2 J/cm
2 on the sample surface (
Table A2). As with LA-MC-ICP-MS, the otoliths were analysed in a spot placed on the dorsal–ventral axis with the innermost edge touching a 100 µm radius from the primordium. The ablation length was set to 40 s for signal acquisition, followed by 20 s of background measurement. The depth of the ablated spot was ~40 µm. The analyses were performed using time-resolved analysis with continuous data acquisition for each set of points (3 standards, 12 unknown). The standard NIST612 (certified solid synthetic silicate standard, NIST, Gaithersburg, MD, USA) was used to standardise the elements Be and As, and the standard JCp-1 (certified coral standard, myStandards, Kiel, Germany) to standardise Li, Si, Ni, Sr, Pb, and Ba. The FEBS-1 standard was used for quality control, and
43Ca was used as an internal standard for quantification to increase the precision. Data reduction was handled using the software GLITTER
TM which enables baseline subtraction, integration of the signal over a selected time resolve area, and quantification using known concentrations of external and internal standards [
56]. As ≥10% of the measurements for Li, Pb, and Be were below the limit of quantification (LOQ), these elemental variables were excluded from all further statistical analysis.
The dependent variables total length, weight, and age were statistically analysed to determine whether these differed for parr sampled at the three nursery areas. The variables were checked for normal distribution using Shapiro–Wilk’s test (alpha level 0.05) and homogeneity of variance using Levene’s test (alpha level 0.05). The tests returned non-significant results except for the variables total length, which deviated slightly from the assumption of homogeneity of variance (p = 0.04) but was considered close enough for further parametric statistical analysis, and age, which did not conform to a normal distribution (p ≤ 0.05) and was subsequently analysed using a Kruskal–Wallis H test (alpha level 0.05). There was also an outlier for the variable weight, but as it was non-extreme; positioned only at 1.75 interquartile ranges (IQR) from the nearest quartile, it was included in the subsequent parametric statistics. The variables total length and weight were checked for multivariate outliers using Mahalanobis’ distance producing a maximum value of 10.00, which was less than the critical value of 13.82 for two dependent variables and no significant multivariate outliers were detected (p ≥ 0.001). The dependent variables were tested for multicollinearity using linear regression, revealing no strong collinearity (variance inflation factor = 6.6). Additionally, the Box’s M test (alpha level 0.001) produced an M statistic of 20.8 which was non-significant (p = 0.005), indicating the covariance matrices were homogeneous. A one-way MANOVA (alpha level 0.05) was conducted on the dependent variables total length and weight to determine whether there were differences for parr sampled at the three nursery areas. As not all assumptions for the MANOVA were strictly met, Pillai’s Trace was used to interpret the results.
The otolith elemental composition for parr sampled in River Simojoki was statistically analysed to determine whether there were differences between the nursery areas. The elemental variables 87Sr/86Sr, Sr, Ba, Si, Ni, and As were checked for normal distribution using Shapiro–Wilk’s test (alpha level 0.05) and homogeneity of variance using Levene’s test (alpha level 0.05). Ni and As failed to meet the assumption of normal distribution and were analysed using Kruskal–Wallis H tests (alpha level 0.025) with Bonferroni corrected Dunn’s post hoc tests (alpha level 0.05). Ba deviated slightly from the assumption of homogeneity of variance (p = 0.03) but was deemed close enough to be included in further parametric statistical analysis. For Si, there was an outlier in the sample from the upper nursery area, but as it was non-extreme; positioned only at 1.66 IQR from the nearest quartile, it was included in the subsequent parametric statistics. The data were checked for multivariate outliers using Mahalanobis’ distance, producing a maximum value of 11.01, which was less than the critical value of 18.47 for four dependent variables and no significant multivariate outliers were detected (p ≥ 0.001). The assumption of no multicollinearity of the dependent variables was tested using linear regression, revealing no strong collinearity (Variance Inflation Factor ≤ 5). Additionally, the Box’s M test (alpha level 0.001) produced an M statistic of 28.1 which was non-significant (p = 0.35), indicating the covariance matrices were homogeneous. A one-way MANOVA (alpha level 0.05) was conducted on the variables 87Sr/86Sr, Sr, Ba, and Si. As not all assumptions for the MANOVA were met, Pillai’s Trace was used to interpret the results. The MANOVA was followed up by multiple one-way ANOVAs (alpha level 0.05) to determine whether there were differences in single elemental variables between the sampled areas. For the elemental variables for which the ANOVAs returned a significant result, Tukey HSD tests (alpha level 0.05) were applied to pinpoint the difference.
A Mantel test was conducted to study how the geographic distance between the six sampled rapids related to the difference in otolith elemental composition for the corresponding parr samples. For the test, the elemental variables 87Sr/86Sr, Sr, Ba, Si, Ni, and As were used. The geometric centres of the otolith elemental variables, or centroids, for the six samples were determined and the Euclidean distances between these were calculated.
DFA was applied on parr caught at the three nursery areas (n = 10/area) based on a combination of the elemental variables 87Sr/86Sr, Sr, Ba, and Si. To measure the rate of correct classification for the DFA, leave-one-out cross-validation was used. The DFA was then applied to classify the smolts as originating from either the lower, middle, or upper nursery areas in River Simojoki.
In addition to the DFA, RF using the R package randomForest was trained on the parr using the elemental variables
87Sr/
86Sr, Sr, Ba, Si, Ni, and As [
46,
57]. The number of variables tested at each split (mtry) was set to the square root of the number of elemental variables. The number of trees used in the forest (ntree) was optimised by plotting the OOB error rates and selecting an ntree after which it stabilised. The RF was applied with the optimised mtry and ntree and the importance of each elemental variable was ranked according to the Mean Decrease in Gini. To further optimise the combination of elemental variables used in the forest, the variable least important for the classification was omitted, and the RF was rerun as previously described. The process above was iterated until only two variables were left, resulting in five random forest configurations: RF1–RF5 (
Table 1). The second and third iterations, RF3 and RF4, obtained the lowest OOB error rates at 36.7%. To classify the smolts as originating from either of the three sampled nursery areas, RF3 based on the elemental variables
87Sr/
86Sr, Sr, Ba, and As was used.
3. Results
The MANOVA detected no significant difference on the linear combination of the variables total length and weight between parr sampled at the three nursery areas (F
4, 54 = 1.283,
p = 0.29, Pillai’s Trace = 0.174, partial η
2 = 0.087,
Figure 2). Similarly, the Kruskal–Wallis H test for the variable age was non-significant (H
2 = 0.405,
p = 0.82,
Figure 2).
The results from the elemental analysis of parr and smolt otoliths using single- and multi-collector LA-ICP-MS are presented in
Figure 3 and
Table A2. The MANOVA on the linear combination of the otolith elemental variables
87Sr/
86Sr, Sr, Ba, and Si detected a statistically significant difference between parr sampled at the three nursery areas (F
8, 50 = 4.293,
p < 0.001, Pillai’s Trace = 0.814, partial η
2 = 0.407). The follow-up ANOVAs detected significant differences in the variables
87Sr/
86Sr (F
2, 27 = 16.104,
p < 0.001, η
2 = 0.544), Sr (F
2, 27 = 13.139,
p < 0.001, η
2 = 0.493), and Ba (F
2, 27 = 16.383,
p < 0.001, η
2 = 0.548), but not for Si (F
2, 27 = 2.831,
p = 0.77, η
2 = 0.173). The post hoc Tukey HSD tests detected a significantly higher (
p < 0.001)
87Sr/
86Sr ratio for parr sampled at the upper area (
± SD, 0.735300 ± 0.002513) compared to the middle (0.730939 ± 0.002485) and the lower (0.729649 ± 0.001960) areas. No significant difference was detected between the middle and the lower area samples (
p = 0.44). The same pattern was repeated for Sr and Ba, where the concentrations were significantly higher (
p < 0.001) for the upper area sample (
± SD, Sr: 1115 ± 142 µg/g, Ba: 38.9 ± 7.58 µg/g) compared to the middle (Sr: 805 ± 155 µg/g, Ba: 24.9 ± 3.76 µg/g) and lower area samples (Sr: 775 ± 191 µg/g, Ba: 24.4 ± 7.27 µg/g). Again, no significant differences were detected for Sr and Ba between parr sampled at the middle and the lower areas (
p = 0.91 and
p = 0.98, respectively). The Kruskal–Wallis H tests for the elements which did not meet the assumptions of normal distribution detected significant differences between the parr samples for As (H
2 = 10.68,
p = 0.01) but not for Ni (H
2 = 0.53,
p = 0.76). The Dunn’s post hoc test for As detected a significant difference between the samples from the upper area (
± SD, 3.70 ± 0.315 µg/g) and the middle area (3.26 ± 0.448 µg/g) (
p = 0.05), and between the upper and the lower area (3.07 ± 0.457 µg/g) (
p = 0.01). Again, no significant difference was detected between parr sampled at the middle and the lower nursery areas (
p = 1.00).
The Mantel test revealed a strong positive correlation between the Euclidean distance between otolith elemental centroids for parr sampled at the six rapids (
Table 2) and the geographic distance between the rapids (r
13 = 0.79,
p = 0.05, 999 runs,
Figure 4).
The two discriminant functions created for the DFA were statistically significant together (Λ = 0.22, X
28, 30 = 38.37,
p < 0.001), where discriminant function 1 had a much higher predictive ability than discriminant function 2 (
Table 3). For discriminant function 1,
87Sr/
86Sr and Sr had the highest loading, while Si and
87Sr/
86Sr had the highest loading for discriminant function 2 (
Table 3). The discriminant functions correctly classified the parr to the nursery area they were sampled from in 53.3% of cases. The highest correct classification rate was for parr sampled at the upper area (90% correctly classified), followed by parr sampled at the middle (40%) and the lower areas (30%) (
Table 4).
The classification of the smolts using DFA indicated that the sample overwhelmingly was made up of fish originating from the middle nursery area (80%,
n = 24), while the lower and the upper areas contributed an equally small proportion (10%,
n = 3) (
Figure 5,
Table 5). Not surprisingly, considering the marked difference in the correct classification rates for the parr samples from the different areas, the smolts classified as originating from the upper area had the highest probability for the classification (
± SD, 0.94 ± 0.07), followed by the smolts classified as originating from the middle (0.71 ± 0.12) and the lower areas (0.63 ± 0.16). Smolts classified as originating from the middle and lower areas had a next to zero probability of being classified as originating from the upper area (
± SD, 0.01 ± 0.02, and 0.00 ± 0.00, respectively), but a quite high probability of being classified as each other (0.28 ± 0.13 and 0.37 ± 0.16, respectively). The probability of smolts classified as originating from the upper area being classified as originating from the middle or the lower areas was very small (
± SD, 0.05 ± 0.06 and 0.01 ± 0.01, respectively).
RF3 correctly classified the parr to the nursery area they were sampled from in 63.3% of cases. Similar to the DFA, the highest correct classification rate for RF3 was for parr sampled at the upper nursery area (100% correctly classified), followed by parr sampled at the middle (50%) and the lower areas (40%) (
Table 4). Unlike the DFA, RF3 classified the largest proportion of the smolt sample as originating from the lower area (53.3%,
n = 16), followed by the middle area (36.7%,
n = 11), and last, the upper area (10%,
n = 3) (
Table 5). The probability for the classifications was also overall lower for RF3 compared to DFA, with the highest probability for smolts classified as originating from the lower area (
± SD, 0.69 ± 0.13), followed by smolts classified as originating from the middle (0.67 ± 0.13) and the upper areas (0.65 ± 0.22). Again, smolts classified as originating from the middle and lower areas had a very low probability of being classified as originating from the upper area (
± SD, 0.09 ± 0.09, and 0.01 ± 0.03, respectively) but quite a high probability of being classified as each other (0.24 ± 0.12 and 0.30 ± 0.12, respectively). The probability of smolts classified as originating from the upper area being classified as originating from the middle or the lower areas was roughly equal (
± SD, 0.20 ± 0.10, and 0.15 ± 0.11, respectively).
Applying DFA and RF3 to classify the smolt sample yielded quite different results, with an overall classification agreement of 57% between the two methods (
Table 5). As expected, based on the overall low correct classification rate for parr sampled at the middle and the lower nursery areas, the disagreement between the methods was found in classifying smolts as originating from these areas. Excluding smolts classified as originating from the upper area where the two methods were in complete agreement, the methods only agreed in 47% of the cases.
4. Discussion
Significant differences in the otolith elemental composition for parr sampled at the three nursery areas were found in the concentrations of Sr, Ba, and As, and in the
87Sr/
86Sr ratio, which all were significantly higher for parr sampled at the upper area compared to the middle and lower areas. No significant difference could be demonstrated between the parr sampled at the middle and the lower areas for any elemental variable. Support for the assumption that there exists a gradient in the elemental composition in the main stem of River Simojoki, which is reflected in the elemental composition of otoliths, was found in the strong correlation of the geographic distance between the six sampled rapids to the distance between centroids for otolith elemental composition for the corresponding parr samples (
Figure 4). Interestingly, comparing the results for Ba, As, and Sr to geological maps compiled for the drainage area of River Simojoki, an unexpected pattern is revealed [
58]. According to water samples from the drainage area, the concentration of Ba and As is lower in the area associated with the upper stretches of the river compared to the lower. These elements would therefore be expected to form a gradient in the river with increasing concentrations downstream, which would be reflected in the otolith elemental composition of parr sampled at the nursery areas, in other words, opposite to the pattern demonstrated in the present study. The geological maps also reveal only minor differences in concentrations of Sr in the drainage area, which would be expected to be reflected by a close to uniform concentration in the river and, consequently, in the otoliths of the sampled parr. Again, this runs contrary to results in this study, where the concentration of Sr was significantly higher in otoliths from parr sampled at the upper nursery area compared to the middle and lower areas. A final unexpected pattern was found for the element Ni for which no significant difference in the otolith concentration of parr sampled at the three nursery areas could be detected. Again consulting the geological maps, there were higher concentrations of Ni in water samples from the drainage area associated with the lower stretches of the river compared to the upper stretches. All things being equal, this pattern would have been expected to be reflected in the elemental composition of the parr otoliths. However, it is here important to note that the elemental uptake of otoliths is affected by the elemental composition of the ambient environment as a whole and, as such, is not always predictable by the availability of single elements [
37].
DFA and RF3 were moderately successful in correctly classifying parr to the nursery areas from which they were sampled, with 53.3% and 63.3% correctly classified, respectively. This moderate correct classification rate still represents a marked improvement compared to random classification, which would only correctly classify the parr in 33.3% of cases. Not surprisingly, considering the significantly higher concentrations of Sr, Ba, and As and the higher
87Sr/
86Sr ratio in otoliths of parr sampled at the upper area, DFA and RF3 were much better at classifying parr sampled from this area (90% and 100% correctly classified, respectively) compared to parr sampled at the other areas (30–50% correctly classified). The similarity in the otolith elemental composition for parr sampled at the middle and lower areas, leading to an overall low correct classification rate, could be interpreted as indicating the intermixing of parr between the two areas. Reliable classification requires that parr remain relatively stationary in the river and obtain an elemental composition of the otolith characteristic of the specific nursery areas before undergoing smoltification. In a study of the neighbouring River Tornionjoki, stable isotopes were successfully utilised to track smolts to their spawning grounds as no large movements of parr within the river were observed [
4]. This might, however, not be entirely the case for River Simojoki as a small number of one-year-old parr are caught each spring during the annual sampling of smolt migrating to the sea [E. Jokikokko 2022, pers. comm.]. As the smolt sampling is undertaken in a pool section not suitable as a nursery for parr, and the nearest nursery areas upstream and downstream are found at a distance of ~600 m and ~300 m, respectively, it is likely that these parr are caught when relocating in the river [E. Jokikokko 2022, pers. comm.]. If parr does relocate within River Simojoki before smoltification, this will lead to the intermixing of parr originating from different nursery areas, making the otolith elemental profiles based on these parr less specific for the nursery area they were sampled from. Displacement during the parr stage would be expected to generally follow the direction of the current, gradually increasing the intermixing of the samples downstream. This could explain the higher correct classification rate for parr sampled at the upper nursery area compared to the middle and lower areas, as this sample would be the least intermixed. However, if the displacement of parr occurred frequently, the samples would have been expected to show a skew towards older and larger individuals downstream. The similarity in total length, weight, and age for parr sampled at the three nursery areas could therefore indicate that if parr did relocate before undergoing smoltification, at least it was not a prevalent phenomenon. Furthermore, as this study sampled parr at nursery areas in the river with a relatively large geographic separation, any weakening of the area specificity of the otolith elemental profiles as a consequence of intermixing of parr from nearby nursery areas would be expected to be overshadowed by the difference in the ambient environments between the sampled areas due to the large geographic distance between them. The issue of whether the similarity in the elemental composition for parr sampled at the middle and lower areas was a consequence of similarities in the ambient environments or intermixing, for now, has to remain unsettled as no water samples were gathered from the nursery areas in this study.
Determining the origin of the smolt sample using DFA and RF3 yielded quite discrepant results with an overall low classification agreement. Not surprisingly, considering the low correct classification rate for parr sampled at the lower and middle areas, the disagreement was found in classifying smolts as originating from either one of these areas. This low agreement of classification is likely a consequence of the similar otolith elemental composition for parr sampled at these nursery areas, increasing the randomness in the classification. Additionally, it is important to note that the DFA and RF3 were only trained on parr sampled at three different nursery areas, which only partially covers all the nursery areas found in the river. Therefore, smolts—which, as parr grew up at unsampled nursery areas— are likely to occur in the sample, adding error to the classification. As a consequence of the limitations mentioned above, no broader conclusions regarding the relative contribution from different nursery areas to overall smolt production could be made based on these results.
RF has been considered a less constrained and generally more powerful method for classification than traditional statistical methods such as DFA [
45]. This viewpoint is supported by the present study, where the optimised RF3 had a 10% higher correct classification rate than the DFA. The study also supports the idea that when it comes to variable selection for RF, “the more” is in fact not always “the merrier” [
45]. When performing the first RF with six elemental variables (
87Sr/
86Sr, Sr, Ba, Si, Ni, and As), a correct classification rate of 56.7% was achieved, but after subsequent removal of the least informative variables, the rate was increased to 63.3% using only four variables (
87Sr/
86Sr, Sr, Ba, and As) (
Table 1). This goes to show that not all elemental variables contribute equally to classification success, and some variables might actually contribute more noise than signal [
45]. Furthermore, the analysis also loses power as variables are added, i.e., the more variables, the more observations are needed.