Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics

Duke, Kyle; Myers, Steven; Cuenca, Daniela; Wallin, Jeanette

doi:10.3390/genes14010102

Open AccessArticle

Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics

California Department of Justice, Richmond, CA 94804, USA

^*

Author to whom correspondence should be addressed.

Genes 2023, 14(1), 102; https://doi.org/10.3390/genes14010102

Submission received: 11 November 2022 / Revised: 17 December 2022 / Accepted: 24 December 2022 / Published: 29 December 2022

(This article belongs to the Special Issue Forensic DNA Mixture Interpretation and Probabilistic Genotyping)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Distributions of the variance parameter values developed during the validation process. Comparisons of these prior distributions to the run-specific average are one measure used by analysts to assess the reliability of a STRmix deconvolution. This study examined the behavior of three different STRmix variance parameters under standard amplification and interpretation conditions, as well as under a variety of challenging conditions, with the goal of making comparisons to the prior distributions more practical and meaningful. Using information found in STRmix v2.8 Interpretation Reports, we plotted the log₁₀ of each variance parameter against the log₁₀ of the template amount of the highest-level contributor (Tc) for a large set of mixture data amplified under standard conditions. We observed nonlinear trends in these plots, which we regressed to fourth-order polynomials, and used the regression data to establish typical ranges for the variance parameters over the Tc range. We then compared the typical variance parameter ranges to log₁₀(variance parameter) v log₁₀(Tc) plots for mixtures amplified and interpreted under a variety of challenging conditions. We observed several distinct patterns to variance parameter shifts in the challenged data interpretations in comparison to the unchallenged data interpretations, as well as distinct shifts in the unchallenged variance parameters away from their prior gamma distribution modes over specific ranges of Tc. These findings suggest that employing empirically determined working ranges for variance parameters may be an improved means of detecting whether aberrations in the interpretation were meaningful enough to trigger greater scrutiny of the electropherogram and genotype interpretation.

Keywords:

STRmix; variance; diagnostic; profile modeling; probabilistic genotyping

1. Introduction

The forensic genetics community currently relies on PCR-amplified short tandem repeat (STR) DNA profiles to assess the source of biological evidence associated with criminal activity. Such biological material is often presented as a mixture of DNA from different individuals, necessitating deconvolution of the STR electropherogram results [1]. An integral part of forensic DNA mixture deconvolution is modeling STR peak amplitude, which varies with DNA input [2,3]. Binary DNA interpretation procedures often address this peak height variation through the application of threshold-based heuristics that establish lower bounds for per-contributor intra-locus peak height balance [4,5]. In contrast, continuous probabilistic genotyping approaches to interpretation rely on probability distributions to represent variance expectations [6,7].

In the case of STRmix™ Probabilistic Genotyping Software, which applies a continuous profile model to forensic DNA mixture data, peak height variation is integrated into STR profile analysis through the application of dynamic variance parameters that are inversely proportional to peak height [8,9]. Higher peak heights are accompanied by smaller variance parameters, leading to less allowance for differences between observed peak heights and those modeled by the software. More specifically, the spread of the lognormal distribution of [observed RFU height]/[modeled RFU height] increases with an increase in a variance parameter and decreases with an increase in peak height, as represented mathematically below:

\log_{10} (Observed RFU / Modeled RFU) ~ N (0, [Variance parameter] / [Relevant peak height])

(1)

If the peak being modeled is an allele, the relevant peak height is always the height of the allele itself modeled by STRmix v2.8 (i.e., the expected height). Alternatively, if the modeled peak is stutter, the relevant peak height is set by the software user and can be either the expected height of the stutter peak itself or the observed height of the allele giving rise to the stutter. Given a hypothesized set of genotypes, a particular peak may also be composed of signal from both allele and stutter peaks, and in these cases, a shifted lognormal model that combines the relevant allele and stutter lognormals together is utilized [9].

In the preliminary stages of STRmix v2.8 implementation, a large training set of single-source profiles with known types is utilized to construct prior gamma distributions of the variance parameters (see Figure 1). These prior gamma distributions, created by the Model Maker module in STRmix, are reproduced on every STRmix Interpretation Report, along with the average variance parameters for the interpretation. The creators of STRmix define the genotype probability distributions (GPDs), mixture proportions, and per-locus likelihood ratios (LRs) as primary diagnostics of an interpretation, whereas the average variance parameters are among the secondary STRmix diagnostics that have less well-defined acceptable ranges but provide information about profile modeling efficacy [10]. Comparison of the average variance parameter for a particular interpretation to the mode of the prior gamma distribution provides a point of reference as to whether greater allowance for variation was required to model the data than was needed for the training set.

However, the comparison of average variance parameters to the prior probability distributions appears to have the same need for threshold calculations that is so central to binary interpretation paradigms. Namely, in order to utilize the average variance parameter values as semi-quantitative diagnostics, it would be important to estimate at what values the average allele and/or stutter variance parameters are far enough away from their prior distribution modes that the STRmix output deserves closer inspection.

Here, we present a systematic examination of STRmix v2.8 allele, reverse stutter (−1 STR repeat), and forward stutter (+1 STR repeat) variance parameters. The goals were to characterize any apparent trends in the variance parameters across a variety of single-source and mixture data, as well as to develop a typical range of parameter values by comparison under standard (i.e., pristine template DNA) and challenging amplification and interpretation conditions. It should be noted that, while both the lower and upper range of variance parameter values are relevant metrics of system performance, STRmix v2.8 has a default lower bound for variance parameters of (0.5) × (prior distribution mode), so we focused exclusively on the upper range in this work.

The value of this study is to increase the usefulness of MCMC (Markov Chain Monte Carlo) summary diagnostics for the forensic DNA community utilizing probabilistic genotyping software. Knowing more about how STRmix variance parameters behave across a large mixture dataset drives empirically supported, reliable assessments of data for forensic case interpretation, addressing related concerns on factor space and likelihood ratio “trustworthiness” as outlined in the recent NISTIR 8351-DRAFT [11].

2. Materials and Methods

2.1. Construction, Amplification, Capillary Electrophoresis, and Analysis of Unchallenged DNA Samples

Buccal cell DNA was collected with informed consent from healthy, unrelated laboratory volunteers and extracted using the PrepFiler™ DNA Extraction Kit (Life Technologies, South San Francisco, CA, USA). Samples were quantified for extraction yield using Applied Biosystems™ Quantifiler™ Trio DNA Quantification kit, amplified using Applied Biosystems™ GlobalFiler™ PCR Amplification kit with the manufacturer’s 28-cycle thermal cycling protocol and capillary electrophoresed (CE) on an Applied Biosystems™ 3500 Genetic Analyzer (1.2 kV injection for 15 s) using POP-6™ Polymer (Life Technologies). Raw data from all amplified samples were subsequently analyzed in GeneMapper ID-X v1.6 Software (Life Technologies) at channel-specific analytical thresholds of 51 RFU (blue), 71 RFU (green), 35 RFU (yellow), 41 RFU (red), and 61 RFU (purple) and analyzed with STRmix v2.8 Probabilistic Genotyping Software (NicheVision, Akron, OH, USA) using laboratory-validated settings, which included stutter models for −1/+1, −2/+2, −0.5/+0.5, and −1.5 STR repeats. The −1 repeat stutter variance was set to be inversely proportional to the observed height of the parent allele; all other stutter variances were set to be inversely proportional to the expected stutter peak height (personal communication with STRmix support staff, 1 September 2020). MCMC accepts per each of the 8 chains were set to 10,000 burn-in/50,000 post burn-in for single-source, 2-person mixtures and 3-person mixtures, and to 200,000 burn-in/1,000,000 post burn-in for 4-person mixtures [12]. The Gelman-Rubin autocontinue option was activated for all runs, and was set to add 10,000 post burn-in accepts to any MCMC initially producing a Gelman-Rubin convergence diagnostic in excess of 1.2. See Supplementary Table S1 for the relevant STRmix interpretation data.

The unchallenged data consisted of single-source samples and 2-, 3-, and 4-person mixtures. Single-source samples (Table 1) included 26 amplifications from 16 individuals, ranging from 8 ng to 63 pg input template DNA for PCR. Unchallenged mixtures at a variety of DNA input amounts and mixture ratios (Table 2) were constructed by combining the indicated amounts of template. See Supplementary Table S2 for anonymized contributor genotypes.

2.2. Construction, Amplification, CE, and Analysis of Challenged DNA Samples

Data from mixtures challenged by inhibition, degradation, signal saturation, and underestimated number of contributors (NOC) were included for comparison to the unchallenged data. These challenged samples were processed as described in Section 2.1, with any exceptions noted below.

Inhibited mixtures were constructed by combining the mixture components as indicated in Table 3 and spiking the subsequent 28-cycle GlobalFiler amplification with the amount of inhibitor (either hematin or humic acid) necessary to achieve the concentration indicated in the table.

To prepare degraded mixtures, degraded single-source DNA was first produced by subjecting buccal samples on BODE buccal collectors to a dry heat bath set to 90 °C for the amounts of time indicated in Table 4. After extraction and quantification of the degraded single-source mixture components, the extracts for each component were amplified, electrophoresed, and paired to each other such that the paired components were degraded to a similar extent, as indicated by an exponential fitting of peak heights. The paired components were then combined in the ratios indicated in Table 5. Note that the inhibited and degraded data sets have the same mixture donors, although this set of donors is different from the donor sets used for the unchallenged data set.

Signal-saturated mixtures were produced by combining the extracts from the unchallenged dataset in the ratios and amounts indicated in Table 6.

A subset of the unchallenged mixtures in Table 2 were re-analyzed with STRmix set to one or two contributors fewer than the true number, as indicated in Table 7. These mixtures were qualitatively categorized by an experienced DNA analyst as ambiguous in contributor number due to intralocus peak imbalances and/or sub-threshold peak information.

An additional set of challenged mixtures was constructed by combining commercial preparations of cell line DNA in the proportions indicated in Table 8. The cell line DNA preparations used to generate this data were HL60 (NIST, Gaithersburg, MD, USA), CEPH 1347-02 (Thermo Fisher Scientific, Waltham, MA, USA), and 2800 M (Verogen, San Diego, CA, USA). This set of mixtures was intended for inclusion in the unchallenged data set, but unintuitive results were observed upon interpretation of the mixtures that indicated they did not fit the STRmix profile model.

Table 1. Unchallenged single-source samples. Upon examination of the CE data, a signal-saturated peak was observed in one of the 8 ng replicate amplifications; for purposes of plotting, data for this amplification was grouped with the saturated mixtures in Table 6.

Study	Number of Samples	Input Amounts	Replicate Amplifications	Replicate STRmix Interpretations
Single-source, nominal-input	14	2 ng	1	10
Single-source dilution series (higher level)	2	8 ng, 4 ng, 2 ng, 1 ng, 500 pg, 250 pg	2	1
Single-source dilution series (lower level)	2	125 pg, 63 pg	4	1

Table 2. Composition of unchallenged mixtures. The input amounts listed are for total DNA. Note that, to include STRmix run-to-run variation in the dataset, ten replicate interpretations were performed for each mixture, at each template level.

Donor Number (Donor Set)	Mixture Ratio	Input Amounts	Replicate Amplifications
2-person (set 1)	9:1	2 ng, 1 ng, 870 pg, 750 pg, 500 pg, 380 pg, 250 pg, 125 pg, 63 pg	2
2-person (set 1)	49:1	2.5 ng, 1.9 ng, 1.25 ng, 625 pg, 313 pg	2
2-person (set 1)	99:1	2.5 ng, 1.25 ng, 625 pg	2
2-person (set 2)	1:1	800 pg, 400 pg, 200 pg, 100 pg, 50 pg, 25 pg	1
2-person (set 2)	3:1	800 pg, 400 pg, 348 pg, 300 pg, 200 pg, 152 pg, 100 pg, 50 pg, 25 pg	1
3-person (set 1)	3:2:1	1.2 ng, 600 pg, 522 pg, 450 pg, 300 pg, 228 pg, 150 pg, 75 pg, 38 pg	2
3-person (set 1)	10:5:1	3.2 ng, 1.6 ng, 1.4 ng, 1.2 ng, 800 pg, 608 pg, 400 pg, 200 pg, 100 pg	2
3-person (set 1)	100:100:4	1.28 ng, 625 pg, 325 pg	2
3-person (set 2)	1:1:1	1.2 ng, 600 pg, 300 pg, 150 pg, 75 pg, 38 pg	1
3-person (set 2)	3:2:1	1.2 ng, 522 pg, 300 pg, 150 pg, 38 pg	2
3-person (set 2)	10:5:1	3.2 ng, 1.4 ng, 800 pg, 400 pg, 100 pg	2
3-person (set 2)	100:100:4	1.28 ng, 638 pg, 319 pg	2
4-person (set 1)	4:3:2:1	2 ng, 1 ng, 870 pg, 750 pg, 500 pg, 380 pg, 250 pg, 125 pg, 63 pg	2
4-person (set 1)	10:5:2:1	3.6 ng, 1.8 ng, 1.6 ng, 1.4 ng, 900 pg, 684 pg, 450 pg, 225 pg, 113 pg	2
4-person (set 1)	100:100:100:6	1.28 ng, 625 pg, 325 pg	2
4-person (set 2)	1:1:1:1	1.6 ng, 800 pg, 400 pg, 200 pg, 100 pg, 50 pg	1
4-person (set 2)	4:3:2:1	2 ng, 870 pg, 500 pg, 250 pg, 63 pg	2
4-person (set 2)	10:5:2:1	3.6 ng, 1.6 ng, 900 pg, 450 pg, 113 pg	2
4-person (set 2)	100:100:100:6	1.28 ng, 638 pg, 319 pg	2

Table 3. Sample composition of inhibited mixtures. All inhibited mixtures were amplified and interpreted once at a total DNA input amount of 3 ng.

Donor Number	Mixture Ratio	Treatment
2-person	3:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	10:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	3:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL
	10:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL
3-person	3:2:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	10:5:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	3:2:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL
	10:5:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL
4-person	4:3:2:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	10:5:2:1	Hematin: 400 µM, 475 µM, 550 µM, 625 µM, 700 µM
	4:3:2:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL
	10:5:2:1	Humic acid: 200 ng/µL, 250 ng/µL, 300 ng/µL, 350 ng/µL, 400 ng/µL

Table 4. Dry heat exposure times for degraded mixture components.

Dry Heat Treatment Number	Dry Heat Exposure Time
1	5.75 h
2	12.13 h
3	19.42 h
4	27.73 h
5	37.32 h
6	48.50 h
7	61.70 h
8	77.52 h
9	96.83 h
10	120.93 h
11	151.85 h
12	192.97 h
13	250.32 h
14	335.88 h

Table 5. Sample composition of degraded samples. Dry heat treatment numbers refer to those in Table 4 and, for mixtures, are listed in order according to which treated components were paired together. All degraded single-source samples were amplified at a DNA input amount of 2 ng, and all degraded mixtures were amplified at a total DNA input of 8 ng.

Donor Number	Mixture Ratio	C1 Dry Heat Treatments	C2 Dry Heat Treatments	C3 Dry Heat Treatments	C4 Dry Heat Treatments
Single source #1	-	1,3,4,5,6,7,9,13,10	-	-	-
Single source #2	-	1,2,3,4,5,6,8,10,14	-	-	-
Single source #3	-	1,2,4,5,6,9,10,11,12	-	-	-
Single source #4	-	1,2,4,5,8,9,11,12,13	-	-	-
2-person	3:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	-	-
2-person	10:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	-	-
3-person	3:2:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	1,2,4,5,6,9,10,11,12	-
3-person	10:5:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	1,2,4,5,6,9,10,11,12	-
4-person	4:3:2:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	1,2,4,5,6,9,10,11,12	1,2,4,5,8,9,11,12,13
4-person	10:5:2:1	1,3,4,5,6,7,9,13,10	1,2,3,4,5,6,8,10,14	1,2,4,5,6,9,10,11,12	1,2,4,5,8,9,11,12,13

Table 6. Sample composition of signal-saturated mixtures. Note that the sets of donors used for these amplifications are the same as the donor sets in Table 1.

Donor Number (Donor Set)	Mixture Ratio	Input Amounts (Total DNA)
2-person (set 1)	9:1	28 ng
2-person (set 1)	99:1	25.5 ng
2-person (set 2)	1:1	29.3 ng
2-person (set 2)	3:1	20.9 ng
3-person (set 1)	3:2:1	24 ng
3-person (set 1)	10:5:1	29 ng
3-person (set 1)	100:100:4	28.5 ng
3-person (set 2)	1:1:1	20.4 ng
3-person (set 2)	3:2:1	17.8 ng
3-person (set 2)	10:5:1	12.7 ng
3-person (set 2)	100:100:4	17.1 ng
4-person (set 1)	4:3:2:1	32.5 ng
4-person (set 1)	10:5:2:1	29.4 ng
4-person (set 1)	100:100:100:6	30 ng
4-person (set 2)	1:1:1:1	20.3 ng
4-person (set 2)	4:3:2:1	15.4 ng
4-person (set 2)	10:5:2:1	11.9 ng
4-person (set 2)	100:100:100:6	22.0 ng

Table 7. Sample composition of mixtures analyzed with STRmix set to one or two contributors less than the ground truth number, as indicated.

Ground Truth Donor Number (Donor Set)	STRmix Donor Number Setting (NOC-1 or NOC-2)	Mixture Ratio	Input Amounts (Total DNA)
2-person (set 1)	1	49:1	313 pg
2-person (set 1)	1	99:1	2.5 ng, 1.25 ng
3-person (set 1)	2	3:2:1	1.2 ng, 600 pg, 522 pg, 450 pg, 300 pg, 228 pg, 150 pg, 75 pg, 38 pg
3-person (set 2)	2	3:2:1	38 pg
3-person (set 1)	2	10:5:1	800 pg, 608 pg, 400 pg, 200 pg, 100 pg
3-person (set 2)	2	10:5:1	100 pg
3-person (set 1)	2	100:100:4	1.28 ng, 625 pg, 325 pg
3-person (set 2)	2	100:100:4	1.28 ng, 625 pg, 325 pg
4-person (set 1)	2	4:3:2:1	125 pg, 63 pg
4-person (set 1)	2	10:5:2:1	113 pg
4-person (set 2)	2	10:5:2:1	113 pg
4-person (set 1)	3	4:3:2:1	2 ng, 1 ng, 870 pg, 750 pg, 500 pg, 380 pg, 250 pg
4-person (set 2)	3	4:3:2:1	63 pg
4-person (set 1)	3	10:5:2:1	3.6 ng, 1.8 ng, 1.6 ng, 1.4 ng, 684 pg, 450 pg, 225 pg
4-person (set 2)	3	10:5:2:1	450 pg
4-person (set 1)	3	100:100:100:6	1.28 ng, 638 ng, 319 ng
4-person (set 2)	3	100:100:100:6	1.28 ng, 625 ng, 325 ng

Table 8. Sample composition of cell line DNA mixtures. The cell lines listed in the first column correspond to the mixture ratios listed in column 2.

Donor Number	Mixture Ratio	Input Amounts	Replicate Amps
2-person (CEPH 1347-02, HL60)	9:1	2 ng, 1 ng, 870 pg, 750 pg, 500 pg, 380 pg, 250 pg, 125 pg, 63 pg	2
	49:1	2.5 ng, 1.9 ng, 1.25 ng, 625 pg, 313 pg	2
	99:1	2.5 ng, 1.25 ng, 625 pg	2
	1:1	800 pg, 400 pg, 200 pg, 100 pg, 50 pg, 25 pg	1
	3:1	800 pg, 400 pg, 348 pg, 300 pg, 200 pg, 152 pg, 100 pg, 50 pg, 25 pg	1
3-person (2800 M, HL60, CEPH 1347-02)	3:2:1	1.2 ng, 600 pg, 522 pg, 450 pg, 300 pg, 228 pg, 150 pg, 75 pg, 38 pg	2
	10:5:1	3.2 ng, 1.6 ng, 1.4 ng, 1.2 ng, 800 pg, 608 pg, 400 pg, 200 pg, 100 pg	2
	100:100:4	1.28 ng, 625 pg, 325 pg	2
	1:1:1	1.2 ng, 600 pg, 300 pg, 150 pg, 75 pg, 38 pg	1
	3:2:1	1.2 ng, 522 pg, 300 pg, 150 pg, 38 pg	2
	10:5:1	3.2 ng, 1.4 ng, 800 pg, 400 pg, 100 pg	2
	100:100:4	1.28 ng, 638 pg, 319 pg	2

3. Results

3.1. Trends in Allele and Stutter Variances with Increasing Peak Height

The inverse proportionality between the lognormal variances in Equation (1) and peak heights (in RFU) would suggest that plots of the average variance parameters against the average “Template (rfu)” values, both of which are found in the STRmix Interpretation Report, hold promise in understanding variance behavior. Given the ranges of the variance and template levels that may be observed in profiles, we found it easier to visualize trends between these variables by plotting their base-10 logarithms against each other. Additionally, in order to avoid skewing plots in favor of DNA mixtures with more contributors, we plotted only the “Template (rfu)” value for the highest-level contributor (which we have termed Tc).

Figure 2a–c show the results of plotting log₁₀(variance parameter) against log₁₀(Tc) for the unchallenged data set in relation to allele variance, reverse stutter variance, and forward stutter variance, respectively. To varying degrees, the data trends across the full Tc range are not linear for all three variances. Using the LINEST function of Excel, we found that a fourth-order polynomial regression provided the best visual fit to each dataset. Quantitative assessments of how well the regression predicts the log₁₀(variance) can be found in Figure S1, Table 9 and Table S3. No significant deviations from normality around the regression lines were observed in the residuals of the log₁₀(variance) data when evaluated using the Jarque-Bera test [13] (see Table 9; all p > 0.01). Figure S1 displays the 99% 2-sided confidence intervals around the regression lines. Table S3 includes the coefficients of determination (R²) and F statistic p-values for the overall regressions, as well as the 99% 2-sided confidence intervals and T statistic p-values for the individual polynomial regression coefficients (β_i_=0to4). All R² values are below 0.5, but the F statistics suggest they are all significantly different from 0 (p < 0.01). The individual coefficients for the allele and reverse stutter regressions are also significantly different from 0 (p < 0.01; 99% confidence intervals never included 0). For the forward stutter, only coefficient β₄ is significantly different from 0. This is consistent with a visual assessment of the forward stutter data, where a clear bias away from the gamma mode is observed, but Tc appears to have no effect on the variance for log₁₀(Tc) < 3.

The 99th percentiles were calculated based upon the fitted polynomial regressions + 2.326 standard deviations (SD) of the residuals (Table 9). In Figure 2a–c, the regression and 99th percentile lines are plotted in red. In addition, a second set of horizontal black lines was plotted to show bands of expected variation that are based on the shape of the prior gamma distributions for the modeled allele and stutter types (see Table 10 for prior gamma distribution information). Specifically, the solid black lines are the modes of the prior gamma distributions. A similar calculation was performed to produce the dotted black lines, given a static variance parameter equal to the 99th percentile of the corresponding prior gamma distribution.

Comparing the black and red bands of expected variation in Figure 2 provides information about whether the variances from completed interpretations tend to mirror the prior gamma distributions of the variance parameters. While the allele variance regression in Figure 2 had a very similar trend to the prior gamma distribution mode line, the reverse and forward stutter variance parameters were very often elevated above their prior gamma modes. It is worth noting, however, that 100 percent of the unchallenged data fell below the lines defined by the 99th percentile of each prior gamma distribution.

Table 9. A summary of fourth-order polynomial regression information for the allele, reverse stutter, and forward stutter variance plots from Figure 2 (corresponding to red lines).

	Allele	Reverse Stutter	Forward Stutter
Polynomial regression formula	y = 0.1122x⁴ − 1.2206x³ + 4.8535x² − 8.2618x + 5.5312	y = −0.2892x⁴ + 3.3258x³ − 13.619x² + 23.456x − 13.404	y = −0.0348x⁴ + 0.313x³ − 1.0891x² + 1.7016x − 0.1678
Jarque-Bera test for normality of the residuals	p = 0.1503	p = 0.2395	p = 0.0275
99th Percentile (+2.326 SD)	+0.2156	+0.2755	+0.1779

Table 10. A summary of prior gamma distribution information for the allele, reverse stutter, and forward stutter variance parameters used to generate the data plotted in Figure 2 (corresponding to black lines).

	Allele	Reverse Stutter	Forward Stutter
α	3.891	1.557	1.526
β	1.131	6.436	4.552
Mode	3.270	3.585	2.394
99th Percentile	11.16	37.24	26.06

3.2. Trends in Allele and Stutter Variance under Challenging Amplification/Interpretation Conditions

The bands of expected variation in the unchallenged plots from Figure 2 were used as a benchmark from which to assess the effects of challenging amplification and interpretation conditions on allele and stutter variance parameters. Overlaying the bands from Figure 2 onto similar plots of log₁₀(variance parameter) v log₁₀(Tc) for the challenged datasets allowed for a direct visualization of variance shifts.

Figure 3 is an overlay of the unchallenged bands of expected variation from Figure 2 onto log₁₀(variance parameter) v log₁₀(Tc) plots for inhibited mixtures. From these plots it is apparent that exposure to an inhibitor increased the reverse stutter variances above the 99th percentile line from the unchallenged plot for a majority (~56.67%) of treated mixtures, while leaving the allele variance and forward stutter variance largely unchanged or even lower than the regression line. A very small percentage (3.33%) of reverse stutter variance parameters exceeded the prior gamma 99th percentile line, and no allele or forward stutter variance data fell above this line.

Figure 4 shows the log₁₀(variance parameter) v log₁₀(Tc) plots for mixtures with an underestimated NOC, overlaid with the bands of expected variation from the unchallenged plots. Increased allele variance was observed for a substantial number of these mixtures; overall, ~28.30% of the mixtures had diagnostic values above the unchallenged 99th percentile line, mostly for higher level contributors, while the reverse and forward stutter variance plots tracked the unchallenged regression closely, with only ~3.77% of each data set above the unchallenged 99th percentile line. The only data above the line defined by the 99th percentile of the prior gamma distributions was a small percentage (~1.89%) of the allele variance diagnostic values.

Figure 5a–c are an overlay of the unchallenged bands of expected variation onto log₁₀(variance parameter) v log₁₀(Tc) plots for degraded mixtures. A moderate proportion of all three variance parameters for these mixtures (~21.11% for alleles, ~23.33% for reverse stutters, and ~12.22% for forward stutters) were above the unchallenged 99th percentile line. Much smaller proportions of each variance parameter exceeded the 99th percentile of the prior gamma distribution (6.67% for alleles, 2.22% for reverse stutters, and 1.11% for forward stutters).

Figure 6a–c are an overlay of the unchallenged bands of expected variation onto log₁₀(variance parameter) v log₁₀(Tc) plots for signal-saturated mixtures. These mixtures showed the most striking deviation from expectation, with ~40.00% of allele variance parameters, ~80.00% of reverse stutter variance parameters, and 70.00% of forward stutter diagnostics exceeding the unchallenged 99th percentile line. Many of these variances also exceeded the prior gamma distribution 99th percentile line (~45.00% for allele, ~75.00% for reverse stutter, and ~5.00% for forward stutter).

Figure 7a–c are an overlay of the unchallenged bands of expected variation onto modified variance diagnostic plots for mixtures of cell line DNA. As mentioned in the Methods section, these mixtures were excluded from the unchallenged data set due to unintuitive analysis outcomes that indicated STRmix was struggling to fit the observed data to the profile model. The plots indicated that allele modeling issues gave rise to the unsatisfactory analyses, demonstrating a population-wide shift upward in allele diagnostics while the reverse and forward stutter diagnostics remained largely unaffected. This was similar to the trend observed with underestimated NOC.

Table 11 summarizes the percentage of data points from each set that exceeded the unchallenged regression line, unchallenged 99th percentile line, prior gamma distribution mode line, and prior gamma distribution 99th percentile line. As a frame of reference, a data set closely tracking unchallenged expectations would have about half of its data points exceeding the line representing typical variances (regression line or prior gamma distribution mode line) and a small percentage of data points in excess of the line representing elevated variances (unchallenged 99th percentile line or prior gamma distribution 99th percentile line, respectively).

4. Discussion

The three STRmix variance parameters we have characterized in this paper fluctuate with increasing template amount, as well as with challenging amplification and/or interpretation conditions. The complexity of the log₁₀(variance parameter) v log₁₀(Tc) plots for the unchallenged data demonstrates the value of establishing an empirically determined working range for STRmix variance parameters instead of assuming that the observed average variance parameter values will always align with the prior gamma distribution.

While this work focused primarily on pragmatic applications of variance parameter data, it is also useful to theorize about what biological or profile modeling factors may have produced the observed patterns in the data. For instance, one might ask why the allele variance parameters for the unchallenged interpretations remained relatively centered on the prior gamma mode across the Tc range, while most of the unchallenged reverse and forward stutter variance parameters (95.28% for reverse stutter and 99.76% for forward stutter) were above the prior mode. These contrasting trends can be attributed to differences in peak detection for alleles, reverse stutters, and forward stutters. While peak height for all detected allelic data is considered during deconvolution, most reverse stutters at low Tc levels will be undetectable, requiring STRmix to model reverse stutter dropout and thus potentially causing the reverse stutter variance to land above the prior mode. The major inflection point of the reverse stutter plot, at a log₁₀(Tc) value of ~2.79, or a Tc of ~617 RFU, corresponds to the approximate point at which stutters begin to be detected. Considering a typical range of reverse stutter ratios to be ~0.05 to 0.1, a Tc of ~617 RFU would equate to reverse stutter peak heights in the range of ~31–62 RFU, which straddles our detection thresholds of 35–71 RFU. Notice that beyond log₁₀(Tc) values of ~2.79, the reverse stutter variance parameters once again move upward, while the allele variance parameters stay relatively flat. A plausible explanation for this trend is that expected stutter peak heights in STRmix are determined by static per-allele stutter ratios and therefore do not have the same degree of model flexibility as expected allele peak heights, which vary with the STRmix template parameter during interpretation. Similar to the reverse stutter variance at log₁₀(Tc) value of ~2.79, the forward stutter variance parameter values begin significantly trending down at a log₁₀(Tc) value of ~3, or a Tc value of ~1000 RFU, which would equate to forward stutter peak heights of ~1–50 RFU (typical forward stutter ratio range of ~0.001 to 0.05), again straddling our detection thresholds and suggesting a change in the modeling fit.

The log₁₀(variance parameter) v log₁₀(Tc) plots of the various challenged datasets also show distinct data patterns that can be associated with biological causes or elements of the STRmix profile model. For the inhibited data set, the most affected of the three variance parameters was reverse stutter, likely because the inhibition required STRmix to model undetected reverse stutter peaks across the profile due to reduced locus yields; in contrast, most of the alleles were still detected at the affected loci. More moderate but significant effects on the reverse stutter variance parameters were observed in the degraded data, which also requires STRmix to model undetected reverse stutter at higher molecular weight loci as peak heights decrease with degradation. However, moderate effects on the allele variance parameters were also observed; these effects are attributable to the difficulty of modeling high levels of degradation, particularly if the value of the exponential decay term in the STRmix degradation model approaches its user-defined ceiling (which occurred with many of the highly degraded mixtures in this set) [14]. The cell line and underestimated NOC data have similar effects on the variance parameters, in that the allele variance was the most affected of the three. This is a sensible result, given the apparent allele modeling issues with the cell line data and the intralocus allele imbalances that may result from NOC underestimation. Signal saturation, meanwhile, often had a pronounced effect on all three variance parameters. At the peak heights observed in saturated mixtures, tolerance for any peak height deviation from expectation is extremely low, and such deviation is more likely with the loss of linearity between peak height and template, leading to a cascade of effects on the variance parameters. However, not all of the saturated mixtures resulted in elevated variances, because there was variation both in the total number of off-scale peaks detected and the degree of saturation. The more nuanced variance expectations we have presented here can be useful in determining whether the extent of signal saturation observed in a profile has had a discernable effect on its interpretation.

As an example of how a working range for variance parameters might be implemented, Figure 8 shows the prior gamma distributions for allele, reverse stutter, and forward stutter variance parameters, overlaid with the corresponding parameters for a cell line mixture that resulted in an LR of 0 for the true minor contributor. While the electropherogram for this mixture was unremarkable (see Supplementary Figure S2), the allele variance parameter was slightly elevated compared to the 99th percentile of our unchallenged log₁₀(allele variance parameter) v log₁₀(Tc) regression, which serves as a prompt for closer scrutiny of the interpretation, as well as contributing to an explanation for the aberrant LR result. Figure 9 and Figure 10 are two further examples of how the unchallenged regression data could be applied for routine use in the assessment of STRmix variance parameters from a case result. The interpretations assessed in both figures are from the inhibited mixture data set. In both cases, the allele variance parameter is below the 99th percentile of the unchallenged regression, but in Figure 9, neither stutter parameter is flagged as high, while in Figure 10 both are flagged. Notice in particular how similar the reverse stutter parameters are between the two interpretations; despite this similarity, a higher threshold for the reverse stutter variance parameter was applied to the 2-person data because it had a higher Tc than the 3-person data.

Despite the focus in this study on more precisely defining typical ranges for variance parameters, it is important to point out that the observation of a variance parameter outside of the norm does not by itself invalidate a STRmix interpretation; rather, it indicates that more variation than usual was needed for profile modeling. While the final LR is not necessarily a direct measure of how well an interpretation reflects the true contributors’ genotypes, it is notable that the variance outliers of the 3308 interpretations conducted in our study included only one instance of a false exclusion (i.e., an LR of 0 for a true contributor where Hp = true contributor + N-1 unrelated unknown contributors and Hd = N unrelated unknown contributors). The data giving rise to the false exclusion was for the 870 pg 9:1 mixture from the cell line data set featured in Figure 8. In this case, higher variances, more consistent with the trend in Figure 7, would have been necessary to capture the allele peak height variation observed in this mixture and avoid the false exclusion. This points to cell line DNA mixtures as potentially inappropriate validation samples for mixture data calibrated to casework-type samples.

While the variance parameter thresholds presented here easily lend themselves to the imposition of binary definitions of “good” and “bad” data, these labels are not appropriate to apply in such a rigid manner. As secondary diagnostics, variances are intended to encourage closer inspection of the input peak data as well as the results of the interpretation, i.e., in this instance, the genotype combinations that STRmix determined to be acceptable explanations of the electropherogram in question. However, despite the utility of secondary diagnostics as indicators of challenged input data, analyst appraisal of the electropherogram data and primary diagnostics can and should be the key measures by which interpretation reliability is assessed.

5. Conclusions

We have presented information relevant to the utilization and interpretation of three important secondary diagnostics contained in the output of the probabilistic genotyping software STRmix v2.8, specifically the allele variance parameter, reverse stutter variance parameter, and forward stutter variance parameter. Despite each of these being universal model parameters that are applied to data at all loci simultaneously, they behave differently with changes in DNA template, as well as challenged amplification and interpretation conditions. We found that the effect of increasing Tc on variance parameters, which was more readily visualized by taking the logarithm of both variables, was nonlinear and required fourth-order polynomial regression to achieve a satisfactory fit. These nonlinear regressions of variance parameters to Tc allowed us to semi-quantitatively compare the variance parameters of a STRmix interpretation to thresholds developed with unchallenged data in order to make more finely tuned appraisals of whether the average variance parameter values on a particular STRmix report are elevated. However, regardless of the benchmarks used to assess whether the variances for a particular interpretation are typical or elevated, as secondary diagnostics they are not intended to be hard analysis stop points. Instead, they are supplementary information to assist in examining the core of the STRmix output, which is the distribution of genotype combination weights.

Supplementary Materials

All supplementary materials can be downloaded at: https://www.mdpi.com/article/10.3390/genes14010102/s1, Supplementary Table S1: STRmix Data, Supplementary Table S2: Profiles, Supplementary Figure S1 and Supplementary Table S3: Regression Analyses, Supplementary Figure S2: EPG.

Author Contributions

Conceptualization and methodology, K.D., J.W., S.M. and D.C.; writing—original draft preparation, K.D.; writing—review and editing, K.D., J.W., S.M. and D.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available in the Supplementary Materials listed above.

Conflicts of Interest

The authors declare no conflict of interest.

References

Clayton, T.M.; Whitaker, J.P.; Sparkes, R.; Gill, P. Analysis and interpretation of mixed forensic stains using DNA STR profiling. Forensic. Sci. Int. 1998, 91, 55–70. [Google Scholar] [CrossRef] [PubMed]
Taylor, D.; Bright, J.A.; Buckleton, J. The interpretation of single source and mixed DNA profiles. Forensic. Sci. Int. Genet. 2013, 7, 516–528. [Google Scholar] [CrossRef] [PubMed]
Taylor, D.; Buckleton, J.; Bright, J.A. Factors affecting peak height variability for short tandem repeat data. Forensic. Sci. Int. Genet. 2016, 21, 126–133. [Google Scholar] [CrossRef] [PubMed]
Bille, T.W.; Weitz, S.M.; Coble, M.D.; Buckleton, J.; Bright, J.A. Comparison of the performance of different models for the interpretation of low level mixed DNA profiles. Electrophoresis 2014, 35, 3125–3133. [Google Scholar] [CrossRef] [PubMed]
Bieber, F.R.; Buckleton, J.; Budowle, B.; Butler, J.M.; Coble, M.D. Evaluation of forensic DNA mixture evidence: Protocol for evaluation, interpretation, and statistical calculations using the combined probability of inclusion. BMC Genet. 2016, 17, 125. [Google Scholar] [CrossRef] [PubMed]
Buckleton, J.; Bright, J.A.; Gittelson, S.; Moretti, T.; Onorato, A.J.; Bieber, F.R.; Budowle, B.; Taylor, D. The Probabilistic Genotyping Software STRmix: Utility and Evidence for its Validity. J. Forensic. Sci. 2019, 64, 393–405. [Google Scholar] [CrossRef] [PubMed]
Bleka, Ø.; Storvik, G.; Gill, P. EuroForMix: An open source software based on a continuous model to evaluate STR DNA profiles from a mixture of contributors with artefacts. Forensic. Sci. Int. Genet. 2016, 21, 35–44. [Google Scholar] [CrossRef]
Bright, J.A.; Taylor, D.; Curran, J.; Buckleton, J. Developing allelic and stutter peak height models for a continuous method of DNA interpretation. Forensic. Sci. Int. Genet. 2013, 7, 296–304. [Google Scholar] [CrossRef] [PubMed]
STRmix v2.8 User’s Manual (September 2020); Institute of Environmental Science and Research Limited: Wellington, New Zealand, 2020.
Russell, L.; Cooper, S.; Wivell, R.; Kerr, Z.; Taylor, D.; Buckleton, J.; Bright, J. A guide to results and diagnostics within a STRmix report. WIREs Forensic. Sci. 2019, 1, e1354. [Google Scholar] [CrossRef]
Butler, J.M.; Iyer, H.; Press, R.; Taylor, M.K.; Vallone, P.M.; Willis, S. DNA Mixture INTERPRETATION: A NIST Scientific Foundation Review. NISTIR 8351-DRAFT; 2021. Available online: https://nvlpubs.nist.gov/nistpubs/ir/2021/NIST.IR.8351-draft.pdf (accessed on 17 November 2022).
Duke, K.; Cuenca, D.; Myers, S.; Wallin, J. Compound and conditioned likelihood ratio behavior within a probabilistic genotyping context. Genes 2022, 13, 2031. [Google Scholar] [CrossRef] [PubMed]
How to Perform a Normality Test in Excel (Step-by-Step). Available online: https://www.statology.org/normality-test-excel/ (accessed on 12 December 2022).
Duke, K.; Myers, P. Systematic evaluation of STRmix™ performance on degraded DNA profile data. Forensic. Sci. Int. Genet. 2020, 44, 102174. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Examples of prior gamma distributions for the allele variance (a), reverse stutter variance (b), and forward stutter variance (c) parameters found in a STRmix v2.8 Interpretation Report. The average value of each variance parameter for the completed interpretation is indicated with a black dot on each distribution.

Figure 2. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the unchallenged data set, overlaid with fourth-order polynomial regression lines (red solid lines) and the prior gamma modes (solid black lines). Additionally, shown are the 99th percentiles of the normal distributions around the fourth-order polynomial regressions (dotted red lines) and the 99th percentiles of the prior gamma distributions for the variance parameters (dotted black lines).

Figure 3. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the inhibited data set.

Figure 4. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the underestimated NOC data set.

Figure 5. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the degraded data set.

Figure 6. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the signal-saturated data set.

Figure 7. Plots of log₁₀(variance parameter) v log₁₀(Tc) for allele variance (a), reverse stutter variance (b), and forward stutter variance (c) of the cell line DNA mixture set.

Figure 8. Prior gamma distributions for allele (a), reverse stutter (b) and forward stutter (c) variance parameters, along with the prior modes and the STRmix Interpretation Report variance parameter values for an 870 pg 9:1 mixture that resulted in a 0 LR for the true minor contributor. The allele variance parameter for the interpretation is flagged as high in this case.

Figure 9. Prior gamma distributions for allele (a), reverse stutter (b) and forward stutter (c) variance parameters, along with the prior modes and the STRmix Interpretation Report variance parameter values for a 3 ng 2-person 10:1 mixture spiked with 475 µM hematin (Tc = 6882). None of the three variance parameters is flagged as high in this case.

Figure 10. Prior gamma distributions for allele (a), reverse stutter (b) and forward stutter (c) variance parameters, along with the prior modes and the STRmix Interpretation Report variance parameter values for a 3 ng 4-person 10:5:2:1 mixture spiked with 625 µM hematin (Tc = 4097). Both the reverse and forward stutter variance parameters are flagged as high in this case.

Table 11. Percentage of variance parameter values exceeding bands of expected variation for allele variance, reverse stutter variance, and forward stutter variance under unchallenged and challenged amplification and/or interpretation conditions. The first two benchmarks for each variance type are based on information in Table 9, while the third and fourth benchmarks are based on information in Table 10.

	% Greater Than…	Unchallenged	Inhibited	NOC −1 or −2	Degraded	Saturated	Cell Lines
Allele Variance	Polynomial Regression	48.90%	21.67%	67.92%	53.33%	55.00%	90.50%
	99th Percentile	1.05%	3.33%	28.30%	21.11%	40.00%	21.84%
	Mode	47.09%	41.67%	58.49%	84.44%	100.00%	80.38%
	99th Percentile	0.00%	0.00%	1.89%	6.67%	45.00%	0.00%
Reverse Stutter Variance	Polynomial Regression	50.91%	100.00%	39.62%	57.78%	85.00%	45.60%
	99th Percentile	1.15%	56.67%	3.77%	23.33%	80.00%	1.92%
	Mode	95.28%	100.00%	96.23%	98.89%	100.00%	97.37%
	99th Percentile	0.00%	3.33%	0.00%	2.22%	75.00%	0.00%
Forward Stutter Variance	Polynomial Regression	53.15%	48.33%	28.30%	44.44%	100.00%	37.71%
	99th Percentile	1.00%	16.67%	3.77%	12.22%	70.00%	0.51%
	Mode	99.76%	100.00%	100.00%	96.67%	95.00%	100.00%
	99th Percentile	0.00%	0.00%	0.00%	1.11%	5.00%	0.00%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Duke, K.; Myers, S.; Cuenca, D.; Wallin, J. Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics. Genes 2023, 14, 102. https://doi.org/10.3390/genes14010102

AMA Style

Duke K, Myers S, Cuenca D, Wallin J. Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics. Genes. 2023; 14(1):102. https://doi.org/10.3390/genes14010102

Chicago/Turabian Style

Duke, Kyle, Steven Myers, Daniela Cuenca, and Jeanette Wallin. 2023. "Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics" Genes 14, no. 1: 102. https://doi.org/10.3390/genes14010102

APA Style

Duke, K., Myers, S., Cuenca, D., & Wallin, J. (2023). Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics. Genes, 14(1), 102. https://doi.org/10.3390/genes14010102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improving the Utilization of STRmix™ Variance Parameters as Semi-Quantitative Profile Modeling Metrics

Abstract

1. Introduction

2. Materials and Methods

2.1. Construction, Amplification, Capillary Electrophoresis, and Analysis of Unchallenged DNA Samples

2.2. Construction, Amplification, CE, and Analysis of Challenged DNA Samples

3. Results

3.1. Trends in Allele and Stutter Variances with Increasing Peak Height

3.2. Trends in Allele and Stutter Variance under Challenging Amplification/Interpretation Conditions

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI