1. Introduction
The estimation of the postmortem interval (PMI) presents itself as a significant challenge within the realm of forensic sciences. This parameter holds a critical role in establishing the forensic relevance of human remains. Hence, the importance of conducting this evaluation becomes evident, as establishing the forensic origin of skeletal remains would demand a distinct investigative approach compared to archaeological remains. However, its precision is intrinsically linked to the time since death, as the accuracy of PMI estimation becomes increasingly intricate in longer postmortem intervals [
1,
2]. Despite the growing number of studies aiming to develop suitable methodologies for estimating the PMI, the effort to date skeletonized remains persists as a complex task for forensic anthropologists [
3].
When dealing with forensic situations, a range of issues emerges involving expenses, time constraints, the quantity and dimensions of the fragments, their origin, the state of preservation and the decomposition conditions, and the feasibility of employing certain techniques over others, among other factors [
4]. Considerable attention has been directed toward investigating the luminol technique for estimating the PMI in skeletonized remains as a preliminary assessment. This is due to its perceived simplicity, affordability, and applicability to visual interpretation, even without sophisticated equipment [
5,
6,
7].
On account of its high sensitivity, luminol is commonly employed in forensic contexts, primarily for detecting blood traces imperceptible to the naked eye, turning them visible due to its chemiluminescent properties [
8]. More recently, its application in forensics has expanded to estimating the postmortem interval of skeletal remains, by correlating the time since death with the persistence of hemoglobin traces within the bone tissue [
7,
9,
10,
11]. Previous studies have showed that even minute concentrations of hemoglobin (ranging from 1:100,000 to 1:5,000,000) can induce a detectable chemiluminescent reaction [
10]. Nonetheless, it is crucial to remember that this method is susceptible to potential interference such as household bleach (sodium hypochlorite), turnips, horseradish or parsnips, furniture polishes and enamel paints, or even motor vehicles’ interior fabrics [
12,
13]. Thus, by applying luminol (5-amino-2,3-dihydro-1,4-phthalazinedione) to traces of hemoglobin within bone tissue, a reaction of chemiluminescence occurs through the production of molecular nitrogen and an excited 3-aminophthalate dianion, which is the emission of chemiluminescent light attributed to the radiative deactivation of this dianion from its singlet excited state (see
Figure 1) [
14,
15]. The occurrence of this reaction demands a redox catalyst, specifically the iron ions in the hemoglobin composition found within bones [
13,
16]. This phenomenon leads to the emission of light in the visible spectrum ranging from violet to blue, arising from the chemical reaction [
5,
9,
15,
17]. Over time, the quantity of hemoglobin present in skeletonized remains reduces [
5,
10,
15], resulting in an anticipated reduction in the intensity of the chemiluminescence reaction as the postmortem interval increases [
7,
10,
11].
The aim of this study is to evaluate the feasibility of luminol chemiluminescence testing as a method that is both simple and economical for estimating time since death, undertaking an assessment of the chemiluminescence repeatability within a significant and well-sized sample, including a broad range of PMI intervals, through intra- and interobserver error calculation.
2. Materials and Methods
Our study included human bones from 266 adult individuals, with ages between 20 and 100 years old, of both sexes, without visible or known bone pathology or antemortem/perimortem trauma, with different and known postmortem intervals, including subsamples from different origins: fresh bones from autopsies performed at the Portuguese National Institute of Legal Medicine and Forensic Sciences (INMLCF.IP.), South (n = 12) and Centre Branches (n = 8); dry skeletonized remains from 21st Century Identified Skeletal Collection [
18] (n = 208); dry skeletonized remains from autopsies performed in the 20th century at the INMLCF.IP., South Branch (n = 29); remains from the Valle da Gafaria’s osteoarcheological collection [
19] (n = 9). Among the subsamples, the PMI ranged between 2655 days and 450 years. To ensure sample standardization, the study focuses solely on a single bone type, namely, the clavicle (commonly known as the collar bone).
The fresh clavicles from autopsy were formerly prepared and reduced to powder by our team in a previous study [
5]. They were obtained from fresh cadavers during the autopsy (after consulting RENNDA—National Register of Non-Donors—and receiving approval from the Ethics Commission of the Portuguese National Institute of Legal Medicine and Forensic Sciences – CE-010/2018) and preserved by freezing at 4 °C (to meet the mandatory conditions for follow-up work) in labelled zip-lock freezer bags. These bones were defrosted and macerated in water and cleaned manually with fresh sandpaper, brushes, and scalpels.
The sample treatment, conducted entirely by the same operator, involved manual cleaning with sandpaper and brushes, and for the reduction into powder, the bones were scraped with a scalper (
Figure 2). For each clavicle, 50 mg of powder was collected in each Eppendorf tube (three replicas per individual) for the luminol technique. The Luminol 16 oz solution was prepared according to the directions included in the reagent (pre-prepared kit). Each sample was placed on a clean white paper sheet and luminol was applied using a spray bottle at room temperature, in a dark room. As a positive control, a drop of blood was used.
The results were examined in situ at the moment of the chemiluminescence reaction with the naked eye by two different observers with different levels of expertise (Observer 1 had a higher level of expertise and Observer 2 had no previous experience), and photographed with a Canon EOS 500D (F/5, 30sec, ISO 3200, 27 mm) (Canon Inc., Tokyo, Japan) in a total absence of light. The intensity of the reaction was measured using a binary scale (positive or negative) and a 5-level scale according to the increasing degree of chemiluminescence (see
Table 1, and
Figure 3, for details). The observations by the two observers were carried out simultaneously and blindly when the reaction occurred, and the recording of the results occurred independently, with no influence from each other. For the intraobserver error, three replicas of each sample were blindly analyzed by each observer. At the time of the observations, the samples underwent anonymization, which meant that the postmortem interval remained undisclosed to both of the distinct observers.
Table 1.
Interpretation of the 5-level scale for the chemiluminescence reaction (adapted from Ermida et al., 2017 [
5]).
Table 1.
Interpretation of the 5-level scale for the chemiluminescence reaction (adapted from Ermida et al., 2017 [
5]).
Result | Interpretation |
---|
Negative (−) | No chemiluminescence. The reaction is not visible to the naked eye. |
Barely positive (+) | The reaction is barely visible to the naked eye. |
Weak positive (++) | The reaction is visible to the naked eye but with low intensity. |
Positive (+++) | The reaction is easily visible to the naked eye. |
Strong positive (++++) | The reaction is strongly visible to the naked eye. |
Figure 3.
Chemiluminescence reaction. (a) Strong positive reaction; (b) weak positive reaction.
Figure 3.
Chemiluminescence reaction. (a) Strong positive reaction; (b) weak positive reaction.
The Kappa statistic is frequently used when data requires statistical methods to assess the technique’s repeatability. It calculates the proportion of agreement, taking chance agreement into account. It is calculated as:
. Kappa is equal to 1 when two measurements perfectly agree and to 0 when two measurements only agree at the chance level [
20]. In the present research, Cohen’s Kappa statistic was used for the assessment of the intra- and interobserver agreement, with the usage of the unweighted values for the binary scale results and the weighted values for the 5-level scale (ordinal scale), evaluating the agreement between observations that takes chance into account, in order to improve the assessment of the luminol’s chemiluminescence repeatability.
3. Results
The obtained results, recorded by both observers, are displayed in the plot illustrated in
Figure 4. In this plot, it is possible to examine the overlap between what was observed by Observer 1 and Observer 2. Based on the obtained results, it is evident that the intensity of chemiluminescence reactions diminishes in individuals with an extended postmortem interval.
As previously mentioned, Cohen’s Kappa statistic test was used for the assessment of the relationship between the error variables (please refer to
Tables S1–S14 in the Supplementary Material). The subjectivity inherent in this technique was considered for two different circumstances: between the two observers responsible for evaluating the chemiluminescence reaction—interobserver error (
Table 2 and
Table 3), and between repetitions of the same sample by the same observer—intraobserver error (
Table 4,
Table 5 and
Table 6). Both inter- and intraobserver error were conducted for the binary scale and 5-level scale.
Table 2 and
Table 3 display the Kappa statistic quantification of the two different observers’ disagreement between chemiluminescence reactions after luminol. Concerning the interobserver error, the binary scale obtained results that indicate a K = 1 (95% CI=1–1). The 5-level scale, on the other hand, has a K = 0.98 with a 95% confidence interval ranging from a minimum of 0.97 to a maximum of 0.99. Regarding the 5-level scale, as an ordinal scoring of luminol chemiluminescence, linear weights were considered, given it is important for the degree of disagreement to be reflected, introducing a greater emphasis on large differences between categories and less on small differences [
20].
Additionally,
Table 4,
Table 5 and
Table 6 show the quantification, through Kappa statistics, of the disagreement between two chemiluminescence analysis performed by the same observer, obtained from the same sample (two replicas). Although three replicas were conducted for each sample, no statistically significant differences were found among the three repetitions (see the
Supplementary Material for more information), so the authors decided to only present the results for the intraobserver error between the first two replicas. Only in a single circumstance did the repetition of the luminol testing lead to a different result regarding Observer 1′s three replicas. The less experienced observed, Observer 2, had a different result in only three cases. Both observers obtained the exact same calculated intraobserver bias regarding the binary scale, with a 100% agreement (
Table 4). Regarding the ordinal 5-level scale, Observer 1 attained a K = 1 with a 95% confidence interval between 0.99 and 1, and Observer 2 a K = 0.99 with a 95% confidence interval between 0.99 and 1.
4. Discussion and Conclusions
The assessment of luminol’s chemiluminescence intra- and interobserver bias supports the potential of this method as a presumptive test for estimating time since death, and the obtained results are in line with the ones obtained in previous investigations [
5,
7]. It is important to highlight the robustness of the sample used, both in size and in its composition, including a wide range of PMI intervals, further emphasizing the importance of the results arising from this investigation. The Kappa statistical analysis illustrated an intraobserver and interobserver agreement superior to 0.9 in all the experiments, which, according to the literature, is suggestive of a nearly perfect agreement (>0.80) [
21]. These results support the idea that even a less experienced observer, or one with no experience at all (as was the case with Observer 2), can use this technique with a reliable outcome. Thus, the current investigation can corroborate luminol chemiluminescence, not only as an inexpensive and simple technique, but also as easy to interpret, confirming its validity.
The process of reducing the fresh clavicles to powder from autopsy displayed some variances. Nevertheless, these small disparities in sample preparation have no discernible impact on the results obtained in the current study, as it is exclusively based on the assessment of chemiluminescent intra- and interobserver error.
It is also really important to emphasize our study’s outcome, as it contributes to bringing the use of this method closer to its practical application. In a forensic case, standards of quality and reliability must be met in the methods applied [
22]. According to the Daubert Standards (US federal standards specified in 1993 in Daubert vs. Merrell Dow Pharmaceuticals), in order for a methodology to be considered valid in a court of law, it must fulfil the following conditions: (1) it is capable of being tested and has undergone testing; (2) it has undergone peer review and publication; (3) its existing or potential error rate is known; (4) there are established standards for the control of the technique’s operation; and (5) it has gained significant acceptance within the relevant scientific community [
23,
24,
25,
26]. These criteria intend to ensure the reliability, replicability, and relevance of scientific testimony in court. It also led the forensic disciplines to recognize the need to reconsider some of the techniques used in their analysis and follow these standards as good practice, validating their statements with scientifically tested methods substantiated by probability assessments [
23,
27]. Regarding these criteria, our findings bring the application of the luminol chemiluminescence method closer to potential acceptance in the legal setting, as the intra- and interobserver assessment repeatability has an extremely high level of consistency.
However, for this technique to meet all the mandatory Daubert Standards, there is still a lot of work to be done. Although luminol chemiluminescence testing is considered a promising technique for PMI estimation, since it is simple, inexpensive, and easy to interpret, it is still not possible to properly relate the reaction intensity with PMI-specific intervals. Therefore, this method is mainly used as a presumptive test to identify human remains of forensic interest, since the intensity of the chemiluminescence reactions clearly decreases in archaeological remains, which can be supported by the results of this study. So, it cannot be used as a single method to date time since death (the obtained results need further confirmation from methods such as radiocarbon analyses). In order to achieve more accurate results, further research is required. Subsequent studies should delve into the quantification of the light derived from the chemiluminescence reaction, as well as on the influence of the taphonomic factors in the decomposition process that affect the results obtained through this method. However, the call for future research is substantiated by promising outcomes achieved by studies, as the one presented in this work, which highlight the value of this technique in terms of its repeatability.
In conclusion, according to our study, the luminol technique is suitable as a rapid diagnostic test, given the intra- and interobserver error obtained, revealing it as practically independent of observation bias. In future research, it would be advantageous to reproduce this experiment with a larger number of observers.