1. Introduction
Sex estimation of an individual is one of the most important elements in establishing a biological profile and an essential step for identifying skeletal remains [
1]. Forensic anthropologists widely use morphologic and metric approaches to determine the sex of human remains. Almost every skeleton component has been utilized in developing sex estimation techniques. However, the pelvis, universally recognized as the most precise predictor of sex, stands out with its distinct sexual dimorphism pattern that is unmatched by any other skeletal element. Moreover, its lack of population-specific characteristics makes it a suitable tool for estimating sex across diverse populations. While other skeletal elements can also be used for sex estimation, the pelvis is generally considered the best source of information for estimating sex [
2,
3,
4,
5,
6,
7]. Although it might not be as evident as in the pelvis, sexual dimorphism may also be visible in the cranium [
8,
9], sternum [
10], or even long bones [
11,
12]. The methods for determining sex can be broadly classified as morphologic or metric. Morphological approaches, which depend on visually evaluating sexually dimorphic characteristics [
13], offer valuable and promptly obtained preliminary results. The practicality of these approaches is evident, as they can be readily applied. However, the level of subjectivity strongly influences these qualities [
14,
15]. The accuracy of morphological assessment is higher when bones are intact, but decreases when dealing with shattered bones or incomplete skeletons. In contrast, metric studies, while more rigorous, rely on the fundamental concept of variation in male and female measurements and primarily employ various statistical techniques to develop models or equations that can be utilized to determine the sex of individuals [
1,
7].
A novel method of estimating sex based on the anthropometric analysis of the pelvis was devised to eliminate all of these drawbacks. Murail et al. [
16] developed the Probabilistic Sex Diagnosis (DSP: Diagnose Sexuelle Probabiliste) method. A large reference population group (over 2000 known sex individuals) from various continents has been used to validate the DSP tool in several community-specific samples. Dry bone samples from known ages and sexes have been utilized in numerous studies to validate the DSP method in various demographic groups, including European populations (like Belgian [
17] or French [
18]) and even Mexican [
19]. Machado et al. [
20] also analyzed the DSP tool and validated the method in Southeast Brazil using an osteological collection from a modern population. Additionally, the DSP method has been validated with great accuracy and reliability using 3D models of the os coxae obtained from CT scans of a living French population and 3D CT scans of a contemporary Danish population [
21,
22]. By comparing real, dry bones against virtual bones (CT scan), Chapman et al. [
17] reported that all of their collection’s coxal bones were correctly identified; the level of indetermination ranged from 2.56% to 46.15%, depending on how many variables were combined. In a recent study on a modern French population using the DSP tool for sex estimation on CT scans of the pelvis, the results showed 97.2% accuracy for women and 92.3% accuracy for men and an indetermination level ranging from 2.8 to 73.1% [
21].
Interest in using medical imaging in forensic anthropology has recently increased. According to numerous research studies, measurements derived from biomedical imaging techniques have shown similar accuracy in sex estimation to other classical methods [
23,
24,
25,
26,
27]. The studies supported the view that DSP is a reliable method of sex estimation based on CT images of the pelvis. Instead of traditional morphological approaches, CT imaging technologies represent a promising alternative for assessing sex in forensic cases [
23,
24].
According to a study by Fukuta et al. [
25], personal forensic identification may be possible using deep learning on pelvic depth images. His research demonstrated that sex estimation might be performed with high accuracy using deep learning and 2D depth pelvic pictures created from homologous models of reconstructed 3D CT images. As demonstrated by the study of Mostafa et al. [
26] among the adult Egyptian population, image processing techniques allow visualization of the bones regardless of how the remains are preserved and could be analyzed beyond contact with the actual bone. The findings of this study showed the value of using detailed virtual images of 3D CT pelvic scans to identify the sex of an individual, especially when the body is partially fleshed or burnt, as in cases found after major disasters where maceration is not a possibility. Furthermore, some research showed the value of using pelvic measurements obtained from adult 3D pelvic computed tomography images in developing a regression formula for adult sex identification. A high validity value of 91.05% accuracy was reported in pelvic anthropometric research of adult Indonesians using a logistic regression model, with 100% sensitivity to identify males and 81.1% specificity to identify females [
27].
This study aimed to evaluate the applicability of the DSP method in the Romanian population and test the accuracy of variables obtained from CT scans.
2. Materials and Methods
The sample consisted of computed tomography images of the pelvis taken from adult living individuals examined at the Department of Radiology, Emergency Municipal Clinical Hospital Timisoara. The clinical data were anonymized and collected following the Declaration of Helsinki.
A total of 80 pelvic CT scans were included in this study, with an equal distribution of 40 males and 40 females. It is essential to consider age when selecting samples. Bones undergo significant changes throughout a person’s lifetime, particularly during growth and development. The degree of sexual dimorphism is a complex phenomenon that varies with age. For example, certain skeletal features may fully differentiate between sexes in adulthood [
28,
29]. Moreover, different populations may exhibit unique patterns of skeletal variation related to sex [
30,
31]. These age-related factors could include changes in bone density, the rate of bone growth, or the timing of bone fusion, all of which can vary significantly with age and impact the accuracy of sex estimation techniques [
28,
29]. Considering all of this, the sample is chosen to ensure the diversity and representativeness of the population. This is achieved by equally dividing the sample based on both sexes and various age groups in order to accurately represent the population of interest. This balanced representation aims to ensure sex parity in the analysis, with participants’ ages ranging from 22 to 93 years and a mean age of 59.51 ± 22.7 years. The age range of male individuals varied from 23 to 93 years, with a mean age of 60.13 ± 22.80 years. The age range of female individuals varied between 22 and 92 years, with a mean age of 58.9 ± 22.89 years (
Figure 1). The selection criteria required all anatomical landmarks to be visible and the entire region of interest to be included. Scans with metal implants or fractures, which could potentially interfere with the accuracy of measurements, were excluded from this study. Also, we considered the practical and resource limitations, particularly with regards to pelvic CT scans, which can be demanding in terms of resources, necessitating both time and equipment.
The examinations were conducted using a Siemens Somatom Definition Edge (Erlangen, Germany) at 120 kV with 250 mAs, 0.6 mm slice thickness, 1 mm slice increment, and a B30f reconstruction algorithm, following a standardized scan protocol. OsiriX software version 11.0 (Pixmeo SARL, Bernex, Switzerland) was utilized for generating the 3D VR’s and conducting measurements, as it offers specific tools for accurate anatomical analysis.
Diagnose Sexuelle Probabiliste (DSP) software (available at
https://osteomics.com/DSP/, accessed on 12 May 2024) was used to estimate the sex [
32]. We select four measurements from the ten variables listed in Murail’s original study, acquired as recommended in the original paper [
16]. PUM, ISMM, and SCOX are suggested for sex estimation and likely showed strong correlations with biological sex in the original study by Murail or in subsequent research. Selecting variables with a known association with sex estimation increases the likelihood of accurate predictions. The chosen variables are noted for their simplicity in measurement and reproducibility on CT (computed tomography) images. This implies that the variables are easy to measure and interpret consistently across different observers or imaging setups. Simplicity and reproducibility are essential in ensuring that the measurements can be reliably obtained and compared across different samples or studies. By applying these criteria, the researchers aim to select a subset of variables that are not only theoretically relevant for sex estimation but also practical in terms of measurement and reproducibility. Repeatability analysis was performed on a subset of 20 randomly chosen samples, which were analyzed by a second observer and the same observer at two different times. The Intra Class Correlation Coefficient (ICC) test was used for the analysis. A two-way-random-effect model and absolute agreement with a mean estimation along with 95% confidence intervals (CI) were reported for each ICC. The results of the ICC test for intra-rater reliability ranged from 0.992 (PUM) to 1.000 (ISSM), indicating excellent intra-observer replicability for all measurements. For inter-observer replicability, the values ranged from 0.993 (PUM) to 1.000 (ISMM), which shows excellent inter-observer replicability.
All measurements obtained from CT images were analyzed using the DSP software (
https://osteomics.com/DSP/, accessed on 12 May 2024) to facilitate data processing. The measurements were introduced in an Excel sheet developed explicitly for the DSP method [
16]. A posterior probability equal to or higher than 0.95 was selected as the sex classification threshold based on established reliability criteria [
35,
36]. The software automatically generates and displays the probabilities for each specimen being male or female, along with the final classification outcome, indicating whether the subject was categorized as male, female, or undetermined. Data were analyzed using IBM
® SPSS
® Statistics version 26. Several statistical tests were performed to analyze the data.
3. Results
The descriptive statistics for all four variables are presented separately for males and females in
Table 2.
According to the t-test, female PUM (M = 72.54, SD = 2.02) is significantly higher than that of males (M = 70.30, SD = 2.70), t = −4.185, p < 0.032. The t-test showed significant differences between females and males for ISSM (female M = 101.12, SD = 2.46, male M= 111.20, SD = 3.92, t = 13.74, p = 0.021). A t-test showed no significant differences between males and females for the SCOX (t = 6.055, p = 0.707) and VEAC (t = 8.103, p = 0.803).
All variable measurements obtained from the CT scans were analyzed using DSP software. For sex classification, a posterior probability equal to or superior to 0.95 was used as a threshold. The resulting sex estimation from the output from DSP software was compared to the known sex of each individual. For the male cases, out of a total of 40 males, 22 were correctly identified as male, and only 1 male was misclassified as female. Conversely, for the female cases, out of a total of 40 females, 34 were correctly classified, and none were misclassified. The data are presented in
Table 3.
In this study, the frequency of cases in which sex estimation was possible (sex estimation) was possible in 71.25% of cases overall, with varying rates between males (57.50%) and females (85%). Despite encountering undetermined specimens, which comprised 42.5% males and 15% females, only one error in sex classification was observed, where a male was misclassified as a female.
In terms of accuracy (the frequency of samples in which sex estimation was performed correctly from the total possible to be estimated), the overall accuracy remained notably high at 98.24%. From the total sample of 40 females that could be estimated, all were classified correctly, with an accuracy of 100%, whereas for males, the accuracy was 95.65%. The data are shown in
Table 4.
4. Discussion
Sex estimation from bones has an important role in determining biological profile and subsequently in the identification process. Our primary aim in this study was to assess the utility of the Diagnose Sexuelle Probabiliste (DSP) method within a contemporary population sample from Romania. By doing so, we sought to evaluate the effectiveness of this software in the current practice of forensic anthropology within the country.
Many studies validate the DSP method as a valuable tool in the sex estimation of the pelvic bone [
17,
18,
21,
36]. Mestekova et al. [
21] and Rodriguez et al. [
18] tested the reliability of the DSP method using CT scan images from a sample of the modern French population, respectively, from the contemporary Danish population. The study by Franchi et al. [
37] aimed to determine whether the established standards for sex estimation are maintained in the 3D reconstruction of the image. His research focuses on the metric analysis of the pelvic bone through the DSP tool, one of the most distinctive anatomical skeletal structures between men and women. The findings indicated a close to 62% accuracy in estimating sex. However, there is still room for improvement because, in 37% of cases, sex cannot be identified. One of the challenges observed in this study is the lack of precision in placing landmarks in the 3D reconstruction of the pelvic images [
37].
In contrast to Machado et al.’s [
20] findings, which did not identify statistically significant differences between the sexes for PUM, the comparative analysis of the four pelvic variables between males and females revealed a significant difference between males and females for PUM and ISMM. In line with the results of earlier studies, the measurements for ISSM were greater in men (
p = 0.021), whereas the values for PUM were statistically higher in women (
p = 0.032) [
18,
21]. In contrast to Mestekova’s [
21] findings, which found that the measures of SCOX and VEAC were significantly higher for men, in our study, the VEAC and SCOX measures did not show a statistically significant difference between the sexes (
p = 0.707;
p = 0.803, respectively). When using this combination of four measurements (PUM, SCOX, ISMM, VEAC), we were able to estimate the sex in 71.25% of cases, with an accuracy rate of 70%. This is slightly less than what other studies have revealed. Using CT scans of the hip bones from 52 men and 54 women in a modern French population, Mestekova et al. [
21] evaluated the DSP tool’s accuracy and found that it was 97.2% accurate for women and 92.3% accurate for men, with an indetermination level ranging from 2.8 to 73.1%, whereas 88.34% accuracy was obtained in a validation study conducted on a mixed Brazilian population [
20]. While other studies’ values were higher in men than women, our results were greater in women [
18]. Females had a high sex estimation rate (85%, with an accuracy rate of 85%), whereas males had a lower rate (57.50%).
Acknowledging methodological limitations: The DSP method, for instance, is potentially limited by its reliance on ten variables for accurate sex estimation. The method’s accuracy significantly decreases if any of these variables are missing or incomplete. A disadvantage of the DSP tool is that it cannot always identify sex, and the accuracy rate for sex estimation is lower when not all ten variables are available [
18,
21]. According to the findings of a recent study on dry bones from a contemporary French population [
18], the degree of indeterminacy in fewer measurements affects sex estimation and accuracy. In forensic contexts, it is often preferred to classify individuals as ‘undetermined’ rather than misclassified. The term “undetermined” refers to cases where the sex of the specimens could not be accurately classified using the DSP software. In line with the findings of Almeida et al. [
23], whose indeterminate percentage ranged between 4.7% and 58%, depending on the number of variables available, our analysis revealed that 28.75% of the cases (42.5% of the men and 15% of the women) were not classified. Several factors could contribute to specimens being classified as undetermined. Some specimens may exhibit features not indicative of either male or female sex characteristics. Female pelvises appear easier to classify in the current study, as there was only one sex classification error. For females, the sex estimation rate was 85%, with an accuracy rate of 85% (the gross accuracy was 100%, meaning that all females from the entire sample that could be estimated were correctly classified). Therefore, the software may have difficulty classifying specimens that deviate from the norm or fall within a gray area between male and female characteristics. Although undetermined specimens were present, only one error in sex classification occurred: misclassifying a male specimen as female. This indicates that, despite encountering challenges in specific cases, the software consistently demonstrates high accuracy in sex estimation. In sex estimation, it is essential to balance sensitivity (correctly identifying true positives) and specificity (correctly identifying true negatives). In summary, the cautious and responsible approach of counting undetermined cases as correct, even if it artificially inflates accuracy, helps maintain specificity by avoiding false positives (misclassifications). The DSP software’s reliability in sex estimation for complete and fragmented os coxae achieved misclassification values below 2%. The success of the software in achieving such high accuracy is attributed to the use of a classification probability criterion of 0.95. While this criterion may result in a decreased number of individuals correctly classified, it significantly reduces the risk of misclassification. The selected cut-off level of 95% utilized in the DSP method eliminates the issue of misclassification, which is one of its strengths [
16,
35]. The software’s approach of classifying individuals with a probability lower than 0.95 as undetermined and prioritizing reliability over attempting to estimate sexes in all individuals is a practical and effective strategy. The high overall accuracy rate of 98.24% highlights the effectiveness of DSP software in sex estimation and emphasizes the crucial importance of accuracy in sex estimation. Notably, all female specimens that could be estimated were correctly classified, achieving a perfect accuracy rate of 100%. Likewise, males exhibited a commendably high accuracy rate of 95.65%. When applying a threshold of 0.95, some samples are not categorized as male or female; however, reducing misclassifications is more advantageous for forensic practice. Our findings are consistent with research that indicates a higher proportion of indeterminate sex among men [
16,
35], in contrast with the findings of Machado et al. [
20], who had 9.43% of pelvic bones misclassified as males and 14% as females. The importance of undetermined cases lies in their potential impact on the accuracy of sex classification. Even though the percentage of undetermined cases was higher among males, it is essential to recognize that even a minor misclassification rate could lead to significant consequences, particularly in forensic settings where precision is crucial. These findings emphasize the critical role of accuracy in forensic sex identification, as even a single misclassification could potentially lead to erroneous conclusions in legal proceedings.
While the use of CT scans and the DSP tool for sex estimation appear promising and comparable to traditional techniques utilizing dry pelvic bones, several potential limitations and areas for further research should be considered. Studies validating the DSP tool against traditional techniques may have been conducted on specific skeletal collections or populations, potentially limiting the generalizability of their findings [
6,
38,
39]. All of these studies’ findings—which are compared with those of the present study—indicate that the DSP tool is appropriate for use with measurements from CT scans [
17,
21]. While CT scanning offers advantages such as non-invasiveness and the ability to capture measurements that cannot be taken directly on the bone, the accuracy of CT imaging may be influenced by factors such as imaging resolution, artifacts, and soft tissue effects. Understanding the limitations and potential sources of error in CT imaging is crucial for interpreting results accurately [
40]. Sex estimation techniques may vary in their accuracy across different populations due to variations in skeletal morphology and demographic characteristics. Further research should investigate the performance of the DSP method in diverse populations, including individuals with varying ancestry, age, and geographic origins. In conclusion, while the DSP tool shows promise as a reliable method for sex estimation using CT scans, further research is needed to address potential limitations, validate its accuracy across diverse populations, and ensure its applicability in various clinical and forensic settings. Standardization of protocols, consideration of ethical considerations, and interdisciplinary collaboration are essential for advancing the field of skeletal sex estimation.
To the best of our knowledge, this study stands as the pioneering investigation into applying the DSP tool for sex estimation among the adult population of Romania. This unique endeavor fills a crucial gap in research, as previous studies in Romania have primarily relied on traditional techniques for sex estimation, overlooking the potential of modern imaging technologies like CT scans and the DSP tool. Despite the inherent limitations of our study’s relatively small sample size, our results demonstrate comparable accuracy and sex estimation rates to those observed in similar studies conducted across European populations. Building upon these initial findings, future research endeavors will prioritize including a more significant number of measurements per individual and a more extensive sample size to enhance the sex estimation and accuracy. Moreover, we recognize the importance of expanding the geographical scope of our study to encompass multiple regions within Romania. Such an expansion is vital for providing a more comprehensive understanding of the applicability and generalizability of the DSP method across the diverse population demographics present within the country. By incorporating data from various regions, we can ensure the robustness and reliability of the DSP tool for sex estimation in the context of the contemporary Romanian population.
5. Conclusions
In conclusion, this research evaluated the utility of the DSP method for sex estimation within the contemporary Romanian population, utilizing a sample of CT scans. Despite encountering undetermined specimens, the overall accuracy rate of sex estimation remained impressively high at 98.24%, highlighting the effectiveness of DSP software in this forensic application. Notably, all female specimens that could be estimated were correctly classified, achieving an accuracy rate of 100%. In contrast, for males, the accuracy rate was slightly lower at 95.65%, still demonstrating high reliability in sex identification. Even in the presence of undetermined cases, the minimal occurrence of misclassification further solidifies the robustness of DSP software in sex identification. These findings highlight the importance of accuracy in forensic sex estimation, particularly in legal and investigative contexts where precision is crucial. As such, the study underscores the confidence with which the DSP method can be employed in practical applications, contributing significantly to the advancement of forensic and anthropological research within Romania and beyond. Even though sex estimation was possible in 71.25% of cases overall, these results reaffirm the method’s efficacy and robustness, particularly when leveraging measurements derived from CT imaging. When dealing with unidentified remains and attempting to estimate sex as part of the biological profile, the DSP method is recommended, if not the only method to be used, at least as a preliminary or adjuvantly accurate technique for sex estimation in forensic anthropology.