1. Introduction
Social cognition (SC) refers to neurocognitive processes involved in the capacity “
to learn about the world from others, to learn about other people, and to create a shared social world” by processing and interpreting social signals [
1]. Recently, awareness of the role of social skills in mental health and well-being has grown exponentially, with a flourishing research interest in social cognitive functions [
2]. From a clinical perspective, the latest edition of the American Psychiatric Association’s Diagnostic and Statistical Manual for Mental Disorders, Fifth Edition, Text Revision (DSM-5-TR) describes SC as a core component of neurocognitive functioning along with attention, executive functions, memory, language, and visuospatial abilities. Even more relevant, SC disturbances contribute to the diagnostic criteria for numerous pathological conditions. Accordingly, recent evidence highlighted social cognitive disturbances as a signature of the cognitive phenotype in neurological, neurodegenerative, and psychiatric conditions [
3,
4,
5,
6,
7]. For example, SC impairment can be a disabling symptom following acute brain injuries, including traumatic brain injury or stroke [
6]. Moreover, altered SC competency has been described in the early stages of several chronic neurological disorders, such as the behavioral variant of frontotemporal dementia, Alzheimer’s disease, Parkinson’s disease, and multiple sclerosis [
4,
5]. Based on the role of SC in the diagnostic process of the aforementioned pathological conditions, a systematic evaluation of social abilities should be incorporated into the preliminary standard neurocognitive evaluation and in the subsequent routine clinical follow-up.
A core SC competency is Theory of Mind (ToM), which is the ability to understand mental states, such as intentions and feelings, of oneself and others [
2]. As a multidimensional psychological construct, ToM allows to make inferences both on cognitive mental states, such as beliefs and thoughts (cognitive ToM component), and affective mental states, such as desires and emotions (affective ToM component) [
2,
8,
9,
10]. Evidence of these distinct dimensions arose from neuroimaging studies, which reported the role of different brain regions mediating cognitive and affective ToM, such as the dorsomedial, dorsolateral, and ventromedial, orbitofrontal portions of the prefrontal cortex, respectively [
11]. Moreover, understanding mental states relies on a hierarchical level of meta-representational attribution, which includes first- and second-order ToM. From a developmental perspective, ToM capacities appear early in infancy, around eighteen months, with the development of shared attention and declarative pointing [
12]. By the age of two, a solid understanding of others’s desires and mental states begins to emerge [
13]. Then, by ages three to four, children recognize that people have different beliefs from their own [
14]. First-order ToM represents the first developmental milestone in the understanding of another’s mental state. This competence allows people to represent themselves and understand mental states of others (e.g., “
he/she thinks that…”) and develops around four years of age [
9,
14,
15,
16,
17]. The second level of recursive thinking (second-order ToM) begins around eight years of age, especially in the context of formal schooling, and requires individuals to represent the mental states of others with respect to different individuals (e.g., “
he/she thinks that he/she thinks that…”) [
9,
14,
15,
16,
17], with a substantial impact on intra- and inter-personal competence [
18].
Importantly, a partial dissociation between first- and second-order affective and cognitive ToM has been observed in healthy aging [
17,
19] and neurological and neuropsychiatric conditions including localized brain lesions [
20], schizophrenia [
21], multiple sclerosis [
22,
23,
24], Parkinson’s disease [
25,
26,
27], Alzheimer’s disease [
28], and Mild Cognitive Impairment [
27,
29]. Especially, greater impairment in affective compared to cognitive ToM has been reported in multiple sclerosis [
30], while second-order ToM tasks seem to be first affected in age-related pathological processes [
31,
32,
33,
34].
Given the relevance of a multidimensional assessment of ToM in the clinical setting and in line with its multicomponent nature, ToM needs a multifaceted assessment. However, ToM measures are traditionally conceived to evaluate components or levels of ToM and are extremely varied in their assessment, including static vs. dynamic and the use of visual vs. non-visual stimuli. For example, the False Belief task is based on written stories to assess cognitive first-order [
14,
35] and second-order ToM [
15]. Similarly, the Faux Pas Recognition test [
24,
36] and the Strange Stories test [
37] present complex real-life social situations (e.g., white lie, double bluff, irony, misunderstanding, metaphor) in written stories to evaluate ToM without considering the level of recursive thinking. Differently, the Story-based Empathy task [
38] and the Reading the Mind in the Eyes test (RMET [
39]) involve visual static stimuli for assessing affective and cognitive components of ToM. Finally, other tests show ecological dynamic stimuli (multimedia content) resembling complex real-life social interactions to assess both affective and cognition ToM and other social cognition abilities (e.g., emotion recognition, moral cognition), or social cognition in a broader sense, such as the The Awareness of Social Interest Test (TASIT [
40]), the Edinburgh Social Cognition Test (ESCoT [
41,
42,
43]), and the Movie for the Assessment of Social Cognition (MASC [
44]). However, their use is hardly suitable in different cultures due to the presence of verbal dialogs, gestures, prosody, and/or culture-dependent social norms. Three relevant issues regarding ToM assessment have emerged: first, most of the adopted measures evaluate only a subset of ToM domains (affective and/or cognitive ToM components and/or first and/or second ToM levels), making it difficult to obtain a global ToM assessment. Second, normative data are available for only a limited number of these measurements (see, for example, [
38,
41,
45,
46]). Finally, ToM tools, as well as most SC measures, are influenced by and dependent on cultural background and ethnicity, making it crucial to conduct population-based validation studies [
47].
The Yoni task [
8,
27,
48] is a digital test allowing to evaluate comprehension of mental states (affective mental states, i.e., emotions, and cognitive mental states, i.e., thoughts). The stimuli are static visual-spatial-colored cartoon-like pictures that are used to assess mental state understanding based on facial expression and direction of gaze. Our recent works [
48,
49] provided two short versions of the task (the 48-item version, the Yoni-48 task, and the 36-item version, the Yoni-36 task), developed from the 98-item version, suitable to be adopted for first-level neuropsychological assessment. Results from the validation study of the 48-item version of the Yoni task [
49] supported its validity and reliability, in line with the earlier study on the 98-item version. Moreover, correction grids and normative data have been provided for the Italian population to enhance the application of the tool in the clinical context. However, limited time for most clinical visits has motivated the development of a ToM assessment that can be included as part of a comprehensive neuropsychological assessment battery. As such, the aim of the present study consists of providing correction grids and normative data for an even shorter version of the test, the Yoni-36 task [
48,
49]. This latter version of the test maintains the multidimensional evaluation of ToM in a shorter administration time than the Yoni-48 task, with the further advantage of being well-balanced in terms of affective, cognitive, and first- and second-order ToM items (ToM components and levels). In fact, the Yoni-48 task score is based on a higher number of ToM second-order items than first-order ones, which allows the test to be more challenging but, simultaneously, leads to lengthy administration time. Therefore, the adoption of the more concise version could be an option due to time constraints or for ToM screening purposes.
This study aims to substantially contribute to the field of neurocognitive assessment. Specifically, it evaluates the psychometric properties of the Yoni-36 task, intending to offer a validated tool for the multidimensional evaluation of ToM, which is now regarded as one of the fundamental pillars of neurocognitive functioning, even in clinical settings. Specifically, these research goals were: (1) to test the validity of the Yoni-36 task (item discrimination ability) and its inter-item reliability (internal consistency and split-half reliability), and (2) to provide normative data and equivalent scores for the Italian population. Validation of the Yoni-36 task will offer a quick and reliable tool for assessing ToM in the diagnostic process within routine clinical practice, ensuring a comprehensive evaluation of ToM components and levels in shorter administration times, as necessitated by time constraints of the clinical setting.
4. Discussion
The goal of the present study was to investigate the psychometric properties of the Italian 36-item version of the Yoni task [
48] to propose a quick and reliable tool for assessing ToM within routine clinical practice. Following earlier studies [
48,
49], the validity and reliability of the Yoni-36 item were investigated in terms of discrimination ability, internal consistency, and split-half reliability. In addition, correction grids, normative data, and equivalent scores have been provided for the Italian population, contributing to the field of neurocognitive assessment by introducing a ToM screening measure for the diagnostic process in routine clinical practice.
Overall, the Yoni-36 showed good psychometric properties. In line with our previous study [
49], the second-order ToM items could distinguish between high and low performance in healthy adults. As expected, low discrimination ability was reported for the first-order ToM items. In fact, while the first-order level of ToM is already consolidated in early childhood [
50], mature ToM skills, mainly involving the second-order ToM abilities, grow over the early adulthood period [
18]. Nevertheless, neurological conditions, such as dementia, are usually characterized by a neurocognitive deficit extending also to the first-order ToM level [
51,
52], and the assessment of this ToM domain in the clinical setting might be useful. Concerning the validity of the Yoni-36 task, the internal consistency was high, supporting its adoption as a neurocognitive ToM measure for neuropsychological assessment. Moreover, the test demonstrated a good split-half reliability, in line with previous results [
48,
49]. Although future studies need to test additional psychometric evidence, such as the test-retest reliability and the diagnostic validity, these findings suggest satisfactory properties of the Yoni-36 task, differently from other common measures. It is worth noting that most ToM measures showed low or sub-optimal psychometric standards, and the validation of further tools or test versions is encouraged [
53,
54]. In fact, unlike the five traditional neurocognitive domains (learning and memory, executive function, complex attention, perceptual-motor function, and language), whose assessment tools have been subjected to a long history of validation, studies on the psychometric properties and normative data for ToM tests are limited. To our knowledge, correction grids and normative data are available for a limited number of tests, such as the Faux Pas test and Story-based Empathy task [
38,
45], and the Eyes Test [
39] remains the ToM test validated and adapted for different languages and age groups [
55,
56].
Concerning the influence of demographic variables on the Yoni-36 task scores, we observed that age and education mainly affected both performance and response time, as highlighted in previous studies on the Yoni task [
48,
49]. Greater performance appeared to be associated with younger age and a greater education level. Altogether, our findings confirm previous evidence on age-related changes across adulthood [
57,
58] and the role of education as a mediator on the link between age and ToM performance [
57]. Considering age, a meta-analysis [
59] including data from six types of ToM tasks, different ToM components, and administration modalities (verbal, visuo-spatial, and dynamic) confirmed a decline in ToM in old age, regardless of type of ToM task, component, and stimuli modality. Moving to education, a previous study [
60] highlighted the role of education on ToM ability, showing that people with greater educational attainment were more proficient in mentalizing tasks. Age and years of education have been demonstrated to influence the performance of other ToM tools, such as the Edinburgh Social Cognition test [
42], the Story-based Empathy Task [
38], and the Reading the Mind in the Eyes test [
46], commonly used ToM tests in the clinical context. Considering the Yoni-36 task sub-scores, the role of education on all the scores, including the ToM first-order score, was not unexpected. In fact, education attainment as well as age influence brain reserve and resilience and act as a cognitive reserve proxy by modulating the effect of disease-related brain changes on cognitive performance [
61,
62]. Namely, a high educational level may protect against the age-related ToM decline.
Finally, normative data and equivalent scores were provided for the Yoni-36 task. High accuracy levels, response time, and affective-cognitive ToM balance were observed in healthy Italian subjects. Our computed equivalent scores will allow the comparison between the Yoni-36 task performance and other neuropsychological measures with different natures and metrics [
63,
64]. Norming ToM measures is essential to make the task suitable for clinical purposes, allowing for an objective judgment regarding a patient’s neurocognitive profile [
65]. Additional studies are needed to confirm the diagnostic validity of the Yoni-36, investigating the sensitivity and specificity in discriminating clinical populations with ToM deficits from healthy conditions. Such work will allow for testing the suitability of the Yoni task in specific clinical settings. In fact, although the present study provides a cut-off to identify people with a ToM performance far from the norm, additional contributions on specific clinical populations will allow for a deeper understanding of social cognition phenotypes in clinical profiles.
This study is not without limitations. Further analyses on the validity and reliability of the Yoni task would have strengthened our conclusions. In more detail, testing the divergent validity on spatial perception, reasoning, or face recognition may add information on the relationship between the Yoni task and other cognitive functions. In fact, convergent and divergent validity evidence of the Yoni task has already been presented in previous studies [
48,
49] only on gender recognition tasks. Moreover, a future inter-time reliability study is needed to support the test adoption for patients undergoing treatment. Finally, it is worth mentioning that the Yoni task presents simple cartoon-like stimuli that do not consist of complex real-life scenarios such as other ToM tests (e.g., the Edinburgh Social Cognition test and the Moving for the Assessment of Social Cognition). Therefore, its application should be targeted towards certain clinical populations in which reduced cognitive load may facilitate a better characterization of specific ToM deficits (e.g., individuals with Alzheimer’s disease) [
8].