SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training

Lyu, Linkai; Hu, Tianrui; Wang, Hongrun; Hou, Wenjun

doi:10.3390/electronics14071337

Open AccessArticle

SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training

¹

School of Intelligent Engineering and Automation, Beijing University of Posts and Telecommunications (BUPT), Beijing 102206, China

²

School of Digital Media and Design Arts, Beijing University of Posts and Telecommunications (BUPT), Beijing 102206, China

³

Beijing Key Laboratory of Network System and Network Culture, Beijing 100876, China

⁴

Key Laboratory of Interactive Technology and Experience System, Ministry of Culture and Tourism, Beijing 100876, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Electronics 2025, 14(7), 1337; https://doi.org/10.3390/electronics14071337

Submission received: 20 February 2025 / Revised: 18 March 2025 / Accepted: 20 March 2025 / Published: 27 March 2025

(This article belongs to the Special Issue Human-Computer Interaction and Artificial Intelligence in VR/AR/MR Application)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

This study focuses on the innovative application of HCI and XR technologies in behavioral skills training (BST) in the digital age, exploring their potential in education, especially experimental training. Despite the opportunities these technologies offer for immersive BST, traditional methods remain mainstream, with XR devices like HMDs causing user discomfort and current research lacking in evaluating user experience. To address these issues, we propose the spatial reality display (SRD) method, a new BST approach based on spatial reality display. This method uses autostereoscopic technology to avoid HMD discomfort, employs intuitive gesture interactions to reduce learning costs, and integrates BST content into serious games (SGs) to enhance user acceptance. Using the aluminothermic reaction in chemistry experiments as an example, we developed a Unity3D-based XR application allowing users to conduct experiments in a 3D virtual environment. Our study compared the SRD method with traditional BST through simulation, questionnaires, and interviews, revealing significant advantages of SRD in enhancing user skills and intrinsic motivation.

Keywords:

spatial reality display method; autostereoscopy; gesture interaction; extended reality application; behavioral skills training; serious games; user experience

1. Introduction

Behavioral skills training (BST) is a systematic teaching method that aims to improve individual behavior and skill levels [1,2,3], providing a safe training environment for professional fields such as experimentation teaching, industrial demonstration, and special training [4,5,6,7]. Currently, BST is gradually transitioning from traditional teaching methods to more informationization and immersive learning experiences [8]. The integration of human–computer interaction (HCI) and extended reality (XR) technologies plays a crucial role in this process [9,10]. HCI in XR optimizes user interfaces and interaction methods, enabling learners to engage with training systems in a more natural and intuitive way. XR technologies create immersive virtual environments for BST, allowing for the simulation of complex real-world scenarios, offering a controlled and safe platform for learners [11].

Despite these significant advantages, the application of HCI and XR in BST still faces several challenges. First, head-mounted displays (HMDs), such as VR headsets and AR glasses, are currently the most widely used XR devices, but long-term use of HMDs can lead to motion sickness [12], eye fatigue [13] and other health problems [14,15,16]. Moreover, the design of HMDs (such as their weight and wearability) can also cause discomfort for users [17]. Second, traditional BST methods still heavily depend on video tutorials and live demonstrations. However, these methods have several limitations, including the limitations of teaching resources, the risks involved in the operation, and the inefficiency of the teaching process [18]. Third, current BST evaluations focus mainly on improvements in behavior or skills, with insufficient research on subjective user experiences. In comparison, gamified education is more attractive to students [19]. To fully explore the benefits of HCI and XR technologies in BST, more attention is needed to the higher cognitive, memory, and emotional processes of users.

To address these challenges, we propose an innovative BST approach—the spatial reality display (SRD):

1.: Spatial reality display (SRD) technology has been introduced into educational methods. This cutting-edge XR technology allows users to experience realistic 3D virtual scenes and objects without the need for head-mounted devices, effectively avoiding the discomfort associated with HMDs. A simple and intuitive gesture-based control scheme for SRD is designed to make it easier for users to learn, with virtual objects matching their real-world counterparts. This helps users become more familiar with the operations and properties of various objects.
2.: BST content in the form of serious games (SGs) is designed by integrating knowledge and skills into specific scenarios and gamified tasks. We also provide guidance at key points to enhance users’ psychological acceptance and reduce unfamiliarity with new technologies. As shown in Figure 1, BST keeps students actively engaged in the learning environment and proposes four steps in the behavioral skills training process: instruction, modeling, rehearsal, and feedback [20,21]. Based on these technologies and design principles, we selected the thermite reaction experiment as a case study in one of the most typical BST application areas—experiment teaching. We developed a BST application for chemistry labs where users can learn and perform the thermite reaction experiment in a 3D virtual environment.
3.: The effectiveness evaluation of the SRD approach is assessed through written exams and simulated operations, with users’ skill acquisition and behavior data compared to their performance in traditional chemical experiment BST. The specific evaluation process is shown in Figure 2, which includes several stages such as pre-test, behavioral skills training, simulated operations, post-test, understanding other BST methods, and interviews. A more detailed description is provided in Section 4.

The following sections of this paper detail our research. Section 2 reviews related studies, covering various immersive XR technologies, common interaction methods, and concepts like BST and SGs. Section 3 introduces the hardware requirements, interaction methods, and design and implementation of the BST application in the chemistry laboratory. Section 4 and Section 5 present the evaluation, including user experiment design, evaluation metrics, and data analysis. In Section 6, we discuss the strengths and weaknesses of SRD compared to traditional BST methods, based on data and user feedback, and identify areas for future improvement. Finally, Section 7 concludes the paper.

2. Related Works

2.1. Immersive XR Technology

Immersive XR technology refers to the use of technical means to immerse users in a virtual environment, thereby creating a sense of presence. This technology is widely applied in various fields such as entertainment, education, healthcare, and engineering, significantly transforming the way users interact and their overall experience. When introducing existing immersive technologies, they are typically categorized based on their hardware types into several categories: head-mounted display (HMD), surrounding display, and spatial reality display [22,23].

Head-mounted Display: Head-mounted display (HMD) is a device worn on the head [24], providing an immersive experience for users by covering their field of view with a display screen. Depending on its usage environment and hardware configuration, HMDs can be further divided into two types: tethered HMDs and all-in-one HMDs. Tethered HMDs are typically connected to high-performance computers, capable of delivering high-resolution and low-latency images. These devices are usually equipped with precision sensors that can accurately capture the user’s head movements, thereby enabling natural perspective changes in the virtual environment. Typical examples of tethered HMDs include the Oculus Rift and HTC VIVE. With the development of integrated chips, the industry has gradually shifted its focus to all-in-one HMDs, such as Meta Quest and PICO4. These devices integrate high-performance chips into the headset itself. Relying on the powerful computing capabilities of these chips, users can connect the headset to a computer and enjoy a good experience without the constraints of cables. Additionally, this approach reduces costs, making virtual reality experiences more accessible and popular.

Surrounding Display: Surrounding display technology is a type of immersive visual experience provided by multiple screens or a large screen system that surrounds the user. Compared to head-mounted displays (HMDs), surrounding display technology is more suitable for fixed-location applications. Its main forms of presentation include dome screen [25], curved screen [26], and CAVE [27,28], among others. These technologies are typically used in scenarios that require a wide field of view and high immersion, such as simulation training, film and television production, and scientific research. In such environments, users can see the virtual environment without wearing HMDs. Compared to HMD solutions, this approach offers higher flexibility and comfort for users. Moreover, it can place multiple users in the same virtual environment, enhancing the social interaction and communication among them. In some specific scenarios, users can also wear 3D glasses, which can provide a more realistic experience.

Autostereoscopy: Autostereoscopy is a cutting-edge immersive display technology that combines the advantages of head-mounted displays (HMDs) and peripheral displays, allowing users to intuitively see three-dimensional virtual objects without wearing an HMD or any other device.

There are several display schemes for autostereoscopy displays [29], one of which is known as a volumetric 3D display [30,31,32]. This method displays every point of an object in the physical space, making it highly realistic and stable. It also supports multi-user viewing. However, the downside is that such devices require light sources or carriers to move in space, which results in relatively weak interactivity.

Another scheme is called the multiview 3D display [33], which is based on the optical principles of human binocular stereoscopic imaging [34], using the parallax effect to create a stereoscopic effect. Parallax refers to the visual difference that occurs when an object is observed by different eyes at different positions. By simultaneously presenting slightly different images to the left and right eyes on the display screen, the brain combines these images into a stereoscopic three-dimensional image. Relevant technical solutions include lenticular gratings, parallax barriers, and light field displays [35,36]. The technology of lenticular gratings utilizes convex lenses or mirrors to create a parallax effect. By controlling the refraction or reflection of light, different observers can see different images at various positions, thereby generating a sense of depth [37]. The parallax barriers technology involves covering the screen surface with a layer of special barrier structures at a certain distance. These barriers can make the left and right eyes perceive different pixels on the screen, creating a parallax effect and allowing observers to experience a stereoscopic effect. Light field display technology simulates the direction, intensity, and color properties of light in the real world to present images [38]. As a result, it produces a realistic stereoscopic effect and depth perception in the eyes of observers, making the images appear more real and three-dimensional. The 3D images generated by this method do not have a physical presence in space but still support multi-user viewing. However, rendering images for every viewing angle of a scene is computationally intensive, leading to reduced frame rates in real-time rendering. Additionally, there are still significant limitations in terms of viewing angles and screen resolution. To address these issues, developers have removed the multi-user viewing feature and instead tracked the user’s eye position, mapping it to the virtual scene’s camera. In rendering, only two images are generated—since a single user requires at least two images to achieve stereoscopic vision. On the screen, a technology called the parallax barrier is used [39]. It employs optical refraction to allow each eye to see different images without the need for special equipment. Although this method does not support multi-user viewing, it reduces the number of frames to be rendered per second, allowing the computer to focus on improving image quality. Additionally, fewer light-emitting diodes (LEDs) are required in the corresponding screen pixels, which means that higher-resolution screens can be produced at the same size, significantly enhancing the user experience.

In recent years, the multiview 3D display scheme has stood out among various methods due to its excellent stability and reasonable cost. Major manufacturers have gradually adopted the parallax barrier scheme in combination with other solutions to develop autostereoscopy display screens. The industry has already launched many commercial-grade display devices based on this scheme, such as Sony’s ELF-SR series, the Looking Glass series, and the Nintendo 3DS. Based on these devices, some researchers have begun to leverage their advantages in different fields. Taking the display of the complexity of molecular structures as an example, D. Svatunek [40] delves into the possibilities and limitations of autostereoscopic displays in the field of chemistry. His research indicates that multi-view autostereoscopic devices currently suffer from low resolution and high cost, while the single-view stereoscopic display with eye tracking used in the SRD method offers higher resolution at a lower cost, making it a potential mainstream technology in the future. In addition, Sony has made innovative attempts with their developed ELF-SR series displays aimed at movie teaching in schools and exhibition scenarios [41].

Compared to the application of serious games in education using VR and other technologies, the SRD method utilizing autostereoscopy holds the potential to make a difference in three aspects. First, the VR method requires the use of HMDs, whereas the SRD method can be experienced without wearing any devices. This effectively avoids issues such as visual fatigue and motion sickness, including dizziness and nausea, caused by HMDs. Second, the VR method creates a closed environment, while the SRD method provides an open environment. This makes the SRD method more interactive and facilitates communication with others during teaching, supporting more complex collaborative learning [42]. Third, the VR method is entirely virtual, whereas the SRD method integrates virtual and real elements. This allows the SRD method to leverage the real world to enhance the learning experience, such as using real-world teaching aids in conjunction with virtual content or providing learners with a familiar learning environment.

2.2. Common Interaction Methods in Immersive XR

Unlike the interaction with virtual environments using a keyboard and mouse in non-immersive scenarios, the interaction technology in immersive scenarios is primarily designed to simulate hand-based interactions and realistic physical effects. In VR head-mounted displays (HMDs), users typically interact through controllers [43]. The virtual environment assumes that users are holding the controllers and simulates hand operations in the virtual space through various sensors on the controllers. It also feeds back physical effects to the controllers, making the interaction more realistic and natural and enhancing the sense of immersion for users. In some special scenarios, designers may consider using specially designed controller devices to make the operators’ experience more realistic [43,44,45]. In recent years, with the development of interaction technology, more and more research and industrial solutions have begun to adopt gesture-based interactions [46,47], such as Meta Quest and Apple Vision Pro. This type of interaction leverages the flexibility of the human hand and directly places human gestures into the virtual environment through computer vision methods, making the interaction smoother and more effortless.

Considering that a crucial aspect of chemistry experiments is the mastery and handling of laboratory equipment, and given the limited display range of holographic screens, using controllers for interaction can be inconvenient. Therefore, in this study, we mainly focus on gesture-based interaction, allowing users to directly manipulate and adjust the experimental equipment and procedures with their hands.

2.3. Behavioral Skills Training

Behavioral skills training (BST) is used to teach a variety of behavioral skills across different contexts [20,21]. Additionally, BST is one of the effective methods for training individuals to take emergency measures in critical situations [48]. It is also employed to teach personal safety skills to individuals of various age groups and skill levels, including skills related to kidnapping, gun safety, sexual harassment, wildlife attacks, and fire safety [48,49,50]. As shown in Figure 1. The specific process includes the trainer demonstrating the correct behavior for the learner to imitate. Modeling provides a targeted description of the specific activity, while rehearsal requires the learner to practice the correct behavior after following the instructions and modeling. Ideally, BST offers as realistic a rehearsal as possible to force the user to perform the correct behavior. Finally, the feedback step provides the learner with the results of the rehearsal, which includes praise, correction of errors, and further guidance for improvement. Through this process, the learner’s correct behavioral responses to different scenarios and situations are continuously reinforced.

For behavior skills training (BST), the key to enhancing training effectiveness lies in the learner’s more active interaction within the environment and the authentic experiences encountered in various situations [20]. Therefore, it is crucial to create environments that closely resemble reality in BST-based training. However, it is extremely challenging to create realistic learning environments for training scenarios such as earthquakes, floods, fire explosions, and so on. As a result, BST has evolved into a computerized behavior skills training (Computerized BST, C-BST) method for such situations that are difficult to realistically simulate. In this approach, various visuals, videos, and interactive games are used for teaching, modeling, rehearsal, and feedback within a computer-generated virtual environment. Unlike the traditional face-to-face BST method, C-BST is more beneficial and time-saving for a broader range of participants [51]. C-BST also allows educators to take into account the individual characteristics of participants during the design process. It is also useful for verbal or performance-based assessments. On the other hand, the lack of a sense of reality in these environments can lead to some disadvantages [52]. In this sense, considering the potential of virtual learning environments, participants can feel more immersed, and they can try and make mistakes in a comfortable manner [53]. Çakiroğlu [54] proposed a BST model based on immersive VR technology (VR-BST) to teach children basic fire safety behavior skills. The BST process is a technical means that provides children with authentic experiences and opportunities for self-learning through in situ training (IST) and in situ assessment (ISA). Ten children were tested based on the VR-BST model. The results showed that the VR technology-based fire safety training conducted within the framework of the VR-BST method can improve children’s behavior skills. When VR-BST is combined with IST, this positive outcome is enhanced, with most children being able to transfer the skills they acquired in VR-BST training to the real world.

In summary, apart from the SRD method, existing BST methods mainly include on-site training [55,56], computer-assisted training (such as videos or websites) [8,57], and VR training [54,58]. On-site training offers a strong sense of realism but is often limited by cost, is difficult to scale up, and sometimes poses significant safety risks. Computer-assisted training has the advantage of reusable teaching materials and controllable costs, but it lacks immersion and interactivity, which are crucial factors for transferring skills from the virtual to the real world. VR training provides a strong sense of immersion, but the head-mounted devices can affect user experience, and switching between virtual and real scenes is not convenient, which hampers user interaction and learning. In response to the issues of the above BST methods, the SRD method aims to create a training method with strong immersion and interactivity, while also being safe and cost-effective for large-scale adoption, thus filling the gap in hybrid virtual and real training and teaching.

2.4. Serious Games

VR based on games is associated with entertainment, immersion, and interaction. Serious games (SGs) allow users to experience virtual worlds that are unlikely to be artificially constructed in the real world due to reasons of safety, cost, and time [59]. SGs have a positive impact on individual skill development, such as in education and healthcare [60]. Although SGs are related to entertainment, it is crucial to note that they are also applicable for purposes beyond entertainment, such as education. This specific purpose is referred to as “edutainment”, which means providing education through entertainment. The term refers to any form of education that can be enhanced and improved through entertainment involving video games [61]. SGs are considered game-based learning. For example, according to [62], the purpose of SGs is “to harness the power of computer games to engage and attract end-users for specific purposes, such as developing new knowledge and skills”. SGs have been used in many fields, such as helping individuals with autism. Research by Whyte et al. [60] shows that the design principles of SGs can effectively improve academic performance. Moreover, SGs can significantly reduce the difficulties and boredom associated with learning from printed textbooks by providing a more interactive, exciting, and engaging learning method. This approach enhances the retention of learning content [63], which is particularly important for the elderly. Therefore, SGs are an ideal method for implementing BST because they can enhance the learning experience through highly immersive interactions that simulate real environments. Additionally, compared to traditional methods such as two-dimensional images or video games, SGs based on XR technology can provide more realistic environments.

In recent years, SGs have become a key tool for skills training. For example, research by [64] shows that SGs improve the effectiveness of technical training for flight crews, with SGs having a significant advantage in accuracy. SGs have also been used to train caregivers for the elderly. Maskeliunas et al. [65] developed the iDO SG for caregivers of dementia patients to enhance their knowledge and capabilities. The results of SG training indicate that caregivers are more relaxed and experience less fear when implementing the care process. Similarly, Maskeliunas et al. [66] developed another SG called iTrain, aimed at training caregivers for stroke patients. They utilized professional medical experience and real-life data to create scenarios that improve caregivers’ behavior. Therefore, this study integrates HCI and XR technology, SGs, and BST theory to provide users with methods and applications for training in hazardous experiments.

3. Method

3.1. Hardware

The experimental system employs a Leap Motion gesture sensor operating in desktop mode as the input device, which captures the positional information of hand nodes in space as well as the orientation and position of the hand. The display device of the experimental system is a Sony ELF-SR2 autostereoscopic 3D display, with a screen size of 27 inches and a resolution of 3840 × 2160. It is capable of detecting the position of the pupils and rendering spatial images in real-time for each eye separately, allowing users to perceive stereoscopic images with the naked eye.

The audio parameters are equally important. The ELF-SR2 is equipped with dual 1 W built-in speakers and supports 3D surround sound field transformation technology. The auditory experience complements the visual experience. To achieve a better overall experience, this device can simulate a spatial sound field to enhance immersion. Additionally, based on Sony’s audio processing technology, the audio signals can be intelligently analyzed to determine the spatial positions of different sound sources, thereby reproducing a more realistic 3D surround sound effect. With this design, the direction and intensity of the sound dynamically adjust according to changes in the visual scene, allowing the audio to be closely integrated with the visual content and providing an immersive audio experience.

3.2. Interaction Methods

To enhance the realism of the experiment, the virtual environment used in this study employed Leap Motion to capture the operator’s hand gestures, allowing the operator to directly manipulate virtual instruments for skill training. In terms of gesture interaction, the Leap Motion Unity3D plugin was utilized to record gesture data within the virtual environment. Additionally, the study explored methods for determining whether the virtual hand has successfully grasped the equipment.

During the initial implementation and research testing, we experimented with several approaches. These included concealing the virtual hand and substituting it with a real human hand, utilizing the physical properties of fingers to interact with objects, and employing bounding boxes with gesture triggers. Our research findings indicate that using a real human hand to interact with the virtual environment, considering the imaging principles of the SRD screen, makes it difficult to align the human hand with the geometric relationships of objects in the virtual scene. Moreover, the hand often obstructs the view of the virtual scene, affecting the operator’s actions. Interacting with the virtual hand in the scene sacrifices some sense of immersion but simplifies the operator’s tasks to a certain extent. This approach gives rise to two types of operations: physical grasping of objects with fingers and gesture-triggered interactions based on bounding boxes. Physical grasping of objects with fingers involves all operations based on real physical collisions. Although this method enhances realism, it significantly increases the interaction difficulty under the hardware conditions of Leap Motion. The implementation of gesture-triggered interactions based on bounding boxes involves binding a bounding box to an object, allowing the virtual hand to grasp and interact with the object by performing the corresponding gestures within the bounding box. After preliminary testing, we found that this interaction method has the highest execution efficiency and thus adopted it as the gesture interaction method for this study.

In this study, developers used a cubic bounding box in the application to encapsulate the corresponding chemical experiment instrument models. As shown in Figure 3, when the virtual hand enters the range of the bounding box, pinching the thumb and index finger together triggers the gesture interaction. To enhance the coherence of the interaction and reduce difficulty, we disabled the physical effects of some experimental instruments. This measure ensures that when gesture recognition is accidentally interrupted and then restored, the operator does not need to re-grasp the object.

3.3. Application

The thermite reaction is an important topic in China’s college entrance examination (Gaokao). It is a chemical reaction that involves the reaction of aluminum powder with metal oxides to obtain metallic elements. The reaction is highly exothermic, with temperatures exceeding 1250 °C. Since the molten aluminum oxide generated during the reaction is prone to splashing, it can easily cause severe consequences such as burns. Given its high level of danger, teachers generally require students to learn about the thermite reaction through video demonstrations rather than conducting the experiment themselves.

The existing teaching of the thermite reaction mainly relies on watching videos, with a few teachers conducting live demonstrations. However, in the effectiveness evaluation studies of these two methods, Xian et al. [67] found that while watching videos is safe, it results in poor memory retention and lack of focus among students. Moreover, the live demonstration method is highly dependent on the teacher’s skill level, involves higher risks, and has a low tolerance for errors. There are also issues with unclear experimental phenomena and the inability to observe repeatedly. These problems can be addressed to a certain extent by the SRD teaching method.

Considering the above, the knowledge and procedural steps of the thermite reaction experiment are highly suitable for learning through a serious game-based BST application using the SRD method. To replicate the real thermite reaction experiment, we developed a BST application using the Unity3D (version 2021.3.6) graphics engine. The engine’s physics and lighting effects were utilized to construct a virtual experimental environment, simulating a real-world laboratory scene. During the experimental operations, corresponding information prompts will also appear in the scene to guide users in learning the relevant knowledge and operational methods of the experiment as shown in Figure 4.

In this application, we have implemented simulations of all the key operational steps of the thermite reaction experiment. Users can personally experience and complete the following operations using gesture interaction technology on a naked-eye stereoscopic reality device.

Wetting the funnel: In the thermite reaction experiment, to ensure that the molten material generated can smoothly drop from the funnel into the crucible, it is necessary to wet the inner funnel with water in advance. Therefore, we consider it crucial to design the simulation of this step in the application. As shown in Figure 5, users need to grasp a dropper with their hand and drop water onto the funnel. When the system recognizes that the step is completed, the UI will change, directing the user to the next experimental step.

Figure 5. Dropping water onto the funnel. The prompt information on the left includes the name of this step, a schematic diagram, and its purpose: moistening the inner funnel with water is to prevent the paper funnel from burning due to high temperatures.

Adding thermite and oxidizer: In this step, users will learn how to add thermite and oxidizer. As shown in Figure 6, users will learn from the UI that thermite is composed of aluminum powder and iron (III) oxide powder mixed in a 1:3 ratio. After using a spoon to take a measured amount of thermite, the operator needs to place it into the wetted funnel from the previous step, compact it, and then add a potassium chlorate compound on top of the thermite to aid combustion.

Figure 6. Adding thermite and oxidizer. (a) The prompt information on the left includes the names, ratios, and methods of adding the reagents for this step: In the thermite mixture, the ratio of iron oxide powder to aluminum powder is 3:1, and they should be thoroughly mixed to ensure a complete reaction; (b) Add the thermite mixture; (c) Add the oxidizer (potassium chlorate) to aid combustion.

Igniting and placing the magnesium strip: This step involves operations such as ignition, which are very dangerous in real experiments. In this step, the operator needs to turn on the alcohol burner and use crucible tongs to hold the magnesium strip and ignite it over the burner. As shown in Figure 7, this step includes many details, such as using crucible tongs to hold the magnesium strip and insert it upside down into the thermite. This is an important detail involving safety hazards in real experiments, and thus it is emphasized in this application.

Figure 7. Igniting and placing the magnesium strip. (a) The information prompt on the left includes the name of this step, a schematic diagram, and the method of operation; (b) Ignite the magnesium ribbon and insert it upside down into the mixture.

Reaction occurrence: As shown in Figure 8, after the magnesium strip is fully ignited and reacts within the thermite, the thermite burns and generates a large amount of heat, producing high-temperature, bright yellow molten iron that cools into iron balls and emits intense flames. This is the hallmark phenomenon of the thermite reaction. In this BST application, we have also well-reproduced the visual effects of the thermite reaction using Unity3D particle effects, allowing users to immerse themselves in understanding the experimental phenomena.

Figure 8. The experimental phenomena. The information on the left describes the phenomena of the reaction: 1. It emits a dazzling light and generates a large amount of heat; 2. The paper funnel is burned through, and molten droplets in a red-hot state fall onto the fine sand in the evaporating dish. After cooling, the droplets turn into a black solid. At the bottom is the button to restart the experiment.

3.4. Content Optimization

In the adjustment of experimental scene content and testing with holographic display devices, we found that in some cases, the stereoscopic effect of the displayed content was not as pronounced as expected. Through exploration, we have summarized that the following factors affect the holographic display effect, with some of the most significant ones being scene brightness, background depth, object material, and reference objects in the foreground and background. Additionally, factors such as object color, object proximity, object dynamics, and aspect ratio of the scene all influence the display effect. For instance, darkening the scene, using a dark background, and employing low-transparency materials with a frosted texture can enhance the overall stereoscopic and realistic feel of the displayed content. Specific influencing factors and adjustment explanations are provided in Table 1.

4. User Study

4.1. Design of User Study

To accurately evaluate the differences between the SRD method and the commonly used BST method in terms of objective effectiveness and subjective user experience, we designed a controlled experimental scheme based on the application background of the aluminothermic reaction experiment. One group of participants learned using the SRD method, while the other group learned using the most commonly used BST method—watching videos. The experimental scheme included multiple components such as examinations, filling out evaluation questionnaires, simulating experimental operations in virtual and real environments, and user interviews. This allowed us to comprehensively assess the SRD method from multiple perspectives using quantitative methods.

The BST assets used in the experiment included an application and a teaching video. The assessment materials consisted of two tests—pre-test and post-test. The pre-test contained seven questions, which were used to understand the participants’ prior knowledge before the BST. The post-test included 15 questions, which were used to evaluate the learning outcomes after the BST. In addition, some necessary experimental instruments were used to allow participants to replicate the steps of the thermite reaction in a real-world environment. Furthermore, a subjective experience evaluation questionnaire with 42 items was provided for the participants to complete.

4.2. Procedure

To accurately assess the differences in effectiveness between the SRD method and the traditional BST method for watching videos, participants were divided into two groups: the traditional group and the SRD group, to control for variables. Therefore, each group of participants was only exposed to one of the two BST methods. However, the subjective evaluation of user experience required that each participant be familiar with both BST methods to minimize the impact of their cognitive biases on the experimental data. To resolve this contradiction, we designed the experimental procedure as shown in Figure 2.

Before the experiment began, we introduced the background and procedure of the experiment, recorded the participants’ information, and conducted a pre-test to assess their current level of understanding of the training content. After that, participants in the traditional group would first learn using the traditional BST method, which involved watching detailed instructional videos on a PC monitor. Additionally, the video links are available in the Supplementary Materials section of the paper. In contrast, participants in the SRD group would first learn using the SRD method. After the learning phase, all participants were required to complete a simulation task and a post-test. The simulation task involved using chemical laboratory instruments and some substitute props to simulate the steps of the thermite reaction experiment in a real-world environment. This process was recorded to capture the participants’ behavioral data. At this point, all objective evaluation aspects related to the effectiveness of the BST methods had been completed. Subsequently, there was no need to control for variables anymore. Participants were then exposed to the other BST method they had not previously experienced and were asked to complete a subjective experience evaluation questionnaire. Following the experimental procedure described above, a semi-structured interview was conducted to gain insights from the participants regarding their views on traditional education methods and SRD education methods.

4.3. Evaluation Indicators

Objective Indicators: Objective indicators are used to assess the differences in effectiveness between the SRD method and the traditional BST method. These differences are mainly reflected in the participants’ level of knowledge about the thermite reaction experiment and their proficiency in operating it. The relevant indicators can be derived from pre-tests and post-tests, such as the accuracy rate of questions. Additionally, key steps from the simulated operation phase can be extracted as objective indicators. By analyzing the participants’ behavioral data and calculating the accuracy rate of completing these steps, an assessment can be made. We have extracted the following 11 indicators:

Place the crucible below the iron stand;
Fill the crucible with sand;
Use double-layer filter paper to hold the reactants;
Cut a small hole in the filter paper;
Moisten the filter paper with water;
Add the reactants using a spatula;
Perform steps 5 and 6 in sequence;
Add the aluminothermic agent and potassium chlorate successively;
Stir the aluminothermic agent and potassium chlorate to mix them evenly using a glass rod or spatula;
Use tweezers to hold the magnesium strip and ignite it;
Insert the ignited magnesium strip into the reactants upside down.

Subjective Indicators: Subjective indicators are used to assess the differences between the SRD method and the traditional BST method in terms of users’ cognitive, emotional, and memory-related subjective experiences. These differences are quantitatively evaluated by analyzing the ratings of scale items in the subjective experience assessment questionnaire. The subjective indicators are primarily derived from authoritative experience assessment scales in the field.

In this experiment, the Presence Questionnaire (PQ) [68], Intrinsic Motivation Inventory (IMI), and System Usability Scale (SUS) were used. Based on the research content, items from the original scales were selected to form a subjective experience assessment questionnaire. Each subjective indicator can be quantified by several questions in this questionnaire. The following 10 subjective indicators were extracted:

Involvement;
Sensory fidelity;
Adaptation/immersion;
Interface quality;
Interest/enjoyment;
Perceived competence;
Effort/importance;
Pressure/tension;
Value/usefulness;
SUS.

When conducting the evaluation, a comparative analysis can be performed between the performance of the SRD method and the traditional BST method in terms of the metrics on the PQ and IMI scales. Additionally, the user ratings of the two BST methods based on the SUS scale can be analyzed to assess their impact on user subjective experience.

4.4. Data Collection and Analysis

The collection of experimental result data originates from multiple stages of the experimental process, including pre-test scores, participants’ reactions during training, behaviors and performance in simulated operations, post-test scores, and interviews. This encompasses both subjective and objective data.

For data analysis, we employed difference analysis to explore the differences in effectiveness and subjective experience between the two BST methods. The difference analysis methods used were the t-test and the U test. The t-test was applied to data that followed a normal distribution, while the U test was used for data that did not follow a normal distribution. Given the small sample size, we utilized the Shapiro–Wilk test to assess whether the data conform to a normal distribution.

We also considered the potential impacts of the small sample size. First, regarding extrapolation, the results of a study with a small sample size may not be easily generalizable to a broader population. Due to the limited sample size, the study may not fully capture the variability among different individuals, thereby restricting the universality and applicability of the findings. Second, in terms of effect size, a small sample size may lead to inaccurate estimates of effect size, which in turn affects the interpretation of the results. To mitigate these negative impacts, we rigorously controlled the selection of experimental participants and chose appropriate analytical methods. The specific strategies are detailed in Section 5. The analysis results are at least somewhat representative of the target user group. However, to widely promote the SRD method to more scenarios and user groups, it is necessary to introduce more diverse experimental subjects for further research to explore the performance of the SRD method in a broader population.

5. Results

The target users of our study were first-year college students majoring in chemistry, who had just begun their professional chemistry studies but were required to conduct potentially hazardous experiments. To obtain accurate experimental results, we selected participants with the following attributes: aged 18–25, a balanced gender ratio, having taken high school chemistry courses, and possessing only very basic knowledge of chemistry experiments. These participants also had little to no experience with SRD-related devices. This ensures that their cognitive level and knowledge base align closely with our target users. Given the participants’ lack of experimental experience, we increased the experiment’s tolerance for errors. For example, in the SRD method, we enlarged the bounding boxes of interactive objects to reduce the difficulty of gesture interactions for selecting and moving virtual objects; when watching videos, participants were allowed to rewind by dragging the progress bar; and in simulated operations, we used salt, sugar, and sand instead of the reactants in the thermite reaction, without providing any ignition sources to avoid potential dangers caused by accidental operations. The actual experimental conditions are as follows.

We selected a portion of thermite reaction-related questions to conduct a pre-test on the candidate participants, ensuring that the participants were not familiar with the operation process of the thermite experiment. This step reduced the influence of the participants’ prior knowledge on the experimental test. After selecting the participants, we recruited 32 experimenters (17 females and 15 males), aged between 18 and 24 years old. Among them, 30 had never used an SRD display, while 2 had used it before. Prior to the experiment, we provided a detailed description of the experimental procedure and the matters that the participants needed to be aware of. We also had the participants sign an informed consent form to ensure that they were fully informed about the experimental information and the potential risks involved, and to ensure that they would not experience any discomfort from the hardware during the experiment. Upon completion of the experiment, each participant received a remuneration of USD 10.

5.1. Analysis of Pre-Test and Post-Test

Firstly, the normality of the correct answer rate data for the SRD group and the traditional group in the pre-test and post-test was tested. Since this is a small sample with a size of less than 50, the Shapiro–Wilk test was employed.

As shown in Table 2, the p-values for the accuracy of answers in the pre-test for the two groups of participants were 0.0612 and 0.3621, respectively. Since both p-values are greater than 0.05, it is concluded that the pre-test data for both groups conform to a normal distribution. Additionally, the p-values for the accuracy of answers in the post-test for the two groups were 0.0194 and 0.0165, respectively. Since both p-values are less than 0.05, it is concluded that the post-test data for both groups do not conform to a normal distribution.

Since the pre-test data of the two groups conform to the normal distribution, a t-test can be conducted on the data before training. As shown in Table 3, the homogeneity of variance was tested, with a p-value of 0.7311, which is greater than 0.05, indicating that the test for homogeneity of variance was passed. Further analysis revealed that the p-value in the independent samples t-test was 0.7233, which is also greater than 0.05. Therefore, there was no statistically significant difference between the post-test data of the SRD group and the traditional group. This result suggests that before the participants received BST, there was no significant difference in their understanding of the training content.

Since the post-test data of the two groups did not conform to a normal distribution, the Mann–Whitney U test was conducted separately on the pre-test and post-test data of each group. In Table 4, the results showed that the p-value for the SRD group was 0.0025, which is less than 0.01, indicating a highly significant difference between the pre-test and post-test data. In contrast, the p-value for the traditional group was 0.0669, which is greater than 0.05, indicating no significant difference between the pre-test and post-test data. This result suggests that the SRD method has a more significant effect on improving users’ theoretical knowledge of behavioral skills compared to the traditional method.

This conclusion can also be corroborated by the comparison of histograms in Figure 9. Before BST, the correct answer rate of the SRD group was slightly lower than that of the traditional group. However, after BST, the correct answer rate of the SRD group exceeded that of the traditional group.

5.2. Analysis of Simulation of the Operation

Firstly, a normality test was conducted on the data of operational accuracy for the SRD group and the traditional group in the simulation of the operation. Since this is a small sample with a size of less than 50, the Shapiro–Wilk test was employed.

Table 5 shows that the p-values of the two groups of participants’ operation data in the simulation of the operation are 0.1706 and 0.5863, respectively. Since both p-values are greater than 0.05, it is concluded that the simulation operation data of the two groups conform to a normal distribution.

Given that the simulation operation data of the two groups conform to a normal distribution, a t-test can be performed on the data, and the results are shown in Table 6. First, the homogeneity of variance was tested, with a p-value of 0.0735, which is greater than 0.05, indicating that the variance homogeneity test was passed. Further analysis revealed that the p-value in the independent samples t-test was 0.0106, which is less than 0.05. Therefore, there is a statistically significant difference between the simulation operation data of the SRD group and the traditional group. This suggests that the SRD method and the traditional method have distinct effects on the improvement of participants’ behavioral skills.

Combining the histogram can more intuitively corroborate the above conclusion. The horizontal axis of the histogram represents the key operation steps, which are the 11 objective indicators mentioned earlier. By subtracting the operation accuracy rates of the traditional group from those of the SRD group for each indicator, the green data in Figure 10 are obtained. It can be observed that the accuracy rate of the SRD group is higher than that of the traditional group in the majority of indicators. In conjunction with the results of the t-test, it can be concluded that the SRD method has a more significant effect on improving users’ operational behavioral skills compared to the traditional method.

After statistical analysis during the simulated operation phase, the SRD group took a total of 34 min and 48 s, while the traditional group took a total of 36 min and 50 s. Overall, the SRD group demonstrated higher proficiency. Examining each individual participant, when considering the results of Figure 11, it was found that the majority of participants in the SRD group were able to complete the simulated experiments well and quickly. In contrast, participants in the traditional group did not exhibit outstanding levels of accuracy or speed in their operations. It can be concluded that the SRD method is more effective in skill transfer from virtual to real scenarios.

5.3. Analysis of Subjective Indicators

Firstly, based on the data from the subjective experience evaluation questionnaires completed by users, we conducted normality tests for the 10 subjective indicators in both the SRD method and the traditional BST method.

The results are shown in Table 7 that in the questionnaire data of the SRD group, involvement, interest/enjoyment, perceived competence, effort/importance, pressure/tension, and value/usefulness did not exhibit normality characteristics. In contrast, sensory fidelity, adaptation/immersion, interface quality, and SUS did exhibit normality characteristics.

As shown in Table 8, in the questionnaire data of the traditional group, sensory fidelity did not exhibit normality characteristics. However, involvement, adaptation/immersion, interface quality, interest/enjoyment, perceived competence, effort/importance, pressure/tension, value/usefulness, and SUS did exhibit normality characteristics.

We then conducted homogeneity of variance tests for the subjective indicators and found that, according to the results in Table 9, the type samples did not exhibit homogeneity of variance for adaptation/immersion, effort/importance, and value/usefulness. The data fluctuation was significantly inconsistent. Therefore, we concluded that the following indicators could be analyzed using the t-test: interface quality, interest/enjoyment, perceived competence, pressure/tension, and SUS. The indicators that could not be analyzed using the t-test included involvement, sensory fidelity, adaptation/immersion, effort/importance, and value/usefulness. For these indicators, we employed non-parametric tests.

5.4. Analysis of Subjective Scale

5.4.1. Presence Questionnaire

After conducting normality tests and analysis of variance (ANOVA), we found that the interface quality in the Presence Questionnaire (PQ), interest/enjoyment, perceived competence, and pressure/tension in the Intrinsic Motivation Inventory (IMI), as well as the System Usability Scale (SUS) score, all approximately followed a normal distribution and exhibited homogeneity of variance. Therefore, we used the t-test to analyze these attributes. For other attributes, we used the Mann–Whitney U test for data analysis.

When analyzing the data from the PQ, the Mann–Whitney U test was conducted on involvement, sensory fidelity, and adaptation/immersion, with the results shown in Figure 12 and Table 10.

The results indicate that there were significant differences among the three factors across different experimental teaching methods. For the involvement factor, the SRD method had a score distribution of 35.000 (33.0, 36.0) (where the median is 35.000, the 25th percentile is 33.0, and the 75th percentile is 36.0; this notation is used consistently for subsequent data), which was significantly higher than the score of 16.000 (13.3, 19.0) obtained from traditional multimedia teaching methods. For the sensory fidelity factor, the SRD method scored 17.000 (15.0, 19.0), which was significantly higher than the traditional multimedia teaching method’s score of 6.000 (4.0, 7.0). For adaptation/immersion, the SRD method scored 28.000 (26.0, 30.0), which was still higher than the score of 18.500 (14.3, 21.0) obtained from traditional video methods. For all three factors, the SRD teaching method significantly outperformed traditional multimedia teaching methods.

When analyzing interface quality, we used the t-test, and the results showed that t = −0.387, p = 0.700 > 0.05, indicating no significant difference between the two methods.

5.4.2. The Intrinsic Motivation Inventory

The t-test was conducted on interest/enjoyment, perceived competence, and pressure/tension, with detailed results shown in Table 11.

It can be observed that there were significant differences among different experimental teaching methods in terms of interest/enjoyment and perceived competence. For interest/enjoyment, the SRD method had a score distribution of 6.15 ± 0.82 (mean = 6.15, standard deviation = 0.82; the same notation applies to subsequent data), which was significantly higher than the traditional method’s score of 3.55 ± 0.97. In terms of perceived competence, the SRD method scored 5.83 ± 0.97, which was significantly higher than the traditional method’s score of 4.51 ± 1.36. The differences and comparisons are shown in Figure 13. However, no significant difference was observed between the SRD method and traditional multimedia methods in terms of pressure/tension.

For the effort/importance and value/usefulness factors in the IMI, the Mann–Whitney U test was used, and the results are shown in Table 12.

Both factors exhibited significant differences. For the effort/importance factor, the SRD method’s score distribution of 6.167 (5.0, 7.0) was significantly higher than the traditional method’s score distribution of 5.000 (3.8, 6.3). For the value/usefulness factor, the SRD method’s score distribution of 6.667 (6.1, 7.0) was significantly higher than the traditional method’s score distribution of 4.667 (4.3, 5.7). The boxplot distributions are shown in Figure 14.

5.4.3. System Usability Scale

We used the t-test on the total SUS score obtained, and the results showed no significant difference across different experimental teaching methods (t = 1.385, p = 0.171). The data showed that the SRD method had a score distribution of 74.77 ± 12.43, while the traditional teaching method had a score distribution of 69.61 ± 16.99. The SRD method was rated as “B” on the SUS scale, with an adjective describing its usability as “good”. In contrast, the traditional teaching method was rated closer to “C+”, with its usability adjective falling between “good” and “OK”.

6. Discussion

After the experimental section was completed, we conducted interviews with each participant, lasting 5–10 min. The interviews were guided by an outline, focusing primarily on the topic of “the differences between the two experimental teaching methods and their respective advantages and disadvantages”. In the following sections, we discuss the strengths and weaknesses of the SRD method and the traditional BST method by integrating the results of the data analysis and the content of the participant interviews. We also summarize the typical views of the participants regarding the two experimental methods, along with the original statements supporting these views, with the participants’ numbers indicated. For each viewpoint, we offer some speculation and explanations regarding its formation.

6.1. Advantages of the SRD Method

The advantages of the SRD method are mainly reflected in three aspects. First, the immersive experience provides users with a more realistic experimental experience, effectively enhancing memory and understanding, and increasing motivation and interest in learning. Some participants described this aspect as follows: “In SRD, I feel like I’m actually conducting the experiment, able to see the details of chemical reactions and even operate the experimental equipment” (P1, P4, P7, P20, P21. P stands for participant, with P1 referring to Participant No. 1. The same notation applies to the following cases). “When I perform the operations myself, I can remember each step, rather than just watching videos and forgetting” (P2, P4, P7, P13, P14). “SRD feels like playing an interesting game, making me more interested in learning and exploring” (P6, P12, P18, P21, P25). Based on these descriptions, the greatest advantage of SRD is that it provides a virtual experimental environment close to reality. Students can operate and observe as if they were in a real laboratory, enhancing their sense of realism and practicality. Through SRD, students can repeatedly perform experimental steps and experience the chemical reaction process firsthand. Combined with previous research data, this active participation helps deepen their memory and understanding. Moreover, the use of novel virtual reality and interactive technologies makes the learning process more engaging, thereby increasing students’ interest and proactivity in learning.

Second, the interaction and feedback mechanism offer a rich interactive experience, allowing users to enjoy immediate feedback and flexible operability. Participants described this during the interviews as follows: “I can use gestures to operate test tubes and instruments, and this interaction makes me feel very involved” (P4, P8, P10, P17). “Whenever I make a mistake in the steps, the system immediately provides feedback, letting me know where I went wrong” (P12, P17, P30). “I can freely control the pace of the experiment, pausing and trying different steps at any time” (P12, P13, P20). Thanks to the innovative gesture-based interaction, this high level of interactivity enhances the richness and engagement of the learning experience. The virtual environment provides an immediate feedback mechanism, helping students correct their mistakes promptly during operations, thus facilitating faster learning and mastery of experimental skills. Additionally, the SRD teaching method allows students to control the pace of the experiment autonomously. They can repeat and adjust their operations according to their learning needs, increasing opportunities for self-directed learning.

Third was safety and cost-effectiveness. Some participants mentioned: “In SRD, I don’t have to worry about the dangers of chemical reagents and can conduct experiments with peace of mind” (P7, P24, P31). “Through virtual experiments, we save a lot of costs associated with purchasing experimental materials and equipment” (P1, P3, P26). During the interviews, we also found that participants recognized one of the original intentions behind the design of the SRD experiment, which was to eliminate the safety hazards of real chemical experiments. This allows students to conduct potentially dangerous experiments in a risk-free environment. Moreover, using SRD for experimental teaching reduces the need for actual experimental materials and equipment, thereby lowering the overall cost of experimental instruction.

6.2. Disadvantages of the SRD Method

Despite its many advantages, the SRD method still has limitations in certain aspects, which are mainly reflected in three areas:

Firstly, the SRD method is limited by technological constraints, resulting in less satisfactory experiences in terms of gesture recognition and sensory aspects beyond vision. For example, several participants reported that “sometimes the gesture recognition is not sensitive enough, leading to unsmooth operations and affecting the experimental experience” (P4, P8, P17, P18), or “although the SRD is very realistic, I still feel there is a gap compared with real experiments, especially in the sense of touch” (P1, P8, P28). In experiments and interviews, we found a relatively serious issue: despite the advanced interaction technology provided by SRD, the accuracy of gesture recognition remains a problem, which may affect students’ operational experience and the smooth progress of experiments. Although the SRD offers a highly realistic virtual environment, some students still believe that it cannot fully replace the physical realism of real experiments, especially due to the lack of tactile feedback. This is because participants can only perform grasping gestures in the air during operations and cannot actually hold physical objects. To improve the accuracy of gesture recognition in the SRD method, several strategies can be adopted. (i) Integrating multiple sensors (such as cameras and infrared sensors) can enhance the accuracy and robustness of gesture recognition. For example, combining visual signals can more comprehensively capture gesture movements. (ii) Utilizing deep learning algorithms, particularly convolutional neural networks (CNN) and long short-term memory networks (LSTM), can automatically extract gesture features and improve recognition accuracy. (iii) Through environmental light adjustment and background separation techniques, the impact of lighting conditions and complex backgrounds on recognition can be minimized.

Secondly, the learning and economic costs associated with the use of new technologies are significant. “To use SRD, specific equipment is required, which is a bit difficult for some students” (P1, P6, P22), and “using SRD at the beginning is a bit complicated, not as convenient and intuitive as watching videos” (P15, P20, P26). During interviews, some participants also expressed concerns about the costs. The hardware for SRD requires high-performance support, which may increase the burden on students and schools, especially in situations with limited resources. The complexity of SRD also results in a steep learning curve for first-time users, meaning that students and educators may need to spend more time and effort to become familiar with and master the usage methods.

Thirdly, the interaction and feedback provided during the use of SRD need to be further enhanced; otherwise, they may exacerbate users’ feelings of frustration when mistakes occur. This point was also confirmed by some participants: “During the experiment, I hope there are more prompts and guidance to prevent me from making mistakes” (P7, P14, P17, P30), and “when I repeatedly make mistakes, the system’s feedback is not effective enough, which makes me feel frustrated” (P2, P17, P19). Although SRD provides interactive and feedback functions, some students hope to receive more real-time guidance during operations to reduce errors caused by unfamiliarity. If students frequently make mistakes during experiments and do not receive effective help and feedback, it may lead to frustration and negatively impact their motivation to learn.

6.3. Skill Transfer from Virtual to Real

In this study, to evaluate the skill transfer effect from virtual environments to the real world between the SRD method and traditional methods, a simulated experimental operation segment was designed. In this segment, participants’ behavior and accuracy in simulating an aluminum thermite reaction experiment were recorded and analyzed to compare the effectiveness differences between the two methods. For the SRD method intended for large-scale applications, some supplements should be made to the skill transfer evaluation method and experimental design used in this study.

Regarding experimental design, environmental differences, perceptual differences, and task complexity are the primary factors influencing the transfer effect. To enhance the skill transfer effect from virtual environments to the real world, the following strategies can be adopted: First, by employing more precise environmental modeling and simulation techniques, the differences between virtual and real environments can be reduced. Second, by integrating various feedback methods such as visual, auditory, and tactile feedback, learners’ perceptual experiences in virtual environments can be enhanced to make them closer to real-world operations. Third, complex tasks can be decomposed into multiple subtasks and gradually trained in the virtual environment. Through this approach, learners can progressively master the skills of each subtask and eventually integrate them into a complete skill set.

Regarding evaluation methods, skill transfer effects can still be evaluated based on real-world task performance. However, a comprehensive and standardized evaluation scheme needs to be developed, including four stages: pre-testing, virtual environment learning, real-world task testing, and data analysis. The pre-testing should be related to the real-world task testing to serve as baseline data. The evaluation process should combine quantitative and qualitative assessments, analyze differences in metrics such as completion time and accuracy, record operators’ behaviors during the learning and testing processes, and analyze learning styles and psychological changes through interview results.

6.4. Promotion of the SRD Method

This study holds practical significance for the large-scale promotion of the SRD method in terms of theoretical basis, practical reference, and model optimization:

1.: Initial validation of technical potential. Although the sample size is small, the study results can preliminarily validate the potential of the SRD method in behavioral skills training. Compared with traditional video training methods, the SRD method demonstrates stronger immersion, interactivity, and better skill transfer capabilities, providing a theoretical basis and practical reference for subsequent large-scale studies.
2.: Promoting technological application and innovation. The study results can attract more attention from the industry and researchers to the application of the SRD method. The positive outcomes from small-sample studies can inspire further exploration of innovative applications and promote the implementation of BST applications based on the SRD method in more fields.
3.: Optimizing training models. Small-sample studies can provide direction for optimizing the training models of the SRD method. For example, the study can reveal which design elements (such as immersion and interactivity) in the SRD method have a greater impact on learning outcomes, thereby providing references for training design when promoting on a large scale. Additionally, by analyzing small-sample data, potential areas for improvement can be identified to further enhance the effectiveness of the SRD method.

To promote the application of the SRD method in different scenarios, several methods need to be considered. First, integrating various interactive methods such as gesture recognition, speech recognition, eye tracking, etc., to form a multimodal interactive system. This allows users to choose the most natural interaction method according to their needs in complex scenarios. For example, prioritizing gesture recognition in noisy environments and combining eye tracking for high-precision operations. Second, utilizing machine learning algorithms to enable the system to automatically adjust the gesture recognition model based on user habits. By recording commonly used gesture patterns and optimizing the recognition algorithm, individual user recognition accuracy and interaction efficiency can be improved. Third, adopting a modular architecture to design various systems of the SRD method to facilitate the rapid integration or replacement of functional modules according to the requirements of different scenarios. For instance, quickly integrating virtual experiment modules in educational scenarios or high-precision virtual assembly modules in industrial scenarios.

7. Conclusions

This study proposes a new method of behavioral skills training (BST) based on spatial reality display (SRD), aiming to overcome the limitations of traditional BST methods and existing applications of human–computer interaction (HCI) and extended reality (XR) technologies. By introducing autostereoscopic technology and natural gesture interaction, the SRD method provides users with an immersive experience without the need for head-mounted devices, effectively avoiding the discomfort associated with HMDs. Combined with the design philosophy of serious games (SGs), this method integrates knowledge and skills into specific contexts and engaging tasks, significantly enhancing users’ psychological acceptance and learning efficiency.

In the method evaluation section, we developed a virtual experiment application using Unity3D, taking the thermite reaction experiment as an example. Users can operate within a three-dimensional virtual environment, experiencing and completing the experiment firsthand. Through simulated operations, written examinations, and subjective experience evaluations based on the Presence Questionnaire (PQ), Intrinsic Motivation Inventory (IMI), and System Usability Scale (SUS), we compared the effectiveness and user experience of the SRD method with that of traditional BST methods. The results indicate that both the SRD immersive teaching method and the traditional video teaching method have their respective strengths and weaknesses. SRD provides a more realistic and interactive learning experience, suitable for hands-on and in-depth learning, and has significant advantages in enhancing users’ behavioral skills and intrinsic motivation. However, it faces challenges in terms of equipment requirements and usage costs. Traditional BST methods, on the other hand, are suitable for large-scale applications due to their convenience and low cost, but lack interactivity and a sense of operation, making it easy for users to passively receive knowledge. By combining the strengths of these two methods, a hybrid teaching model can be explored, leveraging the immersion and interactivity of SRD, combined with the convenience and low cost of video, to provide users requiring BST with a richer and more efficient experience.

In the future, we anticipate that SRD-related technologies, such as display technology, smart interactive devices, AI technology, communication technology, etc., will further develop. In terms of display devices, high-resolution, larger field of view, and lower latency display technologies will become mainstream. The combination of gesture recognition and speech recognition will provide users with more natural and convenient ways of interaction. AI technology will enable SRD applications to achieve more intelligent and precise interactions. AI-driven real-time rendering and content generation technologies will significantly lower the barrier to content creation for SRD methods, driving the prosperity of the content ecosystem. The 5G networks and future 6G network technologies will provide SRD applications with faster, low-latency data transmission services, supporting smoother cloud rendering and real-time interactions.

In addition to chemical experiment training, the SRD method also has a high probability of development and large-scale application in training and education in different fields. Firstly, in medical training, SRD is expected to play an important role in surgical simulation, rehabilitation training, and psychological therapy. It can be used for remote operation and surgical guidance of medical equipment, enhancing medical efficiency and quality. Secondly, in industrial manufacturing training, virtualization of product design, production process simulation, and equipment maintenance will be realized, improving training and production efficiency. Furthermore, based on cloud technology, there is the potential to achieve resource sharing for training, while also reducing the hardware requirements on end-user devices, thereby lowering hardware costs. Combining shared hardware or leasing options can increase the economic feasibility of expanding the SRD method to other scenarios. This enables remote collaborative training to be conducted anytime and anywhere.

Supplementary Materials

The aluminum thermal reaction teaching video used in the experiment can be downloaded at the following URL: https://www.bilibili.com/video/BV1M54y1v7HN (accessed on 13 June 2024).

Author Contributions

Conceptualization, L.L. and H.W.; methodology, L.L.; software, H.W.; validation, L.L. and H.W.; formal analysis, L.L. and T.H.; investigation, L.L. and H.W.; resources, L.L.; data curation, T.H.; writing—original draft preparation, L.L., H.W. and T.H.; writing—review and editing, L.L., T.H. and H.W.; visualization, L.L., H.W. and T.H.; supervision, W.H.; project administration, W.H.; funding acquisition, W.H. and L.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Informed consent has been obtained from all participants involved in this study.

Data Availability Statement

Restrictions apply to the availability of these data.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

HCI	Human–Computer Interaction
XR	Extended Reality
BST	Behavioral Skills Training
SG	Serious Game
HMD	Head-mounted Display
SRD	Spatial Reality Display
PQ	Presence Questionnaire
IMI	Intrinsic Motivation Inventory
SUS	System Usability Scale
CNN	Convolutional Neural Networks
LSTM	Long Short-Term Memory Networks

References

Kirkpatrick, M.; Akers, J.; Rivera, G. Use of behavioral skills training with teachers: A systematic review. J. Behav. Educ. 2019, 28, 344–361. [Google Scholar] [CrossRef]
Slane, M.; Lieberman-Betz, R.G. Using behavioral skills training to teach implementation of behavioral interventions to teachers and other professionals: A systematic review. Behav. Interv. 2021, 36, 984–1002. [Google Scholar] [CrossRef]
Miles, N.I.; Wilder, D.A. The effects of behavioral skills training on caregiver implementation of guided compliance. J. Appl. Behav. Anal. 2009, 42, 405–410. [Google Scholar] [PubMed]
Johnson, B.M.; Miltenberger, R.G.; Egemo-Helm, K.; Jostad, C.M.; Flessner, C.; Gatheridge, B. Evaluation of Behavioral Skills Training for Teaching Abduction-Prevention Skills to Young Children. J. Appl. Behav. Anal. 2005, 38, 67–78. [Google Scholar] [CrossRef] [PubMed]
Sarokoff, R.A.; Sturmey, P. The effects of behavioral skills training on staff implementation of discrete-trial teaching. J. Appl. Behav. Anal. 2004, 37, 535–538. [Google Scholar] [CrossRef] [PubMed]
Hogan, A.; Knez, N.; Kahng, S. Evaluating the use of behavioral skills training to improve school staffs’ implementation of behavior intervention plans. J. Behav. Educ. 2015, 24, 242–254. [Google Scholar]
Dogan, R.K.; King, M.L.; Fischetti, A.T.; Lake, C.M.; Mathews, T.L.; Warzak, W.J. Parent-implemented behavioral skills training of social skills. J. Appl. Behav. Anal. 2017, 50, 805–818. [Google Scholar] [CrossRef]
Campanaro, A.M.; Vladescu, J.C.; DeBar, R.M.; Deshais, M.A.; Manente, C.J. Using computer-based instruction to teach implementation of behavioral skills training. J. Appl. Behav. Anal. 2023, 56, 241–257. [Google Scholar]
Çakiroğlu, Ü.; Gökoğlu, S. A design model for using virtual reality in behavioral skills training. J. Educ. Comput. Res. 2019, 57, 1723–1744. [Google Scholar] [CrossRef]
Radhakrishnan, U.; Koumaditis, K.; Chinello, F. A systematic review of immersive virtual reality for industrial skills training. Behav. Inf. Technol. 2021, 40, 1310–1339. [Google Scholar]
Lin, F.; Ye, L.; Duffy, V.G.; Su, C.J. Developing virtual environments for industrial training. Inf. Sci. 2002, 140, 153–170. [Google Scholar]
Chang, E.; Kim, H.T.; Yoo, B. Virtual reality sickness: A review of causes and measurements. Int. J. Hum. Comput. Interact. 2020, 36, 1658–1682. [Google Scholar]
Hirzle, T.; Fischbach, F.; Karlbauer, J.; Jansen, P.; Gugenheimer, J.; Rukzio, E.; Bulling, A. Understanding, Addressing, and Analysing Digital Eye Strain in Virtual Reality Head-Mounted Displays. ACM Trans. Comput. Hum. Interact. (TOCHI) 2022, 29, 33. [Google Scholar] [CrossRef]
Wann, J.P.; Mon-Williams, M. Health issues with virtual reality displays: What we do know and what we don’t. ACM SIGGRAPH Comput. Graph. 1997, 31, 53–57. [Google Scholar] [CrossRef]
Costello, P.J. Health and Safety Issues Associated with Virtual Reality: A Review of Current Literature. 1997. Available online: http://www.agocg.ac.uk/reports/virtual/37/report37.htm (accessed on 19 March 2025).
Kaimara, P.; Oikonomou, A.; Deliyannis, I. Could virtual reality applications pose real risks to children and adolescents? A systematic review of ethical issues and concerns. Virtual Real. 2022, 26, 697–735. [Google Scholar]
Koulieris, G.A.; Bui, B.; Banks, M.S.; Drettakis, G. Accommodation and comfort in head-mounted displays. ACM Trans. Graph. 2017, 36, 87. [Google Scholar] [CrossRef]
Adler, M.V.; Madsen, J.; Hedberg, J.; Steinberg, R.; Parra, L.C. Effect of explanation videos on learning: The role of attention and academic performance. Educ. Inf. Technol. 2025, 1–29. [Google Scholar] [CrossRef]
Chen, Y.C.; Lu, Y.L.; Lien, C.J. Learning environments with different levels of technological engagement: A comparison of game-based, video-based, and traditional instruction on students’ learning. Interact. Learn. Environ. 2021, 29, 1363–1379. [Google Scholar]
Training, S. Preventing Unintentional Firearm Injury in Children. Educ. Treat. Child. 2004, 27, 161–177. [Google Scholar]
Stewart, K.K.; Carr, J.E.; LeBlanc, L.A. Evaluation of family-implemented behavioral skills training for teaching social skills to a child with Asperger’s disorder. Clin. Case Stud. 2007, 6, 252–262. [Google Scholar]
Liu, Z.; Jin, Y.; Ma, M.; Li, J. A comparison of immersive and non-immersive VR for the education of filmmaking. Int. J. Hum. Comput. Interact. 2023, 39, 2478–2491. [Google Scholar] [CrossRef]
Van Dam, A.; Forsberg, A.S.; Laidlaw, D.H.; LaViola, J.J.; Simpson, R.M. Immersive VR for scientific visualization: A progress report. IEEE Comput. Graph. Appl. 2000, 20, 26–52. [Google Scholar]
Boas, Y. Overview of virtual reality technologies. In Proceedings of the Interactive Multimedia Conference, Barcelona, Spain, 22 October 2013; Volume 2013, pp. 1–6. Available online: https://www.semanticscholar.org/paper/Overview-of-Virtual-Reality-Technologies-Boas/4214cb09e29795f5363e5e3b545750dce027b668 (accessed on 19 March 2025).
Li, S.; Huang, Y.; Tri, V.S.; Elvek, J.; Wan, S.; Kjallstrom, J.; Andersson, N.; Johansson, M.; Lejerskar, D. Interactive theater-sized dome design for edutainment and immersive training. In Proceedings of the 2014 Virtual Reality International Conference, Lecce, Italy, 17–20 September 2014; pp. 1–5. [Google Scholar]
Raskar, R.; Van Baar, J.; Willwacher, T.; Rao, S. Quadric transfer for immersive curved screen displays. Comput. Graph. Forum 2004, 23, 451–460. [Google Scholar] [CrossRef]
Manjrekar, S.; Sandilya, S.; Bhosale, D.; Kanchi, S.; Pitkar, A.; Gondhalekar, M. CAVE: An emerging immersive technology—A review. In Proceedings of the 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, Cambridge, UK, 26–28 March 2014; pp. 131–136. [Google Scholar]
Theodoropoulos, A.; Stavropoulou, D.; Papadopoulos, P.; Platis, N.; Lepouras, G. Developing an interactive VR CAVE for immersive shared gaming experiences. Virtual Worlds 2023, 2, 162–181. [Google Scholar] [CrossRef]
Geng, J. Three-dimensional display technologies. Adv. Opt. Photonics 2013, 5, 456–535. [Google Scholar] [CrossRef]
Lewis, J.D.; Verber, C.M.; McGhee, R.B. A true three-dimensional display. IEEE Trans. Electron Devices 1971, 18, 724–732. [Google Scholar] [CrossRef]
Langhans, K.; Guill, C.; Rieper, E.; Oltmann, K.; Bahr, D. Solid Felix: A static volume 3D-laser display. In Proceedings of the Stereoscopic Displays and Virtual Reality Systems X (Proceeding of SPIE), Santa Clara, CA, USA, 21–24 January 2003; SPIE: Bellingham, DC, USA, 2003; Volume 5006, pp. 161–174. [Google Scholar]
Nayar, S.K.; Anand, V.N. 3D display using passive optical scatterers. Computer 2007, 40, 54–63. [Google Scholar] [CrossRef]
Van Berkel, C.; Parker, D.W.; Franklin, A.R. Multiview 3D LCD. In Proceedings of the Stereoscopic Displays and Virtual Reality Systems III SPIE, Santa Clara, CA, USA, 30 January–2 February 1996; Volume 2653, pp. 32–39. [Google Scholar]
Okoshi, T. Three-Dimensional Imaging Techniques; Elsevier: Amsterdam, The Netherlands, 2012. [Google Scholar]
Klug, M.; Burnett, T.; Fancello, A.; Heath, A.; Gardner, K.; O’Connell, S.; Newswanger, C. A Scalable, Collaborative, Interactive Light-field Display System. SID Symp. Dig. Tech. Pap. 2013, 44, 412–415. [Google Scholar] [CrossRef]
Wu, G.; Masia, B.; Jarabo, A.; Zhang, Y.; Wang, L.; Dai, Q.; Chai, T.; Liu, Y. Light field image processing: An overview. IEEE J. Sel. Top. Signal Process. 2017, 11, 926–954. [Google Scholar] [CrossRef]
Zhao, J.; Ding, Y.; Dai, Z.; Gong, J.; Tong, G.; Zhang, Y. Viewing Zone Expansion of Autostereoscopic Display with Composite Lenticular Lens Array and Saddle Lens Array. IEEE Photonics J. 2023, 15, 3900506. [Google Scholar] [CrossRef]
Lyu, L.; Yang, B.; Hou, W.; Yu, W.; Bai, B. Research on 3D Visual Perception Quality Metric Based on the Principle of Light Field Image Display. In Image and Graphics Technologies and Applications. IGTA 2023. Communications in Computer and Information Science; Yongtian, W., Lifang, W., Eds.; Springer: Singapore, 2023; Volume 1910. [Google Scholar] [CrossRef]
Peterka, T.; Kooima, R.L.; Sandin, D.J.; Johnson, A.; Leigh, J.; DeFanti, T.A. Advances in the dynallax solid-state dynamic parallax barrier autostereoscopic visualization display system. IEEE Trans. Vis. Comput. Graph. 2008, 14, 487–499. [Google Scholar] [CrossRef] [PubMed]
Svatunek, D. “Holographic” Autostereoscopic Displays: A Perspective on Their Technology and Potential Impact in Chemistry. Chem. Eur. J. 2023, 29, e202301746. [Google Scholar] [CrossRef]
Pietroszek, K.; SRD and American University—Sony Pro. Sony Pro. Available online: https://pro.sony/ue_US/products/professional-displays/srd-american-university (accessed on 18 March 2024).
Marín-Vega, H.; Alor-Hernández, G.; Bustos-López, M.; López-Martínez, I.; Hernández-Chaparro, N.L. Extended Reality (XR) Engines for Developing Gamified Apps and Serious Games: A Scoping Review. Future Internet 2023, 15, 379. [Google Scholar] [CrossRef]
Novacek, T.; Jirina, M. Overview of controllers of user interface for virtual reality. PRESENCE Virtual Augment. Real. 2020, 29, 37–90. [Google Scholar] [CrossRef]
Hock, P.; Benedikter, S.; Gugenheimer, J.; Rukzio, E. Carvr: Enabling in-car virtual reality entertainment. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, Denver, CO, USA, 6–11 May 2017; pp. 4034–4044. [Google Scholar]
Lo, J.Y.; Huang, D.Y.; Sun, C.K.; Hou, C.E.; Chen, B.Y. RollingStone: Using single slip taxel for enhancing active finger exploration with a virtual reality controller. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, Berlin, Germany, 14–17 October 2018; pp. 839–851. [Google Scholar]
Pavlovic, V.I.; Sharma, R.; Huang, T.S. Visual interpretation of hand gestures for human-computer interaction: A review. IEEE Trans. Pattern Anal. Mach. Intell. 1997, 19, 677–695. [Google Scholar] [CrossRef]
Buckingham, G. Hand tracking for immersive virtual reality: Opportunities and challenges. Front. Virtual Real. 2021, 2, 728461. [Google Scholar] [CrossRef]
Mazo, A.D. Using Behavioral Skills Training to Teach Medication Safety Skills to Children. Master’s Thesis, Saint Louis University, St. Louis, MO, USA, 2014. [Google Scholar]
Houvouras IV, A.J.; Harvey, M.T. Establishing fire safety skills using behavioral skills training. J. Appl. Behav. Anal. 2014, 47, 420–424. [Google Scholar] [CrossRef]
Rosales, R.; Stone, K.; Rehfeldt, R.A. The effects of behavioral skills training on implementation of the picture exchange communication system. J. Appl. Behav. Anal. 2009, 42, 541–549. [Google Scholar] [CrossRef]
Vanselow, N.R.; Hanley, G.P. An evaluation of computerized behavioral skills training to teach safety skills to young children. J. Appl. Behav. Anal. 2014, 47, 51–69. [Google Scholar] [CrossRef]
Park, K.M.; Ku, J.; Choi, S.H.; Jang, H.J.; Park, J.Y.; Kim, S.I.; Kim, J.J. A virtual reality application in role-plays of social skills training for schizophrenia: A randomized, controlled trial. Psychiatry Res. 2011, 189, 166–172. [Google Scholar] [CrossRef]
Seckinger-Bancroft, K.E. Examining the Effectiveness and Efficiency of Two Delivery Models to Teach Children Abduction Prevention Skills. Ph.D. Thesis, Western Michigan University, Kalamazoo, MI, USA, 2010. [Google Scholar]
Çakiroğlu, Ü.; Gökoğlu, S. Development of fire safety behavioral skills via virtual reality. Comput. Educ. 2019, 133, 56–68. [Google Scholar] [CrossRef]
Smith, S.G.; Mattson, S.L.; Aguilar, J.; Pyle, N.; Higbee, T.S. Behavioral Skills Training with Adult Interventionists: A Systematic Review. Rev. J. Autism Dev. Disord. 2024, 11, 296–319. [Google Scholar] [CrossRef]
Belisle, J.; Rowsey, K.E.; Dixon, M.R. The Use of In Situ Behavioral Skills Training to Improve Staff Implementation of the PEAK Relational Training System. J. Organiz. Behav. Manag. 2016, 36, 71–79. [Google Scholar] [CrossRef]
Chovet Santa Cruz, H.A.; Miltenberger, R.G.; Baruni, R.R. Evaluating Remote Behavioral Skills Training of Online Gaming Safety Skills. Behavior Anal. Pract. 2024, 17, 246–256. [Google Scholar] [CrossRef]
Fu, Y.; Li, Q. A Virtual Reality–Based Serious Game for Fire Safety Behavioral Skills Training. Int. J. Hum. Comput. Interact 2023, 40, 5980–5996. [Google Scholar] [CrossRef]
Michael, D.R.; Chen, S.L. Serious Games: Games That Educate, Train, and Inform; Muska & Lipman/Premier-Trade: Boston, MA, USA, 2005. [Google Scholar]
Whyte, E.M.; Smyth, J.M.; Scherf, K.S. Designing serious game interventions for individuals with autism. J. Autism Dev. Disord. 2015, 45, 3820–3831. [Google Scholar] [CrossRef] [PubMed]
Oliva, D.; Somerkoski, B.; Tarkkanen, K.; Lehto, A.; Luimula, M. Virtual reality as a communication tool for fire safety-Experiences from the VirPa project. In Proceedings of the GamiFIN, Levi, Finland, 8–10 April 2019; pp. 241–252. [Google Scholar]
Corti, K. Games-Based Learning; A Serious Business Application; PIXE Learning Limited: Coventry, UK, 2006. [Google Scholar]
Plomp, T. Educational design research: An introduction. In Educational Design Research, 1st ed.; Plomp, T., Nieveen, N., Eds.; Stichting Leerplan Ontwikkeling: Enschede, The Netherlands, 2013; Volume 1, pp. 10–51. [Google Scholar]
Mautone, T.; Spiker, V.; Karp, D. Using serious game technology to improve aircrew training. In Proceedings of the Interservice/Industry Training, Simulation & Education Conference (I/ITSEC), Orlando, FL, USA, 1–5 December 2008. [Google Scholar]
Damaševičius, R.; Damaševičius, R.; Lethin, C.; Paulauskas, A.; Esposito, A.; Catena, M.; Aschettino, V. Serious game iDO: Towards better education in dementia care. Information 2019, 10, 355. [Google Scholar] [CrossRef]
Maskeliunas, R.; Damasevicius, R.; Paulauskas, A.; Ceravolo, M.G.; Charalambous, M.; Kambanaros, M.; Pampoulou, E.; Barbabella, F.; Poli, A.; Carvalho, C.V. Deep reinforcement learning-based iTrain serious game for caregivers dealing with post-stroke patients. Information 2022, 13, 564. [Google Scholar] [CrossRef]
Xian, Y.; Wang, M. Introspective education of the thermite reaction experiment. Educ. Chem. 2019, 12, 49–54. [Google Scholar]
Tran, T.Q.; Langlotz, T.; Young, J.; Schubert, T.W.; Regenbrecht, H. Classifying presence scores: Insights and analysis from two decades of the igroup presence questionnaire (ipq). ACM Trans. Comput. Hum. Interact. 2024, 31, 1–26. [Google Scholar]

Figure 1. The behavioral skills training (BST) model.

Figure 2. The experimental procedure.

Figure 3. The usage of gesture interaction in this application. (a) Using virtual hands to indicate users’ gestures; (b) Determining the grasping of an object by recognizing the user’s pinching gesture; (c) Maintaining the pinching gesture and moving allows the grasped object to be moved.

Figure 4. The corresponding information guides users. (a) The information prompts on the left side include the reaction name, chemical equation, experimental steps, necessary laboratory equipment, and the start button; (b) It guides the user to quickly spread their hands apart to view more information prompts; (c) After the user performs the gesture shown in (b), they can view the instructions for using each piece of equipment.

Figure 9. Average accuracy rates of pre-test and post-test.

Figure 10. Average operation accuracy and difference between the two methods. The difference is obtained by subtracting the accuracy of the traditional method from the accuracy achieved using the SRD method.

Figure 11. Scatter plot of the time and accuracy for each experimental participant in the SRD method group and the traditional method group for completing simulated operations, with time on the horizontal axis and accuracy on the vertical axis.

Figure 12. Comparison of the three PQ indicators regarding type data.

Figure 13. Comparison of the three IMI indicators regarding type data.

Figure 14. Comparison of the two IMI indicators regarding type data.

Table 1. Factors affecting stereoscopic effect of display content.

Factors	Situations Where the Stereoscopic Effect Is Better
Scene Brightness	Dark scenes
Background Depth	Dark background
Object Material	Rough textures and low transparency
Reference Objects	Reference objects within the depth of field
Object Color	High color grayscale
Object Proximity	Moderate spacing and depth of objects
Object Dynamics	Movable objects
Aspect Ratio	Scene aspect ratio matched with the device

Table 2. Normality test results for pre-test and post-test accuracy data.

Group	Stage	$n$	$\bar{X}$	Skewness	Kurtosis	Shapiro–Wilk Test
Group	Stage	$n$	$\bar{X}$	Skewness	Kurtosis	W	p
SRD	Pre-test	7	56.25%	−1.16	0.02	0.8179	0.0612
SRD	Post-test	15	87.50%	−1.05	0.15	0.8533	0.0194 *
Traditional	Pre-test	7	61.61%	−0.92	−0.2	0.905	0.3621
Traditional	Post-test	15	82.08%	−0.69	−0.99	0.8484	0.0165 *

* p < 0.05, ** p < 0.01.

Table 3. The t-test analysis results for pre-test data of two BST methods.

Group	Stage	$n$	The Results of the Homogeneity of Variance Analysis			The Results of the t-Test Analysis
Group	Stage	$n$	$σ$	F	p	$\bar{X} \pm σ$	t	p
SRD	Pre-test	7	0.2577	0.1238	0.7311	0.5625 ± 0.2577	−0.3625	0.7233
Traditional	Pre-test	7	0.2941	0.1238	0.7311	0.6161 ± 0.2941	−0.3625	0.7233

* p < 0.05, ** p < 0.01.

Table 4. The analysis results of the Mann–Whitney U test for each method.

Group	Stage	$n$	Mdn (P₂₅, P₇₅)	U	z	p
SRD	Pre-test	7	0.6875 (0.5, 0.6875)	9.5	−3.0311	0.0025 **
SRD	Post-test	15	0.9375 (0.8125, 0.96875)	9.5	−3.0311	0.0025 **
Traditional	Pre-test	7	0.75 (0.5, 0.78125)	26.5	−1.8328	0.0669
Traditional	Post-test	15	0.9375 (0.6875, 0.9375)	26.5	−1.8328	0.0669

* p < 0.05, ** p < 0.01.

Table 5. Normality test results for simulation operation accuracy data.

Group	Stage	$n$	$\bar{X}$	Skewness	Kurtosis	Shapiro–Wilk Test
Group	Stage	$n$	$\bar{X}$	Skewness	Kurtosis	W	p
SRD	Simulation Operation	11	86.93%	−0.55	−0.85	0.8972	0.1706
Traditional	Simulation Operation	11	63.07%	−0.42	−0.37	0.9454	0.5863

* p < 0.05, ** p < 0.01.

Table 6. The t-test analysis results for simulation operation data of two BST methods.

Group	Stage	$n$	The Results of the Homogeneity of Variance Analysis			The Results of the t-Test Analysis
Group	Stage	$n$	$σ$	F	p	$\bar{X} \pm σ$	t	p
SRD	Simulation Operation	11	0.1099	3.5683	0.0735	0.8693 ± 0.1099	2.8189	0.0106 *
Traditional	Simulation Operation	11	0.2584	3.5683	0.0735	0.6307 ± 0.2584	2.8189	0.0106 *

* p < 0.05, ** p < 0.01.

Table 7. Normality test results for subjective indicators related to the SRD method.

Indicators	$n$	$\bar{X}$	Skewness	Kurtosis	Shapiro–Wilk Test
Indicators	$n$	$\bar{X}$	Skewness	Kurtosis	W	p
Involvement	32	34.063	−1.428	3.228	0.891	0.004 **
Sensory fidelity	32	16.938	−0.094	−0.622	0.966	0.408
Adaptation/Immersion	32	28.125	−0.169	−0.503	0.973	0.596
Interface quality	32	9.875	0.379	−0.693	0.956	0.208
Interest/Enjoyment	32	6.146	−0.681	−0.689	0.886	0.003 **
Perceived competence	32	5.833	−0.994	0.391	0.896	0.005 **
Effort/Importance	32	6.01	−0.099	−1.666	0.84	0.000 **
Pressure/Tension	32	2.646	0.221	−1.253	0.919	0.020 *
Value/Usefulness	32	6.521	−1.303	1.578	0.803	0.000 **
SUS	32	74.766	−0.365	−0.139	0.978	0.732

* p < 0.05, ** p < 0.01.

Table 8. Normality test results for subjective indicators related to the traditional method.

Indicators	$n$	$\bar{X}$	Skewness	Kurtosis	Shapiro–Wilk Test
Indicators	$n$	$\bar{X}$	Skewness	Kurtosis	W	p
Involvement	32	16.375	0.266	0.526	0.965	0.371
Sensory fidelity	32	6.375	2.506	8.791	0.759	0.000 **
Adaptation/Immersion	32	18.406	0.322	−0.037	0.979	0.773
Interface quality	32	10.25	−0.09	−1.091	0.952	0.159
Interest/Enjoyment	32	3.552	0.289	1.153	0.953	0.181
Perceived competence	32	4.51	−0.494	0.976	0.949	0.138
Effort/Importance	32	4.969	−0.088	−0.961	0.957	0.22
Pressure/Tension	32	3.073	0.193	−1.008	0.937	0.063
Value/Usefulness	32	4.896	0.264	−0.183	0.957	0.231
SUS	32	69.609	−0.552	−0.274	0.954	0.183

* p < 0.05, ** p < 0.01.

Table 9. Results of the homogeneity of variance analysis for subjective indicators.

Indicators	Type (σ)		F	p
Indicators	SRD (n = 32)	Traditional (n = 32)	F	p
Involvement	3.53	4.24	1.059	0.308
Sensory fidelity	2.27	3.05	0.256	0.615
Adaptation/Immersion	2.86	4.87	5.891	0.018 *
Interface quality	3.51	4.22	1.753	0.19
Interest/Enjoyment	0.82	0.97	0.071	0.791
Perceived competence	0.97	1.36	1.711	0.196
Effort/Importance	0.85	1.35	6.019	0.017 *
Pressure/Tension	1.21	1.54	2.782	0.1
Value/Usefulness	0.55	1.11	10.918	0.002 **
SUS	12.43	16.99	3.114	0.083

* p < 0.05, ** p < 0.01.

Table 10. The analysis results of non-parametric tests for PQ indicators.

Indicators	Type Mdn (P₂₅, P₇₅)		U	z	p
Indicators	SRD (n = 32)	Traditional (n = 32)	U	z	p
Involvement	35.000 (33.0, 36.0)	16.000 (13.3, 19.0)	3.5	−6.841	<0.01 **
Sensory fidelity	17.000 (15.0, 19.0)	6.000 (4.0, 7.0)	25.5	−6.554	<0.01 **
Adaptation/Immersion	28.000 (26.0, 30.0)	18.500 (14.3, 21.0)	51.5	−6.195	<0.01 **

* p < 0.05, ** p < 0.01.

Table 11. The t-test analysis results for IMI indicators.

Indicators	Type ( $\bar{X} \pm σ$ )		t	p
Indicators	SRD (n = 32)	Traditional (n = 32)	t	p
Interest/Enjoyment	6.15 ± 0.82	3.55 ± 0.97	11.539	<0.01 **
Perceived competence	5.83 ± 0.97	4.51 ± 1.36	4.493	<0.01 **
Pressure/Tension	2.65 ± 1.21	3.07 ± 1.54	−1.235	0.221

* p < 0.05, ** p < 0.01.

Table 12. The analysis results of non-parametric tests for IMI indicators.

Indicators	Mdn (P₂₅, P₇₅)		U	z	p
Indicators	SRD (n = 32)	Traditional (n = 32)	U	z	p
Effort/Importance	6.167 (5.0, 7.0)	5.000 (3.8, 6.3)	277.5	−3.179	0.001 **
Value/Usefulness	6.667 (6.1, 7.0)	4.667 (4.3, 5.7)	115.5	−5.382	0.000 **

* p < 0.05, ** p < 0.01.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lyu, L.; Hu, T.; Wang, H.; Hou, W. SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training. Electronics 2025, 14, 1337. https://doi.org/10.3390/electronics14071337

AMA Style

Lyu L, Hu T, Wang H, Hou W. SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training. Electronics. 2025; 14(7):1337. https://doi.org/10.3390/electronics14071337

Chicago/Turabian Style

Lyu, Linkai, Tianrui Hu, Hongrun Wang, and Wenjun Hou. 2025. "SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training" Electronics 14, no. 7: 1337. https://doi.org/10.3390/electronics14071337

APA Style

Lyu, L., Hu, T., Wang, H., & Hou, W. (2025). SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training. Electronics, 14(7), 1337. https://doi.org/10.3390/electronics14071337

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

SRD Method: Integrating Autostereoscopy and Gesture Interaction for Immersive Serious Game-Based Behavioral Skills Training

Abstract

1. Introduction

2. Related Works

2.1. Immersive XR Technology

2.2. Common Interaction Methods in Immersive XR

2.3. Behavioral Skills Training

2.4. Serious Games

3. Method

3.1. Hardware

3.2. Interaction Methods

3.3. Application

3.4. Content Optimization

4. User Study

4.1. Design of User Study

4.2. Procedure

4.3. Evaluation Indicators

4.4. Data Collection and Analysis

5. Results

5.1. Analysis of Pre-Test and Post-Test

5.2. Analysis of Simulation of the Operation

5.3. Analysis of Subjective Indicators

5.4. Analysis of Subjective Scale

5.4.1. Presence Questionnaire

5.4.2. The Intrinsic Motivation Inventory

5.4.3. System Usability Scale

6. Discussion

6.1. Advantages of the SRD Method

6.2. Disadvantages of the SRD Method

6.3. Skill Transfer from Virtual to Real

6.4. Promotion of the SRD Method

7. Conclusions

Supplementary Materials

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI