Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling

Wu, Jinyue; Han, Donghee; Fujibuchi, Toshioh

doi:10.3390/app151910488

Open AccessArticle

Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling

by

Jinyue Wu

¹,

Donghee Han

² and

Toshioh Fujibuchi

^2,*

¹

Division of Medical Quantum Science, Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka 812-8582, Fukuoka, Japan

²

Division of Medical Quantum Science, Department of Health Sciences, Faculty of Medical Sciences, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka 812-8582, Fukuoka, Japan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(19), 10488; https://doi.org/10.3390/app151910488

Submission received: 25 August 2025 / Revised: 15 September 2025 / Accepted: 24 September 2025 / Published: 28 September 2025

(This article belongs to the Special Issue Novel Technologies in Radiology: Diagnosis, Prediction and Treatment)

Download

Browse Figures

Versions Notes

Abstract

This study presents the development and evaluation of an Augmented Reality (AR)-based training system aimed at improving patient setup accuracy in radiation therapy. Leveraging Microsoft HoloLens 2, the system provides an immersive environment for medical staff to enhance their understanding of patient setup procedures. High-resolution 3D anatomical models were reconstructed from CT scans using 3D Slicer, while Luma AI was employed to rapidly capture complete body surface models. Due to limitations in each method—such as missing extremities or back surfaces—Blender was used to merge the models, improving completeness and anatomical fidelity. The AR application was developed in Unity, employing spatial anchors and 125 × 125 mm² QR code markers to stabilize and align virtual models in real space. System accuracy testing demonstrated that QR code tracking achieved millimeter-level variation, with an expanded uncertainty of ±2.74 mm. Training trials for setup showed larger deviations in the X (left–right), Y (up-down), and Z (front-back) axes at the centimeter scale. This meant that we were able to quantify the user’s patient setup skills. While QR code positioning was relatively stable, manual placement of markers and the absence of real-time verification contributed to these errors. The system offers a radiation-free and interactive platform for training, enhancing spatial awareness and procedural skills. Future work will focus on improving tracking stability, optimizing the workflow, and integrating real-time feedback to move toward clinical applicability.

Keywords:

augmented reality; patient setup training; HoloLens 2; 3D Slicer; Luma AI

1. Introduction

Radiation therapy (RT) is a cornerstone of cancer treatment, applied in over half of all cancer patients worldwide, either as a standalone treatment or in combination with surgery and chemotherapy [1]. The primary objective of RT is to maximize tumor control while minimizing radiation-induced damage to surrounding healthy tissues and organs at risk [2]. Achieving this balance relies heavily on accurate and reproducible patient setup in every treatment session. Even minor setup deviations can result in significant dosimetric errors, reducing tumor control probability or increasing the risk of toxicity [3]. To ensure setup accuracy, various immobilization devices, including thermoplastic masks and vacuum cushions, are widely used in clinical settings [4]. Advanced techniques such as Image-Guided Radiation Therapy (IGRT) and Augmented Reality (AR)-assisted systems have also been introduced to improve setup precision [5,6].

These technological advances have enabled highly accurate mechanical patient positioning. However, if the patient’s position is twisted between treatment planning and treatment, such misalignment cannot be corrected by simple couch translations or rotations. Therefore, manual patient setup is required, involving direct contact with the patient. Maintaining consistent setup accuracy remains difficult, especially for complex treatment areas such as the chest and pelvis. Variability in patient anatomy, movement, and operator skills contributes to setup errors. Conventional setup training relies on lectures, static images, and limited hands-on practice. These methods do not adequately replicate clinical scenarios or enhance spatial understanding [7,8]. Consequently, there is a growing demand for intuitive, immersive, and interactive training tools that bridge the gap between theoretical knowledge and clinical practice.

In recent years, AR technology has emerged as a powerful educational tool in healthcare. AR overlays digital 3D models and clinical data onto the physical environment. This provides learners with real-time, spatially accurate, and interactive experiences [9,10]. Systematic reviews have shown that AR-based training can improve procedural performance, anatomical understanding, and learner engagement in disciplines such as surgery, anatomy, and nursing [7,11]. In the context of radiation oncology, AR offers promising applications for simulating patient setup and treatment workflows in a safe, repeatable setting [5,12]. For example, Microsoft HoloLens 2 enables users to visualize and manipulate anatomical models in real-world space, facilitating experiential learning without the risks associated with real patients [12,13].

However, research on AR-based radiation therapy training remains limited, particularly in evaluating its spatial accuracy, clinical integration, and training effectiveness. While some prototype systems have demonstrated feasibility, further validation is needed to establish their utility in real-world educational and clinical environments.

This study aims to develop and evaluate an AR-based training system for patient setup in radiation therapy using Microsoft HoloLens 2. High-resolution 3D anatomical models were generated from CT images, and surface models were acquired using photogrammetry-based methods, then fused for enhanced realism. The system integrates spatial anchors and QR code markers to ensure precise alignment between virtual models and physical phantoms.

2. Materials and Methods

2.1. Research Procedure

Figure 1 illustrates the overall workflow for developing an AR-based training application for patient setup in radiation therapy. In the model generation phase, two types of 3D models were created: CT-based anatomical models using 3D Slicer and surface models acquired through 3D scanning with Luma AI. Each method offers distinct advantages—3D Slicer provides high-resolution internal anatomy with smooth surfaces but lacks complete body contours, especially in the extremities, whereas Luma AI offers rapid full-body capture but with lower geometric fidelity and roughness. To address these limitations, the two models were integrated, combining anatomical accuracy and external realism to create a more complete training model.

In the AR development phase, the fused model was imported into Unity to build an application for HoloLens 2. The system enables users to view and interact with 3D patient models in mixed reality, offering an immersive, radiation-free training experience that enhances spatial awareness and setup skills.

2.2. 3D Model Generation

This study employed a multi-source 3D modeling strategy to develop an AR-based training system for radiation therapy. CT data processed with 3D Slicer provided accurate internal anatomy and smooth anterior surfaces. However, it lacked complete contours (e.g., incomplete arms) and did not capture surface textures, as X-ray attenuation reflects internal density rather than external micro-relief. In contrast, Luma AI rapidly reconstructs complete external contours from RGB videos, preserving gross body shape and appearance, though with lower geometric fidelity and occasional surface roughness or backside holes. By fusing CT (internal accuracy) with Luma AI (external completeness), the resulting model combines anatomical precision with external realism, thereby enhancing its pedagogical value for AR training.

2.2.1. AR Model Construction Using Luma AI and 3D Slicer

Figure 2 shows the front and back views of a 3D model generated using Luma AI, an AI-based platform for rapid 3D reconstruction. In this study, Luma AI was used to create visual models for an AR-based radiation therapy training system. We used a smartphone rear camera (60 FPS) to record multi-angle videos of the phantom. The recording followed a circular path at a distance of 0.6–1.2 m and a height range of 0.7–1.6 m. Each capture lasted 90–120 s without flash. We cleared background clutter to improve reconstruction quality. AI algorithms then reconstructed and completed the geometry. The results had some limits, such as surface roughness and missing back details. Even so, the external contours were good enough for AR development in Unity, though further refinement is needed.

Figure 3 shows the high-precision 3D anatomical model generated from CT images using 3D Slicer. The model was developed to support the AR-based training system for radiation therapy. CT data were acquired from an anthropomorphic phantom. The acquisition parameters were: slice thickness 1 mm, resolution 512 × 512 pixels, and field-of-view (FOV) diameter 40 cm. The images were imported into 3D Slicer. Body surface segmentation was performed to reconstruct a smooth model with clear anatomical structures. Because of the limited CT scanning range, some limb structures, such as the arms, were incomplete.

This study focused on the thoracic region, as chest setup is technically challenging and involves critical organs such as the lungs, heart, and major blood vessels. These structures are highly susceptible to displacement caused by respiration or posture changes, where even minor setup errors may result in dose distribution inaccuracies and increased risk.

To enhance spatial understanding of internal anatomy during training, we performed organ segmentation in 3D Slicer using the AutoSeg Version: d748fd3 (2024-10-24) plugin. AutoSeg is a deep learning-based tool that generates 3D anatomical models deterministically and without manual parameter adjustment. Given the same CT input, the output is reproducible [14]. In this study, segmented organs included the lungs, trachea, heart, sternum, spine, and ribs, which were imported into the AR system for visualization (Figure 4).

2.2.2. Blender-Based Model Fusion and Model Visualization in AR

Comparative analysis revealed that the CT-derived model has a smooth surface and clearly defined anatomical structures. However, due to its limited scanning range, it often lacked peripheral regions such as the limbs. In contrast, the Luma AI model can rapidly capture the complete external morphology including arms and legs, providing realistic posture cues for training, albeit with lower anatomical fidelity. Combining these models ensures that even when the thoracic region is the focus, students can still perceive posture–anatomy correspondence within a full-body context.

To overcome these limitations, this study used Blender to fuse the two models. In Blender, the Transform tools (Move, Rotate, Scale) and the Snap function were used for manual registration. The main torso and internal structures from the CT model were preserved. The arm regions from the Luma AI model were aligned with the CT torso using anatomical landmarks such as the nose tip and shoulder joints, assisted by the 3D Cursor and Origin tools. After alignment, the models were merged with the Boolean Modifier (Union) and smoothed using the Voxel Remesher (voxel size 2.0 mm) and Subdivision Surface (Level 1). The final integrated model was then incorporated into the retained CT model, as shown in Figure 5.

2.3. AR Simulation Process for Radiotherapy Setup

The radiation therapy room is used for clinical purposes during the day, so students typically cannot access it. To enable setup training at any time, a general radiography couch with three-axis movement for X-ray imaging practice was used as a simulated radiation therapy couch.

In this study, a central QR code (C-QRcode, 125 × 125 mm²) was used as the spatial reference for AR-based radiotherapy patient setup simulation. Because the isocenter was in the air, it was not possible to attach a QR code, so it was attached to the floor directly below the isocenter. The QR code was printed with crosshairs on A4 paper, and a laser positioning device was used to align its center to the floor directly below the isocenter of the treatment system, ensuring accurate correspondence between the virtual 3D model and the clinical setup.

During the simulation (Figure 6), the built-in camera of the HoloLens 2 detected the C-QRcode in real time, calculated its center coordinates relative to the AR world origin, and displayed the 3D patient model and the Varian TrueBeam model at the corresponding location. The procedure consisted of the following steps:

QR code positioning—Place the C-QRcode precisely at the treatment system isocenter and verify its position using the laser positioning system.
Virtual model display—Wear the HoloLens 2, which detects the QR code and renders the 3D patient and Varian TrueBeam linear accelerator models at the corresponding position.
Physical alignment—Adjust the physical phantom on the treatment couch until it is spatially aligned with the virtual model displayed in the HoloLens 2.
Completion of simulation—Once the virtual and physical models are fully aligned, the radiotherapy patient setup simulation is completed.

2.4. System Accuracy Evaluation: QR Code Setup Stability

To evaluate the setup stability of the QR code-based tracking system in the AR training environment, three experimental assessments were performed (Figure 7).

First, time stability was evaluated by placing the QR code at a fixed distance of 1.0 m from the HoloLens 2 camera and recording its coordinates continuously for 10 s to detect possible temporal drift Figure 7b. The standard deviation of these measurements (

u_{t i m e}

) was used as the stability indicator.

Second, distance sensitivity was assessed by varying the distance between the HoloLens 2 and the QR code to 0.5 m, 1.0 m, and 1.5 m, recording the positional coordinates at each setting Figure 7c. The variability was quantified using the standard deviation (

u_{d i s t}

).

Third, angle sensitivity was investigated by fixing the distance at 1.0 m and changing the viewing angle to 0°, 30°, 45°, and 60°, then recording coordinate changes Figure 7d. The standard deviation (

u_{a n g l e}

) represented the angular sensitivity.

As illustrated in Figure 7, s* represents the designated reference point on the QR code used for coordinating measurements across all evaluations.

Temporal stability, distance sensitivity, and angular sensitivity were all evaluated using the standard deviation (S.D.) of repeated measurements as the stability indicator. In the same manner, distance sensitivity and angular sensitivity were measured using S.D. as the indicator. To obtain a comprehensive measure of system reliability, we calculated the expanded uncertainty (U) by combining these three variabilities in quadrature, following the ISO GUM methodology. This provided a combined measure of temporal stability, distance sensitivity, and angular sensitivity within a unified framework.

Finally, the expanded uncertainty (U) was calculated using:

U = k \cdot \sqrt{u_{t i m e}^{2} + u_{d i s t}^{2} + u_{a n g l e}^{2}}

(1)

where k = 2 corresponds to a 95% confidence level. This provided a combined measure of setup uncertainty incorporating temporal, distance-related, and angular effects, offering a comprehensive evaluation of tracking reliability for AR-based radiotherapy training.

2.5. Model Overlap Evaluation: Coordinate Acquisition

To evaluate the spatial alignment between the virtual and physical models, corresponding feature point coordinates were obtained in both environments.

To assess system performance, repeated setup simulations were conducted using key anatomical landmarks, and the resulting spatial deviations were quantitatively analyzed. In the virtual environment Figure 8a, three anatomical feature points were selected on the 3D model: the nose tip, left hip, and right hip. Their real-time 3D coordinates were obtained using a Unity C# script and designated as Virtual Point 1–3.

In the physical environment Figure 8b, QR codes were placed at the corresponding anatomical locations on the anthropomorphic phantom. The center coordinates of each QR code were recorded using the HoloLens 2 camera and designated as Real Point 1–3.

The center of the C-QRcode was used as the common reference point for both environments, enabling direct comparison of corresponding feature points. These coordinates formed the basis for spatial alignment assessment and deviation analysis.

2.6. Example of a Patient Setup Training Session for Students

Using this system, a medical physics student with no prior clinical experience in radiation therapy conducted setup training. The training was repeated five times. The setup was evaluated by measuring the positional deviation of three QR codes on the tip of the nose and pelvis of the patient phantom in the X, Y, and Z axes.

Three non-collinear anatomical marks (nose tip, left hip, right hip) were selected to define spatial orientation and position. This preliminary evaluation involved one student (N = 5 trials) and was sufficient for a feasibility demonstration. However, future studies should include more landmarks and a larger cohort for robust evaluation.

3. Results

3.1. Organ Model Visualization in AR

To enhance spatial awareness of the gross tumor volume (GTV) and surrounding critical organs during the setup simulation, this study generated 3D visualizations of key thoracic structures, including the sternum, ribs, heart, and lungs, via CT-based segmentation and reconstruction. External contours obtained from Luma AI were fused in Blender to enhance completeness, and the final model was displayed in the AR environment using HoloLens 2.

Figure 9 shows organ visualization in the AR environment, with yellow representing the skeleton, purple for the lungs, and red for the heart. The system fuses internal anatomy with the external phantom, improving understanding of posture and organ spatial relationships, and supporting medical education and training. In this study, the phantom posture was fixed to ensure consistency during training, although the system can be adapted to different postures in future applications.

3.2. System Accuracy Analysis

As shown in Table 1, the time stability test demonstrated that the QR code position remained highly stable over a 10 s measurement period. Drift distance was defined as the displacement difference in the reference point data between the beginning of a 10 s period and each subsequent second, indicating the temporal stability of the QR code anchor. The average drift distance was 0.42 mm, with a standard deviation (S.D.) of 0.17 mm, indicating minimal temporal variation.

In the angle sensitivity test (Table 2), when the viewing angle was adjusted to 30°, 45°, and 60°, the positional error increased with larger angles. The Y-axis exhibited the greatest variation, and the average positional error was 1.88 mm with an S.D. of 0.55 mm.

For Table 3, in the distance sensitivity test, because 1.0 m was set as the reference distance for calibration, the deviations at this position were all zero. Accordingly, Table 3 lists 1.0 m as a reference row, measurements taken at 0.5 m and 1.5 m revealed that positional error increased with greater distance, particularly along the Y-axis. The average error was 1.21 mm, and the S.D. was 1.24 mm.

The overall expanded uncertainty was calculated as:

U_{95} = \pm 2.74 m m

(2)

corresponding to a 95% confidence level. These results indicate that the system maintains millimeter-level variation under most conditions, providing sufficient stability for AR-based setup simulations.

3.3. Student Patient Setup Training Result Analysis

To evaluate the spatial alignment accuracy between the virtual model and the physical phantom, five independent setup simulations were conducted using the C-QR code as the reference point. Positional offsets along the X (left–right), Y (up–down), and Z (front–back) axes were calculated, and the mean, minimum, maximum, and S.D. were recorded, as shown in Table 4.

X-axis deviations were the smallest and most stable. Y-axis deviations were the largest, likely due to QR code placement errors and vertical alignment challenges. Z-axis deviations were consistently negative, suggesting backward displacement from camera height changes or phantom movement. These results are limited to a single participant and are intended to illustrate the proposed methodology rather than to establish statistically significant conclusions.

4. Discussion

We developed an AR-based patient setup simulation system using HoloLens 2 and evaluated its spatial accuracy. QR code tracking demonstrated good stability with an expanded uncertainty of ±2.74 mm (millimeter-level variation). However, setup simulations produced centimeter-level deviations along the X, Y, and Z axes. The primary causes of this discrepancy are likely the students’ lack of patient setup skills, manual placement of the QR marker, and real-time alignment verification. In addition, several practical factors may have amplified these deviations. First, visual acuity and familiarity with AR interfaces differ among novice students, which can affect their ability to perceive subtle misalignments. Second, the anthropomorphic phantom has curved surfaces, making it appear different from various viewing angles and complicating alignment. Third, the phantom is relatively heavy, so fine adjustments are physically difficult and often lead to overshooting. These factors explain why deviations increased to the centimeter level, even though QR code tracking itself maintained millimeter-level stability. Future work will incorporate error propagation analysis and confidence intervals to more systematically separate user-related variability from system-related limitations.

The system used a single QR code as an anchor for simplicity, low cost, and classroom feasibility. However, this approach amplifies slight misplacements when the headset changes perspective, particularly along the Y (vertical) and Z (depth) axes, and provides no dynamic feedback mechanism to correct drift. Future work will investigate multi-marker templates and hybrid tracking to improve accuracy.

The images from the HoloLens 2 view in Figure 8 and Figure 9 were of relatively low resolution, limiting visualization and spatial analysis. This was primally due to network fluctuations during transmission. Future studies could use wired or high-bandwidth connections, or local high-definition caching, to improve image quality.

Prior studies using marker-based AR exhibited smaller errors: Tarutani et al. [15] achieved sub-millimeter accuracy (0.5–0.8 mm), but at the expense of greatly increased setup time, often exceeding 10 min per session. In contrast, our system showed centimeter-level deviations (up to 33 mm), yet a complete trial could be finished within minutes. This trade-off indicates that while Tarutani’s method is clinically precise, our approach prioritizes efficiency and accessibility, which are more suitable for classroom training. Johnson et al. [16] reported 3.0 ± 1.5 mm accuracy using VSLAM with HoloLens 2. While more precise than our results, VSLAM depends heavily on environmental features and is prone to drift in texture-poor areas. By contrast, our QR code approach provides deterministic anchoring with lower computational demand, yielding millimeter-level stability (expanded uncertainty ±2.74 mm) despite user variability, which may be more robust and cost-effective for training contexts. Compared with Tarutani et al. [15] and Johnson et al. [16], our results demonstrated a novel approach: By applying ISO GUM methodology, we combined temporal stability, distance sensitivity, and angular sensitivity into a single expanded uncertainty (U). This integrated evaluation highlights the novelty of our study, as previous research often reported these factors separately.

More robust alternatives have emerged in the recent literature. Zhai et al. [17] combined AR with point-cloud ICP registration, achieving 0.6 ± 0.2 mm accuracy. This precision, while clinically impressive, requires depth-equipped hardware and intensive computation, limiting feasibility for widespread educational deployment. Zhang et al. [18] reported 1.6 ± 0.9 mm errors using structured-light surface imaging with AR overlay, while also reducing setup time compared with CBCT workflows. Their approach highlights a clinically viable balance of accuracy and efficiency. Our system, although less precise, achieved comparable training benefits with far simpler hardware, underscoring that meaningful outcomes can be realized in education even without clinical-grade precision. Future work could explore integrating elements of structured light or point-cloud tracking to narrow the accuracy gap while retaining the feasibility advantages demonstrated here.

In contrast, SGRT has already become widely adopted in clinical practice, covering more than 40% of treatment fractions in the U.S. Rudat et al. demonstrated inter-fraction setup errors of 3.6 mm (SGRT) versus 4.5 mm (laser/tattoos) in thorax, abdomen, and pelvis setups (p = 0.001) [19]. Oliver et al. showed that augmenting SGRT with real-time holographic outlines (Postural Video™, VisionRT, London, UK) reduced setup time by 28% and minimized repeat imaging by 63% [20]. These findings highlight that continuous surface feedback is critical for achieving millimeter-level accuracy, something entirely absent in our current AR workflow. Compared with prior methods such as SGRT combined with CBCT, our AR-based approach achieved shorter setup times, which highlights its educational practicality despite lower accuracy.

Hardware platforms also matter. Frisk et al. [21] reported spine phantom placement errors of 1–2 mm using Magic Leap 2 with dual RGB cameras and an active depth sensor—far more precise than our HoloLens 2 system, though contextually less integration-friendly for radiotherapy workflows.

Despite these limitations, participants in pilot training sessions described the fused anatomical visualization as “intuitively linking posture with anatomy.” This educational benefit mirrors Wang et al.’s findings that AR improves patient understanding and comfort [22], and Zhang’s workflow that reduced training time by over 30% [18]. However, the student patient setup training result analysis appeared relatively subjective, with a larger standard deviation than other results. Future studies should include more simulation trials with a larger student cohort to reduce variability and improve reliability.

The limitations of this study are summarized as follows:

The observed deviations may partly reflect the student’s limited setup skills. Future validation with experienced clinicians will help distinguish skill-related effects from methodological limitations.

Moreover, typical thoracic tumors measure approximately 20–40 mm. In our study, setup deviations reached up to 33 mm, which would compromise tumor targeting in clinical settings.

In addition, this study did not include evaluation of GTV. Since the primary aim was setup training, tumor-related assessment was outside the study scope. We therefore acknowledge both the relatively large alignment errors and the absence of GTV evaluation as important limitations.

Because this study is primarily a methodological proposal, the training evaluation was limited to one novice participant. Future studies with multiple students and larger sample sizes will be required to validate the statistical significance of the results.

Finally, the system provides real-time display of virtual models; however, it lacks automatic feedback to confirm alignment accuracy. This distinction further clarifies the current limitation.

Future improvements will focus on:

Passive–active hybrid markers: rigid, multi-marker placement templates to reduce manual variability.
Real-time depth fusion: structured light or LiDAR-based point-cloud ICP at 10 Hz, akin to Zhai et al.’s method [17].
Closed-loop imaging verification: automatic low-dose CBCT or surface imaging triggering if alignment error exceeds 5 mm, as in Zhang et al.’s workflow [18].
Set clear goals and criteria, pilot in labs before integrating into pre-clinical modules with instructor guides and multi-learner/device support.
We will also ensure privacy and cybersecurity.

In summary, while QR code tracking shows millimeter-level stability, the observed centimeter-level misalignments indicate their insufficiency for clinical-grade setup. In contrast, SGRT and depth-enhanced AR workflows consistently demonstrate sub-5 mm accuracy. Integrating these technologies—and improving marker design, tracking, and feedback—could transform our system into a clinically viable AR-assisted setup solution.

5. Conclusions

This study developed an AR-based training system using HoloLens 2 to support radiation therapy setup, offering an interactive and realistic environment. Accuracy evaluation showed that QR code tracking achieved millimeter-level variation, with an expanded uncertainty of ±2.74 mm, indicating good stability. However, during student patient setup training, centimeter-level deviations were observed along the X, Y, and Z axes, mainly due to the students’ setup skills and the lack of manual QR code placement and real-time verification. The findings support the feasibility of the proposed AR-based training system. Further validation with larger cohorts will be necessary to confirm statistical robustness.

This patient setup training system can be implemented outside of a radiation treatment room if a 3D moving couch and patient phantom are available, expanding the range of uses. The ability to repeat training without time constraints will enhance user proficiency. This study contributes to immersive educational tools for radiation oncology. AR technology shows potential to improve staff training, reduce setup errors, and enhance treatment safety.

Future research will focus on improving tracking stability, workflow efficiency, and real-time feedback, as well as expanding into other treatment areas.

Author Contributions

Conceptualization, J.W.; methodology, J.W. and T.F.; software, J.W.; validation, J.W., D.H. and T.F.; formal analysis, J.W.; investigation, J.W.; resources, T.F.; data curation, J.W.; writing—original draft preparation, J.W.; writing—review and editing, J.W., D.H. and T.F.; visualization, J.W.; supervision, T.F.; project administration, T.F.; funding acquisition, T.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable for studies not involving humans or animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy restrictions.

Acknowledgments

The authors acknowledge the support of the Division of Medical Quantum Science, Department of Health Sciences, Kyushu University, for providing the experimental environment and technical assistance.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AR	Augmented Reality
RT	Radiation Therapy
IGRT	Image-Guided Radiation Therapy
SGRT	Surface-Guided Radiation Therapy
CT	Computed Tomography
CBCT	Cone-Beam Computed Tomography
GTV	Gross Tumor Volume
ICP	Iterative Closest Point
VSLAM	Visual Simultaneous Localization and Mapping
S.D.	Standard Deviation
C-QRcode	Central QR code

References

Delaney, G.; Jacob, S.; Featherstone, C.; Barton, M. The role of radiotherapy in cancer treatment: Estimating optimal utilization from a review of evidence-based clinical guidelines. Cancer 2005, 104, 1129–1137. [Google Scholar] [CrossRef] [PubMed]
Bortfeld, T.; Jeraj, R. The physical basis and future of radiation therapy. Br. J. Radiol. 2011, 84, 485–498. [Google Scholar] [CrossRef] [PubMed]
Bharati, A.; Mandal, S.R.; Srivastava, A.K.; Rastogi, M.; Khurana, R.; Hadi, R.; Sapru, S.; Gandhi, A.K.; Mishra, S.P. Evaluation of the effect of random setup errors on dose delivery in intensity modulated radiotherapy. Pol. J. Med. Phys. Eng. 2020, 26, 55–60. [Google Scholar] [CrossRef]
Navarro-Martin, A.; Cacicedo, J.; Leaman, O.; Sancho, I.; García, E.; Navarro, V.; Guedea, F. Comparative analysis of thermoplastic masks versus vacuum cushions in stereotactic body radiotherapy. Radiat. Oncol. 2015, 10, 176. [Google Scholar] [CrossRef]
Li, C.; Lu, Z.; He, M.; Sui, J.; Lin, T.; Xie, K.; Sun, J.; Ni, X. Augmented reality-guided positioning system for radiotherapy patients. J. Appl. Clin. Med. Phys. 2022, 23, e13516. [Google Scholar] [CrossRef]
Müller-Polyzou, R.; Reuter-Oppermann, M.; Feger, J.; Meier, N.; Georgiadis, A. Assistance systems for patient positioning in radiotherapy practice. Health Syst. 2024, 13, 332–360. [Google Scholar] [CrossRef]
Barsom, E.Z.; Graafland, M.; Schijven, M.P. Systematic review on the effectiveness of augmented reality applications in medical training. Surg. Endosc. 2016, 30, 4174–4183. [Google Scholar] [CrossRef]
Zhu, E.; Hadadgar, A.; Masiello, I.; Zary, N. Augmented reality in healthcare education: An integrative review. PeerJ 2014, 2, e469. [Google Scholar] [CrossRef]
Tene, T.; Bonilla García, N.; Coello-Fiallos, D.; Borja, M.; Vacacela Gomez, C. A systematic review of immersive educational technologies in medical physics and radiation physics. Front. Med. 2024, 11, 1384799. [Google Scholar] [CrossRef]
Tang, K.S.; Cheng, D.L.; Mi, E.; Greenberg, P.B. Augmented reality in medical education: A systematic review. Can. Med. Educ. J. 2020, 11, e81–e96. [Google Scholar] [CrossRef] [PubMed]
Moro, C.; Štromberga, Z.; Raikos, A.; Stirling, A. The effectiveness of virtual and augmented reality in health sciences and medical anatomy. Anat. Sci. Educ. 2017, 10, 549–559. [Google Scholar] [CrossRef] [PubMed]
Hasoomi, N.; Fujibuchi, T.; Arakawa, H. Developing simulation-based learning application for radiation therapy students at pre-clinical stage. J. Med. Imaging Radiat. Sci. 2024, 55, 101412. [Google Scholar] [CrossRef]
Palumbo, A. Microsoft HoloLens 2 in medical and healthcare context: State of the art and future prospects. Sensors 2022, 22, 7709. [Google Scholar] [CrossRef]
Lasso, A. SlicerMONAIAuto3DSeg: Automated 3D Segmentation Extension for 3D Slicer. GitHub Repository. Available online: https://github.com/lassoan/SlicerMONAIAuto3DSeg (accessed on 23 September 2025).
Tarutani, K.; Takaki, H.; Igeta, M.; Fujiwara, M.; Okamura, A.; Horio, F.; Toudou, Y.; Nakajima, S.; Kagawa, K.; Tanooka, M.; et al. Development and accuracy evaluation of augmented reality-based patient positioning system in radiotherapy: A phantom study. Vivo 2021, 35, 2081–2087. [Google Scholar] [CrossRef]
Johnson, P.B.; Jackson, A.; Saki, M.; Feldman, E.; Bradley, J. Patient posture correction and alignment using mixed reality visualization and the HoloLens 2. Med. Phys. 2022, 49, 15–22. [Google Scholar] [CrossRef]
Zhai, S.; Wei, Z.; Wu, X.; Xing, L.; Yu, J.; Qian, J. Feasibility evaluation of radiotherapy positioning system guided by augmented reality and point cloud registration. J. Appl. Clin. Med. Phys. 2024, 25, e14243. [Google Scholar] [CrossRef]
Zhang, G.; Jiang, Z.; Zhu, J.; Dai, T.; He, X.; Liu, X.; Chang, Y.; Wang, L. Innovative integration of augmented reality and optical surface imaging: A coarse-to-precise system for radiotherapy positioning. Med. Phys. 2023, 50, 4505–4520. [Google Scholar] [CrossRef]
Rudat, V.; Shi, Y.; Zhao, R.; Xu, S.; Yu, W. Setup accuracy and margins for surface-guided radiotherapy (SGRT) of head, thorax, abdomen, and pelvic target volumes. Sci. Rep. 2023, 13, 17018. [Google Scholar] [CrossRef] [PubMed]
Oliver, K.; Subick, N.; Moser, T. A prospective, comparative evaluation of an augmented reality tool (Postural Video™) vs. standard SGRT for efficient patient setup. Rep. Pract. Oncol. Radiother. 2025, 29, 740–745. [Google Scholar] [CrossRef]
Frisk, H.; Lindqvist, E.; Persson, O.; Weinzierl, J.; Bruetzel, L.K.; Cewe, P.; Burström, G.; Edström, E.; Elmi-Terander, A. Feasibility and accuracy of thoracolumbar pedicle screw placement using an augmented reality head mounted device. Sensors 2022, 22, 522. [Google Scholar] [CrossRef] [PubMed]
Wang, L.J.; Casto, B.; Reyes-Molyneux, N.; Chance, W.W.; Wang, S.J. Smartphone-based augmented reality patient education in radiation oncology. Tech. Innov. Patient Support Radiat. Oncol. 2023, 29, 100229. [Google Scholar] [CrossRef] [PubMed]

Figure 1. A workflow of this study: (a) Model generation steps and (b) AR application development.

Figure 2. 3D model generation and visualization using Luma AI: (a) 3D models obtained through Luma AI; (b) Luma AI model-front; and (c) Luma AI model-back.

Figure 3. 3D model generation and visualization using CT and 3D Slicer: (a) 3D models obtained through CT and 3D Slicer; (b) CT model-front; and (c) CT model-back.

Figure 4. The internal organs segmented by 3D Slicer.

Figure 5. 3D model fusion and visualization using Blender: (a) Adjusting the model in Blender; (b) combined model-front and combined model-back.

Figure 6. Workflow of AR-based radiotherapy patient setup simulation: (a) Workflow of radiation therapy patient setup simulation; (b) operation scene; and (c) final setup alignment in the HoloLens 2 view.

Figure 7. QR code setup stability tests: (a) Coordinate system of the QR code (X, Y, Z axis); (b) time stability test; (c) distance sensitivity test at 0.5 m, 1.0 m, and 1.5 m; and (d) angle sensitivity test at 0°, 30°, 45°, and 60°. s* means standard.

Figure 8. Feature point acquisition for alignment evaluation: (a) Feature point coordinates (Virtual Point 1–3) in the virtual model; (b) feature point coordinates (Real Point 1–3) in the physical model.

Figure 9. Organ visualization in AR: (a) 3D AR visualization of segmented lung anatomy (purple) aligned with external phantom contours; (b) AR visualization of segmented heart anatomy (red) aligned with external phantom contours.

Table 1. Time stability test results.

Parameter	Minimum (mm)	Maximum (mm)	Mean (mm)	S.D. (mm)
Drift distance	0.00	0.82	0.42	0.17

Table 2. Angle sensitivity test results.

Angle (°)	ΔX (mm)	ΔY (mm)	ΔZ (mm)	Mean (mm)	S.D. (mm)
30	0.33	1.67	0.37	1.88	0.55
45	0.03	1.80	−0.07
60	0.07	2.00	−0.07

Table 3. Distance sensitivity test results.

Distance (m)	ΔX (mm)	ΔY (mm)	ΔZ (mm)	Mean (mm)	S.D. (mm)
0.5	0.20	−1.13	−0.17	1.21	1.24
1.0	0.00	0.00	0.00
1.5	−0.27	2.40	−0.57

Table 4. Axis-specific deviation of QR code-based patient setup.

Axis	Mean (mm)	Minimum (mm)	Maximum (mm)	S.D. (mm)
X (left–right) Offset	4.70	2.10	11.40	6.13
Y (up–down) Offset	16.42	−17.2	33.30	19.50
Z (front–back) Offset	−14.84	−4.30	−30.40	10.85

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, J.; Han, D.; Fujibuchi, T. Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling. Appl. Sci. 2025, 15, 10488. https://doi.org/10.3390/app151910488

AMA Style

Wu J, Han D, Fujibuchi T. Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling. Applied Sciences. 2025; 15(19):10488. https://doi.org/10.3390/app151910488

Chicago/Turabian Style

Wu, Jinyue, Donghee Han, and Toshioh Fujibuchi. 2025. "Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling" Applied Sciences 15, no. 19: 10488. https://doi.org/10.3390/app151910488

APA Style

Wu, J., Han, D., & Fujibuchi, T. (2025). Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling. Applied Sciences, 15(19), 10488. https://doi.org/10.3390/app151910488

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Design and Validation of an Augmented Reality Training Platform for Patient Setup in Radiation Therapy Using Multimodal 3D Modeling

Abstract

1. Introduction

2. Materials and Methods

2.1. Research Procedure

2.2. 3D Model Generation

2.2.1. AR Model Construction Using Luma AI and 3D Slicer

2.2.2. Blender-Based Model Fusion and Model Visualization in AR

2.3. AR Simulation Process for Radiotherapy Setup

2.4. System Accuracy Evaluation: QR Code Setup Stability

2.5. Model Overlap Evaluation: Coordinate Acquisition

2.6. Example of a Patient Setup Training Session for Students

3. Results

3.1. Organ Model Visualization in AR

3.2. System Accuracy Analysis

3.3. Student Patient Setup Training Result Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI