1. Introduction
Patients with severe skeletal deformities cannot be treated with orthodontics alone to create a harmonious face and stable occlusion. These patients instead undergo a treatment comprising an orthodontic preparation phase followed by orthognathic surgery [
1]. Conventional orthognathic treatment planning comprises model plaster surgery and the use of cephalograms to display the relation among dentition, the facial profile, and the facial skeleton [
2]. Two-dimensional simulation of the facial appearance of the surgical outcome, however, will be insufficient because it does not address volumetric changes across all facial structures. With the use of cone beam computed tomography (CBCT) and computer-aided design/computer-aided manufacturing technology, it is possible to add a third dimension to surgical treatment planning, allowing for greater accuracy [
2]. The aim of the orthodontic preparation is to deliver optimal decompensated dental arches with correct incisor inclination in relation to the jaws to finish in a stable occlusion in a harmonious profile after the surgery [
3]. In contrast to the surgical treatment phase, orthodontic preparation is currently not simulated digitally before the start of treatment.
Digital dental models are essential components of orthodontic diagnosis and treatment planning, allowing evaluation of treatment need and outcome [
4]). These models are as accurate as traditional plaster models for orthodontic diagnostics and offer advantages that include ease of storage and retrieval, less risk of damage, and better interoffice transferability [
5,
6,
7,
8,
9]. In 1956, Kesling [
10] suggested that cutting and repositioning the teeth in duplicate plaster dental models of the malocclusion, known as the orthodontic setup, would allow simulation of the results before starting orthodontic treatment. With the digitalization of dental models, it is possible to construct digital orthodontic setups that are as accurate as manual setups [
11]. These digital versions also offer the advantage of easy fabrication and presentation of therapeutic alternatives to patients and professionals compared to manual setups [
8]. Orthodontic setups can be distinguished as either diagnostic or therapeutic. The diagnostic version is a tool to assist the orthodontist in treatment planning for complex cases, such as asymmetric extractions, congenital missing teeth, or combined orthodontic-surgical treatments. Whereas the therapeutic setup can also serve that purpose, and can be used further for the fabrication of orthodontic appliances, such as clear aligners and customized fixed appliance systems [
6].
With the introduction of a new method for integrating digital setups into CBCT scans, orthodontic setups can be integrated into the virtual orthognathic planning. Instead of being fabricated just before the surgery, all of the planning can now be mapped out ahead of orthodontic treatment initiation [
12]. A study by Falter et al. [
13] revealed that one in seven orthognathic patients (13.5%) underwent a different surgical procedure from that originally planned mainly because of facial aesthetic considerations or a negative overjet that did not change as expected in Class III patients. Other reasons for changing a treatment plan included not gaining sufficient width in the maxilla or having an open bite closure after surgically assisted rapid maxillary expansion (SARME) [
13]. Pre-treatment virtual planning of the presurgical orthodontic treatment could lead to fewer treatment plan changes and a more predictable treatment outcome. Planning ahead of the combined orthodontic-surgical treatment also enables comparison between the planned result and final outcome.
Data are limited on the reproducibility of digital setups. In a study comparing the digital setups of six clinicians, measured based on the American Board of Orthodontics Objective Grading System (ABO-OGS) scores, it was concluded that the inter- and intra-operator reliabilities were sufficient for general use of the digital orthodontic setup, although ABO-OGS scores differed significantly among the set-ups [
14]. This study lacked some crucial information, as the models were evaluated and graded only using the ABO-OGS scores, which omit absolute differences in individual three-dimensional (3D) tooth displacement. Furthermore, to predict and evaluate the accuracy of the dental and skeletal planning of the orthognathic treatment in 3D, the digital orthodontic setups should be produced in the correct anatomical position in the patient’s craniofacial skeleton. No data were published in literature concerning this feature. The goal of the present study was to evaluate the intra- and inter-operator performance of the fabrication of digital orthodontic setups integrated into the craniofacial skeleton of the patient.
2. Materials and Methods
2.1. Patients
For this retrospective study, patient data were obtained from the patient archive of the section of the Orthodontics and Craniofacial Biology, Department of Dentistry of the Radboud University Medical Centre, Nijmegen, The Netherlands. Only patients requiring a combined orthodontic–orthognathic surgical treatment were included. Exclusion criteria were presence of a developmental deformity and missing more than one tooth per quadrant. The study was conducted between January and June 2020. Because of the explorative nature of the study design, a sample size calculation is not applicable. All patient data were de-identified prior to analysis, and all participants gave informed consent. The Institutional Review Board of the Radboud University Medical Centre issued an approval for this investigation (2016-2690).
2.2. Data Acquisition
For diagnostic purposes, CBCT scans were taken prior to the treatment (field of view, 17 × 23 cm; voxel size, 0.3 mm; scanning time, 17.8 s; 891.4 mGy/cm2; KaVo 3d eXam, KaVo Dental GmbH, Biberach/Riss Germany). Data from the CBCT exams were exported into a Digital Imaging and Communications in Medicine (DICOM) format. Directly after the acquisition of the CBCT scans plaster models of the dental arches were acquired. The plaster models were digitized with a laser scanner (R500 3D Dental Laser Scanner, 3Shape®, Copenhagen, Denmark).
All digital dental models were exported to Standard Tessellation Language (STL) files. Both the DICOM files from the CBCT scans and the STL files from the dental models were imported into the OrthoAnalyzer™ software (2020-1, 3Shape
®, Copenhagen, Denmark). The dental models were fused with the CBCT data by a semi-automatic surface-based matching procedure executed by the software (
Figure 1).
2.3. Orthodontic Setup
Four operators participated in this study: two orthodontic residents, each with 3 years of clinical experience (operators 1 and 2), and two experienced orthodontists (operators 3 and 4). All operators had access to the written treatment plan and patient records including the CBCT scan and the digital dental model for each patient. The operators were not involved in the treatment and were blinded to the treatment outcome. The OrthoAnalyzer software was used to fabricate a virtual orthodontic setup on the pre-treatment dental model. The setup preparation for all dental models was done by one experienced operator (operator 4). This phase consisted of a semi-automatic determination of the upper and lower arch forms, tooth axes, points of rotation, and tooth crown segmentations. All individual tooth crowns were semi-automatically segmented after indication of the mesial and distal contact points and gingival borders. After the preparation phase, each model was copied eight times for a blinded delivery to the four operators. All operators were instructed in how to make the setup and practiced with test data before starting with the study data.
The original treatment plan, and key principles of occlusion acted as guidelines for the operators: correct molar relationship in final jaw position, correct crown angulation, correct decompensated crown inclination, no rotations, no interdental spaces when not part of the treatment plan, an appropriate occlusal plane according to the patient craniofacial skeleton on the CBCT, correct interproximal contact point, a normal overjet and overbite (<4 mm), and aligned midlines with respect to the skeletal midline. Mandibular inter-canine distances were respected and acted as a guide for obtaining the final arch width and shape.
All four operators repeated each setup twice with at least a one-month time interval, resulting in 30 setups for each operator and 120 setups in total.
2.4. Calculation of Tooth Movement
All setups and individual tooth crowns were exported as STL files and imported into the in-house-created software, 3DMedX
® (3D Lab Radboudumc, Nijmegen, The Netherlands). With 3DMedX
®, all tooth crowns were translated and rotated from the pre-treatment dental model to the orthodontic setup. At the same time, this movement was recorded automatically and saved for each individual tooth. Six parameters of movements were computed: three rotational (yaw, roll, and pitch;
Figure 2) and three translational (left to right (X), anterior to posterior (Y), and cranial to caudal (Z). Together, these parameters form the six degrees of freedom.
2.5. Statistical Analysis
All data were imported into IBM SPSS software, version 26.0 (IBM, Armonk, NY, USA). Paired
t-tests were used to compare the differences between the pre-treatment models and orthodontics setups within and between operators. Intra- and inter-operator performance of the method was expressed by Pearson’s correlation coefficients and duplicate measurement errors (DME). The differences expressing the systematic errors between operators were expressed by mean differences and confidence intervals and tested with the paired sample
t-test. The inter-operator performance was calculated on the first setup series. For an assessment of the relevance of the inter-operator differences, tolerance levels were set at 0.6 mm for translations and 2 degrees for rotations. The significance level was set at 5%. A correlation of 0.90 or higher is considered excellent, between 0.75 and 0.90 is good, between 0.50 and 0.75 is moderate, and less than 0.50 is considered poor [
15].
4. Discussion
Limited information is available in the literature concerning the performance of diagnostic orthodontic setup fabrication for orthodontic–orthognathic surgery planning. Published studies are equally scarce regarding application of the therapeutic setup in customized bracket or clear aligner systems such as Incognito™, Insignia™, or Invisalign
®. Fabels et al. evaluated the intra- and inter-operator performance of six clinicians using the ABO-OGS score to evaluate set-up fabrication using the OrthoCAD software [
14]. In comparing setups, the intra-examiner mean absolute differences in total ABO-OGS scores varied statistically significantly between 2.17 and 6.00 points. Interexaminer mean absolute differences varied statistically significantly between 4.77 and 5.56 points. Reliability showed significant good (ICC, 0.6–0.8) agreement. A study by Barreto et al. [
8] compared the reproducibility of digital setups with manual setups of 20 patients and concluded that both can be reliably reproduced. In this study, no significant differences were found between intermolar, intercanine, and mandibular arch length measurements between digital and manual dental set-ups and the final dental model after orthodontic treatment. The correlations for most measurements were outstanding (r = 0.75 to 0.93). However, in both abovementioned studies, orthognathic cases were excluded, and setups were not made in relation to the facial skeleton using CBCT, which is essential in the planning of an orthognathic case. In addition, no 3D measurements were performed to describe the deviations.
The results for the intra-operator performance in the present study revealed a moderate correlation (>0.5) for the maxillary and mandibular molars (
Figure 4A,B). Especially, the roll in the maxilla and the X tooth displacements in the mandible showed lower correlation values. This finding could be relevant because different roll and X values for molars can lead to differences in the vertical and transversal dimension of the bite and to a different presurgical outcome. In the mandible, the reproducibility of the tooth positions was low for the yaw, as indicated by the duplicate measurement errors. These differences were mainly the result of difficulties for the operators in reproducing the torque and angulation, as expressed by the pitch and roll and translational displacements (X) in the mandible in case of the molars, and the mesiodistal rotation in general. These differences might have resulted from the absence of strict norms for these parameters when producing a setup. However, it is arguable whether deviations of a few degrees in angulation or torque of the molars would affect the surgical outcome. The results for the inter-operator performance revealed differences between operators for the translations on the vertical plane and rotations on the horizontal axis, with low correlations for the Z values and pitch (
Figure 5A,B), especially for maxillary molars. As also seen for the results of the intra-operator performance, the yaw showed the highest values for the DME, indicating a relatively large random error.
For an assessment of the clinical relevance of inter-operator differences, tolerance levels must be considered. These are subjectively described in the literature. Larson et al. [
16] used 0.5 mm for translational differences and 2 degrees for rotational differences as clinically relevant difference levels. Grauer and Proffit considered average translational discrepancies of <1 mm or <4 degrees for rotational discrepancies as clinically irrelevant in their studies comparing the setup with the clinical outcome after lingual orthodontic therapy [
17]. Based on these studies, the arbitrary limits of 0.6 mm for translations and 2 degrees for rotations are chosen as tolerance levels. The results in this study were based on exceptions to these ranges of clinical acceptability. Differences exceeding the ranges were found in 9.7% (7/72) of the translational mean differences and in 18% (13/72) of the rotational mean differences in the maxilla (
Table 1 and
Table 2). For the mandible, 26.3% (18/72) of the translational mean differences and 20.8% (15/72) of the rotational mean differences exceeded the range of clinical acceptance (
Table 1 and
Table 2). Notable for exceeding this cutoff were sagittal and vertical positioning of lower incisors and canines, expressed by Y and Z (
Table 1). For rotations, incisors and canines showed relatively frequent large mean differences for yaw and pitch (
Table 2).
A possible explanation for these discrepancies is that the treatment outcome was susceptible to the subjective judgment of the clinician. Sometimes, occlusion and esthetics are in tension, leading to different decisions by different clinicians favoring one over the other [
14].
This study included only combined orthodontic–orthognathic treatment cases, which are considered the more difficult cases. This could partly explain the differences in operator performance compared to the results of previous studies [
8,
14]. Differences in clinical experience between operators may have influenced the results. Two operators were postgraduate students in their final year of graduation, and the other two operators were experienced orthodontists with possibly different insights into tooth displacement feasibility. It might be possible as well that the level of experience of the operators with the sophisticated OrthoAnalyzer™ software played a role. To reduce the influence of unequal experience with the software, only the most experienced operator performed the setup preparation for all dental models (see
Section 2.3).
The differences between and within operator can be partly explained by the influence of extractions. For all extraction patients, extractions of the lower premolars were planned. The influence of tooth extractions on the reproducibility is mainly seen for the Y and X parameters in the lower jaw (
Table 3). It seems rather difficult to virtually estimate the closing of the extraction diastema’s in a reproducible way in the sagittal and transversal plane. In addition, SARME may have played a role. Significant errors for SARME therapy were seen for molars in the lower jaw for vertical, torque, and rotational displacements. Rotational movements of incisors in the upper arch also showed significant differences in random errors (
Table 4). All statistically significant errors were associated with larger standard errors for SARME or extraction cases compared to non-extraction or non-SARME cases, respectively. It is complicated to keep track of tooth movement changes during the setup fabrication in the software, which easily introduces disorientation in complex cases, such as with extractions and SARME. The result can be more variation and less reproducibility of tooth displacement estimations. A lack of strict rules about closure of extraction diastemas in a setup also may lead to variation during setup preparation.
Another limitation of this study is the choice for the explorative design because of the limited amount of available literature and the use of a relatively new 3D method to follow tooth movements in 3D [
12]. This may complicate the judgment of the results in a clinical perspective. However, the results of this explorative study can be adequately used to optimize the set-up fabrication process and to design follow-up studies.
The inter-operator variability in this study indicates the need for a protocol for the simulation of planned tooth movements in orthodontic setups during treatment planning for combined orthodontic-surgical treatment. Software improvements indicating evidence-based biological limitations of tooth displacement during the virtual setup production could help improve the performance of orthognathic planning at the start of treatment. More research comparing virtual setups with the actual outcome by matching on the skull base could provide information about preventing such planning errors and will leave less chance for the subjective interpretation of the orthodontist, allowing for more predictable planning.