1. Introduction
The main purpose of vaccination is to induce a robust immunological memory response that can be addressed by revaccination after possible exposure to the antigen. Yet, the definition of such an adequate anamnestic immune response, or ‘boostability’, is unclear. For rabies vaccination studies, the rabies virus neutralizing antibody (RVNA) titers on 7 or 14 days after revaccination are often used as markers for boostability, as they directly correspond to the memory B-cell response to revaccination [
1,
2,
3,
4,
5]. This anamnestic antibody response is primarily responsible for the protective immunity against infection with rabies virus, and it is therefore vital to be able to establish whether a memory response is adequate for protection.
Rabies virus is a lyssavirus that causes a fatal encephalitis upon infection [
6]. Transmission through infected dog bites is the main cause (99%) of the 59,000 annual human rabies deaths, occurring mainly in Africa and Asia [
7,
8]. Fortunately, adequate prophylaxis by means of vaccination is available. In addition to thorough wound cleansing, post-exposure treatment in individuals without pre-exposure prophylaxis usually consists of local infiltration of the bite wound with anti-rabies immunoglobulins and the administration of four intramuscular rabies vaccine doses over a 3-week period [
8]. If the exposed person has previously received pre-exposure prophylaxis, revaccination with two (intradermal or intramuscular) doses of rabies vaccine is sufficient [
8].
Pre-exposure prophylaxis can be administered according to one of the two schedules as recommended by the World Health Organization (WHO): on both day 0 and 7, either (1) a single intramuscular vaccination with a full dose (1 mL) or (2) two intradermal vaccinations with a 0.1 mL fractional dose [
8]. An adequate antibody response to pre-exposure rabies prophylaxis is generally defined as an RVNA titer of ≥0.5 IU/mL 21 to 28 days after primary vaccination [
9,
10].
Conventionally, an adequate anamnestic antibody response (referred to as ‘boostability’) is defined as the proportion of people who have a certain minimum antibody titer after revaccination (seroconversion rate, or SCR). As explained above, for rabies revaccination, this threshold for seroconversion lies at 0.5 IU/mL. If the majority of participants in a revaccination study have an antibody titer already exceeding this threshold at the moment of revaccination, their boostability cannot be adequately established, as they already reached the seroconversion endpoint before revaccination. This is problematic, especially when different pre-exposure vaccination schedules are to be compared on their respective boostability using this parameter. A recent systematic review of 36 studies on boostability after rabies revaccination provides a striking illustration of the problem at hand [
11]. The seroconversion endpoint was already reached before revaccination in 67% of all intradermally and 90% of all intramuscularly vaccinated participants. The post-booster SCR was 99% or 100% for all schedules and studies, making it impossible to compare different vaccination schedules or administration routes based on this parameter alone. An option would be to increase the threshold to, for example, 10 IU/mL, but that does not solve the fundamental underlying problem of (high) pre-booster titers interfering with a fixed serological endpoint. Moreover, the clinical relevance of the threshold—the inferred minimum for protection against the antigen—disappears, and the outcome remains categorical and not continuous.
An alternative strategy that circumvents this problem is to compare the slope, or fold increase, of post-booster RVNA geometric mean titers between different vaccination schedules on day 7. A major advantage of this strategy is that it takes the pre-booster titer values into account. Furthermore, this method can be applied over a large range of antibody titers and contains more information in its assessment, as it is continuous and not categorical [
12,
13,
14]. It might prove to be a suitable parameter to measure and compare boostability, in addition to the clinically relevant, yet difficult to compare between schedules, post-booster SCR. The proposed strategy is widely applicable and is also transferrable to revaccination studies with other vaccine antigens.
In this study, we evaluated the feasibility of such a quantitative approach to define boostability. For this purpose, we re-analyzed data from a previously conducted randomized trial that demonstrated non-inferiority using the post-booster SCR as primary endpoint [
2]. We calculated the mean difference between individual log
3 pre- and post-booster titers for two different rabies pre-exposure vaccination schemes and compared these two mean differences to show non-inferiority of these two vaccination schedules regarding boostability.
2. Materials and Methods
2.1. Study Design
An anonymized dataset from a previous randomized controlled trial (NCT01388985; EudraCT 2011-001612-62; approved by the local ethics committee (protocol ITMC0211); October 2011–January 2016) was provided by the Institute for Tropical Medicine in Antwerp, Belgium. For details of the study design, we refer to the original paper [
2]. In short, participants receiving pre-exposure rabies prophylaxis were randomized to one of two study arms. The control group received the—at that time—standard three-visit schedule of intradermal vaccination of a single fractional dose of 0.1 mL human diploid cell culture vaccine (HDCV) (day 0, 7, and 28). The intervention group received a shortened two-visit schedule with two intradermal doses of 0.1 mL HDCV (day 0 and 7). After one to three years, participants were revaccinated with a single intradermal vaccination of 0.1 mL HDCV. RVNA titers were measured on day 0 and day 7 after revaccination, using the rapid fluorescent focus inhibition test (RFFIT), which uses threefold dilution steps [
15,
16,
17]. Values were reported as RVNA concentrations in IU/mL. In this paper, we will use ‘RVNA titers’ to refer to these values. The original paper demonstrated non-inferiority of the two-visit schedule to the three-visit schedule regarding boostability, using the post-booster SCR (proportion of participants having an RVNA titer ≥0.5 IU/mL on day 7 after revaccination) as primary endpoint.
2.2. Study Population
Participants, aged 18–47 years, were recruited from the Belgian Armed Forces. All subjects gave their informed consent for inclusion before they participated in the study. They were excluded if they had previously received rabies vaccinations, if they were pregnant or breastfeeding, or if they had a known or suspected immunodeficiency, chronic disease, mefloquine prophylaxis, or a known allergy to any of the vaccine components. They were also excluded if they would be in overseas deployment within 35 days after the first vaccination.
For the current study, a modified intention-to-treat analysis was performed. Participants were included if RVNA titers on both day 0 and day 7 after revaccination were documented. They were excluded if their RVNA titer on day 0 of primary vaccination was ≥0.5 IU/mL, and if they received the booster vaccination more than 3 years (>1095 days) after their first rabies vaccination.
2.3. Variables
The dataset contained anonymized data on the allocated study group, RVNA serology measurements on day 0 and day 35 after primary vaccination and on day 0 and day 7 after revaccination, and the exact days on which these titers were measured. The difference in log3 RVNA titers between day 0 and 7 after revaccination was the primary outcome variable.
2.4. Bias
As the data were generated in a randomized controlled trial, no confounding bias was expected.
2.5. Statistical Methods
RVNA titer values were transformed to log
3 [titer] values to ensure a normal distribution of values and to narrow the range [
18]. A logarithm base of 3 was chosen because of the threefold dilution steps in the RFFIT. Log transformation allows for easy and straightforward reconversion to non-log-transformed values, as the arithmetic mean of log-transformed values corresponds to the log geometric mean of the original values [
19].
The mean and standard deviation of log3-transformed pre- and post-booster RVNA titers on day 0 and day 7 after revaccination were calculated for each allocated study arm. The individual differences between the log3 pre- and post-booster titers (log3 post-booster − log3 pre-booster = log3 (post-booster/pre-booster)) correspond to the fold increase (or slope) in log3 geometric mean titer. The individual differences in log3 titer were calculated and presented in a boxplot. The mean of these individual differences was calculated as well. Individual log3-transformed RVNA titers on day 0 and day 7 after revaccination were displayed on a scatterplot. To identify possible high- and low-/non-responders, a histogram of the differences in log3 titer was plotted for each schedule.
The geometric mean and standard deviation of non-log-transformed pre- and post-booster titers were calculated for each allocated study arm, as were the geometric mean and standard deviation of the individual differences in titer. For this calculation, negative values (i.e., a decrease in titer) had to be excluded, as geometric means cannot be calculated with negative values.
We assumed that the post-booster RVNA titers are not independent of the pre-booster RVNA titers. In case of baseline imbalance in pre-booster titers among the two schedules, an analysis of covariance (ANCOVA) would be carried out to calculate the mean differences in pre- and post-booster RVNA titers, adjusted for difference in baseline pre-booster values [
20,
21].
The difference between the mean log3 titer difference of the two-visit and three-visit schedule (mean2-visit − mean3-visit) was reported with a two-sided 95% confidence interval (CI), as calculated with an independent t-test. If the lower bound of this confidence interval was not less than −0.369, the two-visit schedule was considered to be non-inferior to the three-visit vaccination schedule It should be noted that −0.369 corresponds to a difference of log3 1.5, which equals a 1.5-fold dilution step in the RFFIT. We have chosen this cut-off point because we considered a difference of less than one dilution step to be clinically irrelevant: a 1.5-fold dilution equals only half a dilution step in a 3-fold dilution step RFFIT.
To account for differences in time after primary vaccination, two additional analyses were carried out. Linear regression analysis was performed for the time after primary vaccination and log3 pre-booster titers, and for the time after primary vaccination and log3 titer increase. Results for these analyses were displayed in scatterplots with linear regression lines per allocated group.
Statistical analysis was carried out using R 3.6.3.
4. Discussion
In this study, we propose an alternative statistical method to determine non-inferiority in boostability in the case of high pre-booster antibody titers. We demonstrate the feasibility to infer non-inferiority in boostability by comparing the mean increase of log-transformed antibody titers after revaccination. The mean increase in log
3 RVNA titers after booster vaccination was non-inferior for a shortened two-visit schedule compared to the standard three-visit schedule, confirming the results from the original paper [
2].
A clear metric of the immunological memory response is a neglected topic in many studies. To assess immunological memory, studies need a long follow-up time, which is likely to lead to participants dropping out. On top of that, the memory response is hard to assess, as there is no clearly defined way of measurement. Assessing which proportion of a population has reached or exceeded a certain antibody titer is a flawed measure for the boostability of the immune system, especially in the case of high pre-booster baseline values and maximum post-booster seroconversion rates. For clinical purposes and at an individual level, the method makes perfect sense, because it is sufficient to know whether someone is protected or not. Unfortunately, in a research setting, this method is not suitable to infer non-inferiority when pre-booster proportions are already close to 100%.
We believe that the method proposed in this study has a major advantage compared to a proportional definition of boostability. A continuous variable maintains by definition more original input than a dichotomous proportional assessment. In the latter case, valuable information is lost due to a forced dichotomization [
12]. For illustrative purposes: 88.5% of the three-visit schedule participants and 96.7% of the two-visit schedule participants in our data already had a pre-booster titer of 0.5 IU/mL or higher. With such a high proportion of individuals already at or above threshold for boostability before the actual revaccination, it becomes difficult to compare the two schedules on boostability using the 0.5 IU/mL threshold, especially considering that 100% of the participants of both schedules had a titer of 0.5 IU/mL or higher 7 days after revaccination, in which case they will always be non-inferior to each other. With this novel proposed method, non-inferiority in boostability—or the lack thereof—can be assessed in a more precise way.
This study has several strengths. Data from a large number of subjects from a randomized controlled trial was included in the analysis. The cut-off for non-inferiority was quite strict, as it was a difference of not more than a 1.5-fold dilution step between the means of the two groups, using a two-sided 95% confidence interval. Another, somewhat surprising, strength lies in the data in the form of the baseline imbalance in pre-booster titers. In general, such a baseline imbalance is in favor of an increase in the group with the lowest baseline values. The reason for this is that there is no linear relation between pre-booster and post-booster values, because the post-booster value has a certain maximum [
23]. In this particular case, the three-visit schedule group had a lower mean baseline titer and therefore a greater mean increase in titer, if not adjusted. Yet, when adjustment for the baseline imbalance was performed, the mean increase for the two-visit schedule was higher than for the three-visit schedule. Unfortunately, not all assumptions for this analysis of covariance (ANCOVA) were met, so its results should be interpreted with caution. However, both the adjusted and unadjusted results clearly point to an overall non-inferiority of the two-visit schedule compared to the three-visit schedule.
From the previous paragraphs, it has become clear that demonstrating non-inferiority while using a proportional approach is hindered if pre-booster values are already high, and post-booster values approach or reach the maximum level (100% seroconversion). This is also the reason why superiority trials are generally not suitable for these kinds of research questions. A superiority trial with expected near-maximum post-booster levels would require an enormous sample size to significantly detect the very small difference in post-booster seroconversion rates (for example, a post-booster SCR of 99.7% versus 99.9%) [
24,
25,
26]. However, our proposed numerical method could also be used in superiority trials, as it does not have the disadvantage of a maximum endpoint. In this way, novel, more efficient vaccination schedules can be studied on the topic of boostability with evidence for superiority rather than non-inferiority. However, one might still wonder whether superiority in boostability is a clinically and immunologically relevant endpoint. We would like to argue that it is immunologically relevant, as the results of the neutralization assay (RFFIT) reflect both the concentration and the affinity of the antibodies. As far as clinical relevance is concerned, being able to actually determine non-inferiority or superiority in boostability might form a stronger foundation for research on and implementation of simplified vaccination schedules in the future.
It is remarkable that the two-visit schedule seems to perform better regarding boostability than the three-visit schedule in individuals with low pre-booster values, but worse in individuals with higher pre-booster values (
Figure 3). Furthermore, even though the blue line (two-visit schedule) crosses the red line (three-visit schedule), it never crosses the green line (three-visit schedule minus the non-inferiority margin of 0.369). This could be due to the formation of more memory B cells, or a longer lifespan of the germinal centers, in the two-visit schedule, during which participants received four 0.1 mL vaccine doses in one week, instead of ‘only’ three doses over three weeks [
27]. Another interesting observation is the decrease in RVNA titer after booster vaccination in two cases. This seems highly improbable. It might be possible that this is a negative feedback mechanism at work, caused by the presence of pre-booster antibodies, which cause the antibody titer levels to drop [
23]. This very same negative feedback mechanism might play a role in the observed lower increase for higher pre-booster antibody titers (
Figure 3).
An important caveat in using the method as proposed in this study is the fact that we were not able to use a pre-existing cut-off point for non-inferiority. Therefore, we chose to define this point to correspond to a difference of a 1.5-fold dilution step. This cut-off point was chosen because we considered a difference of less than a half dilution step to be clinically irrelevant. A 1.5-fold dilution equals half of one 3-fold dilution step. The same cut-off point of a 1.5-fold dilution could also be applied in RFFIT assays using 2-, 10-, or any-fold dilution steps, although it does not equal half a dilution step in those assays. However, a 1.5-fold dilution denotes the same difference for all types of RFFIT: a 1.5-fold dilution step, regardless of the RFFIT method.
In future research on boostability, a different non-inferiority margin might be used, which may—also depending on the vaccine being studied—seem less or more reasonable than the 1.5-fold dilution step that we chose as the cut-off point. For example, Soonawala et al. used a 2-fold dilution step as the non-inferiority margin for the evaluation of the immunogenicity of a poliomyelitis vaccine [
28].
Another choice for future research concerns the confidence interval. In this study, we opted for a two-sided 95% confidence interval, corresponding to a one-sided 97.5% confidence interval. However, one could also make a case for a two-sided 90% confidence interval (a one-sided 95% confidence interval), which increases the range over which non-inferiority can be inferred [
26]. Still, the smaller the cut-off value and the broader the confidence interval, the stronger the evidence for non-inferiority. This emphasizes the importance of the documentation of a clearly defined and clinically relevant non-inferiority margin in the study protocol prior to data analysis [
29].
For future studies, we propose using the more informative method of comparing the mean increase of log-transformed pre-booster antibody titers after revaccination next to the clinically relevant measurement of proportions. This method is easy to use, holds more (numerical) information than the proportional assessment, is applicable in superiority trials, and takes pre-booster values into account. Therefore, we advise researchers to consider using this alternative method as an additional outcome for measuring the boostability of the anamnestic antibody response.