Recurrent events modeling based on a reflected Brownian motion with application to hypoglycemia

Results of parameter estimation under correct specifications for 3 models with sample size |$n\in\{200,400\}$|⁠.

Model	Para	True	Bias	SD	ESD	CR	Bias	SD	ESD	CR
			\|$n\,=\,200$\|				\|$n\,=\,400$\|
CF	\|$\alpha_{0}$\|	2.90	0.090	0.166	0.162	0.95	0.029	0.104	0.106	0.96
	\|$\alpha_{1}$\|	0.20	0.045	0.151	0.148	0.94	0.018	0.096	0.088	0.97
	\|$\alpha_{2}$\|	−0.10	−0.002	0.112	0.099	0.97	0.007	0.071	0.070	0.97
	\|$\gamma$\|	−0.55	−0.029	0.298	0.277	0.96	0.025	0.194	0.201	0.94
	\|$\beta_{0}$\|	0.90	−0.011	0.047	0.046	0.95	−0.005	0.033	0.033	0.94
	\|$\beta_{1}$\|	−0.20	−0.004	0.051	0.052	0.95	−0.002	0.035	0.032	0.97
	\|$\beta_{2}$\|	−0.10	0.000	0.048	0.048	0.94	0.000	0.033	0.030	0.95
	\|$\theta_{1}$\|	0.20	0.029	0.038	0.035	0.92	0.014	0.025	0.024	0.94
	\|$\theta_{2}^{\prime}$\|	0.30	0.118	0.137	0.104	0.98	0.057	0.085	0.081	0.95
IF	\|$\alpha_{0}$\|	2.90	0.085	0.133	0.136	0.93	0.035	0.084	0.087	0.95
	\|$\alpha_{1}$\|	0.20	0.045	0.153	0.152	0.97	0.019	0.099	0.103	0.96
	\|$\alpha_{2}$\|	−0.10	−0.001	0.109	0.103	0.97	−0.000	0.071	0.067	0.97
	\|$\beta_{0}$\|	0.90	−0.010	0.047	0.043	0.96	−0.006	0.033	0.033	0.95
	\|$\beta_{1}$\|	−0.20	−0.007	0.052	0.051	0.94	−0.004	0.036	0.035	0.93
	\|$\beta_{2}$\|	−0.10	0.001	0.048	0.050	0.94	0.001	0.033	0.031	0.95
	\|$\theta_{1}$\|	0.20	0.028	0.037	0.035	0.93	0.012	0.025	0.022	0.97
	\|$\theta_{2}^{\prime}$\|	0.30	0.113	0.131	0.096	0.98	0.057	0.082	0.071	0.94
SF	\|$\alpha_{0}$\|	2.90	0.047	0.127	0.130	0.94	0.025	0.084	0.077	0.96
	\|$\alpha_{1}$\|	0.20	0.035	0.105	0.100	0.97	0.007	0.069	0.068	0.98
	\|$\alpha_{2}$\|	−0.10	−0.002	0.079	0.083	0.92	0.007	0.053	0.052	0.96
	\|$\gamma$\|	−1.00	−0.035	0.200	0.203	0.94	−0.030	0.137	0.139	0.96
	\|$\beta_{0}$\|	0.90	−0.014	0.046	0.044	0.96	−0.003	0.032	0.031	0.95
	\|$\beta_{1}$\|	−0.20	−0.005	0.050	0.052	0.94	0.000	0.034	0.031	0.97
	\|$\beta_{2}$\|	−0.10	−0.001	0.046	0.045	0.95	0.000	0.032	0.029	0.96
	\|$\theta_{1}$\|	0.20	0.028	0.036	0.032	0.94	0.011	0.024	0.022	0.97

Model	Para	True	Bias	SD	ESD	CR	Bias	SD	ESD	CR
			\|$n\,=\,200$\|				\|$n\,=\,400$\|
CF	\|$\alpha_{0}$\|	2.90	0.090	0.166	0.162	0.95	0.029	0.104	0.106	0.96
	\|$\alpha_{1}$\|	0.20	0.045	0.151	0.148	0.94	0.018	0.096	0.088	0.97
	\|$\alpha_{2}$\|	−0.10	−0.002	0.112	0.099	0.97	0.007	0.071	0.070	0.97
	\|$\gamma$\|	−0.55	−0.029	0.298	0.277	0.96	0.025	0.194	0.201	0.94
	\|$\beta_{0}$\|	0.90	−0.011	0.047	0.046	0.95	−0.005	0.033	0.033	0.94
	\|$\beta_{1}$\|	−0.20	−0.004	0.051	0.052	0.95	−0.002	0.035	0.032	0.97
	\|$\beta_{2}$\|	−0.10	0.000	0.048	0.048	0.94	0.000	0.033	0.030	0.95
	\|$\theta_{1}$\|	0.20	0.029	0.038	0.035	0.92	0.014	0.025	0.024	0.94
	\|$\theta_{2}^{\prime}$\|	0.30	0.118	0.137	0.104	0.98	0.057	0.085	0.081	0.95
IF	\|$\alpha_{0}$\|	2.90	0.085	0.133	0.136	0.93	0.035	0.084	0.087	0.95
	\|$\alpha_{1}$\|	0.20	0.045	0.153	0.152	0.97	0.019	0.099	0.103	0.96
	\|$\alpha_{2}$\|	−0.10	−0.001	0.109	0.103	0.97	−0.000	0.071	0.067	0.97
	\|$\beta_{0}$\|	0.90	−0.010	0.047	0.043	0.96	−0.006	0.033	0.033	0.95
	\|$\beta_{1}$\|	−0.20	−0.007	0.052	0.051	0.94	−0.004	0.036	0.035	0.93
	\|$\beta_{2}$\|	−0.10	0.001	0.048	0.050	0.94	0.001	0.033	0.031	0.95
	\|$\theta_{1}$\|	0.20	0.028	0.037	0.035	0.93	0.012	0.025	0.022	0.97
	\|$\theta_{2}^{\prime}$\|	0.30	0.113	0.131	0.096	0.98	0.057	0.082	0.071	0.94
SF	\|$\alpha_{0}$\|	2.90	0.047	0.127	0.130	0.94	0.025	0.084	0.077	0.96
	\|$\alpha_{1}$\|	0.20	0.035	0.105	0.100	0.97	0.007	0.069	0.068	0.98
	\|$\alpha_{2}$\|	−0.10	−0.002	0.079	0.083	0.92	0.007	0.053	0.052	0.96
	\|$\gamma$\|	−1.00	−0.035	0.200	0.203	0.94	−0.030	0.137	0.139	0.96
	\|$\beta_{0}$\|	0.90	−0.014	0.046	0.044	0.96	−0.003	0.032	0.031	0.95
	\|$\beta_{1}$\|	−0.20	−0.005	0.050	0.052	0.94	0.000	0.034	0.031	0.97
	\|$\beta_{2}$\|	−0.10	−0.001	0.046	0.045	0.95	0.000	0.032	0.029	0.96
	\|$\theta_{1}$\|	0.20	0.028	0.036	0.032	0.94	0.011	0.024	0.022	0.97

SD, posterior standard deviation; ESD, empirical standard deviation; CR, coverage rate of |$95\%$| HPD credible intervals.

Table 1.

Results of parameter estimation under correct specifications for 3 models with sample size |$n\in\{200,400\}$|⁠.

Model	Para	True	Bias	SD	ESD	CR	Bias	SD	ESD	CR
			\|$n\,=\,200$\|				\|$n\,=\,400$\|
CF	\|$\alpha_{0}$\|	2.90	0.090	0.166	0.162	0.95	0.029	0.104	0.106	0.96
	\|$\alpha_{1}$\|	0.20	0.045	0.151	0.148	0.94	0.018	0.096	0.088	0.97
	\|$\alpha_{2}$\|	−0.10	−0.002	0.112	0.099	0.97	0.007	0.071	0.070	0.97
	\|$\gamma$\|	−0.55	−0.029	0.298	0.277	0.96	0.025	0.194	0.201	0.94
	\|$\beta_{0}$\|	0.90	−0.011	0.047	0.046	0.95	−0.005	0.033	0.033	0.94
	\|$\beta_{1}$\|	−0.20	−0.004	0.051	0.052	0.95	−0.002	0.035	0.032	0.97
	\|$\beta_{2}$\|	−0.10	0.000	0.048	0.048	0.94	0.000	0.033	0.030	0.95
	\|$\theta_{1}$\|	0.20	0.029	0.038	0.035	0.92	0.014	0.025	0.024	0.94
	\|$\theta_{2}^{\prime}$\|	0.30	0.118	0.137	0.104	0.98	0.057	0.085	0.081	0.95
IF	\|$\alpha_{0}$\|	2.90	0.085	0.133	0.136	0.93	0.035	0.084	0.087	0.95
	\|$\alpha_{1}$\|	0.20	0.045	0.153	0.152	0.97	0.019	0.099	0.103	0.96
	\|$\alpha_{2}$\|	−0.10	−0.001	0.109	0.103	0.97	−0.000	0.071	0.067	0.97
	\|$\beta_{0}$\|	0.90	−0.010	0.047	0.043	0.96	−0.006	0.033	0.033	0.95
	\|$\beta_{1}$\|	−0.20	−0.007	0.052	0.051	0.94	−0.004	0.036	0.035	0.93
	\|$\beta_{2}$\|	−0.10	0.001	0.048	0.050	0.94	0.001	0.033	0.031	0.95
	\|$\theta_{1}$\|	0.20	0.028	0.037	0.035	0.93	0.012	0.025	0.022	0.97
	\|$\theta_{2}^{\prime}$\|	0.30	0.113	0.131	0.096	0.98	0.057	0.082	0.071	0.94
SF	\|$\alpha_{0}$\|	2.90	0.047	0.127	0.130	0.94	0.025	0.084	0.077	0.96
	\|$\alpha_{1}$\|	0.20	0.035	0.105	0.100	0.97	0.007	0.069	0.068	0.98
	\|$\alpha_{2}$\|	−0.10	−0.002	0.079	0.083	0.92	0.007	0.053	0.052	0.96
	\|$\gamma$\|	−1.00	−0.035	0.200	0.203	0.94	−0.030	0.137	0.139	0.96
	\|$\beta_{0}$\|	0.90	−0.014	0.046	0.044	0.96	−0.003	0.032	0.031	0.95
	\|$\beta_{1}$\|	−0.20	−0.005	0.050	0.052	0.94	0.000	0.034	0.031	0.97
	\|$\beta_{2}$\|	−0.10	−0.001	0.046	0.045	0.95	0.000	0.032	0.029	0.96
	\|$\theta_{1}$\|	0.20	0.028	0.036	0.032	0.94	0.011	0.024	0.022	0.97

Model	Para	True	Bias	SD	ESD	CR	Bias	SD	ESD	CR
			\|$n\,=\,200$\|				\|$n\,=\,400$\|
CF	\|$\alpha_{0}$\|	2.90	0.090	0.166	0.162	0.95	0.029	0.104	0.106	0.96
	\|$\alpha_{1}$\|	0.20	0.045	0.151	0.148	0.94	0.018	0.096	0.088	0.97
	\|$\alpha_{2}$\|	−0.10	−0.002	0.112	0.099	0.97	0.007	0.071	0.070	0.97
	\|$\gamma$\|	−0.55	−0.029	0.298	0.277	0.96	0.025	0.194	0.201	0.94
	\|$\beta_{0}$\|	0.90	−0.011	0.047	0.046	0.95	−0.005	0.033	0.033	0.94
	\|$\beta_{1}$\|	−0.20	−0.004	0.051	0.052	0.95	−0.002	0.035	0.032	0.97
	\|$\beta_{2}$\|	−0.10	0.000	0.048	0.048	0.94	0.000	0.033	0.030	0.95
	\|$\theta_{1}$\|	0.20	0.029	0.038	0.035	0.92	0.014	0.025	0.024	0.94
	\|$\theta_{2}^{\prime}$\|	0.30	0.118	0.137	0.104	0.98	0.057	0.085	0.081	0.95
IF	\|$\alpha_{0}$\|	2.90	0.085	0.133	0.136	0.93	0.035	0.084	0.087	0.95
	\|$\alpha_{1}$\|	0.20	0.045	0.153	0.152	0.97	0.019	0.099	0.103	0.96
	\|$\alpha_{2}$\|	−0.10	−0.001	0.109	0.103	0.97	−0.000	0.071	0.067	0.97
	\|$\beta_{0}$\|	0.90	−0.010	0.047	0.043	0.96	−0.006	0.033	0.033	0.95
	\|$\beta_{1}$\|	−0.20	−0.007	0.052	0.051	0.94	−0.004	0.036	0.035	0.93
	\|$\beta_{2}$\|	−0.10	0.001	0.048	0.050	0.94	0.001	0.033	0.031	0.95
	\|$\theta_{1}$\|	0.20	0.028	0.037	0.035	0.93	0.012	0.025	0.022	0.97
	\|$\theta_{2}^{\prime}$\|	0.30	0.113	0.131	0.096	0.98	0.057	0.082	0.071	0.94
SF	\|$\alpha_{0}$\|	2.90	0.047	0.127	0.130	0.94	0.025	0.084	0.077	0.96
	\|$\alpha_{1}$\|	0.20	0.035	0.105	0.100	0.97	0.007	0.069	0.068	0.98
	\|$\alpha_{2}$\|	−0.10	−0.002	0.079	0.083	0.92	0.007	0.053	0.052	0.96
	\|$\gamma$\|	−1.00	−0.035	0.200	0.203	0.94	−0.030	0.137	0.139	0.96
	\|$\beta_{0}$\|	0.90	−0.014	0.046	0.044	0.96	−0.003	0.032	0.031	0.95
	\|$\beta_{1}$\|	−0.20	−0.005	0.050	0.052	0.94	0.000	0.034	0.031	0.97
	\|$\beta_{2}$\|	−0.10	−0.001	0.046	0.045	0.95	0.000	0.032	0.029	0.96
	\|$\theta_{1}$\|	0.20	0.028	0.036	0.032	0.94	0.011	0.024	0.022	0.97

SD, posterior standard deviation; ESD, empirical standard deviation; CR, coverage rate of |$95\%$| HPD credible intervals.

To evaluate the performance of model comparison criteria, we fitted 3 models to each dataset. DIC and LPML were used to select among the 3 fitted models. The Monte Carlo sample size used for approximating the integrals was set to be |$M\,=\,500$|⁠. Table 2 presents the selection frequencies for candidate models under each scenario, based on either DIC or LPML, for sample size |$n\in\{200,400,800,1,600\}$|⁠. On average, the correctly specified models have the smallest DIC and highest LPML. The results suggest good performance of both DIC and LPML in selecting the right models. For example, when the CF model is the data-generating model, the proportion of correctly identifying the true model increases for both DIC and LPML as the sample size increases, reaching |$95\%$| and |$90\%$|⁠, respectively, with DIC slightly outperforming LPML. When the 2 reduced models are the data generating models, both DIC and LPML still select them with the highest frequency, but the tendency to choose the full model also increases when the sample size increases. Such observations echo the limitations of DIC and LPML in distinguishing the true model and an overfitted model (Maity et al. 2021). In our application, both criteria select either the correct model or the full model, which still provides valuable information for practitioners.

Table 2.

Model comparison result with DIC and LPML with sample size |$n\in\{200,400,800,{\rm and},1,600\}$|⁠.

True Model	Criterion	\|$n$\|	Freq	Mean	Freq	Mean	Freq	Mean
			CF		IF		SF
CF	DIC	200	60	11,921.3	24	11,930.1	16	11,932.1
		400	83	23,813.6	11	23,830.8	6	23,846.4
		800	94	47,282.8	3	47,323.8	3	47,354.0
		1,600	96	94,406.0	1	94,492.0	3	94,548.7
	LPML	200	48	-5,973.9	28	-5,978.2	24	-5,978.2
		400	69	-11,923.5	16	-11,931.5	15	-11,937.6
		800	86	-23,660.9	7	-23,682.1	7	-23,693.6
		1,600	91	-47,229.2	5	-47,270.8	4	-47,298.7
IF	DIC	200	20	10,907.8	72	10,906.5	8	10,924.2
		400	23	21,693.7	75	21,692.1	2	21,739.2
		800	35	43,348.3	65	43,347.7	0	43,443.7
		1,600	27	86,503.8	73	86,502.0	0	86,699.6
	LPML	200	16	-5,459.6	75	-5,457.7	9	-5,468.2
		400	21	-10,853.3	76	-10,851.4	3	-10,876.7
		800	27	-21,682.3	73	-21,680.7	0	-21,730.9
		1,600	21	-43,260.7	79	-43,258.6	0	-43,359.3
SF	DIC	200	8	12,445.5	0	12,476.0	92	12,435.4
		400	10	24,774.4	0	24,845.9	90	24,759.9
		800	10	49,268.7	0	49,419.6	90	49,249.6
		1,600	25	98,715.6	0	99,038.6	75	98,697.8
	LPML	200	15	-6,237.2	1	-6,255.2	84	-6,230.3
		400	19	-12,410.9	1	-12,449.2	80	-12,402.0
		800	23	-24,665.0	0	-24,743.7	77	-24,654.2
		1,600	33	-49,393.2	0	-49,558.4	67	-49,384.0

True Model	Criterion	\|$n$\|	Freq	Mean	Freq	Mean	Freq	Mean
			CF		IF		SF
CF	DIC	200	60	11,921.3	24	11,930.1	16	11,932.1
		400	83	23,813.6	11	23,830.8	6	23,846.4
		800	94	47,282.8	3	47,323.8	3	47,354.0
		1,600	96	94,406.0	1	94,492.0	3	94,548.7
	LPML	200	48	-5,973.9	28	-5,978.2	24	-5,978.2
		400	69	-11,923.5	16	-11,931.5	15	-11,937.6
		800	86	-23,660.9	7	-23,682.1	7	-23,693.6
		1,600	91	-47,229.2	5	-47,270.8	4	-47,298.7
IF	DIC	200	20	10,907.8	72	10,906.5	8	10,924.2
		400	23	21,693.7	75	21,692.1	2	21,739.2
		800	35	43,348.3	65	43,347.7	0	43,443.7
		1,600	27	86,503.8	73	86,502.0	0	86,699.6
	LPML	200	16	-5,459.6	75	-5,457.7	9	-5,468.2
		400	21	-10,853.3	76	-10,851.4	3	-10,876.7
		800	27	-21,682.3	73	-21,680.7	0	-21,730.9
		1,600	21	-43,260.7	79	-43,258.6	0	-43,359.3
SF	DIC	200	8	12,445.5	0	12,476.0	92	12,435.4
		400	10	24,774.4	0	24,845.9	90	24,759.9
		800	10	49,268.7	0	49,419.6	90	49,249.6
		1,600	25	98,715.6	0	99,038.6	75	98,697.8
	LPML	200	15	-6,237.2	1	-6,255.2	84	-6,230.3
		400	19	-12,410.9	1	-12,449.2	80	-12,402.0
		800	23	-24,665.0	0	-24,743.7	77	-24,654.2
		1,600	33	-49,393.2	0	-49,558.4	67	-49,384.0

Freq (%): frequency of the correct model being selected; Mean: average of the DIC or LPML.

Table 2.

Model comparison result with DIC and LPML with sample size |$n\in\{200,400,800,{\rm and},1,600\}$|⁠.

True Model	Criterion	\|$n$\|	Freq	Mean	Freq	Mean	Freq	Mean
			CF		IF		SF
CF	DIC	200	60	11,921.3	24	11,930.1	16	11,932.1
		400	83	23,813.6	11	23,830.8	6	23,846.4
		800	94	47,282.8	3	47,323.8	3	47,354.0
		1,600	96	94,406.0	1	94,492.0	3	94,548.7
	LPML	200	48	-5,973.9	28	-5,978.2	24	-5,978.2
		400	69	-11,923.5	16	-11,931.5	15	-11,937.6
		800	86	-23,660.9	7	-23,682.1	7	-23,693.6
		1,600	91	-47,229.2	5	-47,270.8	4	-47,298.7
IF	DIC	200	20	10,907.8	72	10,906.5	8	10,924.2
		400	23	21,693.7	75	21,692.1	2	21,739.2
		800	35	43,348.3	65	43,347.7	0	43,443.7
		1,600	27	86,503.8	73	86,502.0	0	86,699.6
	LPML	200	16	-5,459.6	75	-5,457.7	9	-5,468.2
		400	21	-10,853.3	76	-10,851.4	3	-10,876.7
		800	27	-21,682.3	73	-21,680.7	0	-21,730.9
		1,600	21	-43,260.7	79	-43,258.6	0	-43,359.3
SF	DIC	200	8	12,445.5	0	12,476.0	92	12,435.4
		400	10	24,774.4	0	24,845.9	90	24,759.9
		800	10	49,268.7	0	49,419.6	90	49,249.6
		1,600	25	98,715.6	0	99,038.6	75	98,697.8
	LPML	200	15	-6,237.2	1	-6,255.2	84	-6,230.3
		400	19	-12,410.9	1	-12,449.2	80	-12,402.0
		800	23	-24,665.0	0	-24,743.7	77	-24,654.2
		1,600	33	-49,393.2	0	-49,558.4	67	-49,384.0

True Model	Criterion	\|$n$\|	Freq	Mean	Freq	Mean	Freq	Mean
			CF		IF		SF
CF	DIC	200	60	11,921.3	24	11,930.1	16	11,932.1
		400	83	23,813.6	11	23,830.8	6	23,846.4
		800	94	47,282.8	3	47,323.8	3	47,354.0
		1,600	96	94,406.0	1	94,492.0	3	94,548.7
	LPML	200	48	-5,973.9	28	-5,978.2	24	-5,978.2
		400	69	-11,923.5	16	-11,931.5	15	-11,937.6
		800	86	-23,660.9	7	-23,682.1	7	-23,693.6
		1,600	91	-47,229.2	5	-47,270.8	4	-47,298.7
IF	DIC	200	20	10,907.8	72	10,906.5	8	10,924.2
		400	23	21,693.7	75	21,692.1	2	21,739.2
		800	35	43,348.3	65	43,347.7	0	43,443.7
		1,600	27	86,503.8	73	86,502.0	0	86,699.6
	LPML	200	16	-5,459.6	75	-5,457.7	9	-5,468.2
		400	21	-10,853.3	76	-10,851.4	3	-10,876.7
		800	27	-21,682.3	73	-21,680.7	0	-21,730.9
		1,600	21	-43,260.7	79	-43,258.6	0	-43,359.3
SF	DIC	200	8	12,445.5	0	12,476.0	92	12,435.4
		400	10	24,774.4	0	24,845.9	90	24,759.9
		800	10	49,268.7	0	49,419.6	90	49,249.6
		1,600	25	98,715.6	0	99,038.6	75	98,697.8
	LPML	200	15	-6,237.2	1	-6,255.2	84	-6,230.3
		400	19	-12,410.9	1	-12,449.2	80	-12,402.0
		800	23	-24,665.0	0	-24,743.7	77	-24,654.2
		1,600	33	-49,393.2	0	-49,558.4	67	-49,384.0

Freq (%): frequency of the correct model being selected; Mean: average of the DIC or LPML.

5. HYPOGLYCEMIC EVENT TIME ANALYSIS

The proposed models were applied to analyze the hypoglycemic event times from the DURABLE trial (Buse et al. 2009). Between 2005 and 2007, 2,187 patients with type 2 diabetes from 11 countries were enrolled in the study. The dataset contains the possibly censored times of hypoglycemic events of the patients during their follow-up periods. Also, available are a collection of baseline covariates, which allows assessments of risk factors for hypoglycemia among the patients. The median follow-up time of the patients is 168\,days. Continuous baseline covariates include fasting blood glucose, fasting insulin, adiponectin, weight, height, BMI, systolic blood pressure, diastolic blood pressure, heart rate, and duration of diabetes. Summaries of the continuous covariates are presented in Table 3. Three important variables, fasting glucose level, adiponectin level, and fasting insulin level, have extremely high values, which call for prudence in data analysis. Two categorical variables are available. The first one is starter insulin regimens with 2 levels, twice-daily lispro mix (LM) |$75/25$|⁠, |$75\%$| lispro protamine suspension and |$25\%$| lispro (referred to as LM 75/25 hereafter), versus once-daily insulin glargine. The second one is the usage of oral antihyperglycemic drugs with 3 levels: thiazolidinedione, sulfonylurea, and both. All the available covariates are subject-level and time-independent.

Table 3.

Summary of the covariates from the DURABLE trial.

Variable	Minimum	Median	Maximum	Mean	SD
Fasting glucose (mmol/l)	0.23	10.45	25.96	10.78	3.72
Adiponectin (g/ml)	0.01	5.57	49.01	6.99	5.52
Fasting insulin (mIU/l)	0.18	7.91	142.68	10.40	9.81
Height (cm)	124.25	166.44	198.09	166.47	10.71
BMI (kg/\|${\rm m}^{2}$\|⁠)	15.88	31.28	62.62	31.71	6.18
Diastolic BP (mmHg)	45.01	78.70	116.30	78.23	9.46
Systolic BP (mmHg)	47.26	130.02	196.67	131.53	16.11
Heart rate (beats per minute)	43.86	76.61	121.05	76.76	9.82
Duration diabetes (years)	0.03	8.57	39.48	9.75	6.17

Variable	Minimum	Median	Maximum	Mean	SD
Fasting glucose (mmol/l)	0.23	10.45	25.96	10.78	3.72
Adiponectin (g/ml)	0.01	5.57	49.01	6.99	5.52
Fasting insulin (mIU/l)	0.18	7.91	142.68	10.40	9.81
Height (cm)	124.25	166.44	198.09	166.47	10.71
BMI (kg/\|${\rm m}^{2}$\|⁠)	15.88	31.28	62.62	31.71	6.18
Diastolic BP (mmHg)	45.01	78.70	116.30	78.23	9.46
Systolic BP (mmHg)	47.26	130.02	196.67	131.53	16.11
Heart rate (beats per minute)	43.86	76.61	121.05	76.76	9.82
Duration diabetes (years)	0.03	8.57	39.48	9.75	6.17

BMI, body mass index; BP, blood pressure; SD, standard deviation.

Table 3.

Summary of the covariates from the DURABLE trial.

Variable	Minimum	Median	Maximum	Mean	SD
Fasting glucose (mmol/l)	0.23	10.45	25.96	10.78	3.72
Adiponectin (g/ml)	0.01	5.57	49.01	6.99	5.52
Fasting insulin (mIU/l)	0.18	7.91	142.68	10.40	9.81
Height (cm)	124.25	166.44	198.09	166.47	10.71
BMI (kg/\|${\rm m}^{2}$\|⁠)	15.88	31.28	62.62	31.71	6.18
Diastolic BP (mmHg)	45.01	78.70	116.30	78.23	9.46
Systolic BP (mmHg)	47.26	130.02	196.67	131.53	16.11
Heart rate (beats per minute)	43.86	76.61	121.05	76.76	9.82
Duration diabetes (years)	0.03	8.57	39.48	9.75	6.17

Variable	Minimum	Median	Maximum	Mean	SD
Fasting glucose (mmol/l)	0.23	10.45	25.96	10.78	3.72
Adiponectin (g/ml)	0.01	5.57	49.01	6.99	5.52
Fasting insulin (mIU/l)	0.18	7.91	142.68	10.40	9.81
Height (cm)	124.25	166.44	198.09	166.47	10.71
BMI (kg/\|${\rm m}^{2}$\|⁠)	15.88	31.28	62.62	31.71	6.18
Diastolic BP (mmHg)	45.01	78.70	116.30	78.23	9.46
Systolic BP (mmHg)	47.26	130.02	196.67	131.53	16.11
Heart rate (beats per minute)	43.86	76.61	121.05	76.76	9.82
Duration diabetes (years)	0.03	8.57	39.48	9.75	6.17

BMI, body mass index; BP, blood pressure; SD, standard deviation.

After excluding the subjects with missingness in covariates or outside reference range, the dataset contains |$n\,=\,1,943$| patients. Prior to model fitting, all the continuous covariates were standardized. Log transformation was applied to 2 right-skewed covariates, baseline adiponectin and baseline fasting insulin, before standardization. Among the 1,943 patients, 570 (29%) received both oral antihyperglycemic drugs, 1,207 (62%) only received sulfonylurea, and 166 (9%) only received thiazolidinedione. For ease of discussion, the group that received both drugs was used as the reference group; 2 dummy variables, |$\mathsf{sulf{-}Only}$|⁠, which is 1 if only received sulfonylurea, and |$\mathsf{tzd{-}only}$|⁠, which is 1 if only received thiazolidinedione, were included. Define an indicator variable for the insulin regime |$\mathsf{LM}$|⁠, which equals 1 for the 959 (49%) patients who received LM |$75/25$| and 0 for the 984 (51%) patients who received glargine. Some patients had multiple hypoglycemic events within a single calendar date. In this case, the gap times between successive hypoglycemic events were recorded as zero. This was handled by treating the gap times in days as interval-censored, using the likelihood constructed with (3.4) in Section 3.1. The daily hypoglycemic event rates of the patients have a wide range from 0 to 0.77 with mean 0.07. These descriptive statistics indicate the existence of severe heterogeneity risk of hypoglycemia among subjects.

The 3 models along with their priors investigated in Section 4 were fitted to the DURABLE data. The lower boundary was set to 3.9\,mmol/l (70\,mg/dl), which is the clinical standard for hypoglycemic events (Seaquist et al. 2013). The starting point |$x_{0}$| of the Brownian motion after each hypoglycemic event was set to 10, which is the rounded integer of the median baseline fasting glucose level of all patients. Sensitivity analyses of alternative choices for the starting point |$x_{0}$| and priors were conducted and will be discussed later in this section. For each model, an MCMC was run for 110,000 iterations and thinned by 10 after discarding the first 10,000 iterations as burn-in. The choice of burn-in and convergence of the MCMC chains were monitored by traceplots that are provided in Section S2. The resulting 10,000 posterior samples were used for inference, and we further thinned the posterior samples for model comparison criteria calculation to reduce computational cost. The results of DIC and LPML for the 3 models are presented in Table 4. Both criteria suggest that the CF model and the IF model are similar, both of which are preferred to the SF model. Given that the CF and the IF have close model fit, we chose the IF model as it is more parsimonious. That is, the 2 frailties in the upper reflection barrier and the volatility can be treated as independent.

Table 4.

Model comparison results for the 3 frailty models fitted to DURABLE data.

	CF	IF	SF
DIC	131,398.7	131,395.9	131,947.6
LPML	65,706.2	65,706.7	65,989.0

Table 4.

Model comparison results for the 3 frailty models fitted to DURABLE data.

	CF	IF	SF
DIC	131,398.7	131,395.9	131,947.6
LPML	65,706.2	65,706.7	65,989.0

Table 5.

Estimated parameters of the IF model.

Covariates	Mean	SD	95% CI	Mean	SD	95% CI	Mean	SD	95% CI
	IF model						Proportional hazards gap time
	Volatility			Upper reflection barrier
Intercept	0.906	0.038	[0.835, 0.984]	2.859	0.064	[2.738, 2.984]
Fasting glucose	−0.055	0.019	[0.093, 0.018]	0.108	0.031	[0.049, 0.171]	−0.128	0.025	[0.177, 0.078]
Adiponectin	0.047	0.020	[0.008, 0.086]	0.031	0.032	[0.03, 0.095]	0.040	0.026	[0.011, 0.091]
Fasting insulin	−0.106	0.021	[0.146, 0.063]	0.170	0.033	[0.107, 0.236]	−0.181	0.028	[0.236, 0.126]
Height	−0.045	0.018	[0.082, 0.011]	0.004	0.031	[0.057, 0.063]	−0.075	0.026	[0.126, 0.025]
BMI	−0.091	0.019	[0.127, 0.052]	−0.115	0.033	[0.176, 0.049]	−0.062	0.027	[0.115, 0.009]
Diastolic BP	−0.068	0.022	[0.111, 0.025]	0.057	0.035	[0.012, 0.127]	−0.079	0.030	[0.138, 0.02]
Systolic BP	0.025	0.021	[0.016, 0.064]	−0.029	0.033	[0.093, 0.035]	0.042	0.028	[0.013, 0.097]
Heart rate	0.008	0.017	[0.028, 0.041]	0.046	0.029	[0.01, 0.103]	−0.004	0.025	[0.053, 0.044]
Duration diabetes	0.079	0.017	[0.047, 0.113]	−0.047	0.027	[0.101, 0.006]	0.101	0.025	[0.052, 0.15]
LM	0.169	0.036	[0.099, 0.242]	−0.097	0.058	[0.212, 0.015]	0.230	0.048	[0.136, 0.325]
tzd-only	−0.483	0.077	[0.63, 0.332]	0.329	0.155	[0.019, 0.626]	−0.603	0.099	[0.797, 0.409]
sulf-only	−0.016	0.040	[0.091, 0.062]	0.067	0.068	[0.065, 0.201]	−0.085	0.060	[0.203, 0.032]
Frailty Variance	0.407	0.026	[0.355, 0.458]	0.534	0.044	[0.45, 0.622]	0.972

Covariates	Mean	SD	95% CI	Mean	SD	95% CI	Mean	SD	95% CI
	IF model						Proportional hazards gap time
	Volatility			Upper reflection barrier
Intercept	0.906	0.038	[0.835, 0.984]	2.859	0.064	[2.738, 2.984]
Fasting glucose	−0.055	0.019	[0.093, 0.018]	0.108	0.031	[0.049, 0.171]	−0.128	0.025	[0.177, 0.078]
Adiponectin	0.047	0.020	[0.008, 0.086]	0.031	0.032	[0.03, 0.095]	0.040	0.026	[0.011, 0.091]
Fasting insulin	−0.106	0.021	[0.146, 0.063]	0.170	0.033	[0.107, 0.236]	−0.181	0.028	[0.236, 0.126]
Height	−0.045	0.018	[0.082, 0.011]	0.004	0.031	[0.057, 0.063]	−0.075	0.026	[0.126, 0.025]
BMI	−0.091	0.019	[0.127, 0.052]	−0.115	0.033	[0.176, 0.049]	−0.062	0.027	[0.115, 0.009]
Diastolic BP	−0.068	0.022	[0.111, 0.025]	0.057	0.035	[0.012, 0.127]	−0.079	0.030	[0.138, 0.02]
Systolic BP	0.025	0.021	[0.016, 0.064]	−0.029	0.033	[0.093, 0.035]	0.042	0.028	[0.013, 0.097]
Heart rate	0.008	0.017	[0.028, 0.041]	0.046	0.029	[0.01, 0.103]	−0.004	0.025	[0.053, 0.044]
Duration diabetes	0.079	0.017	[0.047, 0.113]	−0.047	0.027	[0.101, 0.006]	0.101	0.025	[0.052, 0.15]
LM	0.169	0.036	[0.099, 0.242]	−0.097	0.058	[0.212, 0.015]	0.230	0.048	[0.136, 0.325]
tzd-only	−0.483	0.077	[0.63, 0.332]	0.329	0.155	[0.019, 0.626]	−0.603	0.099	[0.797, 0.409]
sulf-only	−0.016	0.040	[0.091, 0.062]	0.067	0.068	[0.065, 0.201]	−0.085	0.060	[0.203, 0.032]
Frailty Variance	0.407	0.026	[0.355, 0.458]	0.534	0.044	[0.45, 0.622]	0.972

BMI, body mass index; BP, blood pressure; SD, standard deviation; CI, 95% HPD credible interval or 95% confident interval. Significant covariates are shown in bold.

Table 5.

Open in new tab Download slide

Estimated parameters of the IF model.

Covariates	Mean	SD	95% CI	Mean	SD	95% CI	Mean	SD	95% CI
	IF model						Proportional hazards gap time
	Volatility			Upper reflection barrier
Intercept	0.906	0.038	[0.835, 0.984]	2.859	0.064	[2.738, 2.984]
Fasting glucose	−0.055	0.019	[0.093, 0.018]	0.108	0.031	[0.049, 0.171]	−0.128	0.025	[0.177, 0.078]
Adiponectin	0.047	0.020	[0.008, 0.086]	0.031	0.032	[0.03, 0.095]	0.040	0.026	[0.011, 0.091]
Fasting insulin	−0.106	0.021	[0.146, 0.063]	0.170	0.033	[0.107, 0.236]	−0.181	0.028	[0.236, 0.126]
Height	−0.045	0.018	[0.082, 0.011]	0.004	0.031	[0.057, 0.063]	−0.075	0.026	[0.126, 0.025]
BMI	−0.091	0.019	[0.127, 0.052]	−0.115	0.033	[0.176, 0.049]	−0.062	0.027	[0.115, 0.009]
Diastolic BP	−0.068	0.022	[0.111, 0.025]	0.057	0.035	[0.012, 0.127]	−0.079	0.030	[0.138, 0.02]
Systolic BP	0.025	0.021	[0.016, 0.064]	−0.029	0.033	[0.093, 0.035]	0.042	0.028	[0.013, 0.097]
Heart rate	0.008	0.017	[0.028, 0.041]	0.046	0.029	[0.01, 0.103]	−0.004	0.025	[0.053, 0.044]
Duration diabetes	0.079	0.017	[0.047, 0.113]	−0.047	0.027	[0.101, 0.006]	0.101	0.025	[0.052, 0.15]
LM	0.169	0.036	[0.099, 0.242]	−0.097	0.058	[0.212, 0.015]	0.230	0.048	[0.136, 0.325]
tzd-only	−0.483	0.077	[0.63, 0.332]	0.329	0.155	[0.019, 0.626]	−0.603	0.099	[0.797, 0.409]
sulf-only	−0.016	0.040	[0.091, 0.062]	0.067	0.068	[0.065, 0.201]	−0.085	0.060	[0.203, 0.032]
Frailty Variance	0.407	0.026	[0.355, 0.458]	0.534	0.044	[0.45, 0.622]	0.972

Covariates	Mean	SD	95% CI	Mean	SD	95% CI	Mean	SD	95% CI
	IF model						Proportional hazards gap time
	Volatility			Upper reflection barrier
Intercept	0.906	0.038	[0.835, 0.984]	2.859	0.064	[2.738, 2.984]
Fasting glucose	−0.055	0.019	[0.093, 0.018]	0.108	0.031	[0.049, 0.171]	−0.128	0.025	[0.177, 0.078]
Adiponectin	0.047	0.020	[0.008, 0.086]	0.031	0.032	[0.03, 0.095]	0.040	0.026	[0.011, 0.091]
Fasting insulin	−0.106	0.021	[0.146, 0.063]	0.170	0.033	[0.107, 0.236]	−0.181	0.028	[0.236, 0.126]
Height	−0.045	0.018	[0.082, 0.011]	0.004	0.031	[0.057, 0.063]	−0.075	0.026	[0.126, 0.025]
BMI	−0.091	0.019	[0.127, 0.052]	−0.115	0.033	[0.176, 0.049]	−0.062	0.027	[0.115, 0.009]
Diastolic BP	−0.068	0.022	[0.111, 0.025]	0.057	0.035	[0.012, 0.127]	−0.079	0.030	[0.138, 0.02]
Systolic BP	0.025	0.021	[0.016, 0.064]	−0.029	0.033	[0.093, 0.035]	0.042	0.028	[0.013, 0.097]
Heart rate	0.008	0.017	[0.028, 0.041]	0.046	0.029	[0.01, 0.103]	−0.004	0.025	[0.053, 0.044]
Duration diabetes	0.079	0.017	[0.047, 0.113]	−0.047	0.027	[0.101, 0.006]	0.101	0.025	[0.052, 0.15]
LM	0.169	0.036	[0.099, 0.242]	−0.097	0.058	[0.212, 0.015]	0.230	0.048	[0.136, 0.325]
tzd-only	−0.483	0.077	[0.63, 0.332]	0.329	0.155	[0.019, 0.626]	−0.603	0.099	[0.797, 0.409]
sulf-only	−0.016	0.040	[0.091, 0.062]	0.067	0.068	[0.065, 0.201]	−0.085	0.060	[0.203, 0.032]
Frailty Variance	0.407	0.026	[0.355, 0.458]	0.534	0.044	[0.45, 0.622]	0.972

BMI, body mass index; BP, blood pressure; SD, standard deviation; CI, 95% HPD credible interval or 95% confident interval. Significant covariates are shown in bold.

Table 5 summarizes the estimated model parameters, their standard errors, and 95% HPD credible confidence intervals from the fitted IF model. The results from the volatility model suggest that patients with higher baseline fasting blood glucose level, lower adiponectin, higher fasting insulin, higher height, higher BMI, higher diastolic blood pressure, and lower duration of diabetes are significantly associated with lower volatility and, hence, lower risk of hypoglycemia. Patients who received LM |$75/25$| appear to have higher volatility or higher risk of hypoglycemia compared to those who received glargine. For the oral antihyperglycemic drugs, patients who received only thiazolidinedione appear to have lower volatility or lower risk of hypoglycemia compared to those who received both thiazolidinedione and sulfonylurea; patients who received only sulfonylurea are not significantly different from those who received both.

In the upper reflection barrier model, fewer covariates are significant and they are a subset of those that are significant in the volatility model. Patients with higher baseline fasting blood glucose level, higher fasting insulin, and lower BMI are associated higher reflection barrier and, hence, lower risk of hypoglycemia. For the oral antihyperglycemic drugs, patients who received only thiazolidinedione appear to have higher reflection barrier and, hence, lower risk of hypoglycemia compared to those who received both thiazolidinedione and sulfonylurea. Interestingly, the effect of baseline fasting blood glucose level, fasting insulin, and received only thiazolidinedione, the coefficients have the same direction on the risk of hypoglycemia in the models for volatility and upper reflection barrier (with opposite coefficient signs). In contrast, BMI is significant in affecting both the volatility and the upper reflecting barrier, but with opposite directions (with the same coefficient signs). That is, the overall effect of BMI on the risk of hypoglycemia is complicated by lowering the volatility (or lowering the risk) while decreasing the upper reflection barrier (or increasing the risk). This discovery has not been reported in the quantile regression analysis of Ma et al. (2021). But the overall effects of baseline BMI are worth further investigating. Figure 2 gives a set of the FHT distributions with the fitted parameters of the IF model using different levels of baseline BMI and frailties. From Fig. 2, while individual differences due to large frailty have a greater impact than BMI, smaller BMI is consistently associated with a higher risk of hypoglycemic events within the same frailty level.

$First hitting time distribution with fitted parameters of independent-frailty model. Distribution functions are derived from 6 combinations of 3 levels of BMI, small, median, and large, which represent the $25\%$, $50\%$, and $75\%$ quantile of the standardized BMI, respectively; and 2 levels of frailties, small and large, which represent the $25\%$ and $75\%$ quantile of the frailty distributions both in volatility $\sigma$ and upper reflection barrier $\kappa$. Other covariates remain the same at their median level after being standardized.$

Fig. 2.

First hitting time distribution with fitted parameters of independent-frailty model. Distribution functions are derived from 6 combinations of 3 levels of BMI, small, median, and large, which represent the |$25\%$|⁠, |$50\%$|⁠, and |$75\%$| quantile of the standardized BMI, respectively; and 2 levels of frailties, small and large, which represent the |$25\%$| and |$75\%$| quantile of the frailty distributions both in volatility |$\sigma$| and upper reflection barrier |$\kappa$|⁠. Other covariates remain the same at their median level after being standardized.

The proportional hazards model of gap time between recurrent events (Huang and Chen 2003) is considered as a comparison method. Before applying the model to the DURABLE data, the censored gap for patients with more than one observation has been excluded. The estimates of the comparative model are also given in Table 5. With the exception of the covariate Adiponectin, the other covariates exhibit similar levels of significance between proportional hazards model and volatility in the proposed model. For Adiponectin, the estimates in the volatility and upper reflection barrier model are |$0.046[0.011,0.085]$| and |$0.030[-0.035,0.088]$|⁠, respectively, in the proposed model. This suggests that Adiponectin contributes in different directions on volatility and upper reflection barrier model for the recurrent risk. Given that the volatility component exerts a stronger influence on recurrence, our estimate indicates that the higher value of Adiponectin is associated with an increased risk of recurrence. In proportional hazards model, the estimate of Adiponectin is |$0.042[-0.008,0.093]$|⁠, aligning with the trend observed in the proposed model. For the other significant covariates, the signs of estimates between the volatility model and proportional hazards model are consistent. This observation suggests that, for this dataset, the proposed model offers a more detailed characterization of the concealed glucose levels linked to recurrent events.

The 2 IFs in the volatility and the upper reflection barrier capture much of the heterogeneity among the patients beyond the covariates. The variances of the 2 frailties are estimated to be far away from zero. From the fitted model, the range of volatility spans from 0.65 to 9.52 with median 2.75 and mean 3.03; the range of upper reflection barrier spans from 12.69 to 83.47 with median 29.57 and mean 29.88. The model without the frailties fits much poorer in terms of DIC and LPML (not reported).

The sensitivity analyses on the DURABLE data, using alternative starting points |$x_{0}$| and priors, including both noninformative and informative priors, were conducted. The results indicate that the significance of the covariate effects is stable when using different noninformative priors and starting points. However, the significance of certain covariate effects is sensitive to certain informative priors, as detailed in Section S3.

6. DISCUSSION

The risk of hypoglycemia is an important concern in diabetes management. It is natural to model the underlying blood glucose level as hypoglycemia occurs when it hits a lower boundary and hyperglycemia occurs when it hits an upper boundary. Because of the unique setting where hyperglycemia cannot be reliably observed in self-reported data, it is challenging to model the blood glucose level as a stochastic process. The proposed Brownian motion model with an upper reflection barrier allows bypassing the need for observing hyperglycemic event times. Only hypoglycemic times are needed for the model fitting. This model fitting is made possible by the FHT density and distribution (Hu et al. 2012). The recurrence of the hypoglycemic events is captured by a sequence of stochastic processes reaching the lower boundary. The upper reflection barrier and volatility of the reflected Brownian motion are linked to patient-level covariates and frailties. Due to the unobserved frailties, we resorted to Bayesian inference for the parameters with MCMC implemented with NIMBLE (de Valpine et al. 2017). The computation of our work relies on an accurate implementation of the FHT density/distribution functions as well as the rejection sampling algorithm with the 3-piece proposal kernel. Another computation challenge is the complexity brought by unobserved frailties in calculating model selection criteria DIC and LPML. We applied Monte Carlo integration for an approximation, with which the 2 criteria were shown to be reasonably effective in selecting the correct frailty model by simulation studies.

It is worthwhile to revisit the key model assumptions. In our model, the imposed upper reflection barrier reflects the expectation that the blood sugar level is bounded and will not reach infinity. This modeling choice aids in illustrating that glucose levels will ultimately be controlled (dropping down) in the range. We believe this assumption is reasonable, especially for participants who are consistently under regular blood sugar maintenance. On the other hand, we acknowledge that the reflecting nature of the barrier may be debatable, as blood sugar levels might remain elevated for a period rather than immediately reflecting back. Nonetheless, this reflecting approach offers a straightforward modeling strategy for this analysis. Another key assumption is that the glucose level returns to normal immediately following the occurrence of the hypoglycemia event. In reality, it certainly takes time to consume food and bring back the glucose level, but the recovery is usually in minutes and, thus, quick enough so we neglect the time used. We also assumed that, after a hypoglycemia event, the glucose level restarts at a fixed level |$x_{0}$|⁠. A sensitivity analysis with different values of |$x_{0}$| showed little difference in the resulting regression coefficient estimates.

Under the model framework, a subject with larger volatility and lower upper reflection barrier is associated with higher risk of the hypoglycemia event. In our experience, the covariate with the same direction on the risk of hypoglycemia (with opposite coefficient signs in volatility and upper reflection barrier) is usually consistent with that of the classic gap time models. On the other hand, if a covariate contributes a positive (negative) effect to both the volatility and upper reflection barrier, its overall impact is less straightforward. In this case, it is possible that the covariate shows no significant impact from the classic gap time models, while play an import role for volatility or upper reflection barrier. An example of this is the covariate Adiponectin in our data application. Therefore, despite its complexity, the proposed model provides additional insights for a better understanding of the data. To further investigate the covariate overall effect of proposed model, we recommend plotting the FHT distribution to provide an overall characterization of the impact of this covariate.

Several directions are worth further investigation. In the broad sense, the proposed model can be applied to scenarios where an event occurs when an underlying health level process hits a boundary on one side. Therefore, many of the examples based on Wiener process reviewed in Lee and Whitmore (2006) are potentially applicable. The Wiener process approach ensures finiteness of the FHT by a nonzero drift. Our process does so with a reflecting boundary on the other side for a driftless Wiener process. When the reflecting boundary is removed, the FHT has a positive probability of being infinity, which makes it applicable when a cure rate is needed (Lee and Whitmore 2006, Section 5). Furthermore, when a large number of unobserved frailties are included in the model, the MCMC chains for some parameters exhibit high autocorrelation, resulting in a relatively small effective sample size. This suggests that there is potential for improving the algorithm. Finally, the DURABLE dataset has additional longitudinally observed blood glucose levels. To combine these longitudinal observations with the hypoglycemic events into a joint modeling framework, the transition density of a reflected Brownian motion would be needed. Incorporating this density into our framework would be interesting but not trivial.

SUPPLEMENTARY MATERIAL

Supplementary material is available at Biostatistics Journal online.

FUNDING

None declared.

CONFLICT OF INTEREST

None declared.

APPENDIX

A. TAIL OF FHT DENSITY

Here we show that the right tail of the FHT is bounded by an exponential rate. Note that rates |$\lambda_{n}$| monotonically increase to |$\infty$| as |$n\to\infty$| (so, |$\lambda_{1}$| is the slowest rate), |$c_{1} \gt 0$|⁠, and |$|c_{n}| \lt 1$| for |$n\geq 2$|⁠. It can be shown that density |$f(t)$| is asymptotically equivalent to |$c_{1}\lambda_{1}e^{-\lambda_{1}t}$| as |$t\to\infty$|⁠.

First, let us rewrite the FHT density as follows:

$$\begin{align*}\begin{array}{ll}f(t)=\sum\limits_{n=1}^{\infty}c_{n}\lambda_{n}e^{-\lambda_{n}t}=c _{1}\lambda_{1}e^{-\lambda_{1}t}\left(1+\frac{1}{c_{1}\lambda_{1}}\sum\limits_{n=2}^{\infty}c_{n}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1})t}\right),\end{array}\end{align*}$$

where |$\lambda_{n}=b(2n-1)^{2}$| and |$b=\frac{\sigma^{2}\pi^{2}}{8(\kappa-\nu)^{2}}$|⁠.

Now, consider |$h(t)=\sum_{n\,=\,2}^{\infty}c_{n}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1})t}$| and observe that

$$\begin{align*}\begin{array}{ll}|h(t)|&=\left|\sum\limits_{n=2}^{\infty}c_{n}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1})t}\right|=e^{-bt}\left|\sum\limits_{n=2}^{\infty}c_{n}\lambda _{n}e^{-(\lambda_{n}-\lambda_{1}-b)t}\right|\\ &\leq e^{-bt}\sum\limits_{n=2}^{\infty}|c_{n}|\lambda_{n}e^{-(\lambda_{n}-\lambda_{1}- b)t}\leq e^{-bt}\sum\limits_{n=2}^{\infty}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1}-b)t}.\end{array}\end{align*}$$

Therefore, if |$\sum_{n\,=\,2}^{\infty}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1}-b)t}$| is bounded for all sufficiently large |$t$|⁠, then |$|h(t)|\to 0$| as |$t\to\infty$|⁠. Indeed, since |$\lambda_{n}-\lambda_{1}-b\,=\,b(2n-1)^{2}-2b \gt 0$| for |$n\geq 2$|⁠, for |$t\,\gt\,1$| we have

$$\begin{align*}\begin{array}{ll}\sum\limits_{n=2}^{\infty}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1}- b)t}\leq\sum\limits_{n=2}^{\infty}\lambda_{n}e^{-(\lambda_{n}-\lambda_{1}-b)*1}=e^{\lambda_{1}+b}\sum\limits_{n=2}^{\infty}\lambda_{n}e^{-\lambda_{n}} \lt\infty,\end{array}\end{align*}$$

because |$\sum_{n\,=\,2}^{\infty}\lambda_{n}e^{-\lambda_{n}}$| is obviously a convergent series. Thus, hitting time density |$f(t)\sim c_{1}\lambda_{1}e^{-\lambda_{1}t}$| as |$t\to\infty$|⁠.

This result allows the use of an exponential distribution, with proper scaling, on the right tail to bound the FHT density as detailed next.

B. REJECTION SAMPLING ALGORITHM

$Actual density of the first hitting time distribution of the reflected Brownian motion with $x_{0}=10$, $\kappa\,=\,20$, $\nu\,=\,3.9$, $\sigma\,=\,2$ is given as the curve. The 3-piece envelope resulting from the multiplication of corresponding constants with the kernel are given as the solid line. In this plot, $q\,=\,0.5$ is considered.$

Fig. B1.

Actual density of the first hitting time distribution of the reflected Brownian motion with |$x_{0}=10$|⁠, |$\kappa\,=\,20$|⁠, |$\nu\,=\,3.9$|⁠, |$\sigma\,=\,2$| is given as the curve. The 3-piece envelope resulting from the multiplication of corresponding constants with the kernel are given as the solid line. In this plot, |$q\,=\,0.5$| is considered.

Open in new tab Download slide

Algorithm 1

Rejection sampling algorithm for drawing one observation from the FHT density.

Input:

|$f$| and |$F$|⁠: the FHT density and distribution functions

|$q$|⁠: a user-defined percentile

|$q_{t_{m}}=F(t_{m})$|

|$g_{1}$|⁠, |$g_{2}$|⁠, |$g_{3}$|⁠: 3 component proposals

|$M_{1}$|⁠, |$M_{2}$|⁠, |$M_{3}$|⁠: bounding constants for the 3 components

Output:|$Y$|⁠: a draw from the target density |$f$|

Begin Algorithm:

Draw |$U\sim{\rm Uniform}(0,1)$|

|$U\leq q_{t_{m}}$|

Draw candidate |$Y\sim g_{1}$|

Draw |$U^{\prime}\sim{\rm Uniform}(0,1)$|

|$U^{\prime}\leq f(Y)/[M_{1}g_{1}(Y)]$|

|$q_{t_{m}} \lt U\leq q$|

Draw candidate |$Y\sim g_{2}$|

|$U^{\prime}\sim{\rm Uniform}(0,1)$|

|$U^{\prime}\leq f(Y)/[M_{2}g_{2}(Y)]$|

Draw candidate |$Y\sim g_{3}$|

Draw |$U^{\prime}\sim{\rm Uniform}(0,1)$|

|$U^{\prime}\leq f(Y)/[M_{3}g_{3}(Y)]$|

Return |$Y$|

To sample from the FHT density |$f$|⁠, we handle the left tail, body, and right tail separately. Define |$g_{1}(t)$|⁠, |$g_{2}(t)$|⁠, and |$g_{3}(t)$| as the proposal density for the left, body, and right components, respectively,

$$\begin{align*} g_{1}(t)&\propto k_{1}t,& t\leq t_{m},\\g_{2}(t)&\propto k_{2}t+f(t_{m})-k_{2}t_{m},& t_{m} \lt t\leq t_{q},\\g_{3}(t)&\propto\exp(-\lambda_{1}t),& t \gt t_{q},\end{align*}$$

where |$k_{1}=f(t_{m})/t_{m}$| and |$k_{2}=[f(t_{m})-f(t_{q})]/(t_{m}-t_{q})$|⁠. For illustration, Fig. B1 shows the actual (target) density of first hitting distribution in red, the kennels of the 3 proposal densities in grey, and the envelopes derived by multiplying corresponding constants to the kernels in black. Given the shape properties of |$f$|⁠, the |$i$|th component of |$f$| can be bounded by |$M_{i}g_{i}(t)$|⁠, where |$M_{i}$| can be identified by maximizing |$f(t)/g_{i}(t)$| over the domain of |$g_{i}$|⁠, |$i\,=\,1,2,3$|⁠.

The rejection sampling algorithm for generating one observation from |$f$| is given in Algorithm 1.

References

Andersen

PK

,

Gill

RD.

1982

.

Cox’s regression model for counting processes: a large sample study

.

Ann Statist

.

10

:

1100

–

1120

.

Box-Steffensmeier

JM

,

De Boef

S.

2006

.

Repeated events survival models: the conditional frailty model

.

Stat Med

.

25

:

3518

–

3533

.

Buse

JB

,

Wolffenbuttel

BH

,

Herman

WH

,

Shemonsky

NK

,

Jiang

HH

,

Fahrbach

JL

,

Scism-Bacon

JL

,

Martin

SA.

2009

.

Durability of basal versus lispro mix 75/25 insulin efficacy (durable) trial 24-week results: safety and efficacy of insulin lispro mix 75/25 versus insulin glargine added to oral antihyperglycemic drugs in patients with type 2 diabetes

.

Diabetes Care

.

32

:

1007

–

1013

.

Celeux

G

,

Forbes

F

,

Robert

CP

,

Titterington

DM.

2006

.

Deviance information criteria for missing data models

.

Bayesian Anal

.

1

:

651

–

673

.

Centers for Disease Control and Prevention

.

2022

. National Diabetes Statistics Report website. https://www.cdc.gov/diabetes/php/data-research/index.html, Accessed 2022 Oct 28.

Chang

S-H.

2004

.

Estimating marginal effects in accelerated failure time models for serial sojourn times among repeated events

.

Lifetime Data Anal

.

10

:

175

–

190

.

Charles-Nelson

A

,

Katsahian

S

,

Schramm

C.

2019

.

How to analyze and interpret recurrent events data in the presence of a terminal event: an application on readmission after colorectal cancer surgery

.

Stat Med

.

38

:

3476

–

3502

.

Cook

RJ

,

Lawless

JF.

2007

.

The statistical analysis of recurrent events

.

New York

:

Springer

.

Google Preview

Cryer

PE

,

Axelrod

L

,

Grossman

AB

,

Heller

SR

,

Montori

VM

,

Seaquist

ER

;

Endocrine Society

.

2009

.

Evaluation and management of adult hypoglycemic disorders: an endocrine society clinical practice guideline

.

J Clin Endocrinol Metab

.

94

:

709

–

728

.

Cryer

PE

,

Davis

SN

,

Shamoon

H.

2003

.

Hypoglycemia in diabetes

.

Diabetes Care

.

26

:

1902

–

1912

.

de Valpine

P

,

Turek

D

,

Paciorek

C

,

Anderson-Bergman

C

,

Temple Lang

D

,

Bodik

R.

2017

.

Programming with models: writing statistical algorithms for general model structures with NIMBLE

.

J Comput Graph Stat

.

26

:

403

–

413

.

DeRosa

MA

,

Cryer

PE.

2004

.

Hypoglycemia and the sympathoadrenal system: neurogenic symptoms are largely the result of sympathetic neural, rather than adrenomedullary, activation

.

Am J Physiol Endocrinol Metab

.

287

:

E32

–

E41

.

Dey

DK

,

Chen

M-H

,

Chang

H.

1997

.

Bayesian approach for nonlinear random effects models

.

Biometrics

.

53

:

1239

–

1252

.

Doubleday

K

,

Zhou

J

,

Zhou

H

,

Fu

H.

2022

.

Risk controlled decision trees and random forests for precision

.

Stat Med

.

41

:

719

–

735

.

Duchateau

L

,

Janssen

P

,

Kezic

I

,

Fortpied

C.

2003

.

Evolution of recurrent asthma event rate over time in frailty models

.

J R Stat Soc Ser C Appl Stat

.

52

:

355

–

363

.

Economou

P

,

Malefaki

S

,

Caroni

C.

2015

.

Bayesian threshold regression model with random effects for recurrent events

.

Methodol Comput Appl Probab

.

17

:

871

–

898

.

Folks

JL

,

Chhikara

RS.

1978

.

The inverse gaussian distribution and its statistical application—a review

.

J R Stat Soc Ser B Methodol

.

40

:

263

–

275

.

Fu

H

,

Luo

J

,

Qu

Y.

2016

.

Hypoglycemic events analysis via recurrent time-to-event (heart) models

.

J Biopharm Stat

.

26

:

280

–

298

.

Geisser

S

,

Eddy

WF.

1979

.

A predictive approach to model selection

.

J Am Stat Assoc

.

74

:

153

–

160

.

Gelfand

AE

,

Dey

DK.

1994

.

Bayesian model choice: asymptotics and exact calculations

.

J R Stat Soc Ser B Methodol

.

56

:

501

–

514

.

Gelman

A.

2006

.

Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper)

.

Bayesian Anal

.

1

:

515

–

534

.

Heidelberger

P

,

Welch

PD.

1983

.

Simulation run length control in the presence of an initial transient

.

Oper Res

.

31

:

1109

–

1144

.

Hofert

M

,

Kojadinovic

I

,

Mächler

M

,

Yan

J.

2018

.

Elements of copula modeling with R.

Cham:

Springer

.

Hu

Q

,

Wang

Y

,

Yang

X.

2012

.

The hitting time density for a reflected brownian motion

.

Comput Econ

.

40

:

1

–

18

.

Huang

Y

,

Chen

YQ.

2003

.

Marginal regression of gaps between recurrent events

.

Lifetime Data Anal

.

9

:

293

–

303

.

Klein

JP.

1992

.

Semiparametric estimation of random effects using the Cox model based on the EM algorithm

.

Biometrics

.

48

:

795

–

806

.

Lawless

JF

,

Nadeau

C.

1995

.

Some simple robust methods for the analysis of recurrent events

.

Technometrics

.

37

:

158

–

168

.

Lee

EW

,

Wei

LJ

,

Amato

DA

,

Leurgans

S.

1992

. Cox-type regression analysis for large numbers of small groups of correlated failure time observations. In:

Klein

JP

,

Goel

PK

, editors.

Survival analysis: state of the art

. Dordrecht:

Springer

. p.

237

–

247

.

Lee

M-LT.

2019

.

A survey of threshold regression for time-to-event analysis and applications

.

Taiwanese J Math

.

23

:

293

–

305

.

Lee

M-LT

,

Whitmore

GA.

2006

.

Threshold regression for survival analysis: modeling event times by a stochastic process reaching a boundary

.

Statist Sci

.

21

:

501

–

513

.

Lin

DY

,

Wei

L-J

,

Yang

I

,

Ying

Z.

2000

.

Semiparametric regression for the mean and rate functions of recurrent events

.

J R Stat Soc Ser B Stat Methodol

.

62

:

711

–

730

.

Luo

X

,

Huang

C-Y

,

Wang

L.

2013

.

Quantile regression for recurrent gap time data

.

Biometrics

.

69

:

375

–

385

.

Ma

H

,

Peng

L

,

Huang

C-Y

,

Fu

H.

2021

.

Heterogeneous individual risk modelling of recurrent events

.

Biometrika

.

108

:

183

–

198

.

Maity

AK

,

Basu

S

,

Ghosh

S.

2021

.

Bayesian criterion-based variable selection

.

J R Stat Soc Ser C Appl Stat

.

70

:

835

–

857

.

Malefaki

S

,

Economou

P

,

Caroni

C.

2015

. Modelling times between events with a cured fraction using a first hitting time regression model with individual random effects. In:

Kitsos

CP

,

Oliveira

TA

,

Rigas

A

,

Gulati

S

, editors.

Theory and practice of risk assessment

.

Springer Proceedings in Mathematics & Statistics, vol 136. Cham

: Springer. p.

45

–

65

.

Manda

SO

,

Meyer

R.

2005

.

Bayesian inference for recurrent events data using time-dependent frailty

.

Stat Med

.

24

:

1263

–

1274

.

Pennell

ML

,

Whitmore

GA

,

Ting Lee

M-L.

2010

.

Bayesian random-effects threshold regression with application to survival data with nonproportional hazards

.

Biostatistics

.

11

:

111

–

126

.

Plummer

M

,

Best

N

,

Cowles

K

,

Vines

K.

2006

.

CODA: convergence diagnosis and output analysis for MCMC

.

R News

.

6

:

7

–

11

.

Prentice

RL

,

Williams

BJ

,

Peterson

AV.

1981

.

On the regression analysis of multivariate failure time data

.

Biometrika

.

68

:

373

–

379

.

Schaubel

DE

,

Cai

J.

2004

.

Regression methods for gap time hazard functions of sequentially ordered multivariate failure time data

.

Biometrika

.

91

:

291

–

303

.

Schrödinger

E.

1915

.

Zur theorie der fall-und steigversuche an teilchen mit brownscher bewegung

.

Phys Zeitschrift

.

16

:

289

–

295

.

Seaquist

ER

,

Anderson

J

,

Childs

B

,

Cryer

P

,

Dagogo-Jack

S

,

Fish

L

,

Heller

SR

,

Rodriguez

H

,

Rosenzweig

J

,

Vigersky

R.

2013

.

Hypoglycemia and diabetes: a report of a workgroup of the American Diabetes Association and the Endocrine Society

.

Diabetes Care

.

36

:

1384

–

1395

.

Spiegelhalter

DJ

,

Best

NG

,

Carlin

BP

,

Van Der Linde

A.

2002

.

Bayesian measures of model complexity and fit

.

J R Stat Soc Ser B Stat Methodol

.

64

:

583

–

639

.

Sun

L

,

Park

D-H

,

Sun

J.

2006

.

The additive hazards model for recurrent gap times

.

Stat Sin

.

16

:

919

–

932

.

Towler

DA

,

Havlin

CE

,

Craft

S

,

Cryer

P.

1993

.

Mechanism of awareness of hypoglycemia: perception of neurogenic (predominantly cholinergic) rather than neuroglycopenic symptoms

.

Diabetes

.

42

:

1791

–

1798

.

Wei

L-J

,

Lin

DY

,

Weissfeld

L.

1989

.

Regression analysis of multivariate incomplete failure time data by modeling marginal distributions

.

J Am Stat Assoc

.

84

:

1065

–

1073

.

Whitmore

GA

,

Ramsay

T

,

Aaron

SD.

2012

.

Recurrent first hitting times in wiener diffusion under several observation schemes

.

Lifetime Data Anal

.

18

:

157

–

176

.

Wild

D

,

von Maltzahn

R

,

Brohan

E

,

Christensen

T

,

Clauson

P

,

Gonder-Frederick

L.

2007

.

A critical review of the literature on fear of hypoglycemia in diabetes: implications for diabetes management and patient education

.

Patient Educ Couns

.

68

:

10

–

15

.

Xu

G

,

Chiou

SH

,

Yan

J

,

Marr

K

,

Huang

C-Y.

2020

.

Generalized scale-change models for recurrent event processes under informative censoring

.

Stat Sin

.

30

:

1773

–

1795

.

PubMed