Empirical Evaluation of Competing High-Frequency Estimators of Quadratic Variation

Abstract

We propose methods for testing hypotheses about differences in bias, differences in error variance, and differences in the mean squared errors of competing estimators of quadratic variation computed using intradaily data. Our approach works under reasonably mild assumptions for members of a class of estimators that may be written as a quadratic form. We prove bootstrap limit theorems that facilitate the use of our tests with multiple hypothesis testing methodologies and investigate finite-sample properties under a range of situations using simulations. We apply our approach to a comparison of competing volatility estimators for a large cross-section of the most liquid stocks traded on the New York Stock Exchange and find that noise-robust volatility estimators generate lower mean-squared errors than 5-min realized volatility for many stocks.

Those who wish to estimate the daily volatility of asset returns using intradaily data are spoiled for choice. The last three decades have seen many applicable estimators proposed, including the standard realized volatility (RV) estimator (Andersen et al. 2001; Barndorff-Nielsen and Shephard 2002; Bandi and Russell 2008), the first-order-autocorrelation adjusted RV estimator (French, Schwert, and Stambaugh 1987; Zhou 1996; Hansen and Lunde 2006), the two-scale (TSRV), and multi-scale RV (MSRV) estimators (Zhang, Mykland, and Ait-Sahalia 2005a; Zhang 2006), the Realized Kernel (RK) estimator (Barndorff-Nielsen et al. 2008), the quasi-maximum likelihood estimator (QMLE; Ait-Sahalia, Mykland, and Zhang 2005; Xiu 2010) and the preaveraged RV (PARV) estimator (Jacod et al. 2009), to name but a few of the most well-established methods. These estimators may be implemented using data on quote or trade prices, measured in calendar-time or tick-time, at a range of different frequencies. Many of these methods also require choices to be made about bandwidths, window widths, kernel functions, etc. This broad menu of estimators presents empirical analysts with a conundrum: which estimator should be used for a particular application of interest?

In a theoretical setting with no microstructure noise, there exists wide agreement that the best choice of estimator is generally the simple RV estimator computed using the highest frequency data available (Andersen et al. 2001). However, as is well known, in the presence of microstructure noise, the RV estimator becomes severely biased at high frequencies. Two broad approaches exist to circumvent this problem. The first is to implement the simple RV estimator using an intermediate data frequency—high enough for the variance to be reasonably small, but not so high as to create severe bias. Five-minute RV is often chosen as a trade-off between these two concerns, although this choice of frequency is often arbitrary. At best, the frequency might be chosen based on a visual inspection of a volatility signature plot (e.g. Awartani 2008; Degiannakis and Floros 2016; Shen, Urquhart, and Wang 2020; Bandi and Russell 2008). The second approach is to implement one of the many noise-robust estimators that have been proposed in the literature. In theory, these estimators are asymptotically unbiased, eliminating the need to choose a sampling frequency that trades off bias for variance. However, the extent to which unbiasedness is achieved by these estimators in empirical applications is unclear. Despite the volume of literature proposing and applying high-frequency volatility estimators, relatively little work has been done on the empirical evaluation of competing estimators.

Ait-Sahalia and Xiu (2019) introduce a Hausman test of the presence of microstructure noise in high-frequency data. This is constructed as a test of the difference between the RV estimator and a maximum likelihood estimator (MLE). It should be noted that the Ait-Sahalia and Xiu (2019) test detects bias only. If the aim is to trade bias for variance, then it provides only part of the necessary information. Patton (2011) proposes a method for testing the equality of values of a class of loss function for two high-frequency volatility estimators. The mean squared error (MSE) is a member of the class, so the method facilitates the empirical evaluation of competing estimators in terms of a particular bias-variance trade-off. However, it does not provide separate evaluations of the bias and variance. Liu, Patton, and Sheppard (2015) have used Patton’s method with a QLIKE loss function in a comprehensive analysis of over 400 different implementations of 8 different types of RV estimator applied to 31 different financial assets. They consider quote and transaction prices with tick- and calendar-time observations, with sampling frequencies ranging from 1 s to 15 min. They find little evidence that any of the estimators considered is superior to the 5-min RV estimator. This result has been widely cited in the literature,¹ usually as a justification of the use of the 5-min RV estimator instead of a noise-robust estimator in applied work. It should be noted that rankings of estimators based on the QLIKE loss function do not necessarily correspond to rankings based on the MSE. However, in Supplementary Appendix, Liu, Patton, and Sheppard (2015) report results computed using the MSE instead of QLIKE, and these also fail to find evidence of other estimators being superior to 5-min RV, a result that they attribute to a lack of power. Gatheral and Oomen (2010) adopt a different approach, comparing the bias and MSE of a range of RV estimators (that includes 5-min RV) using data generated by a simulated order book market. In contrast to the empirical work of Liu, Patton, and Sheppard (2015), they find that the simple RV estimator is consistently one of the worst-performing estimators, irrespective of the sampling frequency used. Their overall recommendation is that practitioners should use TSRV, MSRV, or RK computed with an ad-hoc choice of tuning parameter and the highest available frequency of data. Nonetheless, the fact that their data are simulated raises the question of whether their results would hold with data generated by real markets.

In this article, we propose tests of equality for the bias, error variance and MSE for pairs of estimators of quadratic variation estimated using intradaily returns data. We prove stationary bootstrap limit theorems that allow our tests to be implemented with multiple hypothesis testing methodologies including White’s (2000) reality check, Hansen’s (2005) superior predictive ability (SPA) test, the STEP-M and generalized STEP-M procedures of Romano and Wolf (2005) and Romano and Wolf (2007), and the Model Confidence Set of Hansen, Lunde, and Nason (2011).

Like the Ait-Sahalia and Xiu (2019) test, our test of bias may be applied to simple RV estimators of different frequencies to determine whether they are unduly impacted by microstructure noise. However, our test may also be applied to any other high-frequency volatility estimator under only very mild assumptions about the estimation errors. Also, while the Ait-Sahalia and Xiu (2019) test considers the null hypothesis of equal bias on a single trading day, our test considers the average bias over a large number of trading days. We are unaware of any previously published tests of the equality of variance of the errors of high-frequency volatility estimators. Our test of equal MSE differs from Patton’s in two main ways. First, Patton’s approach is based on the assumption that the latent interdaily volatility process is a simple random walk. In contrast, our approach assumes a standard diffusion process for the intradaily asset price, makes some mild assumptions about microstructure noise, and allows the first difference of the interdaily volatility to be a member of a fairly general class of near-epoch dependent (NED) processes. Second, the simulation experiment that we report in Section 2 suggests that our test of equal MSE has a considerable power advantage over the equivalent test proposed by Patton. We pay two prices for these advantages. First, since our method exploits particular properties of the MSE, it cannot be generalized to other Bregman-type loss functions such as QLIKE. Since the MSE loss function is easily interpreted and widely used, we don’t regard this as a significant drawback. Second, while our test of equal bias applies to all volatility estimators, our tests of equal variance and equal MSE exploit a property that is a feature of a particular class of volatility estimators. We show that this class includes the RV estimator at all frequencies, and the RK, TSRV,² MSRV, PARV,³ QMLE, and FOAC estimators. However, the applicability to estimators outside this class needs to be established on a case-by-case basis.

The remainder of this article is arranged as follows. In Section 1, we state and explain our assumptions, present our test statistics for bias, variance and MSE, and state some theoretical properties that justify their use. The proofs are presented in the Appendix. In Section 2, we present the results of simulation studies that examine one of our key assumptions and investigate the size and power of our three test statistics assuming a range of different models for intradaily asset prices and their volatilities. We also compare the performance of our test for equal MSE with the corresponding test proposed by Patton (2011). In Section 3, we present an empirical study of the comparative bias, error variance and MSE of the RV, TSRV, MSRV, RK, PARV, and QMLE estimators applied to fifty of the most liquid stocks traded on the NYSE using a range of bandwidths, window lengths and subsamples. We also consider optimal parameter selection methods. In contrast to Liu, Patton, and Sheppard (2015), we find considerable evidence that there exist estimators that beat 5-min RV for many stocks in terms of the MSE, and we are able to explain the relative performance in terms of the comparative biases and variances. In Section 4, we draw some conclusions.

1 Main Results

Let $t = 1, …, T$ index a sequence of trading days and let $θ_{t}$ denote the quadratic variation of a variable on day t. Let x_kt, $k \in {i, j}$ denote a pair of estimators of $θ_{t}$ ⁠, such that $x_{k t} = θ_{t} + u_{k t}$ ⁠, with u_kt denoting the estimation error. Let ${\tilde{θ}}_{t}$ denote a proxy for $θ_{t}$ ⁠, and ${\tilde{u}}_{t}$ denote the corresponding proxy error ${\tilde{u}}_{t} = {\tilde{θ}}_{t} - θ_{t}$ ⁠.

θ_{t}

were observable, then the estimation of the difference between the MSEs of the two estimators would be straightforward. Since it is not, and since the pairwise covariances of u_it, u_jt, and

{\tilde{u}}_{t}

are likely to be non-zero, an identification problem exists. Patton’s (2011) solution to this problem is to assume that

E (Δ θ_{t} | F_{t - 1}) = 0

and

E ({\tilde{u}}_{t} | F_{t - 1}, θ_{t}) = 0

⁠. Under these conditions,

MSE (x_{k t}) = E {(x_{k t} - θ_{t})}^{2} = E {(x_{k t} - {\tilde{θ}}_{t + 1})}^{2}

⁠, so MSEs may be estimated using the observed estimators and the future observed values of the proxy. This motivates Patton’s test statistic

{\bar{γ}}_{P : MSE} = \frac{1}{T} \sum_{t = 1}^{T - 1} [{(x_{i t} - {\tilde{θ}}_{t + 1})}^{2} - {(x_{j t} - {\tilde{θ}}_{t + 1})}^{2}]

(1)

While it provides an elegant solution to the identification problem and a simple statistic, the assumption that the latent daily volatility process $θ_{t}$ follows a simple random walk is strong, and may not be satisfied in practical applications.

In contrast, our approach is based on the following assumptions:

Assumption 1.

$E ({\tilde{u}}_{t}) = 0$

Assumption 2.

$cov (θ_{t}, u_{i t - 1} - u_{j t - 1}) = cov (θ_{t - 1}, u_{i t} - u_{j t}) = cov (u_{k t}, u_{k t - 1}) = 0, k \in {i, j}$

Assumption 3.

$\frac{1}{T} \sum_{t = 1}^{T} {(μ_{ukt} - \bar{μ_{u k}})}^{2} = o (T^{- \frac{1}{2}})$ for $k \in {i, j}$ where $μ_{ukt} = E (u_{k t})$ and $\bar{μ_{u k}} = \frac{1}{T} \sum_{t = 1}^{T} μ_{k t}$ ⁠.

Assumption 4.

$cov (θ_{t}, u_{i t} - u_{j t}) = 0$

Assumption 1 may be satisfied by using, for example, a low-frequency RV estimator as the proxy⁴. Assumption 2 is motivated by our belief that the current values of the estimation errors u_kt do not provide useful information for predicting the future estimation errors or quadratic variation. The fact that all the popular high-frequency estimators of $θ_{t}$ are constructed using data from only day t suggests that our belief is widespread. Note that, while there exists evidence of intradaily autocorrelation of microstructure noise that spans several ticks (e.g. Li and Linton 2022; Li, Laeven, and Vellekoop 2020; Li et al. 2022), this implies at worst a negligible degree of dependence between the estimation errors on successive days and, since the estimation error is likely to be related to the sum of the squared microstructure noise terms, does not imply autocorrelation of the daily estimation error. Note also that Li et al. (2022) find evidence that the variance of the microstructure error has predictive power for the quadratic variation on future days—which may result in some correlation between the daily estimation error and the quadratic variation on the following day. However, they find that, while statistically significantly different from zero, the reduction in the out-of-sample root mean squared prediction error from including the microstructure noise in a heterogeneous autoregressive (HAR) model is only 0.054%, so any impact on the veracity of Assumption 2 is likely to be negligible (see Li et al. 2022, Table 6, Panel A, Row 2). In Supplementary Appendix to this article (Supplementary Appendix B), we estimate the values of $cov (θ_{t + 1}, u_{t}) / E (u_{t}^{2}), cov (θ_{t}, u_{t + 1}) / E (u_{t}^{2})$ and $cov (u_{t + 1}, u_{t}) / E (u_{t}^{2})$ for a range of estimators using values simulated with the estimated HAR model of Li et al. (2022), including lagged values of the variance of microstructure noise, and find that they are inconsequentially small.

Table 6

Open in new tab

Number of rejections of null of equal or greater loss than RV-5 min using ${\bar{γ}}_{P : MSE}$ (Patton’s statistic with MSE loss) with confidence level of 0.1, and FDP of 0.1

Bandwidth	5	20	60	120	300	600	Opt
RV	1	1	0	0		0	0
TSRV	1	1	4	5	8	0	1
MSRV	1	1	2	5	7	0	1
RK	1	1	1	1	3	0	3
PARV	1	1	1	4	13	4	1
QMLE	1	0	1	0	0	0

Bandwidth	5	20	60	120	300	600	Opt
RV	1	1	0	0		0	0
TSRV	1	1	4	5	8	0	1
MSRV	1	1	2	5	7	0	1
RK	1	1	1	1	3	0	3
PARV	1	1	1	4	13	4	1
QMLE	1	0	1	0	0	0

Table 6

Open in new tab

Number of rejections of null of equal or greater loss than RV-5 min using ${\bar{γ}}_{P : MSE}$ (Patton’s statistic with MSE loss) with confidence level of 0.1, and FDP of 0.1

Bandwidth	5	20	60	120	300	600	Opt
RV	1	1	0	0		0	0
TSRV	1	1	4	5	8	0	1
MSRV	1	1	2	5	7	0	1
RK	1	1	1	1	3	0	3
PARV	1	1	1	4	13	4	1
QMLE	1	0	1	0	0	0

Bandwidth	5	20	60	120	300	600	Opt
RV	1	1	0	0		0	0
TSRV	1	1	4	5	8	0	1
MSRV	1	1	2	5	7	0	1
RK	1	1	1	1	3	0	3
PARV	1	1	1	4	13	4	1
QMLE	1	0	1	0	0	0

Assumption 3 limits the variability of the daily bias of the estimation errors. If the estimation errors are mean-stationary, then Assumption 3 holds trivially. Assumption 4 requires a more detailed justification. It follows from standard regression theory that if $cov (θ_{t}, u_{k t}) \neq 0$ then there exists a recentered, rescaled version of x_kt that has a lower MSE. Consequently, $cov (θ_{t}, u_{k t}) = 0$ might be regarded as a property that a “good” estimator should possess. However, this observation does not guarantee that the estimators in which we are interested will have this property. Below, we argue that, under fairly general conditions, for comparisons of estimators from a particular class, the relevant covariances will cancel out so that Assumption 4 holds.

For each trading day

t \in {1, …, T}

⁠, for

τ \in (0, N_{t} - 1)

⁠, we model the evolution of the logged asset price by the jump diffusion equation

d p_{t} (τ) = μ_{t} (τ) d τ + σ_{t} (τ) d W_{t} (τ) + ω_{t} (τ) d J_{t} (τ)

(2)

where N_t is the number of intraday price observations on day t,

μ_{t} (τ)

is a deterministic locally bounded drift, and

σ_{t} (τ)

is a non-negative stochastic process adapted to the filtration of the Wiener process

W_{t} (τ)

⁠. We assume that increments in

σ_{t} (τ)

are independent of increments in

W_{t} (τ)

at all leads and lags during Day t (the no-leverage condition).

J_{t} (τ)

is a Poisson process and

ω_{t} (τ)

is the jump size, both of which are assumed to be independent of

W_{t} (τ)

and

σ_{t} (τ)

⁠.

Barndorff-Nielsen and Shephard (2002) show that in the absence of the jump component, if $μ_{t} (τ) = 0, p_{t} (τ)$ is directly observable, no leverage effect exists, and x_kt is the simple RV estimator computed at any frequency, then $cov (θ_{t}, u_{k t}) = 0$ ⁠, so Assumption 4 is satisfied under these conditions for the RV estimator. Meddahi (2002) allows for the leverage effect to exist⁵ and for a non-zero drift, and finds that $cov (θ_{t}, u_{k t}) \neq 0$ under these conditions. However, he also shows that, for a broad class of stochastic volatility models, $corr (θ_{t}, u_{k t}) = O (N_{t}^{- \frac{3}{2}})$ ⁠. Consequently, while non-zero, the relevant covariance would be expected to converge to zero relatively quickly as the number of intraday observations grows. Meddahi (2002) also shows that, for the models estimated by Andersen, Benzoni, and Lund (2002), the empirical magnitude of $corr (θ_{t}, u_{k t})$ is negligible. For example, for 1-h RV, he finds values of the order of $10^{- 5}$ or smaller for this correlation (Table III in Meddahi 2002). For higher-frequency RVs, the magnitude is even smaller. In Supplementary Appendix A, we simulate some popular models of asset prices that exhibit the leverage effect and show that the impact of leverage on our results is inconsequential. For this reason, and to avoid unnecessarily complicating the analysis, we will assume an absence of leverage in our subsequent analysis. Nonetheless, the model of Barndorff-Nielsen and Shephard (2002) and Meddahi (2002) is still too restrictive for our purposes. Accordingly, we generalize it in three ways.

Table 3

Open in new tab

Number of rejections of null of equal or greater loss than RV-5 min using ${\bar{γ}}_{M}$ (our MSE statistic)

Bandwidth	5	20	60	120	300	600	900	Opt
RV	3	1	1	0		0	0	0
TSRV	4	2	5	8	7	1	1	4
MSRV	4	2	5	7	5	0	0	4
RK	4	2	2	2	2	0	0	2
PARV	4	2	5	8	15	2	1	5
QMLE	2	0	0	0	0	0	0

Bandwidth	5	20	60	120	300	600	900	Opt
RV	3	1	1	0		0	0	0
TSRV	4	2	5	8	7	1	1	4
MSRV	4	2	5	7	5	0	0	4
RK	4	2	2	2	2	0	0	2
PARV	4	2	5	8	15	2	1	5
QMLE	2	0	0	0	0	0	0

Note: The table elements state the number of securities for which the null of equal or worse MSE than 5-min RV is rejected (max = 50).

Table 3

Open in new tab

Number of rejections of null of equal or greater loss than RV-5 min using ${\bar{γ}}_{M}$ (our MSE statistic)

Bandwidth	5	20	60	120	300	600	900	Opt
RV	3	1	1	0		0	0	0
TSRV	4	2	5	8	7	1	1	4
MSRV	4	2	5	7	5	0	0	4
RK	4	2	2	2	2	0	0	2
PARV	4	2	5	8	15	2	1	5
QMLE	2	0	0	0	0	0	0

Bandwidth	5	20	60	120	300	600	900	Opt
RV	3	1	1	0		0	0	0
TSRV	4	2	5	8	7	1	1	4
MSRV	4	2	5	7	5	0	0	4
RK	4	2	2	2	2	0	0	2
PARV	4	2	5	8	15	2	1	5
QMLE	2	0	0	0	0	0	0

Note: The table elements state the number of securities for which the null of equal or worse MSE than 5-min RV is rejected (max = 50).

First, we allow the jump component to exist. Many authors (e.g. Huang and Tauchen 2005; Andersen, Bollerslev, and Diebold 2007) document evidence that jumps represent an important component of asset price volatility. Consequently, the addition of jumps is a worthwhile generalization of the model. Secondly, we assume that the log-price in Equation (2) is not directly observable and instead is observed subject to microstructure noise. Using equities listed on the NYSE and NASDAQ, Hansen and Lunde (2006) find evidence that microstructure noise is correlated with the efficient price, is serially dependent, and has properties that change substantially over time. Jacod, Li, and Zheng (2017), Li, Laeven, and Vellekoop (2020), Li et al. (2022), and Li and Linton (2022) also report evidence of intradaily autocorrelation of microstructure noise. Diebold and Strasser (2013) provide a theoretical model of market trading which explains the correlation of microstructure noise with the efficient price. Accordingly, we specify a model for the observed price that allows the microstructure noise to be serially dependent, heteroscedastic, and contemporaneously correlated with the underlying efficient price. This model is presented as Equation (3).

\begin{matrix} {\tilde{p}}_{t} (n) = p_{t} (n) + η_{t} (n) \\ η_{t} (n) = β_{t} (p_{t} (n) - E (p_{t} (n))) + ε_{t} (n) \\ ε_{t} (n) ⊥ ⊥ σ_{t} (m), W_{t} (m), ω_{t} (m), J_{t} (m), β_{t}; E (ε_{t} (n)) = 0 \\ β_{t} ⊥ ⊥ σ_{t} (m), W_{t} (m); E (β_{t}^{2}) < \infty \\ for all n \in (0, …, N_{t} - 1), m \in (0, …, N_{t} - 1) \end{matrix}

(3)

where

{\tilde{p}}_{t} (n)

is the observable log-price,

η_{t} (n)

is the microstructure noise,

β_{t}

is a stochastic parameter that is fixed within any trading day, but may vary over trading days, and

ε_{t} (n)

is a stochastic error term that is assumed to be independent of the other random variables in the model. Note that our model is quite general. If we set

β_{t} = 0

⁠, then the microstructure noise

η_{t} (n)

is independent of the underlying price

p_{t} (n)

⁠. However, this restriction is not needed for our subsequent arguments, so dependence between the microstructure noise and the underlying price is accommodated. Furthermore, since

β_{t}

is a (daily) random variable, the dependence between the microstructure noise and the underlying asset price may change across days. Similarly, the properties of

ε_{t} (n)

may be different on different days. Also, we do not need to restrict the intradaily autocorrelation structure of

ε_{t} (n)

⁠. Consequently, the intradaily autocorrelation structure of the microstructure noise

η_{t} (n)

is not restricted. Similarly, we do not need to impose homoscedasticity on

ε_{t} (n)

or the microstructure noise

η_{t} (n)

⁠. Finally, apart from non-negativity, the no-leverage assumption, and the independence from

β_{t}

and

ε_{t} (n)

⁠, we do not need to restrict the volatility process

σ_{t} (m)

⁠. Thus, for example, the volatility could include a jump process without it affecting our arguments below.

Our third generalization of the model is that, instead of restricting attention to the simple RV estimator, we consider a class of estimators that may be written as a quadratic form

x_{k t} = {\tilde{r}}_{t}^{'} A_{k} {\tilde{r}}_{t} = \sum_{n = 1}^{N_{t}} \sum_{m = 1}^{N_{t}} a_{knm} {\tilde{r}}_{n t} {\tilde{r}}_{m t}, k \in {i, j}

(4)

where

{\tilde{r}}_{t} = ({\tilde{r}}_{1 t} … {\tilde{r}}_{N_{t} t})', {\tilde{r}}_{n t} = {\tilde{p}}_{t} (n) - {\tilde{p}}_{t} (n - 1), n = 1, …, N_{t}

⁠, and A_k is a symmetric matrix with ones on the diagonal (i.e. a_knn = 1 for all

n = 1, …, N_{t}

⁠). Trivially, the simple RV estimator computed using all available returns may be written in this form (a_knm = 0 for

m \neq n

⁠). Andersen, Bollerslev, and Meddahi (2011) show that the RV estimator computed at lower frequencies is also a member of this class, as are the MSRV, RK, and FOAC estimators. TSRV is a member of this class when it includes the small-sample adjustment given by Equation (64) in Zhang, Mykland, and Ait-Sahalia (2005b). In the absence of the small-sample adjustment, the result holds asymptotically as the number of grids used for subsampling grows. Xiu (2010) shows that the QMLE estimator may be written in the form given by Equation (4). In Lemma 5.1 in the Appendix, we show that the PARV estimator becomes a member of this class as the width of the preaveraging window grows.

The rationale for Assumption 4 is now presented as Proposition 1.1, which is proved in the Appendix.

Proposition 1.1.

For the model given by Equations (2) and (3) and the class of estimators defined by Equation (4), Assumption 4 holds.

For estimators that cannot be written as a quadratic form as in Equation (4), Assumption 4 would need to be investigated on a case-by-case basis. In cases where this is mathematically challenging, an alternative approach is to use simulation to estimate $cov (θ_{t}, u_{i t} - u_{j t})$ for particular data-generating processes (DGPs) and estimators.⁶ In cases where $cov (θ_{t}, u_{i t} - u_{j t})$ is found to be negligibly small for a range of DGPs, our test might be considered useful, even in the absence of a formal proof of Assumption 4.

Note that $var (u_{i t}) - var (u_{j t}) = var (x_{i t}) - var (x_{j t}) - 2 c o v (θ_{t}, u_{i t} - u_{j t})$ ⁠, so Assumption 4 is sufficient for the identification of the difference in the error variances of two estimators. More trivially, $E (u_{i t}) - E (u_{j t}) = E (x_{i t}) - E (x_{j t})$ so the difference in the bias of the two estimators is identified. These results, combined with the unbiased proxy given by Assumption 1 are sufficient for the identification of the MSE. Thus, with the addition of some assumptions about weak dependence and the finiteness of moments, we are able to construct statistics for testing the equality of the bias, error variance, and MSE of two estimators x_it and x_jt. We now turn our attention to this task.

Firstly, we define the statistics that we will use to measure the differences in bias, variance, and MSE respectively:

$1 . {\bar{γ}}_{B} = \frac{1}{T} \sum_{t = 1}^{T} γ_{B t} where γ_{B t} = x_{i t} - x_{j t}$
(5)
$2 . {\bar{γ}}_{V} = \frac{1}{T} \sum_{t = 1}^{T} γ_{V t} where γ_{V t} = \frac{1}{2} {(Δ x_{i t} - {\bar{Δ x}}_{i})}^{2} - \frac{1}{2} {(Δ x_{j t} - {\bar{Δ x}}_{j})}^{2}$
(6)
$3 . {\bar{γ}}_{M} = \frac{1}{T} \sum_{t = 1}^{T} γ_{M t} where γ_{M t} = \frac{1}{2} {(Δ x_{i t} - {\bar{Δ x}}_{i})}^{2} - \frac{1}{2} {(Δ x_{j t} - {\bar{Δ x}}_{j})}^{2} + (x_{i t} - {\tilde{θ}}_{t}) ({\bar{x}}_{i} - \bar{\tilde{θ}}) - (x_{j t} - {\tilde{θ}}_{t}) ({\bar{x}}_{j} - \bar{\tilde{θ}})$
(7)

The null hypotheses of interest are $H_{B} : E ({\bar{γ}}_{B}) = 0, H_{V} : E ({\bar{γ}}_{V}) = 0$ ⁠, and $H_{M} : E ({\bar{γ}}_{M}) = 0$ which are, respectively, the hypotheses that the difference in biases is zero, the difference in variances is zero, and the difference in MSEs is zero under Assumptions 1, 2, and 4. In subsequent theorems, we will make use of the following assumptions:

Assumption 5.

$\exists r > 2$ and $c < \infty$ such that:

1) a) $\max_{1 ⩽ t ⩽ T} | | Δ θ_{t} | |_{2 r} < c$ ⁠.
b) For $k \in {i, j}, \max_{1 ⩽ t ⩽ T} | | u_{k t} | |_{2 r} < c$ ⁠.
c) $\max_{1 ⩽ t ⩽ T} | | {\tilde{u}}_{t} | |_{2 r} < c$ ⁠.
2) Z_t is a strong mixing process of size $- \frac{r}{r - 2}$ and
- $Δ θ_{t}$ is L₄-NED of size $- \frac{1}{2}$ on Z_t.
- For $k \in {i, j}$ , u_kt is L₄-NED of size $- \frac{1}{2}$ on Z_t.
- ${\tilde{u}}_{t}$ is L₄-NED of size $- \frac{1}{2}$ on Z_t.

It should be noted that most published papers that introduce new high-frequency estimators of quadratic variation include limit theorems that usually prove that, when suitably centered and rescaled, the estimation error (u_kt) converges to a mixed normal distribution as the number of intradaily observations grows. This, and the fact that each daily estimator is computed using a different dataset, suggests that the assumptions made above about the properties of ${\tilde{u}}_{t}$ and u_kt for $k \in {i, j}$ are quite mild.

Note also that we make mild assumptions about the dynamic behavior of the daily quadratic variation (⁠ $θ_{t}$ ⁠). Specifically, our approach allows the first difference of the quadratic variation to be any member of a broad class of NED processes.

The following results state the relationship between the statistics and the objects of interest, and are proved in the Appendix:

Proposition 1.2.

Under Assumptions 5(1)(b) and 5(2)(b), ${\bar{γ}}_{B} - \frac{1}{T} \sum_{t = 1}^{T} (E (u_{i t}) - E (u_{j t})) \overset{p}{\to} 0$
Under Assumptions 2, 3, 4, 5(1)(a), 5(1)(b), 5(2)(a), and 5(2)(b), ${\bar{γ}}_{V} - \frac{1}{T} \sum_{t = 1}^{T} (var (u_{i t}) - var (u_{j t})) \overset{p}{\to} 0$ , and
Under Assumptions 1, 2, 3, 4, and 5, ${\bar{γ}}_{M} - \frac{1}{T} \sum_{t = 1}^{T} (MSE (u_{i t}) - MSE (u_{j t})) \overset{p}{\to} 0$ ⁠.

Our objective is to be able to test multiple hypotheses of equality of bias, error variance, and MSE for large sets of estimators. For example, we might be interested in comparisons of RV estimators computed at many different frequencies, or RK estimators with different kernels and/or bandwidths, or a comparison of RV versus RK versus TSRV versus MSRV, etc. This requires a convergence result for a suitable bootstrap algorithm in order to justify the use of techniques such as White’s (2000) reality check, Hansen’s (2005) SPA test, the STEP-M and generalized STEP-M procedures of Romano and Wolf (2005) and Romano and Wolf (2007), and the model confidence set of Hansen, Lunde, and Nason (2011). For this purpose, we implement the stationary bootstrap of Politis and Romano (1994), and we refer the reader to that paper for details of the procedure. This requires another assumption:

Assumption 6.

For the stationary bootstrap with geometrically distributed block lengths with success probability k_T, $T k_{T} \to \infty$ as $k_{T} \to 0$ and $T \to \infty$ ⁠.

In the Appendix, we prove the following results:

Proposition 1.3.

Under Assumptions 5(1)(b), 5(2)(b), and 6
$\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{B}^{*} - E^{*} ({\bar{γ}}_{B}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{B} - E ({\bar{γ}}_{B})) ⩽ x] | \overset{p}{\to} 0$
Under Assumptions 3, 5(1)(a), 5(1)(b), 5(2)(a), 5(2)(b), and 6
$\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{V}^{*} - E^{*} ({\bar{γ}}_{V}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{V} - E ({\bar{γ}}_{V})) ⩽ x] | \overset{p}{\to} 0$
Under Assumptions 3, 5, and 6
$\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{M}^{*} - E^{*} ({\bar{γ}}_{M}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{M} - E ({\bar{γ}}_{M})) ⩽ x] | \overset{p}{\to} 0$

where ${\bar{γ}}_{B}^{*}, {\bar{γ}}_{V}^{*}$ and ${\bar{γ}}_{M}^{*}$ are the stationary bootstrap counterparts of ${\bar{γ}}_{B}, {\bar{γ}}_{V}$ and ${\bar{γ}}_{M}, P^{*}$ denotes the probability measure induced by the stationary bootstrap, and $E^{*}$ is the expected value with respect to this probability measure.

Notice that we are able to test hypotheses about equality of bias by assuming only mild moment and mixing conditions for the estimation errors. In particular, we require no assumptions about the intradaily or interdaily behavior of the efficient price or microstructure noise. Nor do we need to assume that $cov (θ_{t}, u_{i t - 1} - u_{j t - 1}) = cov (θ_{t - 1}, u_{i t} - u_{j t}) = cov (u_{k t}, u_{k t - 1}) = 0, k \in {i, j}$ (Assumption 2), or $cov (θ_{t}, u_{i t} - u_{j t}) = 0$ (Assumption 4). Consequently, this test may be applied to any pair of high-frequency estimators of quadratic variation. Furthermore, when comparing the bias of two estimators, one of the estimators could be the unbiased proxy from Assumption 1, in which case the test becomes a test of absolute, rather than comparative, bias. Thus, for example, RV estimators computed using a range of frequencies could be tested to determine a set of frequencies at which there is no evidence of bias, providing an alternative approach to the day-specific Hausman test proposed by Ait-Sahalia and Xiu (2019) that requires only very mild assumptions. To test the equality of variances of two estimators, we also require moment and weak dependence assumptions for the daily change in the quadratic variation $Δ θ_{t}$ (Assumptions 5(1)(a) and 5(2)(a)), we need to assume that $cov (θ_{t}, u_{i t - 1} - u_{j t - 1}) = cov (θ_{t - 1}, u_{i t} - u_{j t}) = cov (u_{k t}, u_{k t - 1}) = 0, k \in {i, j}$ (Assumption 2), and we need to assume that $cov (θ_{t}, u_{i t} - u_{j t}) = 0$ (Assumption 4), which restricts the range of estimators to which the test may be applied (see the discussion preceding Proposition 1.1). Finally, in addition to the assumptions required for the bias and variance tests, the unbiased proxy provided by Assumption 1 is also required for our test of equal MSE.

2 Monte Carlo Simulations

In this section, we perform simulation experiments to investigate the finite-sample performance of our proposed tests for equality of bias, error variance, and MSE. In particular, we consider three matters of interest. First, we wish to confirm that the asymptotic results in Section 1 provide good approximations to the finite-sample size of each of the statistics proposed. Second, we want to compare the finite sample power of the statistics that we propose to that of the statistics proposed by Patton (2011). Third, since both our MSE statistic and the statistics proposed by Patton require the use of an unbiased proxy that may have a large variance, we wish to investigate the impact of changes in the variance of the proxy on the size and power of the statistics considered.

We model the latent daily quadratic variation, $θ_{t}$ ⁠, using the following DGPs:

Exponential martingale (EM): $θ_{t} = e^{σ W_{t} - \frac{σ^{2} t}{2}}$ where $σ = 0.05$ and W_t is a standard Brownian motion.
HAR-RV: This DGP was used by Corsi (2008). Let $R V_{D, t}$ denote RV (the square root of realized variance) on Day t, and let $R V_{W, t} = (1 / 5) \sum_{s = t - 4}^{t} R V_{D, s}$ ⁠, and $R V_{M, t} = (1 / 22) \sum_{s = t - 21}^{t} R V_{D, s}$ ⁠. Then $θ_{t} = c + β_{D} R V_{D, t - 1} + β_{W} R V_{W, t - 1} + β_{M} R V_{M, t - 1} + ε_{t}$ ⁠, where c = 0.781, $β_{D} = 0.372, β_{W} = 0.343, β_{M} = 0.224$ ⁠, and $ε_{t} \sim T N (0, σ)$ ⁠, with $σ = 0.5$ ⁠, where TN denotes a Truncated Normal distribution with a lower bound of $- (c + β_{D} R V_{D, t - 1} + β_{W} R V_{W, t - 1} + β_{M} R V_{M, t - 1})$ and an infinite upper bound. Corsi (2008) suggested the truncation of the left-tail of $ε_{t}$ to ensure the positivity of $θ_{t}$ ⁠. The parameter choices are those obtained by Corsi (2008) when estimating the model using S&P500 data. Let $n = 1, …, N$ index intraday returns, with N = 78 corresponding to 5-min increments over a 6.5-h trading day. Intraday returns are simulated via $r_{t, n} = \sqrt{θ_{t}} Z_{n}$ where $Z_{n} \sim N (0, 1)$ ⁠. For each t, these are the returns used to construct $R V_{D, t}$ ⁠. We initialize the model with $θ_{0} = c / (1 - β_{D} - β_{W} - β_{M})$ ⁠, and use 100 days as a burn-in period.
Two-factor Diffusion (TF): This two-factor diffusion model was used in Andersen, Bollerslev, and Meddahi (2005) and Bollerslev and Zhou (2002). Let $θ_{t} = \int_{t - 1}^{t} σ_{t} (τ) d τ$ where $σ_{t} {(τ)}^{2} = σ_{1 t} {(τ)}^{2} + σ_{2 t} {(τ)}^{2}, d σ_{1 t} {(τ)}^{2} = 0.5708 (0.3257 - σ_{1 t} {(τ)}^{2}) d t + 0.2286 σ_{1 t} (τ) d W_{1, t} (τ)$ ⁠, and $d σ_{2 t} {(τ)}^{2} = 0.0757 (0.1786 - σ_{2 t} {(τ)}^{2}) d t + 0.1096 σ_{2 t} (τ) d W_{2, t} (τ)$ ⁠. $W_{1, t} (τ)$ and $W_{2, t} (τ)$ are independent standard Brownian motions. We set $σ_{1 t} {(0)}^{2} = 0.3$ and $σ_{2 t} {(0)}^{2} = 0.2$ ⁠.
Jump Diffusion (JD): This jump diffusion model was used by Eraker, Johannes, and Polson (2003). Let $θ_{t} = \int_{t - 1}^{t} σ_{t} (τ) d τ$ with $d σ_{t} {(τ)}^{2} = 0.025 (0.5585 - σ_{t} {(τ)}^{2}) d t + σ_{t} (τ) ((- 0.504) (0.0896) d W_{1, t} (τ) + \sqrt{1 - {0.504}^{2}} (0.0896) d W_{2, t} (τ)) + χ_{v} d J_{v, t}$ ⁠, where $W_{1, t} (τ)$ and $W_{2, t} (τ)$ are independent standard Brownian motions, $J_{v, t}$ is a Poisson process with an intensity of 0.0055 and $χ_{v} \sim Exp (1.798)$ ⁠, where Exp denotes the Exponential distribution. We set $σ_{t} (0) = 0.5$ ⁠.
Rough Fractional Stochastic Volatility (RFSV): This DGP was used by Gatheral, Jaisson, and Rosenbaum (2018). $θ_{t} = \int_{t - 1}^{t} σ_{t} (τ) d τ$ where $d \log σ_{t} {(τ)}^{2} = - 0.0005 (\log σ_{t} {(τ)}^{2} - (- 5)) d t + 0.3 d W_{H, t} (τ)$ ⁠, and $W_{H, t} (τ)$ is fractional Brownian motion with a Hurst index of 0.14. We set $σ_{t} (0) = exp (- 5)$ ⁠.

We consider six different estimators of daily volatility computed on T = 500 days, which we denote as x_kt,

k = 0, …, 5, t = 1, …, 500

⁠. In our simulations,

x_{0 t}

will always be the “base case” estimator. We also generate an unbiased proxy

{\tilde{θ}}_{t}

⁠. The proxy and the estimators are simulated by

{\tilde{θ}}_{t} = θ_{t} + {\tilde{u}}_{t}

and

x_{k t} = θ_{t} + u_{k t}

respectively, where the proxy error is simulated as

{\tilde{u}}_{t} = σ {\tilde{Z}}_{t}

and the estimator errors are:

u_{k t} = b_{k} σ + ζ_{k} 0.5 σ ({\tilde{Z}}_{t} \sqrt{w} + Z_{k, t} \sqrt{1 - w})

(8)

where

({\tilde{Z}}_{t}, Z_{k, t}) \overset{iid}{\sim} N (0, I_{2}), σ = \sqrt{var (Δ θ_{t})}, t = 1, …, 500

and

k = 0, …, 5

⁠. Bias is introduced by setting

b_{k} \neq 0

for

k = 1, …, 5

⁠, while

ζ_{0} = 1

yields the base-case variance. Excess variance over the base-case is obtained by setting

ζ_{k} > 1

for

k = 1, …, 5

⁠. The correlation coefficient between the proxy error and the estimator error is fixed at 0.5 which obtains by setting

\sqrt{w} = 0.5

⁠.

This simulation setup is very similar in style to that of Patton (2011). In particular, the properties of the estimator and the proxy error are parameterized to provide a close agreement with the equivalent quantities in those simulations. It is worth noting that we also duplicated the simulation methodology of Patton (2011) and found the results to be qualitatively very similar to those reported here. In the interests of saving space, we do not report these results here, preferring instead the simulation design described above since it covers a much wider range of dependence structures.

The statistics examined are:

${\bar{γ}}_{B}$ for difference in bias (see Equation 5).
${\bar{γ}}_{V}$ for difference in error variance (see Equation 6).
${\bar{γ}}_{M}$ for difference in MSE (see Equation 7).
${\bar{γ}}_{P : MSE} = \frac{1}{T} \sum_{t = 1}^{T - 1} ({(x_{i t} - {\tilde{θ}}_{t + 1})}^{2} - {(x_{j t} - {\tilde{θ}}_{t + 1})}^{2})$ which is the statistic for the difference in MSE method due to Patton (2011).
${\bar{γ}}_{P : QLIKE} = \frac{1}{T} \sum_{t = 1}^{T - 1} (\frac{{\tilde{θ}}_{t + 1}}{x_{i t}} - \frac{{\tilde{θ}}_{t + 1}}{x_{j t}} + l n (\frac{x_{i t}}{x_{j t}}))$ which is the statistic for the difference in QLIKE method due to Patton (2011).
${\bar{γ}}_{Inf} = \frac{1}{T} \sum_{t = 1}^{T} ({(x_{i t} - θ_{t})}^{2} - {(x_{j t} - θ_{t})}^{2})$ for the infeasible difference in MSE.

In our simulations, we test null hypotheses that each of these statistics has an expected value of zero. The first three of these statistics are those that we propose in Section 1. The fourth and fifth are the statistics proposed by Patton (2011) for testing the difference in MSE and QLIKE for pairs of volatility estimators, assuming that the underlying latent daily volatility follows a simple random walk. The sixth statistic is the statistic that we would use if the daily volatility were actually observable. It represents an upper limit on the possible performance of the other statistics. We simulate 5000 samples of size T = 500 and, for each sample, we compute the daily value of the base case volatility estimator $x_{0 t}$ and each of the estimators $x_{1 t}, …, x_{5 t}$ ⁠. We then calculate each of the six statistics listed above using $x_{0 t}$ and, in turn, each of $x_{1 t}, …, x_{5 t}$ ⁷, and conduct t-tests using the stationary bootstrap of Politis and Romano (1994) with the corrected block length selection procedure of Politis and White (2004) and Patton, Politis, and White (2009), implemented with the bandwidth selection procedure of Politis (2003). Using each statistic, we record rejection rates computed using standard critical values for a 5% significance level, which we use to estimate the size of each statistic and its power to detect a range of departures from the null hypothesis.

In order to estimate the size of each statistic under the different DGPs for daily volatility, we compute each statistic using $x_{0 t}$ and $x_{1 t}$ using the parameter values $b_{1} = 0$ and $ζ_{1} = 1$ since this corresponds to the case where $x_{0 t}$ and $x_{1 t}$ have the same bias, error variance, and MSE. We estimate the power of each test statistic to detect two departures from this null hypothesis: difference in the biases, and difference in the variances. In order to estimate the power to detect difference in the biases, we compute the estimators $x_{2 t}, …, x_{5 t}$ using the parameter values $ζ_{2} = ζ_{3} = ζ_{4} = ζ_{5} = 1$ and $b_{2} = 1.5, b_{3} = 3, b_{4} = 4.5$ and $b_{5} = 6$ ⁠. To estimate the power to detect difference in the variances, we compute the estimators $x_{2 t}, …, x_{5 t}$ using the parameter values $ζ_{2} = 1.125, ζ_{3} = 1.25, ζ_{4} = 1.375$ and $ζ_{5} = 1.5$ and $b_{2} = b_{3} = b_{4} = b_{5} = 0$ ⁠. We report the rejection rates in Table 1.

Table 1

Open in new tab

Rejection frequencies

	b_k: 0	1.5	3	4.5	6	0	0	0	0

	ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
EM

${\bar{γ}}_{B}$	0.05	0.64	0.92	0.98	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.05	0.06	0.06	0.06	0.27	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.06	0.10	0.28	0.47	0.65	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.08	0.18	0.31	0.45	0.27	0.74	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.06	0.06	0.10	0.17	0.27	0.12	0.33	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.04	0.05	0.06	0.10	0.17	0.10	0.29	0.54	0.74

HAR-RV

${\bar{γ}}_{B}$	0.05	0.74	1.00	1.00	1.00	0.05	0.05	0.06	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.07	0.06	0.25	0.67	0.93	0.99
${\bar{γ}}_{Inf}$	0.05	0.06	0.16	0.49	0.90	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.07	0.07	0.09	0.17	0.38	0.25	0.67	0.93	0.99
${\bar{γ}}_{P : MSE}$	0.05	0.06	0.05	0.07	0.15	0.12	0.34	0.61	0.84
${\bar{γ}}_{P : QLIKE}$	0.05	0.05	0.05	0.04	0.06	0.12	0.34	0.60	0.84

Two Factor Diffusion

${\bar{γ}}_{B}$	0.05	0.83	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.06	0.06	0.28	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.05	0.06	0.21	0.68	0.98	0.79	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.05	0.06	0.11	0.25	0.57	0.28	0.73	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.05	0.06	0.10	0.19	0.13	0.32	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.06	0.06	0.05	0.04	0.04	0.12	0.33	0.63	0.84

JD

${\bar{γ}}_{B}$	0.05	1.00	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.05	0.04	0.04	0.04	0.04	0.34	0.75	0.90	0.96
${\bar{γ}}_{Inf}$	0.05	0.92	0.98	1.00	1.00	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.04	0.88	0.95	0.98	0.99	0.34	0.75	0.90	0.96
${\bar{γ}}_{P : MSE}$	0.05	0.83	0.91	0.95	0.97	0.14	0.39	0.66	0.85
${\bar{γ}}_{P : QLIKE}$	0.02	0.12	0.19	0.26	0.34	0.02	0.04	0.06	0.09

RFSV

${\bar{γ}}_{B}$	0.05	0.98	1.00	1.00	1.00	0.05	0.06	0.05	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.06	0.06	0.05	0.27	0.73	0.95	1.00
${\bar{γ}}_{Inf}$	0.05	0.75	0.93	0.97	0.99	0.78	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.61	0.85	0.94	0.97	0.28	0.73	0.95	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.48	0.77	0.88	0.93	0.12	0.34	0.62	0.84
${\bar{γ}}_{P : QLIKE}$	0.02	0.08	0.12	0.16	0.20	0.03	0.04	0.06	0.10

	b_k: 0	1.5	3	4.5	6	0	0	0	0

	ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
EM

${\bar{γ}}_{B}$	0.05	0.64	0.92	0.98	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.05	0.06	0.06	0.06	0.27	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.06	0.10	0.28	0.47	0.65	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.08	0.18	0.31	0.45	0.27	0.74	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.06	0.06	0.10	0.17	0.27	0.12	0.33	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.04	0.05	0.06	0.10	0.17	0.10	0.29	0.54	0.74

HAR-RV

${\bar{γ}}_{B}$	0.05	0.74	1.00	1.00	1.00	0.05	0.05	0.06	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.07	0.06	0.25	0.67	0.93	0.99
${\bar{γ}}_{Inf}$	0.05	0.06	0.16	0.49	0.90	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.07	0.07	0.09	0.17	0.38	0.25	0.67	0.93	0.99
${\bar{γ}}_{P : MSE}$	0.05	0.06	0.05	0.07	0.15	0.12	0.34	0.61	0.84
${\bar{γ}}_{P : QLIKE}$	0.05	0.05	0.05	0.04	0.06	0.12	0.34	0.60	0.84

Two Factor Diffusion

${\bar{γ}}_{B}$	0.05	0.83	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.06	0.06	0.28	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.05	0.06	0.21	0.68	0.98	0.79	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.05	0.06	0.11	0.25	0.57	0.28	0.73	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.05	0.06	0.10	0.19	0.13	0.32	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.06	0.06	0.05	0.04	0.04	0.12	0.33	0.63	0.84

JD

${\bar{γ}}_{B}$	0.05	1.00	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.05	0.04	0.04	0.04	0.04	0.34	0.75	0.90	0.96
${\bar{γ}}_{Inf}$	0.05	0.92	0.98	1.00	1.00	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.04	0.88	0.95	0.98	0.99	0.34	0.75	0.90	0.96
${\bar{γ}}_{P : MSE}$	0.05	0.83	0.91	0.95	0.97	0.14	0.39	0.66	0.85
${\bar{γ}}_{P : QLIKE}$	0.02	0.12	0.19	0.26	0.34	0.02	0.04	0.06	0.09

RFSV

${\bar{γ}}_{B}$	0.05	0.98	1.00	1.00	1.00	0.05	0.06	0.05	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.06	0.06	0.05	0.27	0.73	0.95	1.00
${\bar{γ}}_{Inf}$	0.05	0.75	0.93	0.97	0.99	0.78	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.61	0.85	0.94	0.97	0.28	0.73	0.95	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.48	0.77	0.88	0.93	0.12	0.34	0.62	0.84
${\bar{γ}}_{P : QLIKE}$	0.02	0.08	0.12	0.16	0.20	0.03	0.04	0.06	0.10

Table 1

Open in new tab

Rejection frequencies

	b_k: 0	1.5	3	4.5	6	0	0	0	0

	ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
EM

${\bar{γ}}_{B}$	0.05	0.64	0.92	0.98	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.05	0.06	0.06	0.06	0.27	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.06	0.10	0.28	0.47	0.65	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.08	0.18	0.31	0.45	0.27	0.74	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.06	0.06	0.10	0.17	0.27	0.12	0.33	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.04	0.05	0.06	0.10	0.17	0.10	0.29	0.54	0.74

HAR-RV

${\bar{γ}}_{B}$	0.05	0.74	1.00	1.00	1.00	0.05	0.05	0.06	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.07	0.06	0.25	0.67	0.93	0.99
${\bar{γ}}_{Inf}$	0.05	0.06	0.16	0.49	0.90	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.07	0.07	0.09	0.17	0.38	0.25	0.67	0.93	0.99
${\bar{γ}}_{P : MSE}$	0.05	0.06	0.05	0.07	0.15	0.12	0.34	0.61	0.84
${\bar{γ}}_{P : QLIKE}$	0.05	0.05	0.05	0.04	0.06	0.12	0.34	0.60	0.84

Two Factor Diffusion

${\bar{γ}}_{B}$	0.05	0.83	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.06	0.06	0.28	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.05	0.06	0.21	0.68	0.98	0.79	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.05	0.06	0.11	0.25	0.57	0.28	0.73	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.05	0.06	0.10	0.19	0.13	0.32	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.06	0.06	0.05	0.04	0.04	0.12	0.33	0.63	0.84

JD

${\bar{γ}}_{B}$	0.05	1.00	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.05	0.04	0.04	0.04	0.04	0.34	0.75	0.90	0.96
${\bar{γ}}_{Inf}$	0.05	0.92	0.98	1.00	1.00	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.04	0.88	0.95	0.98	0.99	0.34	0.75	0.90	0.96
${\bar{γ}}_{P : MSE}$	0.05	0.83	0.91	0.95	0.97	0.14	0.39	0.66	0.85
${\bar{γ}}_{P : QLIKE}$	0.02	0.12	0.19	0.26	0.34	0.02	0.04	0.06	0.09

RFSV

${\bar{γ}}_{B}$	0.05	0.98	1.00	1.00	1.00	0.05	0.06	0.05	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.06	0.06	0.05	0.27	0.73	0.95	1.00
${\bar{γ}}_{Inf}$	0.05	0.75	0.93	0.97	0.99	0.78	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.61	0.85	0.94	0.97	0.28	0.73	0.95	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.48	0.77	0.88	0.93	0.12	0.34	0.62	0.84
${\bar{γ}}_{P : QLIKE}$	0.02	0.08	0.12	0.16	0.20	0.03	0.04	0.06	0.10

	b_k: 0	1.5	3	4.5	6	0	0	0	0

	ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
EM

${\bar{γ}}_{B}$	0.05	0.64	0.92	0.98	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.05	0.06	0.06	0.06	0.27	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.06	0.10	0.28	0.47	0.65	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.08	0.18	0.31	0.45	0.27	0.74	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.06	0.06	0.10	0.17	0.27	0.12	0.33	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.04	0.05	0.06	0.10	0.17	0.10	0.29	0.54	0.74

HAR-RV

${\bar{γ}}_{B}$	0.05	0.74	1.00	1.00	1.00	0.05	0.05	0.06	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.07	0.06	0.25	0.67	0.93	0.99
${\bar{γ}}_{Inf}$	0.05	0.06	0.16	0.49	0.90	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.07	0.07	0.09	0.17	0.38	0.25	0.67	0.93	0.99
${\bar{γ}}_{P : MSE}$	0.05	0.06	0.05	0.07	0.15	0.12	0.34	0.61	0.84
${\bar{γ}}_{P : QLIKE}$	0.05	0.05	0.05	0.04	0.06	0.12	0.34	0.60	0.84

Two Factor Diffusion

${\bar{γ}}_{B}$	0.05	0.83	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.06	0.06	0.07	0.06	0.06	0.28	0.74	0.96	1.00
${\bar{γ}}_{Inf}$	0.05	0.06	0.21	0.68	0.98	0.79	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.05	0.06	0.11	0.25	0.57	0.28	0.73	0.96	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.05	0.06	0.10	0.19	0.13	0.32	0.62	0.83
${\bar{γ}}_{P : QLIKE}$	0.06	0.06	0.05	0.04	0.04	0.12	0.33	0.63	0.84

JD

${\bar{γ}}_{B}$	0.05	1.00	1.00	1.00	1.00	0.05	0.05	0.05	0.05
${\bar{γ}}_{V}$	0.05	0.04	0.04	0.04	0.04	0.34	0.75	0.90	0.96
${\bar{γ}}_{Inf}$	0.05	0.92	0.98	1.00	1.00	0.77	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.04	0.88	0.95	0.98	0.99	0.34	0.75	0.90	0.96
${\bar{γ}}_{P : MSE}$	0.05	0.83	0.91	0.95	0.97	0.14	0.39	0.66	0.85
${\bar{γ}}_{P : QLIKE}$	0.02	0.12	0.19	0.26	0.34	0.02	0.04	0.06	0.09

RFSV

${\bar{γ}}_{B}$	0.05	0.98	1.00	1.00	1.00	0.05	0.06	0.05	0.06
${\bar{γ}}_{V}$	0.06	0.06	0.06	0.06	0.05	0.27	0.73	0.95	1.00
${\bar{γ}}_{Inf}$	0.05	0.75	0.93	0.97	0.99	0.78	1.00	1.00	1.00
${\bar{γ}}_{M}$	0.06	0.61	0.85	0.94	0.97	0.28	0.73	0.95	1.00
${\bar{γ}}_{P : MSE}$	0.05	0.48	0.77	0.88	0.93	0.12	0.34	0.62	0.84
${\bar{γ}}_{P : QLIKE}$	0.02	0.08	0.12	0.16	0.20	0.03	0.04	0.06	0.10

Consider first the estimated sizes of the statistics given by the first column of data in Table 1 (b_k = 0, $ζ_{k} = 1$ ⁠). Since the rejection statistics were computed using a 5% critical value, the fact that almost all of the statistics in this column have values close to 0.05 indicates that all the statistics have good size for the DGPs considered. The only exceptions to this are ${\bar{γ}}_{M}$ ⁠, which is slightly oversized for the HAR-RV DGP and ${\bar{γ}}_{P : QLIKE}$ ⁠, which is slightly undersized for the JD and RFSV DGPs.

The estimates of power show considerably more variation. Consider first the alternative hypotheses in which the two volatility estimators have equal variances but different biases. The relevant statistics are in columns 2–5 of Table 1 (⁠ $b_{k} \in {1.5, 3, 4.5, 6}, ζ_{k} = 1$ ⁠). As might be expected, ${\bar{γ}}_{B}$ has the most power since it directly measures the mean difference in bias. Among the other statistics, with the exception of the infeasible test statistic ${\bar{γ}}_{Inf}, {\bar{γ}}_{M}$ has the most power when testing against the alternative hypothesis of different biases for all the DGPs. Of particular interest is the fact that ${\bar{γ}}_{M}$ exhibits considerably more power than ${\bar{γ}}_{P : MSE}$ in this context. Note that ${\bar{γ}}_{P : QLIKE}$ generally has very poor power when testing against the alternative hypothesis of different biases. Note also that the size of ${\bar{γ}}_{V}$ remains appropriately close to 0.05 in the presence of bias, indicating that this statistic does not spuriously detect bias.

Now consider the power of the statistics in the context where both volatility estimators have the same bias, but different variances. The relevant statistics are in columns 6–9 of Table 1 (b_k = 0, $ζ_{k} \in {1.125, 0.25, 0.375, 1.5}$ ⁠). Note that ${\bar{γ}}_{M}$ and ${\bar{γ}}_{V}$ have nearly identical power. Also, both exhibit considerably more power than ${\bar{γ}}_{P : MSE}$ ⁠. Furthermore, the size of ${\bar{γ}}_{B}$ remains very close to 0.05 in the context of volatility estimators with different error variances but the same bias. This, and the corresponding result for ${\bar{γ}}_{V}$ in the context of bias, confirm that these two statistics are capable of determining the extent to which differences in MSE are due to differences in bias or differences in variance. Note that ${\bar{γ}}_{P : QLIKE}$ has some power to reject the null hypothesis when the volatility estimators have different variances for the EM, HAR-RV, and two-factor diffusion models, but is clearly inferior to ${\bar{γ}}_{M}$ ⁠. Also, it has comparatively very poor power for the JD and RFSV models.

In order to investigate the impact of changes in the variance of the proxy, we repeat the analysis from Table 1 for the EM DGP but model the proxy error using ${\tilde{u}}_{t} = ξ σ {\tilde{Z}}_{t}$ ⁠, with $ξ = {0.25, 4}$ corresponding to low and high proxy error variance respectively, with the results in Table 2. The right-hand-side of this table shows little impact on ${\bar{γ}}_{M}$ ⁠, though the left-hand-side shows there is a mild loss of power for our statistic under the alternative of different biases but identical variances. In contrast, the high proxy error variance environment is disastrous for the power of Patton’s statistic, under both the alternatives of different biases and different variances. It is worth emphasizing that in practice, typical choices of proxy exhibit high error variance, not low, so the results in Table 2 are of practical importance.

Table 2

Open in new tab

Rejection frequencies comparing low and high proxy error variance given the EM DGP

		b_k: 0	0.25	0.5	0.75	1	0	0	0	0

		ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
ξ = 0.25	${\bar{γ}}_{M}$	0.06	0.08	0.18	0.32	0.46	0.27	0.73	0.97	1.00
	${\bar{γ}}_{P : MSE}$	0.06	0.06	0.11	0.23	0.36	0.18	0.53	0.83	0.97
ξ = 4	${\bar{γ}}_{M}$	0.06	0.10	0.19	0.29	0.37	0.27	0.72	0.95	1.00
	${\bar{γ}}_{P : MSE}$	0.05	0.06	0.06	0.08	0.11	0.06	0.09	0.12	0.18

		b_k: 0	0.25	0.5	0.75	1	0	0	0	0

		ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
ξ = 0.25	${\bar{γ}}_{M}$	0.06	0.08	0.18	0.32	0.46	0.27	0.73	0.97	1.00
	${\bar{γ}}_{P : MSE}$	0.06	0.06	0.11	0.23	0.36	0.18	0.53	0.83	0.97
ξ = 4	${\bar{γ}}_{M}$	0.06	0.10	0.19	0.29	0.37	0.27	0.72	0.95	1.00
	${\bar{γ}}_{P : MSE}$	0.05	0.06	0.06	0.08	0.11	0.06	0.09	0.12	0.18

Table 2

Open in new tab

Rejection frequencies comparing low and high proxy error variance given the EM DGP

		b_k: 0	0.25	0.5	0.75	1	0	0	0	0

		ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
ξ = 0.25	${\bar{γ}}_{M}$	0.06	0.08	0.18	0.32	0.46	0.27	0.73	0.97	1.00
	${\bar{γ}}_{P : MSE}$	0.06	0.06	0.11	0.23	0.36	0.18	0.53	0.83	0.97
ξ = 4	${\bar{γ}}_{M}$	0.06	0.10	0.19	0.29	0.37	0.27	0.72	0.95	1.00
	${\bar{γ}}_{P : MSE}$	0.05	0.06	0.06	0.08	0.11	0.06	0.09	0.12	0.18

		b_k: 0	0.25	0.5	0.75	1	0	0	0	0

		ζ_k: 1	1	1	1	1	1.125	1.25	1.375	1.5
ξ = 0.25	${\bar{γ}}_{M}$	0.06	0.08	0.18	0.32	0.46	0.27	0.73	0.97	1.00
	${\bar{γ}}_{P : MSE}$	0.06	0.06	0.11	0.23	0.36	0.18	0.53	0.83	0.97
ξ = 4	${\bar{γ}}_{M}$	0.06	0.10	0.19	0.29	0.37	0.27	0.72	0.95	1.00
	${\bar{γ}}_{P : MSE}$	0.05	0.06	0.06	0.08	0.11	0.06	0.09	0.12	0.18

3 An Empirical Study

Previous empirical work on this topic can be found in Patton (2011), and importantly, Liu, Patton, and Sheppard (2015). The latter applies the methods proposed in Patton (2011) to a comprehensive set of intraday volatility estimators, across a wide range of financial time series, and find that, on balance, it is difficult to beat 5-min RV given a QLIKE loss function. This result has subsequently been cited as a motivating factor for modeling choices in a range of studies (see, among many others, Bollerslev et al. 2018; Lahaye and Neely 2020; Dhaene and Wu 2020).

Because of their focus on QLIKE, the results of Liu, Patton, and Sheppard (2015) are not directly comparable with those presented in this article (which are based on the MSE). This is because QLIKE and MSE have significantly different shapes, and so in practice are likely to prefer different estimators. In particular, QLIKE is heavily asymmetric—the left tail is penalized more heavily than the right. This means that, given some fixed b > 0 and the two quantities $θ_{t} - b$ and $θ_{t} + b$ ⁠, QLIKE minimization will choose the latter, since it lies in the right tail of the loss function, while the former lies in the left. In contrast, being symmetric, the MSE loss function is indifferent between the two quantities.

We apply our test statistics to fifty of the most liquid securities listed on the NYSE.⁸ We obtained 1-s transaction data⁹ from Refinitiv Tick History¹⁰ for each of these securities for the period January 2010 to December 2018.¹¹ The data are pre-cleaned by Refinitiv, but we also implement the cleaning procedures described in Barndorff-Nielsen et al. (2009). Using Kevin Sheppard’s MFE Toolbox,¹² we constructed the following intraday volatility estimators across all securities and days: RV, TSRV, MSRV, RKs, and PARV. We also wrote code to compute the QMLE. For each estimator, the input data is transaction prices indexed by a 1-s partition¹³ that spans the market open to the market close, so there are 23,400 observations per day. We used sampling frequencies for RV of 5, 20, 60, 120, 300, 600, and 900 s. The “fast” scale for TSRV was 1 s, and we set the range of subsample frequencies to 5, 20, 60, 120, 300, 600, and 900 s. Similarly, we computed MSRV using 1-s data with the number of scale frequencies set to 5, 20, 60, 120, 300, 600, and 900, RK with 1-s data, the non-flat-top Parzen kernel, and bandwidths of 5, 20, 60, 120, 300, 600, and 900, and PARV with 1-s data and preaveraging window widths of 5, 20, 60, 120, 300, 600, and 900 s. QMLE is computed using sampling frequencies of 5, 20, 60, 120, 300, 600, and 900 s. For efficiency of expression, in what follows, we refer to 5, 20, 60, 120, 300, 600, and 900 collectively as “seconds,” irrespective of the property of the estimator to which they refer. For each estimator except the QMLE, we also compute the optimal configuration using the default method suggested in the MFE Toolbox.

In Supplementary material, Liu, Patton, and Sheppard (2015) note that using the method of Patton (2011) with the MSE loss function, they are unable to reject the null hypothesis of any estimator failing to outperform 5-min RV. They attribute this to a lack of power in Patton’s statistic given an MSE loss function.¹⁴ This therefore seems an ideal question to investigate with our more-powerful MSE statistic, and our ability to distinguish difference in the biases and difference in the error variances.

Following Liu, Patton, and Sheppard (2015), we set 5-min RV as the base-case, and use 30-min RV as the unbiased proxy. Given the 50 securities and the wide range of frequencies for RV, subsample frequencies for TSRV, number of scale frequencies for MSRV, bandwidths for RK, and preaveraging window lengths for PARV, we have a total of 2300 null hypotheses. Since classical testing procedures are likely to produce a large number of spurious rejections of the null hypotheses, we use a testing procedure that controls the false discovery proportion (FDP). Specifically, we use the generalized step-wise procedure of Hsu, Hsu, and Kuan (2014), which is a modification of Romano and Wolf’s (2007) method that incorporates the sample-dependent null distribution proposed by Hansen (2005) for the SPA test. We perform the procedure across all securities simultaneously for each estimator, although we note that the results are qualitatively the same when a test is performed on each security individually. We use a significance level of 0.05 and set the FDP at 0.1. Therefore, the test is designed so that the probability that the proportion of rejected null hypotheses that are false discoveries is more than 10% is controlled to be less than 0.05.

When testing the null hypothesis of equal or worse MSE than 5-min RV using Patton’s statistic (⁠ ${\bar{γ}}_{P : MSE}$ ⁠), we are unable to reject the null for any of the estimators, bandwidths or stocks that we considered—a result that is broadly consistent with that of Liu, Patton, and Sheppard (2015). In contrast, the results when using our more powerful statistic (⁠ ${\bar{γ}}_{M}$ ⁠) tell a different story. As can be seen in Table 3, there are rejections for at least some stocks at some frequencies for all the estimators. In the cases of the RV estimator and the QMLE, the number of rejections is very small and might be considered inconsequential. For the other estimators, there are many rejections. The standout case is the PARV with a preaveraging window width of 300 s, for which the null hypothesis of equal or worse MSE than 5-min RV is rejected for 15 of the 50 stocks.

The rejections in Table 3 can be further understood in terms of the bias and variance of the underlying estimation errors. Table 4 contains the number of rejections of equal or greater error variance than 5-min RV. As expected, we find some evidence that, compared to 5-min RV, the RV estimator has a lower variance when computed using frequencies higher than 5 min, and no evidence that it has a lower variance when computed using a frequency lower than 5 min. Also, since we compute all the TSRV, MSRV, RK, and PARV estimators using 1-s data, it is unsurprising to see that we find evidence of smaller variance to 5-min RV for many stocks over a wide range of time scales, bandwidths, and preaveraging window widths. With the exception of a handful of stocks at the highest frequencies, there is no evidence of QMLE having a lower variance than 5-min RV.

Table 4

Open in new tab

Number of rejections of null of equal or greater error variance than RV-5 min using ${\bar{γ}}_{V}$ (Our variance statistic)

Bandwidth	5	20	60	120	300	600	900	Opt
RV	9	7	2	1		0	0	0
TSRV	10	7	10	13	10	1	1	10
MSRV	10	6	8	9	7	1	1	10
RK	11	8	6	7	2	0	0	5
PARV	10	8	7	11	18	2	1	7
QMLE	7	4	0	0	0	0	0

Bandwidth	5	20	60	120	300	600	900	Opt
RV	9	7	2	1		0	0	0
TSRV	10	7	10	13	10	1	1	10
MSRV	10	6	8	9	7	1	1	10
RK	11	8	6	7	2	0	0	5
PARV	10	8	7	11	18	2	1	7
QMLE	7	4	0	0	0	0	0

Note: The table elements state the number of securities for which the null of equal or larger error variance than 5-min RV is rejected (max = 50).

Table 4

Open in new tab

Number of rejections of null of equal or greater error variance than RV-5 min using ${\bar{γ}}_{V}$ (Our variance statistic)

Bandwidth	5	20	60	120	300	600	900	Opt
RV	9	7	2	1		0	0	0
TSRV	10	7	10	13	10	1	1	10
MSRV	10	6	8	9	7	1	1	10
RK	11	8	6	7	2	0	0	5
PARV	10	8	7	11	18	2	1	7
QMLE	7	4	0	0	0	0	0

Bandwidth	5	20	60	120	300	600	900	Opt
RV	9	7	2	1		0	0	0
TSRV	10	7	10	13	10	1	1	10
MSRV	10	6	8	9	7	1	1	10
RK	11	8	6	7	2	0	0	5
PARV	10	8	7	11	18	2	1	7
QMLE	7	4	0	0	0	0	0

Note: The table elements state the number of securities for which the null of equal or larger error variance than 5-min RV is rejected (max = 50).

To investigate bias, we set 30-min RV as the base-case, under the assumption that it is unbiased, and we use ${\bar{γ}}_{B}$ to test the null of zero or negative bias, and $- 1 {\bar{γ}}_{B}$ to test the null of zero or positive bias. The results are in Table 5. Theory suggests that RV should exhibit positive bias at any frequency at which microstructure noise is not eliminated, so it is not surprising that we have numerous rejections of the null of zero or negative bias at all frequencies from 5 to 900 s. TSRV is positively biased when the second time scale is small and becomes negatively biased as the second time scale increases over 300. Similarly, MSRV is positively biased for a small number of time scales and becomes negatively biased as the number of time scales increases over 300. PARV is positively biased for narrow averaging windows and becomes negatively biased as the window width increases past 300 s. Consequently, if judged purely on the basis of bias, TSRV, MSRV, and PARV with the second time scale, number of time scales, or preaveraging window width of around 300 are clearly preferred to 5-min RV since they are approximately unbiased, and 5-min RV is positively biased. The fact that we find evidence that these estimators also have a lower variance than 5-min RV for many stocks (see Table 4), strengthens the case for their use and explains their good performance in terms of the MSE. In contrast, the results in Table 5 suggest that RK is positively biased across the range of bandwidths considered, and QMLE is mostly positively biased.

Table 5

Open in new tab

Number of rejections of null of zero or negative bias, or zero or positive bias, using ${\bar{γ}}_{B}$

Bandwidth	5	20	60	120	300	600	900	Opt
${\bar{γ}}_{B}$
RV	49	50	50	50	45	44	31	41
TSRV	49	50	43	28	7	0	0	46
MSRV	50	50	44	26	6	0	0	49
RK	48	50	50	46	42	33	26	42
PARV	46	50	47	29	7	0	0	47
QMLE	50	49	42	33	21	1	8
$- 1 {\bar{γ}}_{B}$
RV	0	0	0	0	0	0	0	0
TSRV	0	0	0	0	7	39	49	0
MSRV	0	0	0	0	7	43	49	0
RK	0	0	0	0	0	0	0	0
PARV	0	0	0	0	3	40	49	0
QMLE	0	0	0	0	0	3	1

Bandwidth	5	20	60	120	300	600	900	Opt
${\bar{γ}}_{B}$
RV	49	50	50	50	45	44	31	41
TSRV	49	50	43	28	7	0	0	46
MSRV	50	50	44	26	6	0	0	49
RK	48	50	50	46	42	33	26	42
PARV	46	50	47	29	7	0	0	47
QMLE	50	49	42	33	21	1	8
$- 1 {\bar{γ}}_{B}$
RV	0	0	0	0	0	0	0	0
TSRV	0	0	0	0	7	39	49	0
MSRV	0	0	0	0	7	43	49	0
RK	0	0	0	0	0	0	0	0
PARV	0	0	0	0	3	40	49	0
QMLE	0	0	0	0	0	3	1

Note: For ${\bar{γ}}_{B}$ ⁠, the table elements state the number of securities for which the null of zero or negative bias is rejected (max = 50), while for $- 1 {\bar{γ}}_{B}$ the table elements state the number of securities for which the null of zero or positive bias is rejected (max = 50).

Table 5

Open in new tab

Number of rejections of null of zero or negative bias, or zero or positive bias, using ${\bar{γ}}_{B}$

Bandwidth	5	20	60	120	300	600	900	Opt
${\bar{γ}}_{B}$
RV	49	50	50	50	45	44	31	41
TSRV	49	50	43	28	7	0	0	46
MSRV	50	50	44	26	6	0	0	49
RK	48	50	50	46	42	33	26	42
PARV	46	50	47	29	7	0	0	47
QMLE	50	49	42	33	21	1	8
$- 1 {\bar{γ}}_{B}$
RV	0	0	0	0	0	0	0	0
TSRV	0	0	0	0	7	39	49	0
MSRV	0	0	0	0	7	43	49	0
RK	0	0	0	0	0	0	0	0
PARV	0	0	0	0	3	40	49	0
QMLE	0	0	0	0	0	3	1

Bandwidth	5	20	60	120	300	600	900	Opt
${\bar{γ}}_{B}$
RV	49	50	50	50	45	44	31	41
TSRV	49	50	43	28	7	0	0	46
MSRV	50	50	44	26	6	0	0	49
RK	48	50	50	46	42	33	26	42
PARV	46	50	47	29	7	0	0	47
QMLE	50	49	42	33	21	1	8
$- 1 {\bar{γ}}_{B}$
RV	0	0	0	0	0	0	0	0
TSRV	0	0	0	0	7	39	49	0
MSRV	0	0	0	0	7	43	49	0
RK	0	0	0	0	0	0	0	0
PARV	0	0	0	0	3	40	49	0
QMLE	0	0	0	0	0	3	1

A final point of interest: as previously mentioned, using the method of Patton (2011) with a significance level of 0.05 and an FDP of 0.1, we are unable to reject the null hypothesis of equal or worse MSE than 5-min RV for all estimators and almost all stocks under consideration. However, it is instructive to re-examine these results after boosting the significance level to 0.1. As can be seen in Table 6 the more generous significance level allows Patton’s method to reject the null in some cases. Comparing the results to Table 3, we can see that the pattern of rejections is similar. That is, Patton’s method produces a couple of rejections of the null at high frequencies for the RV and QMLE, produces many more rejections for the other estimators, and the most rejections for the PARV with a preaveraging window width of 300 s. These similarities are reassuring given the different identification assumptions made by the two methods, and confirm that the differences in the results generated by the two statistics in Table 3 are likely to be due to the differences in power that were reported in Section 2.

4 Concluding Comments

This article considers the problem of choosing an estimator of quadratic variation in empirical applications. We have proposed tests for the equality of bias, error variance, and MSE for pairs of estimators. These tests may be used to construct model confidence sets or may be implemented in multiple hypothesis testing procedures that control the (generalized) family-wise error rate or the FDP. Amongst other things, our test of bias may be used to determine frequencies at which the RV estimator is contaminated by microstructure noise. In this setting, it may be viewed as an alternative to the Hausman test proposed by Ait-Sahalia and Xiu (2019). Our approach requires only mild moment and mixing conditions for the estimation errors, whereas Ait-Sahalia and Xiu’s (2019) places restrictions on the intradaily price process and microstructure noise. However, Ait-Sahalia and Xiu’s (2019) test applies to a single day, whereas our approach is a test of equal average bias across a large number of days. For this reason, the tests are best viewed as complements rather than substitutes.

Our test of equal MSE has a direct competitor in the test proposed by Patton (2011). An advantage of our test is that it makes only mild assumptions about the structure of interdaily quadratic variation and some more mild assumptions about intradaily efficient prices and microstructure noise. In contrast, Patton (2011) assumes that the daily quadratic variation follows a specific process, that is, a simple random walk. An important practical difference between the two tests is that ours appears to have significantly more power. Of course, our test applies only to a particular set of estimators that satisfy Assumption 4 and applies only to the MSE whereas Patton’s approach applies to any estimators subject to some moment and mixing conditions being satisfied, and may also be applied to the QLIKE loss function. For this reason, again, we view our MSE test as being a complement to existing work, rather than a substitute. Importantly, our ability to test for equality of bias and error variance provides some insight into why particular estimators have a lower MSE.

Empirically, we find evidence that 5-min RV is often beaten (in terms of MSE) by some noise-robust estimators; with PARV, TSRV, and MSRV showing the best performance in our application. This finding is in contrast to the widely-cited article by Liu, Patton, and Sheppard (2015) who found little evidence of anything beating 5-min RV. The apparent reason for the different findings is that our test appears to be significantly more powerful given an MSE loss function. We also find that, when configured appropriately, the PARV, TSRV, and MSRV are approximately unbiased. In contrast, 5-min RV is positively biased. While these results do not invalidate the use of 5-min RV, they do suggest that the standard practice of using 5-min RV without giving serious consideration to alternatives should be reconsidered.

In combination, this article, Patton (2011), and Ait-Sahalia and Xiu (2019) provide a suite of tests to help researchers choose from the wide range of available estimators of quadratic variation. While Liu, Patton, and Sheppard (2015) is widely cited, authors typically use it to justify their use of 5-min RV in their research. We know of no replication studies of their article, and there exist few other published applications of the work of Patton (2011) and Ait-Sahalia and Xiu (2019). Consequently, there exists considerable scope for further research in this field. While we have found evidence that there exist estimators that are empirically superior to 5-min RV in some applications, many questions remain unanswered. Comparisons of results across different asset classes, different markets, and different time periods; comparisons between highly liquid assets and less liquid assets; comparisons of different methods of computing optimal parameterizations of estimators; and comparisons of estimators computed using prices in calendar time and tick time at different frequencies may reveal empirical regularities that could provide guidance to applied researchers. We hope that future research will tackle these tasks.

Footnotes

Examples include Sévi (2014), Bollerslev et al. (2018), Gong and Lin (2017), Xu et al. (2019), Wen et al. (2019), and Gkillas, Gupta, and Pierdzioch (2020), but many more may be found by searching on Google Scholar.

The result holds exactly for the TSRV with the small-sample adjustment given by Equation (64) in Zhang et al. (2005b). In the absence of the small-sample adjustment, the result holds asymptotically as the number of grids used for subsampling grows.

The result for PARV holds asymptotically. This is proved in Lemma 5.1 in the Appendix. In the subsequent remark, we argue that this is likely to provide a good approximation in applications.

Strictly speaking, this would be an approximately unbiased proxy due to the likely presence of a very small but non-zero drift. However, the impact of this is negligible. Patton (2011) and Liu, Patton, and Sheppard (2015) also use this approach.

Specifically, Meddahi (2002) allows for $W_{t} (τ)$ to be correlated with a Brownian motion that determines the stochastic behavior of $σ_{t} (τ)$ ⁠.

In Supplementary Appendix A, we provide an example of this in which we show that our method may not work well with the realized range estimator of Martens and Van Dijk (2007) and Christensen and Podolskij (2007), and the realized quantile estimator of Christensen, Oomen, and Podolskij (2010). In contrast, our method may be useful in some circumstances for comparisons of the minRV and medRV estimators of Andersen, Dobrev, and Schaumburg (2012), the Bipower Variation (BPV) estimator of Barndorff-Nielsen and Shephard (2004), and the Preaveraged BPV estimator of Podolskij and Vetter (2009). Importantly, the simulations show that if the true DGP includes a jump component then Assumption 4 will be significantly violated when comparing an estimator of quadratic variation with an estimator of integrated variance (in this case, quadratic variation and integrated variance are different quantities).

That is, we compute each of the statistics using the pairs ${x_{0 t}, x_{1 t}}, {x_{0 t}, x_{2 t}}, …, {x_{0 t}, x_{5 t}}$ ⁠.

Formerly known as Thomson Reuters Tick History.

Across all stocks, we remove 12 days from the sample due to shut-downs, technical glitches, and flash crashes. The dates are 2010-05-06, 2011-08-08, 2012-08-01, 2013-04-23, 2013-08-22, 2014-10-30, 2014-11-25, 2015-07-08, 2015-07-09, 2015-08-24, 2015-08-25, and 2016-05-18.

https://github.com/bashtage/mfe-toolbox/.

This is sometimes referred to as calendar-time sampling.

See Footnote 27 of Liu, Patton, and Sheppard (2015).

ABT, AIG, APA, AXP, BAC, BMY, C, CAT, COF, COP, CVS, CVX, DE, DIS, EOG, F, FCX, GE, HAL, HD, HON, IBM, JNJ, KO, LLY, LMT, LOW, MCD, MDT, MET, MMM, MO, MRK, NEM, NKE, OXY, PFE, PG, SLB, SPG, TGT, UNH, UNP, UPS, USB, UTX, VZ, WFC, X, and XOM.

Table 4 in Liu, Patton, and Sheppard (2015) broadly recommends 1-s calendar sampled data across most estimators and securities.

Appendix

Lemma 5.1.

The PARV estimator can be expressed via the quadratic form:

x_{k t} = \sum_{n = 1}^{N} \sum_{m = 1}^{N} a_{kmn} {\tilde{r}}_{n t} {\tilde{r}}_{m t},

(A.1)

where

a_{knn} \to 1, n = K, …, N - K

, as K increases, and tapering to 0 in the end-points.

Proof. The t subscript is not relevant and so is omitted during the proof. The preaveraged returns are:

{\bar{r}}_{n} = \sum_{i = 1}^{k} g (\frac{i}{k}) r_{n + i - 1}, n = 1, …, N - k + 1,

(A.2)

where

g : [0, 1] \to R

is a piecewise continuously differentiable function with

g (0) = g (1) = 0

such that

g'

is piecewise Lipschitz. The preaveraged RV estimator is then defined as:

P R V^{(k)} = \frac{1}{ψ_{2} k} \sum_{n = 1}^{N - k + 1} {\bar{r}}_{n}^{2} - \frac{ψ_{1}}{2 N ξ^{2} ψ_{2}} \sum_{n = 1}^{N} r_{n}^{2},

(A.3)

where

ψ_{1} = \int_{0}^{1} {(g' (x))}^{2} d x

and

ψ_{2} = \int_{0}^{1} {(g (x))}^{2} d x

⁠. We set

ξ = \frac{k}{\sqrt{N}}

⁠.

Define:

h (x) = {\begin{matrix} g (x) if 0 ⩽ x ⩽ 1 \\ 0 otherwise \end{matrix}

(A.4)

The coefficients of the quadratic form of the estimator are:

a_{k (n + j) n} = a_{k n (n + j)} = \frac{1}{ψ_{2} k} \sum_{i = 1}^{\min (n, k)} h (\frac{i}{k}) h (\frac{i + j}{k}) - 1 (j = 0) \frac{ψ_{1}}{2 N ξ^{2} ψ_{2}},

(A.5)

for

n = 1, …, N - k + 1, j = 0, …, k - 1

⁠, and

a_{k (n - j) n} = a_{k n (n - j)} = \frac{1}{ψ_{2} k} \sum_{i = 1}^{N - n} h (\frac{k - i}{k}) h (\frac{k - i - j}{k}) - 1 (j = 0) \frac{ψ_{1}}{2 N ξ^{2} ψ_{2}},

(A.6)

for

n = N - K + 2, …, N, j = 0, …, k - (N - n) - 1

⁠. Since

g (1) = 0

⁠, elements of the quadratic form that are more than k−2 places from the main diagonal have a value of zero. Note also that the first and last k−2 elements of the main diagonal are tapered. The middle

N - 2 k + 4

elements of the main diagonal may be written as:

\frac{1}{ψ_{2} k} \sum_{i = 1}^{k} g {(\frac{i}{k})}^{2} - \frac{ψ_{1}}{2 N ξ^{2} ψ_{2}} \to 1,

(A.7)

k \to \infty

⁠, for

n = k, …, N - k + 1

⁠. This limiting behavior is due to the mean

\frac{1}{k} \sum_{i = 1}^{k} g {(\frac{i}{k})}^{2}

converging to the integral

ψ_{2} = \int_{0}^{1} {(g (x))}^{2} d x

⁠, and

\frac{ψ_{1}}{2 k^{2} ψ_{2}}

converging to zero, as

k \to \infty

⁠.

Remark

A common practice in the literature (e.g. Jacod et al. 2009; Hautsch and Podolskij 2013), is to use $g (x) = \min (x, 1 - x)$ as the weighting function. It follows that $ψ_{1} = 1$ and $ψ_{2} = \frac{1}{12}$ ⁠. In their simulations Jacod et al. (2009) set k = 51. Using these values, we find $a_{k, n, n} = 0.997$ for values of n such that there is no tapering. We also find that $a_{k, n, n} ⩾ 0.99$ when $k ⩾ 27$ for values of n such that there is no tapering.

Proof of Proposition 1.1.

Let

{\tilde{μ}}_{n t} = \int_{n - 1}^{n} μ_{t} (τ) d τ, r_{n t} = \int_{n - 1}^{n} σ_{t} (τ) d W_{t} (τ), j_{n t} = \int_{n - 1}^{n} ω_{t} (τ) d J_{t} (τ)

⁠,

{\tilde{ε}}_{n t} = ε_{n t} - ε_{(n - 1) t}

⁠, and

ζ_{t} = 1 + β_{t}

⁠. Then, by combining definitions:

{\tilde{r}}_{n t} = ζ_{t} ({\tilde{μ}}_{n t} + r_{n t} + j_{n t}) + {\tilde{ε}}_{n t}

Thus,

\begin{array}{l} cov (θ_{t}, {\tilde{r}}_{n t} {\tilde{r}}_{m t}) = cov (θ_{t}, (ζ_{t} ({\tilde{μ}}_{n t} + r_{n t} + j_{n t}) + {\tilde{ε}}_{n t}) (ζ_{t} ({\tilde{μ}}_{m t} + r_{m t} + j_{m t}) + {\tilde{ε}}_{m t})) \\ = cov (θ_{t}, ζ_{t}^{2} ({\tilde{μ}}_{n t} + r_{n t} + j_{n t}) ({\tilde{μ}}_{m t} + r_{m t} + j_{m t})) \\ = E (ζ_{t}^{2}) cov (θ_{t}, ({\tilde{μ}}_{n t} + r_{n t} + j_{n t}) ({\tilde{μ}}_{m t} + r_{m t} + j_{m t})) \\ = E (ζ_{t}^{2}) {cov (θ_{t}, ({\tilde{μ}}_{n t} + r_{n t}) ({\tilde{μ}}_{m t} + r_{m t})) + E (j_{n t}) cov (θ_{t}, ({\tilde{μ}}_{m t} + r_{m t})) \\ + E (j_{m t}) cov (θ_{t}, ({\tilde{μ}}_{n t} + r_{n t}))} \\ = E (ζ_{t}^{2}) {{\tilde{μ}}_{n t} cov (θ_{t}, r_{n t}) + {\tilde{μ}}_{m t} cov (θ_{t}, r_{m t}) + cov (θ_{t}, r_{n t} r_{m t}) + E (j_{n t}) cov (θ_{t}, r_{m t}) \\ + E (j_{m t}) cov (θ_{t}, r_{n t})} \\ = E (ζ_{t}^{2}) cov (θ_{t}, r_{n t} r_{m t}) \end{array}

(A.8)

where the second equality follows from the zero mean and the independence structure of

ε_{t} (τ)

stated in Equation (3), the third equality is due to the independence structure of

β_{t} (τ)

stated in Equation (3), the fourth equality is due to the independence structure of

ω_{t} (τ)

and

J_{t} (τ)

stated after Equation (2), the fifth equality follows from the fact that the drift term

μ_{t} (τ)

is assumed to be deterministic in Equation (2), and the sixth equality follows from the fact that

E (r_{n t} | σ_{t}) = 0

⁠, where σ_t denotes the trajectory of the volatility over Day t, under the “no leverage” assumption stated in Equation (2).

For

n \neq m

⁠, it follows from the “no leverage” condition and the fact that the Wiener increments are non-overlapping that

cov (θ_{t}, r_{n t} r_{m t}) = 0

⁠. Since a_knn = 1 from Equations (4) and (A.8) it follows that

cov (θ_{t}, x_{k t}) = \sum_{n = 1}^{N} \sum_{m = 1}^{M} a_{knm} cov (θ_{t}, {\tilde{r}}_{n t} {\tilde{r}}_{m t}) = E (ζ_{t}^{2}) \sum_{n = 1}^{N} a_{knn} cov (θ_{t}, {\tilde{r}}_{n t}^{2}) = E (ζ_{t}^{2}) \sum_{n = 1}^{N} cov (θ_{t}, {\tilde{r}}_{n t}^{2})

Therefore, $cov (θ_{t}, u_{i t} - u_{j t}) = cov (θ_{t}, x_{i t}) - cov (θ_{t}, x_{j t}) = 0$ ⁠. □

In order to prove propositions 1.2 and 1.3, it is convenient to write the statistics in the following ways:

{\bar{γ}}_{B} = \frac{1}{T} \sum_{t = 1}^{T} (u_{i t} - u_{j t})

(A.9)

{\bar{γ}}_{V} = \frac{1}{T} \sum_{t = 2}^{T} (γ_{1 t} + γ_{3 t}) + γ_{2}

(A.10)

{\bar{γ}}_{M} = \frac{1}{T} \sum_{t = 2}^{T} (γ_{1 t} + γ_{3 t}) + γ_{2} + γ_{4}

(A.11)

where

γ_{1 t} = \frac{1}{2} {(Δ x_{i t} - μ_{Δ xit})}^{2} + μ_{Δ xit} (Δ x_{i t} - μ_{Δ xit}) - \bar{μ_{Δ x i}} (Δ x_{i t} - μ_{Δ xit})

(A.12)

- \frac{1}{2} {(Δ x_{j t} - μ_{Δ xjt})}^{2} - μ_{Δ xjt} (Δ x_{j t} - μ_{Δ xjt}) + \bar{μ_{Δ x j}} (Δ x_{j t} - μ_{Δ xjt})

(A.13)

γ_{2} = \frac{1}{2} {(\frac{1}{T} \sum_{t = 1}^{T} γ_{2 j t})}^{2} - \frac{1}{2} {(\frac{1}{T} \sum_{t = 1}^{T} γ_{2 i t})}^{2}

(A.14)

γ_{3 t} = \frac{1}{2} {(μ_{Δ uit} - \bar{μ_{Δ u i}})}^{2} - \frac{1}{2} {(μ_{Δ ujt} - \bar{μ_{Δ u j}})}^{2}

(A.15)

+ (μ_{Δ θ t} - \bar{μ_{Δ θ}}) [(μ_{Δ uit} - \bar{μ_{Δ u i}}) - (μ_{Δ ujt} - \bar{μ_{Δ u j}})]

(A.16)

γ_{4} = {(\frac{1}{T} \sum_{t = 1}^{T} γ_{4 i t})}^{2} - {(\frac{1}{T} \sum_{t = 1}^{T} γ_{4 j t})}^{2}

(A.17)

with, for

k \in {i, j}, γ_{2 k t} = Δ x_{k t} - μ_{Δ xkt}, γ_{4 k t} = x_{k t} - {\tilde{θ}}_{t}, μ_{Δ xkt} = E (Δ x_{k t}), μ_{Δ ukt} = E (Δ u_{k t}), \bar{μ_{Δ x k}} = \frac{1}{T} \sum_{t = 2}^{T} μ_{Δ xkt}

and

\bar{μ_{Δ u k}} = \frac{1}{T} \sum_{t = 2}^{T} μ_{Δ ukt}

⁠. Also,

μ_{Δ θ t} = E (Δ θ_{t})

and

\bar{μ_{Δ θ}} = \frac{1}{T} \sum_{t = 2}^{T} μ_{Δ θ t}

⁠.

We now establish some useful properties of these variables.

Lemma 5.2.

$For k \in {i, j}, γ_{2 k t} = Δ x_{k t} - μ_{Δ xkt}$ is an L₄-NED process of size $- \frac{1}{2}$ on Z_t, under Assumptions 5(2)(a) and 5(2)(b).

Proof.

From Davidson (1994) Theorem 17.8 and Assumption 5(2)(b), $Δ u_{k t}$ is L₄-NED of size $- \frac{1}{2}$ on Z_t. The result then follows from Assumption 5(2)(a) and Davidson (1994) Theorem 17.8.

Lemma 5.3.

$For k \in {i, j}, γ_{4 k t} = x_{k t} - {\tilde{θ}}_{t}$ is an L₄-NED process of size $- \frac{1}{2}$ on Z_t, under assumptions 5(2)(b) and 5(2)(c).

Proof.

$x_{k t} - {\tilde{θ}}_{t} = u_{k t} - {\tilde{u}}_{t}$ so the result follows from Assumptions 5(2)(b) and 5(2)(c) and Davidson (1994) Theorem 17.8.

Lemma 5.4.

$γ_{1 t}$ is an L₂-NED process of size $- \frac{1}{2}$ on Z_t, under Assumptions 5(2)(a) and 5(2)(b).

Proof.

From Lemma 5.2 and Corollary 5.11, ${(Δ x_{k t} - μ_{Δ xkt})}^{2}$ is L₂-NED under the assumptions. The result then follows from Lemma 5.2 and Davidson (1994) Theorem 17.8.

Lemma 5.5.

$\sqrt{T} {\bar{γ}}_{3} = o (1)$ under Assumptions 3 and 5(1)(a).

Proof. $For k \in {i, j}, | \sqrt{T} \frac{1}{T} \sum_{t = 2}^{T} (μ_{Δ θ t} - \bar{μ_{Δ θ}}) (μ_{Δ ukt} - \bar{μ_{Δ u k}}) |$ $⩽ \sqrt{\frac{1}{T} \sum_{t = 2}^{T} {(μ_{Δ θ t} - \bar{μ_{Δ θ}})}^{2} \sqrt{T} \frac{1}{T} \sum_{t = 2}^{T} {(μ_{Δ ukt} - \bar{μ_{Δ u k}})}^{2}} = \sqrt{O (1) o (1)} = o (1)$ under Assumptions 3 and 5(1)(a). The result then follows from Assumption 3 and Minkowski’s Inequality.

Lemma 5.6.

$For k \in {i, j}, \exists c_{2} < - \infty ∋ \max_{1 ⩽ t ⩽ T} {‖ γ_{2 k t} ‖}_{2 r} = \max_{1 ⩽ t ⩽ T} {‖ Δ x_{k t} - μ_{Δ xkt} ‖}_{2 r} < c_{2}$ under Assumptions 5(1)(a) and 5(1)(b).

Proof.

The result follows from the Minkowski Inequality applied to

{‖ Δ θ_{t} - μ_{Δ θ t} + u_{k t} - μ_{ukt} - u_{k t - 1} + μ_{ukt - 1} ‖}_{2 r} .

Lemma 5.7.

$For k \in {i, j}, \exists c_{3} < - \infty ∋ \max_{1 ⩽ t ⩽ T} {‖ γ_{4 k t} ‖}_{2 r} = \max_{1 ⩽ t ⩽ T} {‖ x_{k t} - {\tilde{θ}}_{t} ‖}_{2 r} < c_{3}$ under Assumptions 5(1)(b) and 5(1)(c).

Proof.

The result follows from the Minkowski Inequality applied to ${‖ u_{k t} - \tilde{u} ‖}_{2 r}$ ⁠.

Lemma 5.8.

$\exists c_{4} < - \infty ∋ \max_{1 ⩽ t ⩽ T} {‖ γ_{1 t} ‖}_{r} < c_{4}$ under Assumptions 5(1)(a) and 5(1)(b).

Proof.

${‖ {(Δ x_{i t} - μ_{Δ xit})}^{2} ‖}_{r} ⩽ {‖ (Δ x_{i t} - μ_{Δ xit}) ‖}_{2 r}^{2} < c_{2}^{2}$ from Lemmas 5.6 and 5.9. The result then follows from the Minkowski Inequality and Lemma 5.6.

The following are technical results that are used in the proofs.

Lemma 5.9.

For random variables A and B and constants $s ⩾ 1$ and r > 1, $| | A B | |_{s} ⩽ | | A | |_{s r} | | B | |_{\frac{s r}{r - 1}}$ . Furthermore, strict equality holds when A = B and r = 2.

Proof.

For $s ⩾ 1$ ⁠, $| | A B | |_{s} = {(E (| A B |^{s}))}^{\frac{1}{s}} = {(E (| A^{s} B^{s} |))}^{\frac{1}{s}} ⩽ {(| | A^{s} | |_{r} | | B^{s} | |_{\frac{r}{r - 1}})}^{\frac{1}{s}}$ for r > 1 from Hölder’s Inequality. Also ${(| | A^{s} | |_{r} | | B^{s} | |_{\frac{r}{r - 1}})}^{\frac{1}{s}} = {({(E (| A^{s} |^{r}))}^{\frac{1}{r}} {(E (| B^{s} |^{\frac{r}{r - 1}}))}^{\frac{r - 1}{r}})}^{\frac{1}{s}} = {(E (| A |^{s r}))}^{\frac{1}{s r}} {(E (| B |^{\frac{s r}{r - 1}}))}^{\frac{r - 1}{s r}} = | | A | |_{s r} | | B | |_{\frac{s r}{r - 1}}$ ⁠. Therefore $| | A B | |_{s} ⩽ | | A | |_{s r} | | B | |_{\frac{s r}{r - 1}}$ ⁠.

Now let B = A and r = 2. Then $| | A^{2} | |_{s} = {(E (| A^{2} |^{s}))}^{\frac{1}{s}} = {({(E (| A |^{2 s})}^{\frac{1}{2 s}})}^{2} = | | A | |_{2 s}^{2}$ ⁠.

Remark.

If s = 1 then the result is Hölder’s Inequality.

Lemma 5.10.

Let X_t be L_sr-NED of size $- ϕ_{X}$ on any process $σ_{t} {(τ)}^{2}$ and let Y_t be $L_{\frac{s r}{r - 1}}$ -NED of size $- ϕ_{Y}$ on $σ_{t} {(τ)}^{2}$ , where $r > 1, s \geq 1$ and $ϕ_{X}, ϕ_{Y} > 0$ . Then $X_{t} Y_{t}$ is L_s-NED of size $- \min {ϕ_{X}, ϕ_{Y}}$ on $σ_{t} {(τ)}^{2}$ ⁠.

Proof.

For conciseness, we adopt the following notation: $E_{m} (X_{t}) \equiv E (X_{t} | F_{t - m}^{t + m})$ and ${[X_{t}]}_{c} = X_{t} - E_{m} (X_{t})$ ⁠. Let $d_{t}^{X}, d_{t}^{Y}$ denote positive, finite, constants and $η_{m}^{X} = O (m^{- ϕ_{X}})$ and $η_{m}^{Y} = O (m^{- ϕ_{Y}})$ mixing coefficients such that ${‖ {[X_{t}]}_{c} ‖}_{s r} \leq d_{t}^{X} η_{m}^{X}$ and ${‖ {[Y_{t}]}_{c} ‖}_{\frac{s r}{r - 1}} \leq d_{t}^{Y} η_{m}^{Y}$ ⁠. As discussed in Davidson (1994) Theorem 17.9, and using Minkowski’s Inequality: ${‖ {[X_{t} Y_{t}]}_{c} ‖}_{s} \leq {‖ X_{t} {[Y_{t}]}_{c} ‖}_{s} + {‖ {[X_{t}]}_{c} E_{m} (Y_{t}) ‖}_{s} + {‖ E_{m} ({[X_{t}]}_{c} {[Y_{t}]}_{c}) ‖}_{s}$ ⁠. The first two norms in this decomposition are bounded using Lemma 5.9 since ${‖ X_{t} {[Y_{t}]}_{c} ‖}_{s} \leq {‖ X_{t} ‖}_{s r} {‖ {[Y_{t}]}_{c} ‖}_{\frac{s r}{r - 1}} \leq {‖ X_{t} ‖}_{s r} d_{t}^{Y} η_{m}^{Y}$ ⁠, and ${‖ {[X_{t}]}_{c} E_{m} (Y_{t}) ‖}_{s} \leq {‖ {[X_{t}]}_{c} ‖}_{s r} {‖ E_{m} (Y_{t}) ‖}_{\frac{s r}{r - 1}} \leq {‖ Y_{t} ‖}_{\frac{s r}{r - 1}} d_{t}^{X} η_{m}^{X}$ ⁠. For the third norm, using Jensen’s Inequality (for conditional expectations), and the Law of Iterated Expectations, ${‖ E_{m} ({[X_{t}]}_{c} {[Y_{t}]}_{c}) ‖}_{s} \leq {‖ {[X_{t}]}_{c} {[Y_{t}]}_{c} ‖}_{s}$ and then applying Lemma 5.9 ${‖ {[X_{t}]}_{c} {[Y_{t}]}_{c} ‖}_{s} \leq {‖ {[X_{t}]}_{c} ‖}_{s r} {‖ {[Y_{t}]}_{c} ‖}_{\frac{s r}{r - 1}} \leq d_{t}^{X} η_{m}^{X} d_{t}^{Y} η_{m}^{Y}$ ⁠.

Combining these three bounds demonstrates that ${‖ {[X_{t} Y_{t}]}_{c} ‖}_{s} \leq d_{t} η_{m}$ ⁠, where $d_{t} = \max {{‖ X_{t} ‖}_{s r} d_{t}^{Y}, {‖ Y_{t} ‖}_{\frac{s r}{r - 1}} d_{t}^{X}, d_{t}^{X} d_{t}^{Y}}$ and $η_{m} = η_{m}^{X} + η_{m}^{Y} + η_{m}^{X} η_{m}^{Y} = O (m^{- \min {ϕ_{X}, ϕ_{Y}}})$ ⁠.

Remark.

Let $r = 2, s = 1$ ⁠. In this special case, $X_{t} Y_{t}$ is L₁-NED and the result is Theorem 17.9 of Davidson (1994).

Corollary 5.11

Let X_t be $L_{2 s}$ -NED of size $- ϕ_{X}$ on any process $σ_{t} {(τ)}^{2}$ , where $s \geq 1$ and $ϕ_{X} > 0$ . Then $X_{t}^{2}$ is L_s-NED of size $- ϕ_{X}$ on $σ_{t} {(τ)}^{2}$ ⁠.

Proof of Corollary 5.11:

The result is proved by setting r = 2 and Y _t = X_t in Lemma 5.10 □

We now use the above results to prove the theorems.

Proof of Proposition 1.2 (a):

Under Assumption 5(2)(b), from Theorem 17.8 of Davidson (1994) and the Lyapunov Inequality, $u_{i t} - u_{j t} - (μ_{uit} - μ_{ujt})$ is a zero-mean L₂-NED process of size $- \frac{1}{2}$ on Z_t, where $μ_{ukt} = E (u_{k t})$ for $k \in {i, j}$ ⁠. From Assumption 5(1)(b) and Minkowski’s Inequality, $\max_{1 ⩽ t ⩽ T} | | u_{i t} - u_{j t} - (μ_{uit} - μ_{ujt}) | |_{2 r} < c < \infty$ ⁠. The required result then follows from Theorem 6.4.4 of Davidson (2000).

Proof of Proposition 1.2 (b):

Under Assumptions 5(2)(a) and 5(2)(b), it follows from Lemmas 5.2 and 5.4 and the Lyapunov Inequality that $γ_{1 t}$ and $γ_{2 k t}, k \in {i, j}$ are L₁-NED processes of size $- \frac{1}{2}$ on Z_t. Also, under Assumptions 5(1)(a) and 5(1)(b), it follows from Lemmas 5.6 and 5.8 that $\exists c_{5} < \infty ∋ \max_{1 ⩽ t ⩽ T} {‖ γ_{1 t} ‖}_{r} < c_{5}$ and $\max_{1 ⩽ t ⩽ T} {‖ γ_{2 k t} ‖}_{2 r} < c_{5}$ ⁠. Finally, from Lemma 5.5, under Assumptions 3 and 5(1)(a), ${\bar{γ}}_{3} \to 0$ ⁠. The required result then follows from Equation (A.10), Assumptions 2 and 4 and Slutsky’s Theorem.

Proof of Proposition 1.2 (c):

Under Assumptions 1, 5(2)(b) and 5(2)(c), 5(1)(b), and 5(1)(c), it follows from Lemmas 5.3 and 5.7, Equation (A.17) and Davidson (2000) Theorem 6.4.4 that $({\bar{γ}}_{4} - E ({\bar{γ}}_{4})) \overset{p}{\to} 0$ ⁠. The required result follows from this and Proposition 1.2(b).

Proof of Proposition 1.3 (a):

From Assumption 5(2)(b), Theorem 17.8 of Davidson (1994) and the Lyapunov Inequality,

u_{i t} - u_{j t} - (μ_{uit} - μ_{ujt})

is a zero-mean L₂-NED process of size

- \frac{1}{2}

on Z_t. From Assumption 5(1)(b), Minkowski’s Inequality and the Lyapunov Inequality,

| | u_{i t} - u_{j t} - (μ_{uit} - μ_{ujt}) | |_{2 δ} < c < \infty

⁠. It follows from Theorem 6.4.6 of Davidson (2000) that

\exists c_{6} < \infty ∋ \sqrt{T} | | {\bar{γ}}_{B} - E ({\bar{γ}}_{B}) | |_{2} \to c_{6}

⁠. Under these conditions, and Assumptions 3 and 6, it follows from Theorem 1 of Calhoun (2018) that

\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{B}^{*} - E^{*} ({\bar{γ}}_{B}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{B} - E ({\bar{γ}}_{B})) ⩽ x] | \overset{p}{\to} 0

Proof of Proposition 1.3 (b):

Note that

\begin{matrix} \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{V}^{*} - E^{*} ({\bar{γ}}_{V}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{V} - E ({\bar{γ}}_{V})) ⩽ x] | \\ = \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*}) + {\bar{γ}}_{5}^{*} - E^{*} ({\bar{γ}}_{5}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1}) + {\bar{γ}}_{5} - E ({\bar{γ}}_{5})) ⩽ x] | \end{matrix}

(A.18)

where

{\bar{γ}}_{5} = {\bar{γ}}_{3} + γ_{2}

⁠. Under Assumptions 5(1)(a), 5(1)(b), 5(2)(a), and 5(2)(b), it follows from Lemmas 5.4 and 5.8, Theorem 2 of de Jong (1997) and Theorem 6.4.6 of Davidson (2000) that

\exists σ_{1} < \infty ∋ \sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) \overset{d}{\to} N (0, σ_{1})

⁠. Also, under Assumptions 3, 5(1)(a), 5(1)(b), 5(2)(a), and 5(2)(b), it follows from Lemmas 5.2, 5.5, and 5.6, Equation (22), Davidson (2000) Theorem 6.4.4, de Jong (1997) Theorem 2 and the Slutsky Theorem that

\sqrt{T} ({\bar{γ}}_{5} - E ({\bar{γ}}_{5})) \overset{p}{\to} 0

⁠. Therefore, from the Slutsky Theorem,

\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1}) + {\bar{γ}}_{5} - E ({\bar{γ}}_{5})) \overset{d}{\to} N (0, σ_{1})

⁠. From the continuity of the Gaussian distribution and Pólya’s Theorem (Serfling 2009, p. 18), we then have

\sup_{x \in R} | P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1}) + {\bar{γ}}_{5} - E ({\bar{γ}}_{5})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] | \to 0

(A.19)

Also, from the stationarity of the bootstrap sample conditional on the original sample (Proposition 1 of Politis and Romano 1994), it follows that $γ_{3}^{*} = 0$ ⁠. Therefore, $γ_{5}^{*} = γ_{2}^{*}$ ⁠.

Combining the above results yields

\begin{matrix} \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{V}^{*} - E^{*} ({\bar{γ}}_{V}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{V} - E ({\bar{γ}}_{V})) ⩽ x] | \\ \to \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*}) + γ_{2}^{*} - E^{*} (γ_{2}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] | \end{matrix}

(A.20)

Let $ε > 0$ be an arbitrarily chosen real number. By considering the two cases where $P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*}) + γ_{2}^{*} - E^{*} (γ_{2}^{*})) ⩽ x]$ is both greater than, and less than

P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x]

⁠, we may write the inequality

\begin{matrix} \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*}) + γ_{2}^{*} - E^{*} (γ_{2}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] | \\ ⩽ \max {\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*})) ⩽ x + ε] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] |, \\ \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x + ε] |} \\ + P^{*} [\sqrt{T} | γ_{2}^{*} - E^{*} (γ_{2}^{*}) | > ε] \end{matrix}

(A.21)

Consider the last term in Equation (A.21). From Equation (A.14)

\begin{matrix} P^{*} [\sqrt{T} | γ_{2}^{*} - E^{*} (γ_{2}^{*}) | > ε] & = P^{*} [\sqrt{T} | ({\bar{γ}}_{2 j}^{*}^{2} - {\bar{γ}}_{2 i}^{*}^{2}) - (E^{*} ({\bar{γ}}_{2 j}^{*}^{2}) - E^{*} ({\bar{γ}}_{2 i}^{*}^{2})) | > ε] \\ < P^{*} [{(\sqrt{T} {\bar{γ}}_{2 j}^{*})}^{2} > \frac{ε \sqrt{T}}{4}] + P^{*} [{(\sqrt{T} {\bar{γ}}_{2 i}^{*})}^{2} > \frac{ε \sqrt{T}}{4}] \\ + P^{*} [E^{*} {(\sqrt{T} {\bar{γ}}_{2 j}^{*})}^{2} > \frac{ε \sqrt{T}}{4}] + P^{*} [E^{*} {(\sqrt{T} {\bar{γ}}_{2 i}^{*})}^{2} > \frac{ε \sqrt{T}}{4}] \\ < \frac{2 V a r^{*} (\sqrt{T} {\bar{γ}}_{2 j}^{*})}{ε \sqrt{T}} + \frac{2 V a r^{*} (\sqrt{T} {\bar{γ}}_{2 i}^{*})}{ε \sqrt{T}} \overset{p}{\to} 0 \end{matrix}

(A.22)

where the final inequality is due to Markov’s Lemma, the convergence in probability is due to Calhoun (2018) Theorem 1, and the limit of zero follows from the absolute summability of covariances due to Davidson (2000) Theorem 6.4.6, which applies due to Lemmas 5.2 and 5.6 under Assumptions 5(1)(a), 5(1)(b), 5(2)(a), 5(2)(b), and 6.

Now consider the first term after the inequality in Equation (A.21). Under Assumptions 5(1)(a), 5(1)(b), 5(2)(a), 5(2)(b), and 6 it follows from Calhoun (2018) Theorem 1 that

\sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] | \overset{p}{\to} 0

(A.23)

Since ε may be chosen to be arbitrarily small, the required result follows from Equations (A.20)–(A.23), and the right-continuity of cumulative distribution functions.

Proof of Proposition 1.3 (c):

Since

{\bar{γ}}_{M} = {\bar{γ}}_{V} + γ_{4}

the proof is largely the same as that for Proposition 1.3(b), with some additional arguments to deal with γ₄. We have

\begin{matrix} \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{M}^{*} - E^{*} ({\bar{γ}}_{M})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{M} - E ({\bar{γ}}_{M})) ⩽ x] | \\ = \sup_{x \in R} | P^{*} [\sqrt{T} ({\bar{γ}}_{1}^{*} - E^{*} ({\bar{γ}}_{1}^{*}) + {\bar{γ}}_{6}^{*} - E^{*} ({\bar{γ}}_{6})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1}) + {\bar{γ}}_{6} - E ({\bar{γ}}_{6})) ⩽ x] | \end{matrix}

(A.24)

where

{\bar{γ}}_{6} = {\bar{γ}}_{5} + γ_{4}

⁠.

Under Assumptions 5(2)(b) and 5(2)(c), 5(1)(b), and 5(1)(c), it follows from Lemmas 5.3 and 5.7, Equation (A.17), Davidson (2000) Theorem 6.4.4, de Jong (1997) Theorem 2 and the Slutsky Theorem that

\sqrt{T} (γ_{4} - E (γ_{4})) \overset{p}{\to} 0

⁠. It was shown in the proof of Proposition 1.3(b) that

\sqrt{T} ({\bar{γ}}_{5} - E ({\bar{γ}}_{5})) \overset{p}{\to} 0

so it follows that

\sqrt{T} ({\bar{γ}}_{6} - E ({\bar{γ}}_{6})) \overset{p}{\to} 0

and consequently, from previously stated arguments, that

\sup_{x \in R} | P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1}) + {\bar{γ}}_{6} - E ({\bar{γ}}_{6})) ⩽ x] - P [\sqrt{T} ({\bar{γ}}_{1} - E ({\bar{γ}}_{1})) ⩽ x] | \to 0

(A.25)

Following a similar argument to that used in Equation (A.22),

P^{*} [\sqrt{T} | γ_{7}^{*} - E^{*} (γ_{7}^{*}) | > ε] \overset{p}{\to} 0 where γ_{7}^{*} = γ_{2}^{*} + γ_{4}^{*}

(A.26)

The required result then follows from Equation (A.26) and Equations (A.20)–(A.23) with ${\bar{γ}}_{M}$ substituted for ${\bar{γ}}_{V}$ ⁠, γ₇ substituted for γ₂, ε chosen to have an arbitrarily small value, and the right-continuity of cumulative distribution functions.

Supplemental Material

Supplemental material is available at Journal of Financial Econometrics online.

References

Ait-Sahalia

Mykland

P. A.

Zhang

2005

How Often to Sample a Continuous-Time Process in the Presence of Market Microstructure Noise

Review of Financial Studies

351

–

416

Month:	Total Views:
April 2025	158
May 2025	52

Article Contents

Empirical Evaluation of Competing High-Frequency Estimators of Quadratic Variation

Abstract

1 Main Results

2 Monte Carlo Simulations

3 An Empirical Study

4 Concluding Comments

Footnotes

Appendix

Supplemental Material

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only