Estimation and forecasting of multivariate high frequency volatility models: application to a leveraged risk-parity portfolio
The purpose of the thesis was to study a selection of bivariate high frequency volatility models and compare them using Hansens model confidence set (MCS) (link). A subset of the best models found via the MCS procedure, was then used in a risk-targeted risk-parity portfolio. The idea was to construct a simplified portfolio following the same strategy of Man AHL TargetRisk. The general results for the thesis are described below.
From 10 years of data on the ETFs, SPY and TLT, I had more than 250 GB of raw TAQ data which was cleaned following the procedure described in Barndorff‐Nielsen et al. (2009). This gave us enough usable data to construct sample schemes down to second frequencies. In the empirical analysis I ended up with more than 350 volatility models across frequencies, realized measures, and volatility models. These were tested using forecast comparison methods with a direct and indirect evaluation of the forecast losses.
It needs to be noted that the portfolios was heavily affect by the stock-bond correlation, since I chose a time horizon with negative correlation, thus imposing better diversification in terms of how a bond position can hedge against an equity market sell-off.
The results from my thesis are described below.
From the analysis of the realized measures:
-
I found evidence that the literature-preferred realized covariance estimator on a 5-minute frequency, , was significantly out-performed by 4 out of 75 realized measures.
-
In line with Li et al. (2015), I concluded that appeared more often in the superior set, and that noise-robust measures on second-frequencies tend to perform well for liquid equity-indices or proxies thereof.
-
An alteration of the MCS procedure with an economic loss function, favoured jump-robust estimators under a proxy for integrated covariation. This contributes to the fact that measuring quadratic covariation induces more variability in each estimator, which in turn, might procure higher variances in the MVP framework.
-
Conclusively, we found evidence that many of the realized measures on minute-frequencies seems to provide much of the benefits of high frequency data without exposing the estimators to microstructure noise. Moreover, the empirical accuracy only increases marginally (on average) when considering noise-robust estimators on second frequencies.
In general, the analysis was done to investigate how the superior performance of the realized measures translated over to volatility models.
From the analysis of the realized volatility models:
-
It was observed that continuous (assymetrical) models accounting for leverage effect could not capture the effect for both models. This was partially due to an "inverse"-type leverage effect observed for the assets.
-
In general, the DRD-HAR(Q) type models performed best and procurred the lowest statistical losses, while being very parsimonious.
-
All of the high frequency volatility models (even the poorest ones) beat their open-to-close volatility model counterparts as noted by the forecast comparison analysis.
-
I further observed that the performance of the realized measures was generally carried over to the volatility models. However, this was independent of the sample frequency, telling us that the characteristics of each measure, matter more than the frequency.
From the portfolio analysis:
-
Transitioning from volatility models on daily returns to any DRD-HAR type model under consideration, decreased the vol of vol substantially while retaining (approximately) the same annualized expected returns. This lead to a reduction in the expected shortfall under a time-horizon with no apparent crises, that would net the risk-averse investor a 2.8 million notional difference, if they invest 1 billion in the DRD-CHAR portfolio.
-
Contrary to Bollerslev et al. (2018), we found evidence that the turnover rates of the ARQ type models increased substantially contrary to the HAR-type dynamics, which contributed to a reduction in performance (as seen by Sharpe ratios) under proportional transaction costs.
-
The most surprising result, was the universal performance of the CHAR model. Upon further investigation, we found that the DRD-CHAR model was in the top three across all cost-schemes in terms of Sharpe ratio, which could be contributed to the reduced turnover rates.
-
In general, it lead me to believe, that good out-of-sample portfolio performance is not only dictated by models within the superior set of the MCS procedure, but in some cases, rejected parsimonious models might perform consistently well across realized measures with adequately chosen frequencies.
The code for my master thesis was fractioned into small snippets corresponding to different subsections of my thesis. You cannot download the repository and expect all of the code to be runable due to lacking the original cleaned stock data. The code has been provided for illustrational purposes.
functions.R
contains all of my self-written ad-hoc functions used throughout the entire thesis.
Summarystats.R
, Summary stats for cleaning Podolskij method.R
and jumpproportion.R
contains the code for the summary statistics and analysis of the assets, provided in the empirical analysis.
DRD-HAR models.R
and Volatility models.R
contains the code for the in-sample estimation and analysis of the volatility models.
mcs_realized_measures.R
and out_of_sample_MCS_and_prelim_volmodels.R
contains the forecast comparison analysis for the realized measures and the volatility models. Morevover, the results was compared with the MCS function in Kevin Sheppards toolbox and the code sniplet can be seen in mainMCS.m
.
Out_of_sample_forecast_analysis.R
contains the code for the out-of-sample forecast analysis.
Out_of_sample_portfolio_analysis.R
contains the code for the out-of-sample portfolio analysis.
riskparity.R
contains the code that produces the graphs provided in the theoretical section of the risk-parity portfolio.
Correlation check.R
, IntradayACF.R
, semicovplot.R
, pre-averaging stability.R
and realized semicovariance graph.R
contains the code that produces preliminary graphs before the empirical analysis analysis.