Download Report

Stressed Correlations and Volatilities –
How to Fulfill Requirements of the Basel
Committee
Christoph Becker∗
Wolfgang M. Schmidt†
Preliminary version, March 24, 2011
Abstract
We propose a new approach to the definition of stress scenarios for
volatilities and correlations which fulfills the requirements of the Basel Committee on Banking Supervision for the quantification of market risk. Correlations and volatilities are functions of one and the same market factor, which
is the key to stressing them in a consistent and intuitive way. Our approach is
based on a new asset price model where correlations and volatilities depend
on the current state of the market. The state of the market captures marketwide movements in equity-prices and thereby fulfills minimum requirements
for risk factors stated by the Basel Committee. For sample portfolios we
compare correlations and volatilities in a normal market and under stress and
explore consequences for value-at-risk. Stressed value-at-risk exceeds the
standard value-at-risk by a factor of 3 to 4, confirming estimates from the
Basel Committee.
We finally compare our modeling approach with multivariate GARCH
models. For all data analyzed our model proved to be superior in capturing
the dynamics of volatilities and correlations.
Keywords: correlation, volatility, Basel III, GARCH models
JEL classification codes: C13, C32, C58, G11, G12
We thank Darrell Duffie, participants of EFA 2010 conference, Peter Ruckdeschel, Alexander
Szimayer, Uwe Kuechler, Natalie Packham, Robert Tompkins, Thomas Heidorn, Matthias Fengler,
and Marlene Mueller for helpful suggestions and discussions. This paper is part of a research project
that is funded by the Frankfurt Institute for Risk Management and Regulation (FIRM). Special thanks
to Keith Jarrett for his music.
∗ Frankfurt School of Finance & Management, Sonnemannstr. 9-11, 60314 Frankfurt am Main,
e-mail: [email protected]
† Frankfurt School of Finance & Management, Sonnemannstr. 9-11, 60314 Frankfurt am Main,
e-mail: [email protected]
1
1
Introduction
We suggest a model that provides a natural setting to define consistent stress scenarios for volatilities and correlations. The stress scenarios are based on historical
experience and correspond to pre-specified probabilities. In our model the vector
of volatilities σ and the correlation matrix ρ depend on one and the same market
state F. The market state F is generic; to comply with suggestions of the Basel
Committee we define the market state as the realized drift of a market index. We
estimate the dependence structure of volatilities σ (F) and correlations ρ(F) on
the market state F from daily stock prices. In other words, we propose and estimate a nonlinear one-factor model for the dynamics of volatilities and correlations.
Stressed volatilities and correlations are then defined by
σ ( fα ),
ρ( fα ),
where fα is the α-quantile of the empirical distribution of observed market states
F. The concept is transparent and, by construction, volatilities and correlations are
stressed in a consistent manner. Furthermore, the choice of the quantile probability α relates to the probability of the stress scenario. We examine consequences
of stressed volatilities and correlations on portfolio value-at-risk. We find that the
stressed value-at-risk exceeds a standard value-at-risk by a factor of 3 − 4, confirming results in BCBS [2009a].
Our approach is motivated by the requirements of the Basel Committee on
Banking Supervision (BCBS) on how to determine capital requirements for the
market risk of a stock portfolio. If a bank uses internal models to determine capital requirements, the committee demands that the “bank must calculate a ‘stressed
value-at-risk’ measure” where “the relevant market factors were experiencing a period of stress”, see BCBS [2009b] and the updated document BCBS [2011b]. One
type of stress tests suggested by the committee is to “evaluate the sensitivity of the
bank’s market risk exposure to changes in the assumptions about volatilities and
correlations”. However, in the document BCBS [2011a] the committee acknowledges that banks that wish to fulfill this requirement face difficult technical questions. For risk factors the minimum requirement stated in BCBS [2011b] is that
“there should be a risk factor that is designed to capture market-wide movements
in equity-prices (e.g. a market index)”. One question is how to use such a factor to
stress volatilities and correlations in a consistent manner. Another question is how
to stress volatilities and correlations in a way that is justified by historical experience. Furthermore, stressing correlations in a portfolio with more than two assets
is technically difficult because one needs to maintain the positive semi-definiteness
of the correlation matrix. By increasing individual correlations by a fixed quantity
the positive semi-definiteness of the correlation matrix is lost in general. Therefore
2
a naive approach that bases stressed correlations on estimates of bivariate models
is infeasible.
Popular models like multivariate GARCH models do not seem to be able to
fulfill all the requirements posed by the Basel Committee. Our approach for modeling the dynamics of volatilities and correlations is related to existing GARCH
approaches. In comparison, out of all the time series we considered, our model
proves to capture the behavior of correlations and volatilities better than the Dynamic Conditional Correlation GARCH model by Engle [2002].
The paper is organized as follows: In Section 2 we outline our model and
present the main results. We investigate the dependence structure of volatilities
and correlation on the market state F and consider examples of stressed volatilities and correlations. Furthermore, we explore consequences for value-at-risk. In
Section 3 we describe the model in more detail. The relationship between our
model and multivariate GARCH models is discussed in Section 4. In Section 5 we
compare our model with the Dynamic Conditional Correlation GARCH model, a
model with constant volatilities and correlations, and two moving averages based
on the past 30 and 90 days, respectively. We analyze how the models capture the
dynamics of volatilities and correlations in a given time series and find that our
model outperforms.
2
Stressed volatilities and correlations and their impact
on VaR
To describe the main results we sketch our model and provide details in Section
3. We develop a diffusion model for the dynamics of assets S1 , . . . , Sn , where
volatilities and correlations depend on a market state F. The market state F is
the realized drift of a market index over a fixed number of past observations. We
estimate our model for stocks from the S&P 500 using daily data for the period
Jan 1990 − Nov 2010. Here, the market state F is defined as the realized drift of
the S&P 500. We estimate the model by applying a maximum likelihood estimator.
The dependence of volatilities and correlations on the market state is first analyzed
by considering pairs of stocks. The estimates in Figure 1 show that correlations
are increased in bear markets and stable in normal and bull markets. Furthermore,
correlations and volatilities seem to be co-moving. The horizontal lines in Figure
1 are estimated constant correlations and volatilities in a corresponding model that
assumes only constant volatilities and correlations. The difference between market
state dependent volatilities as well as correlations and their constant counterparts,
respectively, indicates how strongly these quantities change in a crisis.
However, we cannot determine stressed correlations for a larger portfolio by
3
(a) Bank of America - Citigroup
(b) Colgate - Exxon
(c) Microsoft - Halliburton
(d) Walt Disney - Pfizer
(e) Walmart - Johnson & Johnson
(f) Merck - Pfizer
Figure 1: Typical dependency structures of correlation ρ and volatilities σ1 , σ2 of
daily stock returns on the market state, that is, the realized drift of the S&P 500
over a rolling windows of nF = 75 business days. The realized drift is annualized.
Data from 1990 − 2010.
computing stressed correlations in a bivariate manner because we may obtain a
non-positive-semidefinite matrix. Therefore, for a portfolio consisting of n stocks
we simultaneously estimate the vector of volatilities σ (F) and the n × n - correlation matrix ρ(F).
Recall that the market state is computed as a realized drift over a fixed number
of past daily observations. This number of past observations has a convenient
interpretation as the memory of the market. We estimate our model for different
market memories and find that the optimal memory is about 75 business days, see
Figure 6 in Appendix B.
As a result, the vector of volatilities and the correlation matrix are known functions σ (F), ρ(F) of the market state F. How can we define stressed volatilities and
correlations that comply with the requirements in BCBS [2009b]? We propose to
define risk scenarios by shifting the market state to predefined quantiles of its em-
4
pirical distribution. That is, we shift the market state F to α-quantiles fα with, for
example, α ∈ {0.1%, 1%, 5%} and compute the corresponding correlation matrices
ρ( fα ) = (ρi, j ( fα ))i, j=1,...,n
(1)
σ ( fα ) = σ1 ( fα ), . . . , σn ( fα ) .
(2)
and vectors of volatilities
Since the market state is defined as the realized drift of an appropriate index,
it captures the systemic market risk component of the stock portfolio. By design
the proposed stress scenarios for volatilities and correlations fulfill the minimum
requirements posed by BCBS [2011b]. In particular, the choice of the quantile
probability α relates to the probability of the stress scenario. Moreover, volatilities
and correlations are stressed in a consistent way because they are stressed simultaneously and based on one and the same market factor F.
We illustrate our approach by analyzing a sample portfolio consisting of six
stocks: Bank of America (BoA), Exxon, General Electric (GE), Microsoft (MS),
Pfizer and Walmart. Figures 7 and 8 in Appendix B show the estimated dependence
structure of correlations and volatilities on the market state. We emphasize that
these estimations must be performed for the portfolio as a whole and not on a
pairwise basis. Table 1 shows the correlation matrix and volatilities for the market
state F at the 5% - quantile of its empirical distribution, Table 2 for the market state
F at the median.
Vol.
MS
0.517512
Walmart
0.304813
GE
0.604834
Pfizer
0.345711
Exxon
0.406842
BoA
1.355577
Corr.
MS
Walmart
GE
Pfizer
Exxon
BoA
MS
1.000000
0.391786
0.445221
0.538888
0.620236
0.582697
Walmart
0.391786
1.000000
0.410489
0.529755
0.438121
0.135700
GE
0.445221
0.410489
1.000000
0.588320
0.588359
0.547035
Pfizer
0.538888
0.529755
0.588320
1.000000
0.632803
0.546533
Exxon
0.620236
0.438121
0.588359
0.632803
1.000000
0.406522
BoA
0.582697
0.135700
0.547035
0.546533
0.406522
1.000000
Table 1: Volatilities and correlations for the market state at the 5%-quantile of
its empirical distribution. The market state is computed as the realized drift of
the S&P500 over the past 75 business days. The model is estimated for period
Jan 2004 − Nov 2010.
We analyze the portfolio’s value-at-risk in our proposed stress scenarios. Let
us assume that we invest V0 = 100$ in our sample portfolio. For asset weights γi
5
Vol.
MS
0.196424
Walmart
0.168472
GE
0.200905
Pfizer
0.212423
Exxon
0.201778
BoA
0.285326
Corr
MS
Walmart
GE
Pfizer
Exxon
BaA
MS
1.000000
0.350425
0.437016
0.284452
0.332468
0.238774
Walmart
0.350425
1.000000
0.389480
0.336892
0.287761
0.373636
GE
0.437016
0.389480
1.000000
0.376709
0.384305
0.707337
Pfizer
0.284452
0.336892
0.376709
1.000000
0.299811
0.384892
Exxon
0.332468
0.287761
0.384305
0.299811
1.000000
0.323369
BoA
0.238774
0.373636
0.707337
0.384892
0.323369
1.000000
Table 2: Volatilities and correlations for the market state at the median of its
empirical distribution. The market state is computed as the realized drift of
the S&P500 over the past 75 business days. The model is estimated for period
Jan 2004 − Nov 2010.
we compute a 10-day ‘stressed value-at-risk’ for level α by
!
!
s
r
n
10
= −V0 exp
VaRstressed
Φ−1 (α) ∑ γi γ j σi ( fα )σ j ( fα )ρi, j ( fα ) − 1 ,
α
250
i, j=1
where volatilities and correlations are evaluated at the α-quantile fα of the empirical distribution of the market state F. The function Φ is the cumulative distribution
function of the standard normal distribution. Furthermore, we compute a ‘nonstressed value-at-risk’ for level α by
!
!
s
r
n
10
VaRnon-stressed
= −V0 exp
Φ−1 (α) ∑ γi γ j σiconst σ const
ρi,const
−1 ,
α
j
j
250
i, j=1
where volatilities and correlations are estimated from a corresponding model with
constant volatilities and correlations. In Figure 2a we plot the functions
α 7→ VaRstressed
,
α
α 7→ VaRnon-stressed
,
α
α ∈ (0, 0.25)
(3)
for our sample portfolio by assuming equal asset weights γi . Figure 2b shows the
ratio of stressed and non-stressed value-at-risk,
α 7→
VaRstressed
α
,
non-stressed
VaRα
α ∈ (0, 0.25).
We observe that the stressed 99%, 10-day value-at-risk VaRstressed
computed with
0.99
market state dependent volatilities and correlations is 3 times higher than the value6
(a) Value-at-Risk
(b) Ratio of Values-at-Risk
Figure 2: Estimated ten-day VaR for different probabilities α for our sample portfolio and portfolio value 100$. The model is estimated on Jan 2004 − Nov 2010.
computed with constant volatilities and correlations. For a
at-risk VaRnon-stressed
0.99
1
portfolio of 20 stocks we even observe a ratio of 4, see Figure 3.
Our results confirm the findings of BCBS [2009a] who report that the ratio of
the stressed value-at-risk and the non-stressed value-at-risk as computed by banks
is in the range of 0.68 − 7 with median 2.6.
To summarize the advantages of our approach we conclude that, firstly, by
defining stress scenarios for volatilities and correlations by (1)-(2), we fulfill the
minimum requirements posed by BCBS [2011b], that is, “there should be a risk
factor that is designed to capture market-wide movements in equity-prices (e.g. a
market index)”. Moreover, volatilities and correlations are stressed in a consistent
way because they are stressed simultaneously and based on one and the same market factor F. Secondly, our approach confirms findings of the impact study BCBS
[2009a] on how much a stressed value-at-risk should exceed a standard value-atrisk. Thirdly, the correlation matrix (1) and the vector of volatilities (2) can be used
as inputs for a market risk analysis in any model where daily returns are assumed
to be normally distributed, see the discussion in Section 4. Moreover, as will be
shown in Section 5, the model seems to perform better than the Dynamic Conditional Correlation GARCH model by Engle [2002] in capturing the dynamics of
correlations and volatilities within given samples.
1 Apple, American Express, AT&T, Bank of America, Boeing, Chevron, Citigroup, Coca Cola,
Exxon, Ford, General Electric, J.P. Morgan, Johnson & Johnson, McDonalds, Merck, Microsoft,
Pfizer, Procter & Gamble, Walmart, Walt Disney.
7
(a) Value-at-Risk
(b) Ratio of Values-at-Risk
Figure 3: Estimated ten-day VaR for different probabilities α for a portfolio of 20
stocks and portfolio value 100$. The model is estimated on Jan 2004 − Nov 2010.
3
An asset price model with state-dependent correlation
in continuous time
In Section 2 we have sketched the main elements of our model. In this section we
state its complete definition.
We consider a diffusion type asset price model driven by Brownian motion.
Our assumption is that asset volatilities and asset correlation depend on the current
state of the market, which is interpreted as a common risk factor. An example of
the state of the market is the realized drift of a market index, which is determined
on a rolling window of past and current asset price realizations. The dependency
of the asset dynamics on past asset realizations leads us to stochastic differential
equations with time delay, so called stochastic delay differential equations, see Mao
[2007] or Mohammed [1984]. In every point in time t volatilities and correlations
in our model are determined by states in the past interval [t − r,t] for some fixed
length r of the memory window.
We work on a probability space (Ω, F , P) equipped with a filtration F =
(Ft )t≥0 satisfying the usual conditions. The filtration is rich enough to carry at
least n independent Wiener processes. Before we state the dynamics of asset price
processes S = (S1, . . . , Sn ), we introduce the concept of the segment process.
Let S = S(t) t∈[−r,∞) be a continuous Rn -valued stochastic process. For every
8
t ≥ 0 we define the [t − r,t]-segment of the process S by
St (u) = S(t + u),
u ∈ [−r, 0],
that is, St is a mapping that takes values in the space C ([−r, 0], Rn ) of continuous
functions from [−r, 0] to Rn ,
St : Ω 7→ C ([−r, 0], Rn ) .
We call (St )t∈[0,∞) the segment process with time delay r. For a segment φ ∈
C ([−r, 0], Rn ) we define the norm
||φ ||∞ = sup ||φ (u)||2 ,
u∈[−r,0]
with || · ||2 the Euclidean norm on Rn . We base our analysis on the model
dSi (t) = µi (θ , St )Si (t) dt + σi θ , F(St ) Si (t) dW i (t),
dW i (t) · dW j (t) = ρi, j θ , F(St ) dt,
i, j = 1, . . . , n
n
S0 ∈ C [−r, 0], R+ ,
(4)
(5)
(6)
with Wiener processes W 1 , . . . ,W n . The drifts µi , the volatilities σi and the instantaneous correlations ρi, j are functions,
µi : Θ ×C ([−r, 0], Rn ) → R,
σi : Θ × R → [0, ∞),
ρi, j : Θ × R → [−1, 1],
i, j = 1, . . . , n,
that are parameterized with some parameter θ ∈ Θ ⊂ R p , p ∈ N. The volatilities σi
and the correlations ρi, j depend on the market state F(St ), which we describe by a
market state function F of the past segment St ,
F : C ([−r, 0], Rn ) → R.
Since the market state depends on a past window of assets S1 , . . . , Sn , the model is
a system of delay equations.
We assume that the instantaneous correlation matrix
ρ(θ , x) = ρi, j (θ , x) i, j=1,...,n
is positive definite for all (θ , x) ∈ Θ × Rn , hence there exists a unique Cholesky
decomposition. For future reference we define the Cholesky decomposition C(θ , x)
of the instantaneous covariance matrix, that is,
C(θ , x)C(θ , x)T = diag σ (θ , x) ρ(θ , x) diag σ (θ , x) , (θ , x) ∈ Θ × R, (7)
9
with diag σ (θ , x) a matrix with components σ1 (θ , x), . . . , σn (θ , x) on the diagonal and zeros otherwise.
Before we define the functional form of the market state F and introduce appropriate parameterizations for the drifts µi , volatilities σi and correlations ρi, j , we
state sufficient conditions for the existence of a unique solution of our model in its
most general form (4)-(6). Furthermore, we show that the instantaneous correlation
can be interpreted as a proxy for the correlation of daily log-returns. For proofs see
Becker and Schmidt [2010].
Proposition 1. Assume that for every θ ∈ Θ and for i = 1, . . . , n
µi ◦ exp(θ , ·) : C ([−r, 0], Rn ) → R
is locally Lipschitz-continuous and fulfills the linear growth condition
|µi ◦ exp(θ , x)|2 ≤ D 1 + ||x||2∞ ,
x ∈ C ([−r, 0], Rn ) ,
(8)
with a constant D > 0. The concatenation µi ◦ exp is defined as
µi ◦ exp( f ) = µi exp ◦ f 1 , . . . , exp ◦ f n , f = ( f 1 , . . . , f n ) ∈ C ([−r, 0], Rn ) . (9)
Furthermore, assume that for every θ ∈ Θ and i, j = 1, . . . , n
σi (θ , ·) : (R, | · |) → ([0, ∞), | · |) ,
ρi, j (θ , ·) : (R, | · |) → ([−1, 1], | · |) ,
F ◦ exp : C ([−r, 0], Rn ) , || · ||∞ → R, | · |
are locally Lipschitz continuous, where the concatenation F ◦ exp is defined analogously to µi ◦ exp. Let the volatility functions σi (θ , ·) be bounded, and the instantaneous correlation ρ(θ , x) be positive definite for all (θ , x).
Then for every θ ∈ Θ there exist F-adapted Wiener processes W 1 , . . . ,W n
and an F-adapted process S = (S1 , . . . , Sn ) with strictly positive paths that satisfy the system of delay equations (4)-(6). The distribution of S on the path space
C [−r, ∞), Rn is unique. If the drifts µi are bounded it holds that
n
E
sup
∑
2
S (t)
< ∞.
i
(10)
t∈[−r,T ] i=1
From now on we assume that the conditions of Proposition 1 hold and S denotes
the unique positive solution of the system (4)-(6).
10
The instantaneous correlation ρi, j admits a convenient interpretation as the correlation between daily log-returns of assets Si and S j given the market state F(St ),
that is
Si (t + 1 day)
S j (t + 1 day) ρi, j θ , F(St ) ≈ Corr log
, log
Ft .
Si (t)
S j (t)
The following lemma makes this more precise.
Lemma 1. Denote the components of the Cholesky-decomposition C(θ , x) introduced in (7) as
θ ∈ Θ, x ∈ C ([−r, 0], Rn ) .
C(θ , x) = (ci, j (θ , x))i, j=1,...,n ,
For every θ ∈ Θ let the mappings
x 7→ µi (θ , ·) ◦ exp(x),
i = 1, . . . , n,
and
x 7→ ci, j (θ , ·) ◦ exp(x),
x 7→ c2i, j (θ , ·) ◦ exp(x),
i, j = 1, . . . , n
be uniformly Lipschitz-continuous, with the concatenation of functions defined as
in (9). Furthermore, let the volatilities σi be bounded and condition (8) hold.
Then for every t ≥ 0 and the sequence ∆m = 1/m, m ∈ N, it holds P-almost
surely
Si (t + ∆m )
S j (t + ∆m ) lim Corr log
F
=
ρ
θ
,
F
(S
)
.
,
log
t
i,
j
t
m→∞
Si (t)
S j (t) The market state function F is a common risk factor for the dynamics of volatilities and correlations. The minimum requirement for such a risk factor stated in
BCBS [2011b] is that it shall “capture market-wide movements in equity-prices”.
Therefore we define the market state as an average realized drift of assets S1 , . . . , Sn ,
F (St ) = F S(t), S(t − ∆t), . . . , S(t − nF ∆t)
j
1 n 1 nF
S (t − (k − 1)∆t)
=
∑ nF ∆t ∑ log S j (t − k∆t)
n j=1
k=1
j
1 c2
S (t − (k − 1)∆t)
+
σ log
: k = 1, . . . , nF .
(11)
2∆t
S j (t − k∆t)
with
c2 (x1 , . . . , xn ) = 1
σ
F
nF − 1
11
nF
∑
j=1
1
xj −
nF
nF
∑ xk
k=1
!2
.
(12)
In case of constant volatilities σi ≡ σi θ , F(St ) and constant drifts µi ≡ µi (θ , St ),
the proposed function F is an unbiased estimator of ∑ni=1 µi /n. Alternatively, we
can define the market state function F as the realized drift of a stock market index.
It is straightforward to prove that the market state (11) fulfills the conditions in
Proposition 1.
For the functional form of volatilities and correlations in the continuous time
model (4) - (6) we want to achieve a maximum of flexibility. Parameterizations
for certain classes of correlation matrices have been proposed, for example, by
Ding and Engle [2001]. However, it is not clear whether these parameterizations
are suitable for our model and the problems we are targeting. Furthermore, parameterizing correlation matrices is technically difficult because of the required
positive definiteness, see, for example, Rebonato and Jackel [1999]. Therefore
we do not define separate parameterizations for pairwise correlations ρi, j (θ , ·) and
for volatilities σi (θ , ·). Instead, we introduce a parameterization for the Cholesky
decomposition
C(θ , x) = (ci, j (θ , x))i, j=1,...,n
of the instantaneous covariance matrix (7) via


ξ , θ ),
i> j
hi, j (x,
ci, j (θ , x) = α + hi, j (x, ξ , θ ), i = j


0,
i < j.
(13)
The functions hi, j (·, ξ , θ ) are cubic splines through a common set of equidistant
discretization points ξ for the values of the market state,
ξ = (ξl )l=1,...,nCov ,
nCov ∈ N,
(14)
and individual sets θli, j
of values for every entry of the Cholesky del=1,...,nCov
composition ci, j . These individual sets are collected in one vector θ that we estimate from real market data. We choose the points (ξl )l=1,...,nCov such that they, for
given asset realizations s(ti ) i=1...,n , cover the range of realized market states
n
o
F s(ti−nF ), . . . , s(ti ) , i ≥ 1 + nF .
The variable α is a small, positive number that guarantees that the covariance matrix defined via (13) and hence the correlation matrix ρ(·, ·) is positive definite for
all (θ , x) ∈ Θ × R.
For the drift functions µi we use either constant values or the weighted form
m
Si (t − j∆t)
µi (θ , St ) = ∑ β j log i
,
(15)
S (t − ( j + 1)∆t)
j=0
12
with ∑mj=0 β j = 1, m ∈ N, and β j ≥ 0 for all j. In our estimates for volatilities
and correlations we found that the estimation results are quite insensitive to the
particular choice of the drift function. Therefore, we assume constant drifts µi in
our estimates.
4
Relation to multivariate GARCH models
Multivariate GARCH models are standard models to describe the dynamics of
volatilities and correlations, see, for example, Silvennoinen and Ter¨asvirta [2009].
For a vector of returns
r(k) = (r1 (k), . . . , rn (k))
with discrete time k = 1, 2, . . . , multivariate GARCH models describe the dynamics
of r via
r(k) = C(k)η(k).
(16)
The matrix C(k) is the Cholesky decomposition of the covariance matrix H(k) ∈
Rn,n , which by assumption depends on the observations up to time k − 1. The
Rn -valued random process η has independent, identically distributed realizations
η(k), k = 1, 2, . . . . The random variable η(k) has components (η1 (k), . . . , ηn (k))
with covariances
(
1, i = j
Cov (ηi (k), η j (k)) =
.
0, i 6= j
The variable η(k) is assumed to be independent of the observations up to time
k − 1. The assumption
E (ηi (k)) = 0,
for all i, k,
implies then that the returns are centered. For normally distributed η(k) the dynamics (16) yields that
r(k) ∼ N (0, H(k)) .
(17)
The core element of multivariate GARCH models are updating rules for the covariance matrix H(k) that are based on past observations of the covariance matrix
H and past returns r. One group of models defines linear updating rules, see,
for example, the VEC-model by Bollerslev et al. [1988], while a second group
introduces nonlinearities, see, for example, the Dynamic Conditional Correlation
GARCH model by Engle [2002].
13
To compare our approach with multivariate GARCH models we translate our
continuous time dynamics into a dynamics in discrete time. Let S follow the dynamics (4)-(6) introduced in Section 3 and define the discretized asset price process
S˜ by
˜ = S(k∆t),
S(k)
k = 0, 1, . . . .
We investigate the discrete time dynamics of the returns
!
S˜i (k)
,
ri (k) = log
i = 1, . . . , n.
S˜i k − 1
(18)
For the process S˜ we denote by pk the transition density given all information up
to time k − 1,
˜ ∈ dyk log S(
˜ j) = y j : j = 1, . . . , k − 1 .
pk (yk |y1 , . . . , yk−1 , θ )dyk = Pθ log S(k)
The transition density pk is unknown in our model. Therefore we approximate pk
with the density of a n-dimensional normal distribution with distribution parameters implied by the Euler scheme. Neglecting the drift2 we obtain a distribution for
asset returns like (17),
r(k) ∼ N (0, H(k)) ,
(19)
with
˜ k−1 ) ,
H(k) = ∆t diag σ θ , F(S˜k−1 ) ρ θ , F(S˜k−1 ) diag σ θ , F(S
and diag σ (θ , F(S˜k−1 )) a matrix with diagonal
(20)
σ1 (θ , F(S˜k−1 )), . . . , σn (θ , F(S˜k−1 ))
and zeros otherwise. The market state F depends on the segment S˜k−1 of the discretized path,
˜ − 1), . . . , S(k
˜ − 1 − nF ) .
S˜k−1 = S(k
The dynamics of returns is similar to (16),
r(k) ≈ C(k)η(k),
(21)
with C(k) the Cholesky decomposition of the instantaneous covariance matrix (20),
and (η(k)) , k = 1, 2, . . . independent identically distributed random vectors with
2 Our estimates for volatilities and correlations prove to be quite insensitive to the particular choice
of the drift function.
14
n-dimensional standard normal distribution. Formula (21) explains why our approach is related to multivariate GARCH models. However, instead of updating
the covariance matrix based directly on past observations of returns and covariance
matrices, our model first updates the state of the market, which then determines the
covariance matrix. Note that the market state function F as defined in (11) can be
˜ − 1), . . . , S(i
˜ − 1 − nF ), but
expressed not only as a function of price realizations S(i
also as a function of realized returns r(i − 1), . . . , r(i − nF ). Like in the Dynamic
Conditional Correlation GARCH model volatilities and correlations are non-linear
functions of realized returns.
5
Capturing the dynamics of volatilities and correlations:
a comparison with GARCH type models
How well does our model (4)-(6) capture the real-world dynamics of volatilities
and correlations? To answer this question we follow an approach suggested by
Engle and Colacito [2006]. They compare different models by optimizing a portfolio based on volatilities and correlations that are predicted by the corresponding
model. The model that yields the smallest portfolio variance is then best able to
capture the real dynamics of volatilities and correlations.
We use the notation introduced in Section 4. For the period from time k − 1 to
k we denote the asset weights in the portfolio by
w(k) = (w1 (k), . . . , wn (k)) ∈ Rn .
If the sum ∑ni=1 wi (k) differs from one we invest the difference
n
1 − ∑ wi (k)
i=1
in a riskfree asset with return r f . The return of the asset portfolio is then
!
n
n
rportfolio (k) =
∑ wi (k)ri (k) +
i=1
n
=
1 − ∑ wi (k) r f
(22)
i=1
∑ wi (k) (ri (k) − r f ) + r f .
(23)
i=1
Denote by H(k) = (hi, j (k)) the covariance matrix of the returns ri (k), which is
predicted by the model at time k − 1. Then the predicted portfolio volatility is
s
n
∑
wi (k)w j (k)hi, j (k).
i, j=1
15
Observe that the second summand in (22) does not contribute to the predicted
volatility. Denote by µ = (µ1 , . . . , µn ) the vector of expected excess returns,
µi = E(ri (k) − r f ),
i = 1, . . . , n,
where we assume that these expectations are identical for all k. For the period
[k − 1, k], the asset weights w(k) minimizing the portfolio variance are the solution
of the problem
wT (k)H(k)w(k),
(24)
minn
w(k)∈R ,
s.t. wT (k)µ=µ0
where µ0 ∈ R is the required excess return of our portfolio. The solution to problem
(24) is
H −1 (k)µ
w(k) = T −1
µ0 .
(25)
µ H (k)µ
The conditional portfolio variance for period [k − 1, k] is
2
σ (k) = Ek−1 wT (k) (r(k) − r) ,
(26)
where r = Ek−1 r(k) is the conditional mean of returns r(k), which is assumed to
be constant. By Theorem 2 in Engle and Colacito [2006], the unknown conditional
mean r in (26) can be safely approximated by the sample mean.
The portfolio weights w(k) in (25) depend on the model predicted conditional
covariance matrix H(k) and the vector µ. The corresponding weights using the
true (however, unknown) conditional covariance matrix would lead to a different
portfolio. Comparing the variances of these two portfolios Engle and Colacito
[2006], Theorem 1, show that the latter variance is always smaller, no matter the
choice of µ. This justifies that the model generating the smallest portfolio variance
is considered to be superior.
We analyze two specifications for the vector µ, which can be interpreted as two
portfolio strategies. In the first strategy we assume that
µ = (1, . . . , 1) ∈ Rn .
Since no asset outperforms, the portfolio can be interpreted as a minimum variance
portfolio. In the second strategy the vector of expected excess returns is chosen as
µ = (1, 0, . . . , 0) ∈ Rn ,
which corresponds to a strategy where the first asset is held for return whereas the
others are hedging positions.
16
MinVar
Hedge
DCC
0.220075
0.290439
Const
0.223416
0.291045
Avg30
0.230884
0.306560
Avg90
0.221615
0.294265
StateDepen
0.219187
0.289201
Table 3: Annualized portfolio volatilities, averaged over all choices of four-asset
portfolios. Data are from 1997 − 2004.
MinVar
Hedge
DCC
0.175241
0.184381
Const
0.173437
0.182614
Avg30
0.177969
0.192667
Avg90
0.173076
0.184465
StateDepen
0.165852
0.179317
Table 4: Annualized portfolio volatilities, averaged over all choices of four-asset
portfolios. Data are from 2004 − 2010.
As an example, in the following we consider sample portfolios that consist of
four stocks out of ten given large US stocks3 . We compare five different models:
the discretized version (21) of our state-dependent model (denoted by ’StateDepen’ in the following tables); the standard mean-reverting Dynamic Conditional
Correlation GARCH model by Engle [2002] (denoted by ’DCC’); a model with
constant volatilities and correlations (’Const’); and two moving averages over the
past 30 and 90 days, respectively (’Avg30’, ’Avg90’).
Recall that we want to investigate how well the models capture the dynamics of
volatilities and correlations on a given set of data. To this end we estimate the models4 and optimize the portfolios period by period in the time frames from Jan 1997
- Jan 2004 and from Jan 2004 - Nov 2010. Tables 3 and 4 show the annualized realized portfolio return volatilities, averaged over all choices of portfolios that consist
of four stocks out of ten. The differences in portfolio return volatilities are small
but systematic. Tables 5 and 6 show the percentage of all four-asset portfolios for
which the model in the respective row yields a smaller realized portfolio volatility than the model in the respective column. We observe that the state-dependent
model (21) outperforms the other models systematically.
3 AT&T, Coca Cola, Exxon, Ford, General Electric, Johnson & Johnson, Microsoft, J.P. Morgan,
Procter & Gamble, Walmart.
4 Formula (23) for the portfolio return is only justified for simple returns. Therefore the DCCGARCH model, the moving averages and the constant model are estimated based on simple returns.
17
MinVar
DCC
Const
Avg30
Avg90
StateDepen
DCC
–
0.152381
0.000000
0.228571
0.671429
Const
0.847619
–
0.042857
0.657143
1.000000
Avg30
1.000000
0.957143
–
1.000000
0.985714
Avg90
0.771429
0.342857
0.000000
–
0.790476
StateDepen
0.328571
0.000000
0.014286
0.209524
–
Hedge
DCC
Const
Avg30
Avg90
StateDepen
DCC
–
0.409524
0.000000
0.009524
0.747619
Const
0.590476
–
0.000000
0.128571
0.985714
Avg30
1.000000
1.000000
–
1.000000
1.000000
Avg90
0.990476
0.871429
0.000000
–
0.980952
StateDepen
0.252381
0.014286
0.000000
0.019048
–
Table 5: Percentage of four-asset portfolios for which the model in the row yields
a smaller realized portfolio volatility than the model in the column. Data are from
1997 − 2004.
MinVar
DCC
Const
Avg30
Avg90
StateDepen
DCC
–
0.700000
0.376190
0.647619
0.947619
Const
0.300000
–
0.252381
0.471429
1.000000
Avg30
0.623810
0.747619
–
0.842857
1.000000
Avg90
0.352381
0.528571
0.157143
–
0.966667
StateDepen
0.052381
0.000000
0.000000
0.033333
–
Hedge
DCC
Const
Avg30
Avg90
StateDepen
DCC
–
0.823810
0.014286
0.480952
0.985714
Const
0.176190
–
0.000000
0.209524
1.000000
Avg30
0.985714
1.000000
–
1.000000
1.000000
Avg90
0.519048
0.790476
0.000000
–
1.000000
StateDepen
0.014286
0.000000
0.000000
0.000000
–
Table 6: Percentage of four-asset portfolios for which the model in the row yields
a smaller realized portfolio volatility than the model in the column. Data are from
2004 − 2010.
18
A
Estimation method
We estimate the parameter θ ∈ Θ in the continuous time model (4)-(6) from discrete time market observations. Statistical methods for stochastic delay equations
are not yet well-developed. For delay equations with affine drift a first estimation
approach is developed in K¨uchler and Sørensen [2009a] and K¨uchler and Sørensen
[2009b]. It is not obvious how their approach can be generalized to the setting
of our model5 . We propose an approximate maximum likelihood estimator that is
heuristically motivated and is shown to work well in simulation experiments. A
proof of the consistency and asymptotic distribution of this estimator is beyond the
scope of this paper and subject of future research.
Consider daily realizations of the process S,
s(tk ) k=1,...,N = s1 (tk ), . . . , sn (tk ) k=1,...,N ,
where tk+1 − tk = ∆t = 1/250. It is natural to base our estimator on log-prices.
Denote by pk the conditional density
Pθ log S(tk ) ∈ dyk log S(t j ) = y j : j = 1, . . . , k − 1 = pk (yk |y1 , . . . , yk−1 , θ )dyk ,
which is unknown in our model. The log-likelihood function is then given by
N
∑
log pk log s(tk )| log s(t1 ), . . . , log s(tk−1 ), θ .
(27)
k=1+nF
We approximate the unknown density pk with the density p˜k of a n-dimensional
normal distribution. The parameters of p˜k are motivated by the Euler scheme for
stochastic delay equations6 , cf. K¨uchler and Platen [2000]. The density p˜k has
mean
!
2
1 n
i
log s (tk−1 ) + µi (θ , stk−1 ) − ∑ σ˜ i,p θ , F(stk−1 )
∆t
,
2 p=1
i=1,...,n
and covariance matrix (cf. (7))
T
∆t diag σ θ , F(stk−1 ) ρ θ , F(stk−1 ) diag σ θ , F(stk−1 )
,
both depending on the parameter
θ . Recall that stk−1 denotes the segment of observations s(tk−1 ), . . . , s(tk−1−nF ) . As in our setup (11) and (15), we let µi (θ , stk−1 )
5 Private
communication with Uwe K¨uchler.
case of non-delay stochastic differential equations, our approach reduces to the well-known
parameter estimation technique, see, for example, Hurn et al. [2007].
6 In
19
and F(stk−1 ) depend on daily past observations s(tk−1 ), s(tk−2 ), . . . . We propose an
approximate maximum likelihood estimator
θb = argmaxθ ∈Θ L(θ ),
(28)
with likelihood function
N
L(θ ) =
∑
k=1+nF
log p˜k log s(tk ) log s(t1 ), . . . , log s(tk−1 ), θ .
(29)
For a number of n assets, the computational effort of estimator (28) grows
at the order of n3 , because the density p˜k requires us to compute the inverse of an
n × n-covariance matrix. This makes the application of estimator (28) to large portfolios practically infeasible from a computational point of view. This is a common
problem also for other models like multivariate GARCH, see, for example, BCBS
[2011a]. Inspired by methods in Engle et al. [2009], we reduce the computational
effort to order n2 . This is achieved by replacing the likelihood function (29) by
the sum over all bivariate likelihood functions Li1 ,i2 (θ ) that refers to the asset pair
Si1 , Si2 ,
Li1 ,i2 (θ )
(30)
N
=
∑
k=1+nF
log p˜k
log(si1 , si2 )(tk ) log s(t1 ), . . . , log s(tk−1 ), θ .
Here, p˜k is the density of a two-dimensional normal distribution with parameters
suggested by the Euler scheme. Observe that the covariance matrix of Si1 , Si2 depends on the whole vector S of assets. The parameter θ is now estimated from
n
θb = argmaxθ ∈Θ
Li1 ,i2 (θ ).
∑
(31)
i1 ,i2 =1,i1 <i2
The computational effort of this estimator is at the order n2 .
Still, we face a high-dimensional optimization problem. The effort there can
be reduced by subsequently applying estimator (31) to sub-portfolios with m =
2, 3, 4, . . . assets. More precisely, we estimate the parameters for the sub-portfolio
with assets S1 , . . . , Sm and then use these parameters as pre-estimated values in the
estimation of the sub-portfolio with assets S1 , . . . , Sm+1 .
To reduce the computational effort further, we would have to depart from modeling correlations individually, thereby lose too much flexibility.
Recall that the market state (11) depends on the market memory nF , which has
to be estimated from data as well. We estimate the market memory from
n
nc
F = argmaxnF ∈{2,...,250} max
∑
θ ∈Θ i ,i =1,i <i
1 2
1 2
20
Li1 ,i2 (θ , nF ).
(32)
The approximate maximum likelihood estimator (28) and thus our estimators
(31) and (32) can be severely biased, because for large Euler discretization steps
∆t the density p˜k of a normal distribution may be a poor approximation for the
transition density pk . For daily observations we justify the reliability of estimator
(31) by re-estimating a given parameterization in model (4) - (6) from simulated
discrete time asset realizations. We use n = 4 assets and nCov = 8 discretization
points (14) for every spline function of the instantaneous covariance matrix. In
the model (4) - (6) we use the market state function (11) with nF = 75 days and
constant drifts µk = 0.1 for all i = 1, . . . , 4. For the dependency of volatilities and
instantaneous correlation
on the
market state we describe the Cholesky decomposition C F(St ) = ci, j F(St ) i, j of the instantaneous covariance matrix by (13),
with

2


 π arctan (sin(i + j+p0.9x)) ,
i > j
2
ci, j (x) = 0.001 + π arctan
|2 j + 0.9x| , i = j
i, j = 1, . . . , n,


0,
i < j,
We use an Euler scheme to generate a time series of daily asset realizations over
a period of 20 years. To keep the discretization bias small in the Monte Carlo
simulation we use 2000 additional equidistant discretization steps per day. Figure
4 shows that estimator (31) yields a reliable estimate of the dependency structure
of volatilities and correlation on the market state. Figure 5 shows that estimator
(32) is able to identify the market memory.
21
Figure 4: Re-estimation of pre-specified dependency structures of volatilities and
correlations on the market state. Black lines indicate the empirical estimate (31),
red lines indicate the true model dependencies.
22
Figure 5: Re-estimation of the market memory nF of the market state function F in
the model (4) - (6). The Black line are estimated maxima of the likelihood function
in (31), the red line indicates the true market memory.
B
Results of empirical estimations
(a) Colgate - S&P 500
(b) Microsoft - S&P 500
(c) Halliburton - S&P 500
(d) Walt Disney - S&P 500
(e) Pfizer - S&P 500
(f) Walmart - S&P 500
3
Figure 6: Plots of the estimated maximum of the likelihood function (31) versus varying market memories of the market state function in the model (4) - (6).
We observe that the market memory is about 75 business days. Estimates are for
1990 − 2010.
23
(a) Microsoft-Walmart
(b) Microsoft-GE
(c) GE - Walmart
(d) Pfizer-Microsoft
(e) Pfizer-Walmart
(f) Pfizer-GE
(g) Microsoft-Exxon
(h) Walmart-Exxon
(i) GE-Exxon
(j) Pfizer-Exxon
(k) Microsoft-BoA
(l) Walmart-BoA
(m) GE-BoA
(n) Pfizer-BoA
(o) Exxon-BoA
Figure 7: Dependence structure of return correlations on the market state for a
portfolio of six stocks. The market state is defined as the realized drift of the
S&P500 over a rolling window of 75 business days. Estimates are for Jan 2004 −
Nov 2010.
24
(a) Microsoft
(b) Walmart
(c) General Electric
(d) Pfizer
(e) Exxon
(f) Bank of America
Figure 8: Dependence of return volatilities on the market state. The market state
is defined as the realized drift of the S&P500 over a rolling window of 75 business
days. Data from Jan 2004 − Nov 2010.
References
Basel Committee on Banking Supervision. Analysis of the Trading Book Quantitative Impact Study. Technical report, Bank for International Settlements, October
2009a.
Basel Committee on Banking Supervision. Revisions to the Basel II Market Risk
Framework. Technical report, Bank for International Settlements, 2009b.
Basel Committee on Banking Supervision. Messages from the Academic Literature on Risk Measurement for the Trading Book. Technical report, Bank for
International Settlements, 2011a.
Basel Committee on Banking Supervision. Revisions to the Basel II Market Risk
Framework. Technical report, Bank for International Settlements, February
2011b.
C. Becker and W. M. Schmidt. State-dependent dependencies: a continuous-time
dynamics for correlations. 2010.
25
T. Bollerslev, R. Engle, and J. Woolridge. A Capital Asset Pricing Model with
Time-Varying Covariances. Journal of Political Economy, 96:116–131, 1988.
Z. Ding and R. Engle. Large Scale Conditional Covariance Matrix Modeling, Estimation and Testing. Technical report, Leonard N. Stern School of Business,
2001. URL http://ssrn.com/abstract=1296437.
R. Engle. Dynamic Conditional Correlation: A Simple Class of Multivariate
Generalized Autoregressive Conditional Heteroskedasticity Models. Journal of
Business & Economic Statistics, 20(3):339–350, 2002.
R. Engle and R. Colacito. Testing and Valuing Dynamic Correlations for Asset
Allocation. Journal of Business and Economic Statistics, 24(2):238–253, 2006.
R. Engle, N. Shephard, and K. Sheppard. Fitting Vast Dimensional Time-varying
Covariance Models. Technical report, NYU Working Paper, 2009.
A.S. Hurn, J.I. Jeisman, and K.A. Lindsay. Seeing the Wood for the Trees: A Critical Evaluation of Methods to Estimate the Parameters of Stochastic Differential
Equations. Journal of Financial Econometrics, 5(3):390, 2007.
U. K¨uchler and E. Platen. Strong Discrete Time Approximation of Stochastic Differential Equations with Time Delay. Mathematics and Computers in Simulation, 54(1):189–205, 2000.
U. K¨uchler and M. Sørensen. A Simple Estimator for Discrete-time Samples from
Affine Stochastic Delay Differential Equations. 2009a.
U. K¨uchler and M. Sørensen. Statistical Inference for Discrete-time Samples from
Affine Stochastic Delay Differential Equations. 2009b.
X. Mao. Stochastic Differential Equations and Applications. Horwood Pub Ltd,
2007.
S.E.A. Mohammed. Stochastic Functional Differential Equations. Research
Notes in Mathematics, no. 99, Pitman Advanced Publishing Program, BostonLondonMelbourne, 1984.
R. Rebonato and P. Jackel. The Most General Methodology to Create a Valid Correlation Matrix for Risk Management and Option Pricing Purposes. Quantitative
Research Centre of the NatWest Group, 1999.
A. Silvennoinen and T. Ter¨asvirta. Multivariate GARCH Models. Handbook of
Financial Time Series, pages 201–229, 2009.
26