Macroeconomics III Lecture notes∗

Macroeconomics III
Lecture notes
This version: March 29, 2015
Contents
Expectations
3
Introduction to the concept of expectations . . . . . . . . . . . . . . . . . . . . . . . . . .
3
Types of expectations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
10
Traditional models with expectations
14
The Cobweb Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14
The Cagan Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
20
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
28
The Lucas Imperfect Information Model with AD . . . . . . . . . . . . . . . . . . . . . . .
28
The Sticky Wage Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
33
The E¤ectiveness of monetary policy (with …xed rules)
37
A model with stabilizing monetary policy . . . . . . . . . . . . . . . . . . . . . . . . . . .
37
A model with constant growth of money . . . . . . . . . . . . . . . . . . . . . . . . . . . .
44
Discretionary monetary policy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
46
Political cycles and discretionary monetary policy . . . . . . . . . . . . . . . . . . . . . . .
48
Monetary policy under commitment and discretion . . . . . . . . . . . . . . . . . . . . . .
50
Monetary policy under commitment . . . . . . . . . . . . . . . . . . . . . . . . . . .
51
Monetary policy under discretion (without commitment) . . . . . . . . . . . . . . . .
53
Business cycles
55
Business cycles - The Carlin and Soskice (2005) model . . . . . . . . . . . . . . . . . . . .
58
Endogenous business cycles - The Goodwin (1967) model . . . . . . . . . . . . . . . . . .
63
A Real Business Cycles Model
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
67
The Basic RBC model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
70
Price Rigidities - The Calvo (1983) model . . . . . . . . . . . . . . . . . . . . . . . . . . .
77
These notes may contain typos/mistakes and are subject to changes/updates during our course. Please keep track if
there are any.
1
Expectations and …nancial markets
84
Bonds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
84
Stocks, stock prices, and stock markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
89
Measures of returns and risk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
93
Measuring portfolio return and risk to assemble a portfolio . . . . . . . . . . . . . .
94
Market price of risk and the CAPM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Black-Scholes model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Appendix
107
Appendix - Reminder of Statistics 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Mean-variance trade-o¤ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
2
Expectations
Introduction to the concept of expectations
Why did you start reading these notes? Would you start and/or continue reading if you expect
these notes to be useless and/or your test(s) to be very easy? I suppose the answer is no at least
for some students.2 This is an example how expectations matter for strategies, actions, and later
for performance at individual level.
Consider another example to see how expectations matter for economic performance at individual, as well as aggregate, level. Suppose we have an economy of 1000 …rms and consumers.
In this economy, …rms hire consumers’labor to produce and consumers use wages they receive to
buy …rms’ products. Suppose that one of the …rms expects low demand for its products prior to
deciding how much to produce. In order to produce according to its expectation, it would hire few
labor and pay low wage bill. Further, imagine that instead of one …rm, all …rms expect low demand
for their products. In such a case, all …rms will hire few labor and pay low wage bills. Therefore,
consumers will have low income and consume few products, which will reinforce …rms’expectations.
You continue reading this chapter, and I will bring more such examples. The main focus of our
examples, and classes in general, will be how expectations can matter macroeconomic performance
and aggregate economic ‡uctuations due to supply or demand shocks.
Prior to proceeding to these examples, lets digress (which we will do often) and discuss formally
what expectations are.
Expectations - intuitive and formal discussion
We know for sure that if we throw an apple (perhaps, not an iPhone) it will eventually hit the
ground because of gravitation. Since we know for sure, we expect the event "thrown apple hits the
ground" to happen. Can you claim with the same certainty that tomorrow it won’t rain around our
university?
In these examples we have two di¤erent (random) events. One of the events is "thrown apple hits
the ground." Whereas, the other event is "no rain tomorrow around our university." These events
happen with some probabilities. The …rst event happens with probability 1, i.e., it happens for
sure/with certainty. (Since it happens with certainty we don’t call it a random event.) The second
event, however, happens with probability less than 1. For example, let it happen with probability
0:5. Then you would say that with probability 0:5 you expect to have no rain around our university
tomorrow.
Lets now consider a bit more sophisticated examples. Suppose we have a lottery which pays 100
EUR with probability 0:01 and 0 EUR otherwise, i.e., with probability 0:99. What is the expected
pay-o¤ of this lottery? It is 1 EUR:
0:01
2
100 + 0:99
0 = 1:
These notes are not useless and I promise your tests will not be easy.
3
Therefore, if this lottery costs 5 EUR, one might think at least twice prior to buying it.
We have two possible realizations of random variable "pay-o¤" in this example: "100 EUR" and
"0 EUR." These realizations have associated probabilities 0:01 and 0:99. When we are calculating
expected value of the random variable "pay-o¤" we weight each of these possible realizations with
the corresponding probability. Intuitively, we do so in order to give larger weight to realizations
which are more likely to happen (here: 0 EUR is more likely to happen and gets 0:99 weight which
is larger than 0:01 weight of 100 EUR).
Suppose now we have a random variable x which takes (independently distributed) values from
a …nite and ordered set of real numbers fxj gN
j=1 with associated probabilities pxj
real numbers
pxj
N
j=1
fxj gN
j=1
N
.
j=1
The set of
are the possible realizations of x. In turn, the space of associated probabilities
is the probability distribution of x. To relate to previous example, let x be the pay-o¤ of
a lottery which gives xj amount of Euros with probability pxj where j = 1; :::; N . For instance, if
x1 is the event when pay-o¤ is 0 EUR and x2 is the event when pay-o¤ is 10 EUR. Then px1 and
px2 are the probabilities of those events.
What is then the expected value of x? Use E [x] to denote it. E [x] is given by
E [x] = px1 x1 + px2 x2 + ::: + pxN xN =
N
X
pxj xj ;
j=1
which clearly generalizes our previous example in a straightforward manner. If there were (countably) in…nite possible realizations of x so that we had fxj g+1
j=1 and pxj
write
E [x] =
+1
X
+1
,
j=1
then we would simply
p xj x j :
j=1
In any case E [x] is just a real number, of course assuming that E [x] < +1.
Lets continue this generalization process. Suppose now that random variable x takes values
from a continuous set of real numbers [A; B], where A < B (e.g., A = 0 and B = 100). Denote the
probability that the realization of x happens to be less than X by
F (X) = P (x < X) ;
where F is the probability distribution function of x. F function maps the set of possible realizations
[A; B] to [0; 1].
To derive the expected value of x we need to know the probability of each of the possible realizations of x. For a second suppose that x, as in previous example, took (independently distributed)
values from a discrete and ordered space fxj g+1
j=1 . In such a case the probability of observing exactly
realization x2 is the di¤erence between probability that the realization of x is less than x3 and the
probability that x is less than x2 . In other words,
p x2 = p x1 + p x2
4
p x1 :
According to our new notation this can be written then as
Denote this di¤erence by
px2 = F (x3 )
F (x2 ) :
F = F (x3 )
F (x2 ) :
F;
Lets go back to the case when x takes values from a continuous set [A; B]. Suppose, as in
previous example, x1 ; x2 ; and x3 are consecutive possible realizations of x. Since the set of possible
realizations of x is continuous the distance between these realizations is in…nitesimally small. In
such a circumstance we consider an in…nitesimally small change in F for obtaining the probability
that x takes a value of x2 . This in…nitesimally small change we denote by dF instead of
F . The
expected value of x in this case is
E [x] =
ZB
x
~dF (~
x) :3
A
In this expression, x
~ are possible realizations of x and dF (~
x) are their associated probability. We
use integral instead of sum since we are summing/integrating over a continuum of in…nitesimally
small points.
In case when F is a di¤erentiable function on [A; B] (i.e.,
dF (~
x)
d~
x
exists on that interval) we can
rewrite E [x] as
E [x] =
ZB
x
~f (~
x) d~
x;
A
where f (~
x) =
dF (~
x)
d~
x
is the probability density function of x.
In economics and in many other disciplines we call the possible realizations of a random variable
"possible states" and the space of possible realizations "space of possible states." You can easily
notice that the expected value of x depends on both the space of possible states and the probability
distribution of x. For example, suppose for simplicity that
F (~
x) =
x
~
B
B
A
(i.e., we have a uniform distribution) then
E [x] =
1
B
A
ZB
x
~d~
x
A
=
1
(B + A) :
2
Therefore, changing B and/or A, which corresponds to changing the space of possible states of x,
3
Again, we need to have E [x] < +1 so that E [x] is something well de…ned, i.e., it is just a real number.
5
changes its expected value. Consider a change in F now. Suppose,
F (x) =
1
A
B
Zx
1
x
~
d~
x
A
where
1 z
1
p e 2(
2
Zz
(Z) dZ:
(z) =
(z) =
2
) ;
1
(i.e., we have a truncated normal distributed random variable with non-truncated mean
variance .
and
and
are normal distribution and density functions, correspondingly) In such a case,
it can be shown that
E [x] =
+
A
B
B
A
:
Cooking up examples for random variables with discrete distributions is much easier. Suppose,
we have (1) : fxj gN
j=1 = f1; 2g and pxj
N
j=1
f0:5; 0:5g, and (3) : fxj gN
j=1 = f1; 2g and
= f0:5; 0:5g, (2) : fxj gN
j=1 = f2; 2g and pxj
pxj
N
j=1
N
j=1
=
= f0:8; 0:2g. Clearly, the di¤erence between
(1) and (2) is in the space of possible states. In turn, the di¤erence between (1) and (3) is in the
associated probabilities/distribution of states. Expected values in each of these cases are 1:5, 2, and
1:2, correspondingly. For more discussion see Appendix - Statistics 0.
Economic agents act according to their expectations. Often in real life, as well as in economics,
we might not exactly know the entire space of possible states of a random variable neither we
might know exactly its probability distribution function. In our examples we will see that economic
performance depends on economic agents’ beliefs of what the possible states are and what the
distribution function is.
Further examples and Keynesian-beauty contest
In order to further assert that expectations and the way they are formed matter in economics
consider a game called "p-beauty contest" and …rst run in Nagel (1995). The rules of the game are
as follows:
Each of N -players is asked to choose a number from the interval [0; 100]
The winner is the player whose choice is closest to p times the mean of the choices of all
players, where p < 1 (e.g., p = 0:5).
In this game the random variable for a player is the mean of the choices of all players. Meanwhile,
the probability distribution of this variable depends on the types and beliefs/knowledge of all players.
6
For example, it turns out that if all players are rational in the sense that everyone performs iterated
elimination of (weakly) dominated strategies and all players know about that (i.e., it is a common
knowledge that everyone is rational) then it is straightforward to guess what would be the mean of
choices of all players. Under these assumptions everyone simply chooses 0. To see this consider a
player. This player will never choose a number above 100p since it is dominated by 100p. Moreover,
given that the player believes that others are rational too, s/he will not pick a number above 100p2
since s/he will know that no one will pick above 100p. Similarly believing that everyone is rational,
s/he will not pick a number above 100p3 and so on, until all numbers but zero are eliminated.
If p > 1 then 100 can also be an equilibrium (and 0 is not a "stable" equilibrium). For p = 1
any number chosen by all players can be an equilibrium.
This game mimics the problem a seller in the stock market, for example. The seller wants to sell
his shares when the price of the share is at its peak, just before at least someone wants to sell. In
order to do that (i.e., design its actions) the seller needs to know the types and beliefs/knowledge of
other sellers. This example motivated John Maynard Keynes to propose the original setup of this
game in Chapter 12 of his work: The general theory of employment, interest and money (1936). In
that work we proposed an explanation behind ‡uctuations in prices in equity markets in terms of
changes in beliefs of sellers and buyers. Instead of picking numbers, Keynes used an analogy based
on a newspaper contest, in which players are asked to choose from a set of photographs of women
that are the "most beautiful." Those who picked the most popular face are then the winners. (This
is the reason why the name of the game is beauty contest).
Imagine that there are three players. A naive strategy in this game would be to pick the most
beautiful face one perceives. If everyone does so, one could deviate with a more sophisticated
strategy and pick up a face which s/he thinks is most likely to be chosen by the other two. If
everyone does so, one could deviate with a more sophisticated strategy and pick up a face which
s/he thinks is most likely to be expected to be chosen by the other two, etc.
Keynes wrote: "It is not a case of choosing those [faces] that, to the best of one’s judgment, are
really the prettiest, nor even those that average opinion genuinely thinks the prettiest. We have
reached the third degree where we devote our intelligences to anticipating what average opinion
expects the average opinion to be. And there are some, I believe, who practice the fourth, …fth and
higher degrees."
It turns out, however, that in reality in p-beauty contest games assumptions of rationality are
often violated and therefore, equilibrium is not at 0. To see this check the following webpage:
http://www.marietta.edu/~delemeeg/expernom/nagel.htm. The following comment is taken from
that website and summarizes the thought processes a high school class student participating in a
newspaper contest (submitted to one of the newspaper studies, the Spektrum der Wissenschaft).
The game is a p-beauty contest with p = 2=3.
"I would like to submit the proposal of a class grade 8e of the Felix-Klein-Gymnasium Goettingen
for your game: 0:0228623. How did this value come up? Johanna . . . asked in the math-class
whether we should participate in this contest. The idea was accepted with great enthusiasm and
7
lots of suggestions were made immediately. About half of the class wanted to submit their favorite
numbers. To send one number for all, maybe one could take the average of all these numbers.
A …rst concern came from Ulfert, who stated that numbers greater than 66 2=3 had no chance
to win. Sonja suggested to take 2=3 of the average. At that point it got too complicated to some
students and the …nding of the decision was postponed. In the next class Helena proposed to
multiply 33 1=3 with 2=3 and again with 2=3. However, Ulfert disagreed, because starting like
that one could multiply it again with 2=3. Others agreed with him that this process then could be
continued. They tried and realized that the numbers became smaller and smaller. A lot of students
gave up at that point, thinking that this way a solution could not be found. Other believed to have
found the path of the solution: one just has to submit a very small number.
However, one could not agree how many of the people who participated realized this process.
Johanna supposed that the people who read this newspaper are quite sophisticated. At the end of
the class 7 to 8 students heatedly continued to discuss this problem. The next day the math teacher
received the following message: We think it best to submit number 0:0228623."
Consider another game called "ultimatum game." There are two players in this game. Player 1
is entitled to a 10 EUR and players decide how to divide it. The rules of the game are as follows
Player 1 proposes a division of the sum
Player 2 can either accept or reject this proposal
– If the player 2 rejects, neither player receives anything
– If the player 2 accepts, 10 EUR is split according to the proposal
The extensive form representation of the game is:
In this …gure it assumed that player 1 either gives 2 EUR to player 2 or 5 EUR. Player 2 then
decides to accept or reject. If player 2 rejects in any of these cases both get 0 EUR. In turn, if
player 2 accepts the o¤er money is split according to the proposal.
The strategy of player 1 is the proposal coupled with the expected strategy/response of player
2, which depends on the type of player 2. Imagine that player 1 and 2 care only about money in
this game (i.e., players are expected pay-o¤ maximizers). If player 1 knows about the type of player
8
2 then s/he knows that whatever positive amount s/he proposes player 2 will accept. Therefore,
s/he can propose something very close to 0 (in fact, perhaps, exactly 0) and get as much of the
pay-o¤ as possible (something around 10 EUR). If, however, player 1 expects that player 2 has a
strict preference over equality of the split of the award (i.e., player 2 will reject a proposal if it splits
reward very unequally) then s/he might propose something higher than 0 EUR and close to 5 EUR.
This is because otherwise s/he gets nothing.
9
Types of expectations
In previous (sub-)section we de…ned and discussed expectation operators for discrete and continuous
random variables. We saw that we can compute the expected value of a random variable in cases
when we know exactly what is the space of its possible states and the probability of those states
(distribution function). We saw as well that if we have three random variables which have di¤erent
spaces of possible states and/or probabilities of those states then their expected values can be
di¤erent (i.e., expected value depends on the set of possible states and on distribution function).
Further, we saw examples when economic agents act according to their expectations and they might
not exactly know the entire space of possible states of a random variable neither its probability
distribution function.
What to do if we don’t know exactly the distribution function of a random variable? To alleviate
such a problem we use statistics. In other words, we observe realizations of the random variable and
use them in order to infer certain moments of its distribution. Expected value is the …rst moment.
In particular, suppose that we have f^
xj gN
j=1 realizations of random variable x. Further, we have no
priors about the probability of each of the observations. In such a case the mean of observations
xeN
N
1 X
=
x
^j
N
j=1
is the sample (statistical) analogue of the expected value of x. As E [x] it is just a number.
Here we have
1
N
in front of each realization since we need to treat each of the observations
equally likely (i.e., each observation has
1
N
probability to occur). A central theorem in probability
theory called Law of Large Numbers provides us with a proof that if we have in…nite realizations
of the random variable then the mean of the sample is the expected value of the random variable,
Pr
lim xeN = E [x]
N !+1
= 1:
Usually in economics and in particular in macroeconomics we observe realizations of random
variables over time. Therefore, often instead of index j and N we use t and T , where t indexes time
and T refer to its most recent value. Then we write, for example,
xeT =
T
1X
x
^t
T
t=1
where x
^1 is the value that random variable x has obtained at time 1, x
^2 is the value that random
variable x has obtained at time 2, etc. Again, this is the best (unbiased and consistent) guess of
the expected value of x when we have no priors on the likelihood of its observations. It is usually
10
the default option, therefore, and we will treat
T
1X
x
^t
T
xeT =
t=1
as the …rst type of expectation. It incorporates all observations and treats them equally.
In certain circumstances we might have priors over the likelihood of observations to reoccur.
For example, it might be that we know that older observations are less likely to happen. To put
more meat into the discussion, suppose that we observe f^
xt gTt=1 realizations of random variable x
and would like to compute the mean of x, perhaps because we will apply it for our actions in time
T + 1. We will denote xeT by xeT +1 for that purpose.
Suppose further that if we know that for any t from 1 to T if the likelihood of x
^t to reoccur
~
~
~
~
is
< 1 then the likelihood of x
^t 1 to reoccur is
1
. (Notice that
< 1 implies that
~ 1
~ < ~ . Therefore x
^t
1
is less likely to happen than x
^t .) In such a case, we would write the
sample mean as
xeT +1 ( ) =
=
x
^T + (1
T
X
(1
)x
^T
)T
t
1
+ ::: + (1
)T
1
x
^1
x
^t :
t=1
where ~ is the likelihood that we think x
^T will happen. x
^T
We use
xeT +1 (
xeT +1 ( )
that xeT +1 (
) instead of
xeT
1
happens with likelihood
(1
).
( ).
has a special name. It is called exponentially weighted average with time decay. Notice
) can be easily written in a recursive form of
xeT +1 ( ) =
x
^T + (1
) xeT ( )
= xeT ( ) + [^
xT
xeT ( )] :
According to the second line xeT +1 ( ) is a sum of old expectation xeT ( ) and a weighted di¤erence
between the realization of x at time T , x
^T , and its expectation xeT ( ). This last term adapts/corrects
the current expectation to the error: [^
xT
xeT ( )]. In this context,
is called a correction para-
meter. Economists call these types of expectations "adaptive expectations." These are the second
type of expectations. Hereafter, we will use notation xeT +1jT to denote the expectation of random
variable xT +1 conditional on information available at time T .
Macroeconomists have used extensively (and abused) adaptive expectations in their models
before 1970s. The assumption of adaptive expectations has been imposed in these models without
much justi…cation. For example, this assumption usually generates persistent errors in these models
T
X
1
(i.e., limT !+1 T
[^
xt xeT ( )] 6= 0) which seems to be odd. It implies that economic agents make
t=1
persistent errors (i.e., have no intention to correct their errors).
11
An assumption regarding expectations which is consistent with models is "rational expectations"
assumption. This assumption states that economic agents use the model to form expectations. A
model is a description of an economy. Assuming that it is the right one, rational expectations
assumption states that the agents know the economy entirely and they use that information to form
their expectations. "The agents know the economy" means that they know the structure behind
demand and supply and that there are random variables, with given distribution functions, which
a¤ect supply and demand. In such a circumstance in terms of the model, agents’ expectations
are not systematically wrong in that all errors are random/not persistent. Therefore, under this
assumption deviations from perfect foresight in the model are only random.
Denote these expectations as
xeT +1jT = ET [xT +1 ] ;
where ET [xT +1 ]
E [ xT +1 j
T]
is the expected value of x at time T + 1 given all information till
time time T + 1. Information is summarized by
T
and includes all the possible structures in the
economy and values of fundamentals. These are the third type of expectations.
Lets see why this can work better than adaptive expectations which miss some of the structure
of the model. Suppose that agents are assumed to have adaptive expectations, and the model
economy features a constantly rising in‡ation rate. In such a circumstance agents would be assumed
to always underestimate in‡ation since they are assumed to predict in‡ation by looking at in‡ation
in previous years. Under rational expectations assumption, since the trend in in‡ation is part of
the model agents would take it into account in forming their expectations and there won’t be such
a bias.
Formally, this can be represented in the following manner. Consider an economy which starts
at time 1 and where in‡ation
at any time T is given by
T
where
is a constant, T indexes time, and
=
T
+T +
T;
is a random variable with 0 mean and
2
variance.
Further, suppose we are at the end of T = 2 and the agents’in this economy need to form expectation
of in‡ation for time T + 1 = 3. If agents have adaptive expectations with correction parameter 13 ,
then
e
3
At the end of T = 2 the realizations of
1
3
2
1X
=
3
t=1
2
3
2 t
t:
are known. Suppose,
1
= 0:01 and
in‡ation then can be rewritten as
e
3
1
3
=
=
1
( + 2 + 0:02) +
3
5
44
+ :
9
30
12
2
3
( + 1 + 0:01)
2
= 0:02. Expected
In case of rational expectations,
e
3j2
= E[
3j
2]
= E[ +3+
3j
2]
=
+3+E[
3j
2]
=
+ 3:
Notice that
e
3j2
>
e
3
1
3
:
In this simple economy, trend in in‡ation can be thought to represent policy changes of a central
bank. In other words, suppose there is a central bank in this economy which constantly increases
in‡ation.
One of the in‡uential drivers behind widespread adoption of rational expectations in macroeconomics has been the famous Lucas’critique. In terms of our discussion, Lucas’critique is that in
models which do not feature agents with rational expectations, agents might not react to policy
changes. This seems not to reconcile well with reality since in such a case agents can be tricked
almost always with some policy changes. There is a famous Lincoln Quote on this issue: "You can
fool some of the people all of the time, and all of the people some of the time, but you can not fool
all of the people all of the time."
Despite these seemingly plausible properties and widespread use, rational expectations assumption has received criticism on the grounds that it assumes that agents know everything. Criticism
goes on saying that in the real world agents (consumers, …rms, etc) do not know exactly the economy.4 (If this were to happen, would Economics be a science?)
4
There is an emerging …eld in macroeconomics which deals with this issue. The models in this …eld feature agents which
continuously learn the economy and know the economy fully at the end of time horizon (i.e., they are asymptotically
rational). See for details "Evans, G., and Honkapohja, S. (2001). Learning and expectations in macroeconomics.
Princeton University Press."
13
Traditional models with expectations
This section highlights how important is the assumption on expectations in two traditional models.
The models are Cobweb Model and Cagan Model.
The Cobweb Model
The Cobweb Model Kaldor (1934) proposes an explanation why prices might be subject to periodic
‡uctuations in certain markets. It assumes that …rms must choose output before prices are observed,
demand and supply (prices) are uncertain, and that …rms’have adaptive expectations with
= 1.
These type of expectations are called static expectations since there is no correction to error. Firms’
expectations about prices at time T +1 are based on the prices that prevailed just in previous period,
at time T . In other words, using p to denote prices,
peT +1 = peT ( ) + [pT
peT ( )]
= pT :
Hereafter, we will replace T + 1 with t.
It seems that such a model can be well applicable to agricultural markets. In such markets,
producers invest in production (start the production process) much before they sell their output.
Periodic ‡uctuations then can happen for example because of supply shocks such as bad weather. For
example, according this model if producers of corn experience very bad weather and have reduced
output they would get higher prices. In the next period, they expect high prices and therefore will
produce a lot. This will dampen the prices and lead to expectation of low prices. Expecting low
prices, the producers of corn will produce few corn and in equilibrium price will rise, etc.
In graphical terms the process described above can be represented in the following manner.
The di¤erence between these two …gures is the relation between the slopes of the demand and
supply curves. In the …gure to the left, the slope of the supply curve is higher than the absolute
14
value of the slope of the demand curve. In …gure to the right, the slope of the supply curve is lower
than the absolute value of the slope of the demand curve.
Suppose an economy represented by these …gures starts at time t = 0 at the intersection of S
(supply) and D (demand) curves. In that period the economy receives a negative shock to supply
so that prices in the next period are p1 . In period t = 1 the economy receives a counterbalancing
positive supply shock which brings supply curve to its original position. In this period, however,
expected prices for period t = 1 are still p1 . Producers produce according to p1 . The demand,
however, is shorter than supply. Therefore, maximum that producers are able to charge is p2 which
is lower than p1 (p2 < p1 ). For period 2 then producers expect price p2 and produce accordingly.
In such a case, in period 2 it turns out that they have produced less than the demand is. They sell
then at a higher price p3 (p2 < p3 ). This process continues and generates periodic ‡uctuations in
prices.
In case of the …gure to the left, prices tend to stabilize after the shock. This happens because
inverse supply (or supply prices) reacts more than does inverse demand (or demand prices). Faced
with higher demand in period t = 2, for example, …rms rise their prices. They do so much that
dampens the demand in the next period. However, in case of the …gure to the right, prices do not
stabilize. This happens because inverse supply reacts less than does the inverse demand.5
Lets make this model more formal. Suppose at any time t demand and supply functions are
given by
Dt = mI
St = rI + rp pet +
where mI ; mp ; rI ; and rp are positive parameters.
demand and supply curves.
1;t
and
2;t
1;t ;
mp pt +
1
mp
2;t ;
1
rp .are
and
absolute values of the slopes of
are shocks/disturbances. Let
independently distributed (i.i.d.) and have 0 mean and
2
1;t
and
2;t
be identically and
variance. The values of these shocks are
not known at the time when price expectations are formed. At time t, supply function is designed
according to the expected value of the price. To stay in line with our story, assume that the economy
starts at p0 where D and S curves intersect. Moreover,
from p0 to p1 and makes
pe1
= p1 . Further,
2;1
=
0 and
1;t
2;0 .and
2;0
< 0 so that price shifts
0 8t > 1.
2;t
Market clearing condition requires that at each and every point in time quantity demanded is
equal to the quantity supplied,
Dt = St :
Therefore,
mI
mp pt +
pt =
mI rI
mp
and
5
1;t
= rI + rp pet +
rp e
p +
mp t
1;t
mp
2;t
2;t
:
If in this economy the absolute values of the slopes of demand and supply were equal then there would be permanent
price ‡uctuations with constant magnitude.
15
Denote
1
=
2
=
t
=
mI rI
mp
rp
mp
1;t
2;t
:
mp
And rewrite pt as
pt =
Assuming that pet = pt
1
e
2 pt
1
+
t:
we have
pt =
2 pt 1
1
+
t:
This is a very basic stochastic di¤erence equation. Its solution is the sum of a general solution of
homogenous equation
pt =
2 pt 1
and particular solution of the entire equation.
The solution of pt =
2 pt 1
is very basic
t 1
p1 ;
2)
pht = (
where p1 is the price where the economy started. Now we need to guess a particular solution of
the general equation. It turns out that for this form of equations the particular solution has the
following form
ppt
=
t 1
X
(
2)
t 1
(
1
):
+
=1
Therefore,
pt = (
t
2)
1
p1 +
t 1
X
(
2)
t 1
(
1
+
Ignore the second term in this expression and notice that if
2
) :6
=1
(
2)
t 1
> 1 then the absolute value of
is increasing over time. Therefore, pt diverges to in…nity. Parameter
mp
rp .
the ratio of slopes of supply and demand curves
value of the slope of the inverse demand curve
supply). However, if
2
is the inverse of
It is greater than 1 in case when rp > j mp j ;
which is equivalent to say that the slope of the inverse supply curve
1
mp
2
1
rp
is lower than the absolute
(so that inverse demand reacts more than inverse
< 1 then the absolute value of (
t 1
2)
declines over time. Therefore, pt
converges to a number. In this case rp < j mp j ; which means that the slope of the inverse supply
curve is higher than the absolute value of the slope of the inverse demand curve (so that inverse
6
To see that this is the solution, plug this expression into the equation above.
16
demand reacts less than inverse supply). What is the number that price converges to? The answer
to this question is quite simple. Convergence here means that price stabilizes over time (i.e., we
have a steady-state). We have assumed that
t
p=
= 0 for any t > 1. Therefore, stable prices means
2 p;
1
and price converges to
p=
1
:
1+
2
Notice that p is the level of price where our example D and S intersect. In other words, p = p0 .
To make this analysis clearer, let’s consider a numerical examples. Suppose,
p1 = 1;
mI
= mp = 1;
rI
= 0:
and consider two values of rp
rp1 = 0:5;
rp2 = 1:5:
This implies that
Assuming the value of p1 and
1;t
0 and
2;t>1
2;1
=
1
= 1;
1
2
= 0:5;
t
=
2;0
2
2
= 1:5;
2;t :
determines that values of the shock (of course given that
0).
Let’s determine the paths of p for these parameter values. First, consider the case when rp1 = 0:5
p2 = 1
0:5p1 +
2
= 0:5;
p3 = 1
0:5p2 +
3
= 0:75;
p4 = 1
0:5p3 +
4
= 0:625;
:::
p15 = 0:666687012;
:::
2
p+1 = :
3
The price converges to 23 , which is equal to
1
1+
2
.
17
Now consider the case when rp2 = 1:5
p2 = 1
1:5p1 +
2
=
0:5;
p3 = 1
1:5p2 +
3
= 1:75;
p4 = 1
1:5p3 +
4
=
1:625;
:::
p15 = 175:5575562;
:::
p+1 = 1:
The price diverges. Of course, negative price does not make sense. Therefore, one would have
stopped at p2 saying that there is something fundamentally wrong in this economy.
What happens if
< 1 and we have adaptive expectations? If
pet = pet
1
+
pt
pet
1
< 1 then
1
where
pt
1
=
e
2 pt 1
1
+
t 1:
Expectations can be written as
pet = [1
e
2 )] pt 1
(1 +
+
1
+
t 1
:
In this case we have a non-homogenous di¤erence equation in expectations. Similar to the previous discussion, expectations converge to a number if j1
(1 +
2 )j
< 1 and diverge to in…nity
otherwise. Steady expectations are given by
pe = [1
p
e
(1 +
1
=
1+
2 )] p
e
+
1;
:
2
Since we have adaptive expectations, pe is steady when prices are steady
p=
1
1+
:7
2
Interestingly, there can be situations when prices (and their expected values) are stable under
adaptive expectations with
that 1
7
(1 +
2)
< 1 but unstable under static expectations. For instance, suppose
< 0 then since by de…nition
To verify this plug pe =
1
1+ 2
into p =
1
2p
e
2
> 0 the absolute value of 1
:
18
(1 +
2)
is always
less than
2:
j1
(1 +
(1 +
This implies that if 1
(1 +
2)
2 )j
2)
2
,
2)
1
< 1:
< 0 then it is more likely that prices (and their expected values)
< 1 than under static expectations. This happens
suppresses the reaction of inverse supply curve to the observed price. For instance,
suppose that
j1
(1 +
1 <
are stable under adaptive expectations with
because
=
(1 +
2
2 )j
= 1:5 so that we are back to our example. Further, suppose
= 0:5. In such a case
= 0:25 and prices (together with their expected values) are stable under adaptive
expectations. We have seen, however, that for this example prices (and their expected values) are
not stable under static expectations.
In case, however 1
In case
that
1
1+
2
(1 +
2)
> 0 then
j1
(1 +
1
(1 +
2 )j
= 1
2)
<
2
(1 +
1
,
1+
2
< :
2
> 1 (and therefore prices are not stable under static expectations) it is always the case
< , which means that adaptive expectations are more likely to generate stable prices
2
2
(and expectations). For instance, again suppose that
j1
2)
(1 +
2 )j
= 1:5 but now
2
= 0:1. In such a case
= 0:75 and prices (together with their expected values) are stable under adaptive
expectations.
What would happen if we had rational expectations in this model? In case of rational expectations, expectations are implied by the model. In order to answer the question then we need to solve
for expected price level from demand and supply equations. Luckily, we have done most of the job.
Write Et
1 [pt ]
instead of pet in
1
e
2 pt
t:
+
pt =
2 Et 1 [pt ]
1
+
t:
Take the expected value of pt conditional on information at time t
Et
1 [pt ]
= Et
=
1 [ 1]
2 Et 1 [Et 1 [pt ]]
2 Et 1 [pt ] :
1
Therefore,
Et
1 [pt ]
=
19
1
1+
2
1.
+ Et
1 [ t]
and this relation holds for any t.8 This implies that in short-term
pt =
1
1
2
+
1+
2
t;
and shocks can be the only reason of ‡uctuations in the economy. Such an inference holds since in
this case the realizations of are not important given that it has a zero mean and is i.i.d./unpredictable.
In the long-term, however, we assume that there are no shocks. In some sense this corresponds to
assuming that everything is perfect. In such a case, Et
p=
1
1+
1 [pt ]
= Et
1 [pt ]
and
:
2
This analysis points that types of expectations matter for the level of aggregate economic activity
and prices and for ‡uctuations. In case of static or adaptive expectations in this model there can
be prolonged ‡uctuations in the economy much after the shock. However, in case of rational
expectations there are no ‡uctuations after the shock.
The Cagan Model
Cagan Model has been an in‡uential contribution to policy making and academia. Cagan in his
1956 article (Cagan, 1956) proposed a model which delivered novel explanation for extraordinarily
high in‡ation/hyperin‡ation. This model seems to do well in explaining for the behavior of in‡ation
and the demand for money even in the midst of such distress.
The common line of thought is that hyperin‡ation is because of continued supply of very large
quantities of money by the central bank. Famous examples are hyperin‡ation in Germany during
the period of 1921-1923 and Argentina in 1989. Hyperin‡ation in both countries happened because
their governments decided to fund their debt printing money.
Cagan Model stresses the destabilizing e¤ects that expectations might have on in‡ation. In
Cagan Model in‡ation can destabilize and become very large even with very small amount of money
injected into the economy. Therefore, an implication of the model is that policy makers should be
wary with …nancing a de…cit by printing money even of small quantities.
Cagan Model consists of several blocks. The …rst block of this model is money demand equation
Mt
= L (Yt ; it ) ;
Pt
where Mt is the amount of desired money holdings, Pt is aggregate price, and money is the medium
of exchange. Therefore,
Mt
Pt
is the real amount of desired money holdings. L (Yt ; it ) is money demand
function. It is assumed to depend on current output/income Yt and nominal interest rate it . More
precisely, it is assumed to increase with current output/income since increasing output implies
8
We have just found expectation from the model. In this sense we have assumed that agents which live in the economy
described by the model, know the economy and use the model to form expectations. Moreover, in this sense adaptive
expectations are a notion imposed on the model (additional condition/assumption so to say.)
20
that the same amount of money buys more goods. In turn, money demand is assumed to depend
negatively on nominal interest rate. A justi…cation for this assumption is that nominal interest rate
is paid on bonds/nominal savings but it is not paid on money holdings. Money holdings can be
freely converted into savings. Therefore, nominal interest rate is the opportunity cost of holding
money. Higher nominal interest rate increases this opportunity cost and reduces the desire to hold
money. (This equation can be thought to be a result of combination of Quantity Equation of Money
and Liquidity Preference Theory.)
Suppose that L (:; :) is di¤erentiable function in both arguments. Formally, our assumptions
about L (:; :) can be summarized as
@L (Y; i)
@Y
@L (Y; i)
@i
> 0;
< 0;
where time index t has been dropped because these inequalities are assumed to hold for any t.
The second block is the Fisher Equation. In case we have a deterministic setup Fisher Equation
is
1 + it = (1 +
t+1 ) (1
+ rt ) ;
where it is the nominal rate agreed for bonds at time t and paid at time t + 1 on bond holdings
from time t.
t+1
is in‡ation in the period from t to t + 1,
t+1
=
Pt+1 Pt
:
Pt
In turn, rt is real interest rate.
It is straightforward to obtain this equation. Suppose at time t one has acquired an asset At
at value Pt At . At time t + 1 the value of the asset has changed to Pt+1 At+1 . The percentage
change of the value of asset is the nominal interest earned on Pt At and the percentage change of
the volume/quantity of the asset is the real interest rate,
Pt+1 At+1 Pt At
Pt+1 At+1
=
1;
Pt At
P t At
Pt+1
At+1
=
1;
Pt
At
Pt+1 Pt
At+1 At
=
+1
+1
Pt
At
= (1 + t+1 ) (1 + rt ) 1:
it =
1;
In Cagan Model, however, future prices are not known. They are assumed to follow a rane . This also
dom/stochastic process. In such a case, we replace Pt+1 with its expected value Pt+1
21
implies that we need to replace in‡ation rate with its expected value, and Fisher Equation becomes
1 + it = 1 +
e
t+1
(1 + rt ) :
There is no expectation on it since, although it has to be paid at time t + 1, it is determined/agreed
at time t.9
The analysis in Cagan Model particularly focuses on periods of hyperin‡ation. In these periods
nominal values change much more rapidly than real values. Therefore, lets assume that real variables
are constant. Moreover, assume that the logarithm of the money demand function is linear in ln Yt
and ln (1 + it ). In particular,
ln L (Yt ; it ) =
0
+
1 ln Y
= :
where
0,
1,
and
ln
ln 1 +
are positive constants and
=
1+
e
t+1
0
e
t+1
(1 + r)
;
+
1 ln Y
ln (1 + r).
In such a case, combining money demand equation and Fisher Equation we have to have that
ln Mt
e
ln Pt+1
ln Pt =
ln Pt :
Denote z = ln Z and rewrite the equation above
mt
or
pt =
1+
pet+1
pt =
+
1
1+
mt +
pt ;
1+
pet+1 ;
This is a fairly interesting equation. It suggests that current price is function of current money
supply and expected price level in future. Moreover, current price increases with money supply. This
happens because, given constant output increasing money supply simply increases prices. (Prices are
the rates at which money is converted to goods). Current price also increases with expected future
price. This happens because higher expected price increases expected in‡ation. Given constant
real interest rate, this implies that nominal interest rate increases, which reduces the desire to hold
money. However, the supply of money is unchanged, i.e., Mt = f ixed. Therefore, the rates at which
money is exchanged for goods increase which is the same as to say that prices, pt , increase. In terms
of equations:
pet+1 ")
e
t+1
If we assume that
9
" and rt = const ) it ") L (Yt ; it ) # and Mt = const ) pt " :
= 0, then price (well... the logarithm of it) becomes a weighted average of
You might be used to the following form of Fisher Equation it = et+1 + rt . This is an approximation of the one stated
above. To see this assume that it ; et+1 , and rt are close to 0 and apply the following approximation x = ln (1 + x)
when x is close to zero.
22
current money supply and expected in‡ation
pt =
1
1+
mt +
pet+1 :
1+
For simplicity we will maintain this assumption and to keep the house clean will denote
=
1
1+
(0; 1) so that
) pet+1 :
pt = mt + (1
2
(1)
This equation together with assumption on expectations is the Cagan Model.
We will analyze now how di¤erent types of expectations matter for dynamics in this economy.
Moreover, we will check how changes in money supply matter for changes in prices and in‡ation,
which is the percentage change in prices.
Suppose that agents in this economy have adaptive expectations. Further, for simplicity suppose
1 coincided with its realization pet
that the expected value of price for time t
pt
1
= pt
2
and
t 1
1
= pt
1
(i.e.,
= 0). In such a case
pet+1 = (1
) pet + pt
= (1
) (1
) pet
1
+ pt
1
= (1
) [(1
) pt
1
+ pt
1]
= (1
) pt
1
+ pt
+ pt
+ pt :
Therefore, the Cagan Model can be expressed as
pt =
1
(1
)
mt +
(1
1
) (1
)
pt
(1
)
1:
The general solution of this di¤erence equation is the sum of the general solution of homogenous
equation and particular solution of this equation. It is "easy" to guess and verify that it is given by
pt =
(1
1
) (1
)
(1
)
t
p0 +
1
(1
)
t
X
(1
1
=0
) (1
)
(1
)
t
m ;
where p0 is the initial level of prices. It is a given number.
Suppose only few mt 6= 0 (or mt is a stationary function). The logarithm of the price level in
such case is stable if
(1
1
) (1
)
< 1:
(1
)
With this inequality, the logarithm of the price level converges over time to 0. Therefore, price
23
converges to 1. In‡ation, in turn, can be written as
= pt
t
pt
1
(1
1
=
+
1
(1
t 1
) (1
)
(1
)
(
1
mt
)
It is straightforward to notice that
(1
1
p0
)
(1
t 1
X
(1
1
)
(1
)(1 )
1 (1
)
=0
) (1
)
(1
)
t 1
m
)
< 1. Therefore, we have stability in terms of prices
and in‡ation in this economy if expectations are for prices. This situation is not so interesting for
our current purposes.
Consider now how things change if we have adaptive expectations not for price levels but for
in‡ation:
e
t+1
= (1
e
t
)
+
t:
Suppose again that at time t 1 prices were stable. Therefore, in‡ation was equal to 0. This implies
that
e
t+1
= (1
=
) (1
e
t 1
)
+
t 1
+
t
t
or equivalently,
pet+1
pt =
(pt
pt
1)
Therefore,
pet+1 = (1 + ) pt
pt
1:
This can drastically change the solution of the model. To see that, plug this expression back into
(1) and obtain
pt =
1
(1
) (1 + )
mt
1
(1
(1
)
pt
) (1 + )
1:
The general solution of this di¤erence equation is
pt =
1
(1
(1
)
) (1 + )
t
p0 +
1
(1
) (1 + )
t
X
=0
1
(1
(1
)
) (1 + )
t
m ;
For the current purposes we know the model su¢ ciently well. Let’s focus on the main novelty
in terms of inference which Cagan introduced. Suppose that p0 = 0 and economy is at p0 at time t
and m
0. Therefore, pt = p0 for any t. In this situation money supply is constant and is equal
to 1. Economy, price levels, and in‡ation are stable
pt = 0;
24
t
= 0:
Consider a deviation from this situation. Suppose the government in this economy wants to
raise money supply at the beginning of time t = 0 so that m0 > 0 and keep m
0 for the rest of
the periods. Perhaps, it does so in order to …nance its de…cit. In such a case
pt =
Therefore, if
(1
1 (1
1
)
)(1+ )
(1
) (1 + )
1
(1
(1
)
) (1 + )
t
m0 :
< 1 over time prices converge back to 0: However, if
(1
1 (1
)
)(1+ )
>
1 then prices diverge to in…nity. This happens even though only m0 > 0 so that there is no
continual expansion of money supply. This is the "possible" destabilizing e¤ect of (forward looking)
expectations. In‡ation in this case is given by
t
=
1
(1
Clearly, in case when
(1
(1
) (1 + ) 1
(1
1 (1
)
)(1+ )
)
+1
) (1 + )
1
(1
(1
)
) (1 + )
t 1
m0 :
> 1 in‡ation is also ever growing in absolute terms or spiralling
out of control.
The summary of the main novelty here is: In case expectations are for in‡ation, for certain
parameter values it is enough to slightly increase money supply for a very short period to have ever
increasing prices and in‡ation because of expectations over in‡ation. This is in contrast with the
common line of thought that in order to have ever increasing in‡ation and prices money supply
has to increase permanently. Moreover, it suggests that …scal authority (central government) and
monetary authority (central bank) should be independent so that government would not be able to
…nance its de…cit printing money. This independence is implemented in many countries as of now
and even is part of constitutions of some of those countries.
What would happen if we had rational expectations in this model? (AGAIN) Rational expectations assumption means that the agents use the model to form their expectations. Therefore they’ll
use the following equation for forming expectations:
) pet+1 :
pt = mt + (1
Replace pet+1 with Et [pt+1 ] and rewrite this equation
pt = mt + (1
) Et [pt+1 ] :
Take expectation over information that available at time t
Et
1 [pt ]
= Et
1 [mt ]
+ (1
) Et
1.
1 [Et [pt+1 ]] :
According to the Law of Iterated Expectations
Et
1 [Et [pt+1 ]]
= Et
25
1 [pt+1 ] :
Lets assume mt is deterministic. Therefore, we have a di¤erence equation in terms of expectations
Et
1 [pt ]
= mt + (1
) Et
1 [pt+1 ] :
Since the coe¢ cient in front of the lead variable is less than one it has to be that expected price
tends to in…nity in this model. Therefore, with rational expectations assumption we don’t have
convergence/stabilization of the economy. Prices and their expected values diverge to in…nity.
This situation is called self-ful…lling in‡ation. Agents expect to have high in‡ation. Prices rise
accordingly, and agents’expectations ful…ll.
If we impose though additional condition that
lim (1
1
)
!+1
Et [pt+ ] = 0;
then we will have stability in this model. This condition is sometimes called "no bubble" condition
in the sense that it makes sure that expected prices do not become in…nitely large.
Our main equation implies that
pt =
mt + (1
) Et [pt+1 ] :
Et [pt+1 ] =
mt+1 + (1
) Et [pt+2 ]
Et [pt+2 ] =
mt+2 + (1
) Et [pt+3 ]
:::
Plugging back Et [pt+1 ], Et [pt+2 ], etc, gives
pt = mt + (1
) f mt+1 + (1
) f mt+2 + (1
) Et [pt+3 ]gg :
Iterating till in…nity we have
pt =
+1
X
(1
) mt+ :
=0
Therefore, with "no bubble" condition prices are forward looking and can be stationary. Prices
are forward looking implies that they take into account future changes in policy/money supply.10
Basically, this is the Lucas Critique to models which do not feature rational expectations. In those
models changes in policy parameters do not a¤ect expectations and therefore current actions and
economic outcomes. However, if expectations are rational then policy changes can a¤ect actions
and economic outcomes immediately and therefore change the environment which the change of
policy/policy-makers might not have anticipated.
Suppose that m
10
0; then pt = 0. If m is constant and equal to m > 0 then by the formula of
This is one of the reasons why it might be important to announce policies much before their implementation.
26
geometric progression we get
+1
X
pt = m
(1
)
=0
= m:
This holds for any t. Notice that in this case pt is constant which is in sharp contrast to the case
of adaptive expectations assumption.
If instead, for example, mt+1 = mt+2 = m
~ and mt+k = m for any k > 2, then with the same
logic
pt+3 = m
"
pt+2 =
"
pt+1 =
m
~ +m
+1
X
(1
)
=1
m
~ + (1
)m
~ +m
= m
~ + (1
+1
X
(1
)
=2
= [ +
This implies that if m0 > 0 and m
#
(1
)] m
~ + (1
) m;
#
)2 m:
0 for the rest of the periods then p0 =
m0 and pt = 0 for
any t > 0.
Lets extend slightly the model and assume that money supply is not deterministic but follows
a random process
mt = m +
where m is constant and
is the realization of
t:
is an i.i.d. random variable with 0 mean and constant variance .
t
at time t and is known at time t. Therefore, mt at time t is not a random
variable. However, mt+1 is a random variable at time t since its realization is not known. The
assumption that mt = m+
t
corresponds to assuming that money supply is subject to unanticipated
shocks. Where could such shocks come from? A source for such shocks in the model could be that
the government/central bank does not fully announce its money supply policy and allows some
random/unpredictable changes.
Maintaining the "no bubble condition" price can be rewritten as
pt =
+1
X
(1
) Et [mt+ ] ;
=0
where of course Et [mt ] = Et [m +
t]
pt =
=m+
(m +
t
since the value of
t) + m
+1
X
(1
=1
=
(m +
t)
+ (1
27
) m:
t
)
is known and
Notice that this is similar to what we previously obtained and given that
are not predictable they
don’t matter for expectations and prices.
Appendix
If expectations are for prices then prices are stationary. What is the value they converge to when
money supply is constant? Here is the answer to that question:
If money supply is constant m > 0; p0 = 0, and
lim pt =
t!+1
1
(1
(1
)(1 )
1 (1
)
< 1 then
t
X
(1
t!+1
1
) (1
)
(1
)
m lim
)
=0
t
Rewrite this equation as
t h
X
(1
)(1 )
1 (1
)
lim pt =
t!+1
and notice that
+1 h
X
(1
)(1 )
1 (1
)
=0
t h
X
(1
)(1 )
1 (1
)
lim
t!+1
=0
h
(1
)(1 )
1 (1
)
i
i
t
i
1
(1
=0
m lim
)
t!+1
h
(1
)(1 )
1 (1
)
i
i
t
= +1. Apply L’Hôpital’s rule to …gure out the limit:
@
@t
=
=
lim
t!+1
lim
t!+1
t h
X
(1
)(1 )
1 (1
)
=0
@
@t
h
(1
)(1 )
1 (1
)
h
i
i
t
i t
(1
)(1 )
1 (1
)
h
i t h
i
(1
)(1 )
(1
)(1 )
ln
1 (1
)
1 (1
)
=
ln
h
1
1 (1
)
(1
)(1 )
i:
At the end of time horizon, price converges to
lim pt =
t!+1
1
(1
) ln
h
1
1 (1
)
(1
)(1 )
i m:
The Lucas Imperfect Information Model with AD
The Lucas Imperfect Information Model provides us with an explanation of why in short run aggregate supply curve might be upward sloping, but not vertical.11 Expectations matter for dynamics
of prices and quantities in this model too. Prior to proceeding to the model lets digress shortly and
review AD-AS Model.
11
Macro II subject slightly covers this model. You can …nd and read a slightly di¤erent version of this model in Part A
of Chapter 6 of Romer (2006) textbook.
28
The AD-AS Model consists of two equations/curves: aggregate demand (AD) and aggregate
supply (AS). Aggregate demand curve is determined from the IS-LM Model. IS-LM-model has two
equations: investments-savings and liquidity-money (money demand). These equations are
[IS] : Y = C (Y; T ) + I (r) + G;
M
[LM ] :
= L (Y; r + e ) ;
P
where C is consumption, I is investment, G and T are government spending and taxes (…scal policy
parameters), M is money supply/demand (monetary policy parameter). C increases with Y and
declines with T . I declines with r.
In a period, prices are assumed to be given. Money supply is also given. The central bank
controls it. Therefore, unknowns are Y and r in the IS-LM Model. Graphically this model can be
represented in the following manner.
The solutions of Y and r depend on price level P , …scal and monetary policy parameters G; T ,
and M , and on expected in‡ation level,
e.
The solution of Y is the aggregate demand curve,
AD = Y (P; G; T; M;
e
):
It declines with P . Negative relation stems from LM curve. This curve shifts up when prices
increase implying higher equilibrium interest rate and lower income.
The graphical representation of AD-AS model is
In this …gure, LRAS is long-run aggregate supply (AS) curve. SRAS is short-run aggregate
supply curve. The former is vertical since we assume that in long-run factor inputs are given/prices
29
are determined in the model. In turn, the latter is upward sloping and lower is its slope higher is the
e¤ect of shifts of aggregate demand on income. Shifts of aggregate demand AD = Y (P; G; T; M;
e)
in this model can stem from changes in policy parameters G; T; and M , and expected in‡ation level
e.
Lucas Imperfect Information Model provides us with a SRAS curve.
In Lucas Imperfect Information Model, there are N …rms. Each …rm i (i = 1; N ) is assumed to
face demand
d
i
where
y
N
=
y
+
N
(pi
i
is income spent on each good (y = ln Y ).
p) ;
i
is a taste parameter which has normal
distribution with 0 mean and is i.i.d. across …rms. pi is the price of the product of …rm i. p is the
aggregate level which consumers face. Aggregate price levels is average price in this model
p=
N
1 X
pi :
N
i=1
Consumers know this price level since they are consuming all types of goods produced in the
economy.
Further each …rm i is assumed to have the following supply function
s
i
where
and
=
+
pe ) ;
(pi
are positive parameters, pi is the price of the good of …rm i, and pe is …rms’expected
aggregate level price. There is intra-temporal uncertainty here which stems from the assumption
that …rms cannot observe the prices (shocks) of their rivals. This is especially meaningful for
…rms from di¤erent industries. Supply positively depends on the gap between pi
pe which is the
(expected) relative price of the …rm. In this regard, for example pi > pe can happen when …rm i
has received positive shock to demand relative to others.
Market clearing requires that in equilibrium
+
(pi
pe ) =
s
i
d,
i
=
y
+
N
(pi
i
p) :
Therefore, solving for the price of …rm i we get
pi =
1
+
y
+
N
+
i
1
+
( pe + p) :
(2)
The aggregate supply is then
y = N [ + ( + ) pi
( pe + p)
i] :
Take the average of this expression and assume that N is high. According to the Law of Large
30
Numbers then
1
N
N
X
i
= 0 and
i=1
y=N[ +
(p
pe )] :
If there is no uncertainty, which means that pe = p, then Y is vertical
y = N:
This is long run aggregate supply curve (LRAS).
The expected price is conditional on observation of pi ,
pe = E [ pj pi ]
(3)
so that …rms can update their beliefs within a period. Since
i
is driving the uncertainty in this
model and it has normal distribution, p has a normal distribution too. It can be shown that in
such a case the conditional expectation of p is its expected value plus an error term which has zero
expected value,
E [ pj pi ] = E [p] + (pi
= (1
where
> 0, and the error of prediction is pi
E [p])
) E [p] + pi :
(4)
E [p].
Take expected value of (2) conditional on pi
pi =
1
+
y
+
N
+ E [ pj pi ] :
i
(5)
Plug (4) into (5) and take the average of pi ,
p=
1
( + ) (1
y
) N
Compute aggregate output denoting ( + ) (1
+ E [p] :
) N = b,
y = y + b (p
E [p]) :
(6)
This is short run aggregate supply curve. Clearly it is upward sloping. In case p > E [p] clearly
in short run y > y . Such a relation holds since if p > E [p] then on average …rms have received a
positive shock (their prices have turned out to be higher than expected aggregate price) and have
increased output.
Combining aggregate supply curve with aggregate demand curve gives equilibrium levels of price
and quantity. To keep the discussion as simple as possible suppose that aggregate demand is given
by the Quantity Equation of Money
P Y = V M:
31
Suppose further that the velocity of money is equal to 1. The logarithm of the aggregate demand
therefore is
y=m
p:
Then prices can be determined from
y + b (p
E [p]) = m
p;
(7)
which is common market clearing condition.
Under rational expectations the agents use (7) to derive their expectations. To …nd it, take the
expected value of (7),
E [p] = E [m]
y:
Just as in Cagan Model, therefore, agents’expectations of prices depend on expectation of nominal
money stock. Plugging this back into (7) and solving for p gives equilibrium price levels
p=
b
1
E [m] +
m
1+b
1+b
y:
Aggregate price level in equilibrium is a weighted average of expected money supply E [m] and
actual money supply m. Plugging p from this expression into the aggregate demand equation gives
the equilibrium level of output
y=y+
b
(m
1+b
E [m]) :
This implies that under rational expectation assumption short run deviations of output from
long run level are possible only through unanticipated changes in money supply. If money supply
rule is known then simply m
E [m] = 0 and y = y. Therefore, output ‡uctuations in this model
with rational expectations assumption are purely driven by unanticipated changes in money supply.
Prices, though, respond to unanticipated changes to money supply.
We have a version of the AD-AS Model where aggregate demand curve is
y=m
p;
which means that aggregate demand is assumed not to depend on …scal policy. This is the reason
why inference does not include policy parameters other than money supply.
Lucas Imperfect Information Model with AD implies a very well known and very debated relationship in macroeconomics which is called Phillips Curve. Phillips Curve illustrates the short-run
trade-o¤ between in‡ation and output. To derive it assume that long-run output is …xed and put
time index in (6) to obtain
yt = y + b (pt
Et
= y + b [(pt
pt
1)
= y + b(
Et
1 [ t ]) :
t
1 [pt ])
(Et
32
1 [pt ]
Et
1 [pt 1 ])]
In short we have an expectation augmented Phillips Curve
yt = y + b (
Et
t
1 [ t ]) :
In case when in‡ation is higher than expected, short-run output is higher than the long-run output.
In this sense, if policy makers can manage to full the agents they can increase short-run output.
Usually Phillips Curve is written, however, as a trade-o¤ between in‡ation and unemployment.
Assuming that output is inversely proportional to unemployment and using u to denote the long-run
level of unemployment, Phillips Curve can be rewritten as
ut = u
~b (
Et
t
1 [ t ]) ;
where ~b > 0 is a constant. Increasing then unanticipated in‡ation reduces unemployment.
The Sticky Wage Model
In Macroeconomics II you have also seen a version of the Sticky Wage Model. We will now consider
a version of that model adding microstructure which hasn’t been discussed in detail in the previous
course. We will show again that expectations matter for macroeconomic outcomes.
Suppose that there is a continuum of mass one of identical and in…nitely lived consumers. Each
consumer is endowed with L units of time. The consumers can supply their time to …rms at
wage rate w or use it for leisure. The consumers derive utility from consumption of N di¤erent
types of goods fxj gN
j=1 and have disutility from supplying labor. They have constant elasticity of
substitution utility function of the form
U (x1 ; :::; xN ; L
l) =
N
X
xj +
ln (L
l) :
j=1
where
2 (0; 1) and
1
1
is the elasticity of substitution between di¤erent pairs of goods x.12 l is
the amount of labor force that the representative consumer supplies to …rms and L
time.
l is its leisure
is a positive parameter.
The consumers maximize their utility subject budget constraint taking prices pxj
N
j=1
of goods
x and wage rate as given. Therefore, the representative consumer solves the following problem in
x
12
The elasticity of substitution between any i and k (i 6= k) pair of x is
33
d ln x i
k
du(xk )=dxk
d ln
du(xi )=dxi
=
1
1
.
each period of time
8
N
<X
max
:
fxj gN
j=1 ;l
j=1
s:t:
N
X
0 = wl
xj +
ln (L
l)
9
=
;
pxj xj
j=1
To solve this problem we use Lagrangian
L = max
Normalize
8
N
<X
:
fxj gN
j=1 ;l
j=1
xj +
2
l) + 4wl
ln (L
N
X
j=1
39
=
pxj xj 5
;
= 1. The optimal rules are given by …rst order conditions
[xj ] : pxj = xj 1 ;
1
;
[l] : w =
L l
where the …rst one is the inverse demand function of good xj and the second one is the inverse
supply function of labor. Notice that there are actually N inverse demand functions because there
are N goods x. This means that we have N + 1 optimal rules.
Assume that e¤ective wage rate is given by
w = pe f (u; z) ;
where pe is the expected aggregate price level (pe = E
h
1
N
i
p
j=1 xj ), f is a function which decreases
PN
in the …rst argument and increases in the second argument. u is the level of unemployment in the
economy, and z are institutional features in the economy such as unemployment bene…ts, minimum
wages, etc.
Suppose that there are N …rms. Each …rm produces a type of x good. Firms’ input for
production is labor and a unit of labor produces a unit of output in all …rms. Firms are price
setters. Therefore, the problem of the …rm which produces good xj is
max pxj xj
wxj
xj
s:t:
pxj = xj
1
:
This implies that the inverse supply function is
p xj =
34
1
w:
Inverse supply function implies that pxj
p and from the inverse demand function it follows that
in equilibrium all x goods are produced at the same quantities xj
x.
Notice that in this framework since …rms are price setters price is higher than marginal cost w.
As
tends to 1 goods become more similar/substitutable since
1
1
tends to in…nity and px tends
to the marginal cost.
Given that in equilibrium all …rms produce the same amount of output they hire the same
amount of labor. Since there is a continuum of mass one of consumers, the total amount of labor
that …rms hire is l. This implies that unemployment rate is
u=
L
l
L
l 13
:
L
=1
Moreover, the production of a unit of good requires unit of labor. Unemployment rate then can be
rewritten as
Y
L
u=1
where Y is total output, Y =
The intersection of
PN
j=1 xj .
w
p
w
p
=
pe
f
p
=
;
Y
;z ;
L
1
(8)
gives equilibrium level of output and prices. At the same time it gives equilibrium level of unemployment.
=
pe
f
p
1
Y
;z :
L
Suppose that expected price relative to actual price
equation to continue to hold f 1
Y
L;z
pe
p
(9)
increases. In such a case, in order this
should decline. That will happen if unemployment in-
creases/output declines.
13
Although we treat this as unemployment rate, this is actually the non-participation rate. Unemployment rate would
be the di¤erence between participation rate in a non-distorted economy and the participation rate in this distorted
economy.
35
Equation (9) holds in each period of time. Add time subscripts to it and write
pt =
1
f
1
Yt
; z pet :
L
This is the aggregate supply curve. In long-run pt = pet so that long-run output and unemployment
are given by
=f
1
Y
;z :
L
In short-run price level pt is an increasing function of Yt and expected price level pet . Higher Yt in
this setup increases the wages paid according to (8). This increases the marginal costs of the …rms
and therefore the prices that they charge. Higher expected price also increases wages according to
(8) and therefore results in higher actual prices.
If we assume for example that
f (ut ; z) = 1
ut + z =
Yt
+ z;
L
then the short-run aggregate supply curve is given by
pt =
1
or
Yt =
Yt
+ z pet ;
L
pt
pet
z L:
Long-run aggregate supply curve then will be
Y =(
z) L:
36
The E¤ectiveness of monetary policy (with …xed rules)
We have seen that expectations and their types can matter for aggregate economic performance.
They can matter for both the levels of nominal and real variables and for their dynamics. We will
now turn to more rigorous discussion of how monetary policies can matter for economic performance
given the type of expectations.
A model with stabilizing monetary policy
Often policy makers perceive an agenda of stabilizing the economy. They set …scal and monetary
policy rules in order to achieve that. The reason they do so is that consumers/households are
believed to be risk averse. Therefore, they might be better-o¤ if the economy is more stable. The
intuition behind such a results is that risk averse consumers prefer steady income over volatile
income.
This (sub-)section discusses properties and e¤ects of a monetary policy rule which attempts to
stabilize the economy. The discussion follows a simpli…ed version of the AD-AS Model.
Since the focus of the section is on monetary policy, aggregate demand (in logarithms) is simpli…ed to
yt = m t
t
pt +
t:
can be thought to be the velocity of money. In such a case, this equation is the Quantity Equation
of Money.
In the reminder we will assume that
t
is a random variable which follows a simple AR(1) process
of the form
t
where
2 (0; 1) is a parameter and
t
=
t 1
+
t;
is an i.i.d. random variable with 0 mean and
the unconditional expected value of
Denote with
(10)
t,
E[
=
Denote with
2
unconditional variance of
2
t 1]
+ E [ t]
+ 0:
t,
V [ t ]. Unconditional variance of
= E[
t
E [ t ]]2
= E [ t ]2
Plug
t 1
+
t
for
t,
E [ t ]2 = E [
t 1]
2
+ 2E [(
37
variance.
E [ t ]. It is straightforward to see that
is equal to 0,
E [ t] =
2
t 1 ) ( t )]
+ E [ t ]2
t
is
Given that
t
is i.i.d. E [(
t 1 ) ( t )]
= 0. Therefore,
2
2
=
2
1
Conditional on information available at time t
Et
1 [ t]
=
Et
=
:
1 mean and variance of
1 [ t 1]
+ Et
t
are
1 [ t]
(11)
t 1;
and
Vt
1 [ t]
= Et
1[ t
= Et
1[
Et
t 1
+
2
1 [ t ]]
t 1]
t
2
=
2
:
Aggregate supply in this model is given by
yt = pt
wt ;
where wt are wages. Aggregate supply increases with prices and declines with wages, which summarize/represent the costs of the …rms.
In this model long-run level of output, prices, and money supply are normalized to 1 so that in
long-run
y = m = p = w = 0:
Agents are assumed to live for 2 periods. Their wage contracts …x wages for 2 periods. In
this sense contracts are for a long term and wages are sticky. Wages are …xed according to the
agents’ expectation of prices for 2 periods. At birth (denote by t
2) they inherit information
about previous periods of time from their parents and require wage at time t according to
wt =
1 e
p
2 tjt
1
+ petjt
2
:
(12)
Monetary authority pursues an agenda of stabilizing the economy. It tries to set money supply
so that to reduce the e¤ect of shocks
on output conditional on information it has. The shock
arrives after monetary policy has been implemented. In other words, monetary policy authority
sets money supply at the beginning of time t not knowing the realization of the shock
at time
t, but knowing its realizations in earlier periods In this sense monetary policy authority does not
have superior information.
Suppose that monetary policy rule/money supply is
mt = ht + (1
38
) ht
1;
(13)
where
and
Et
2 [0; 1] and
< 0.
1 [ t]
=
ht =
t 1;
With this policy rule monetary policy authority can reduce the expected shock
t 1
since when
increases money supply declines.
t 1
In equilibrium aggregate demand is equal to aggregate supply,
pt
wt = mt
pt +
t:
Therefore,
1
(mt + wt +
2
1
(mt wt +
2
pt =
yt =
t) ;
(14)
t) :
(15)
Plugging monetary policy rule (13) and wages (12) into output equation gives
yt =
In this expression,
t 1
1
2
t 1
and
t 2
+ (1
)
t 2
+
1 e
p
2 tjt
t
1
+ petjt
:
2
are known at time t.
The dynamic path of the model is given by
pt =
t
1
2
t 1
=
yt =
t 1
+
1
2
+ (1
)
t 2
+
t
)
t 2
+
t
+
1 e
p
2 tjt
1
+ petjt
2
;
1 e
p
2 tjt
1
+ petjt
2
:
t;
t 1
We have, therefore, unknowns yt ,
+ (1
t,
pt , and petjt
1
and petjt
2
and need equations for the latter two.
Suppose that agents have rational expectations in this model. In such a case, expected prices
need to be …gured out from the model and
petjt
1
= Et
e
1 [pt ] ; p tjt 2
= Et
2 [pt ] :
Take the expectation of price pt (14) conditional on information at time t
Et
1 [pt ]
=
1
(Et
2
1 [mt ]
+ Et
1 [wt ]
+ Et
1 [ t ]) :
According to (13) and (11) the conditional expectation of mt is
Et
1 [mt ]
=
=
Et
1 [ t 1]
+ (1
+ (1
)
t 1
39
) Et
t 2:
1 [ t 2]
1,
Et
2 [pt ]
is a given number at time t 1, which means that Et
1 [Et 2 [pt ]]
= Et
2 [pt ].
According
to (12), then, the conditional expectation of the wage rate is given by
Et
1 [wt ]
1
Et 1 [Et 1 [pt ] + Et 2 [pt ]]
2
1
(Et 1 [pt ] + Et 2 [pt ]) :
2
=
=
Therefore, the expected value of pt conditional on information from time t
Et
1 [pt ]
=
1
( +
2
)
+ (1
t 1
)
t 2
+
1
(Et
2
1 [pt ]
1 is
+ Et
2 [pt ])
2 [pt ]
:
;
or equivalently,
Et
1 [pt ]
=
2
( +
3
)
t 1
+ (1
)
t 2
1
+ Et
2
(16)
This means that expected price is a function of previous realizations of shock and expectations.
We still need an equation for Et
information at time t
2 [pt ].
Take the expectation of price pt (14) conditional on
2,
Et
2 [pt ]
=
1
(Et
2
According to (10), the expected value of
Et
2 [ t]
2 [mt ]
+ Et
2 [wt ]
+ Et
2 [ t ]) :
conditional on information at time t
t
= Et
2
=
2
2
t 2
+
t 1
+
2 is
t
t 2:
Conditional expectation of monetary policy rules is
Et
2 [mt ]
=
Et
= [1
2 [ t 1]
(1
+ (1
) ]
) Et
2 [ t 2]
t 2:
In turn, according to (12) and the Law of Iterated Expectations conditional expectation of wage
rate is
Et
2 [wt ]
1
Et 2 [Et
2
= Et 2 [pt ] :
=
1 [pt ]
+ Et
2 [pt ]]
Therefore,
Et
2 [pt ]
=
1
[1
2
(1
) ]
t 2
+ Et
2 [pt ]
+
or equivalently,
Et
2 [pt ]
= (1
) +
40
+
2
t 2;
2
t 2
;
The expected value of price conditional on time t
Et
2
( +
3
1 (16) can be rewritten as
1
3 (1
3
) +
+
2
1
1
[( + ) t 1 + (1
) t 2] + t
2
2
1
1 2
( + ) t 1+
3 (1
) +
4 3
3
1
(1
)+
+ 2 t 2:
4
+
2
1 [pt ]
=
)
t 1
+
t 2:
In turn, output (15) can be rewritten as
yt =
Group together the coe¢ cients of
t 1
and
t 2
and use the condition that
t 2
t 1
=
t 2
+
t 1
to
rewrite output as
yt =
1
( +
3
)
t 1
+
1
:
2 t
The …rst term in this expression is the e¤ect of shocks in previous periods on output. Monetary
policy, for example, can eliminate their e¤ect setting
=
and reduce the e¤ect of shocks on
output. In such a case
yt =
where
t
1
;
2 t
is shock in period t which is not observed by the monetary policy authority before it sets
and runs the policy rule. Policy rule, therefore, cannot a¤ect
t.
Such a policy rule implies that the variance of output is
V [yt ] =
Notice that if
is not selected to eliminate
V [yt ] =
which is larger than
1 2
.
4
1
4
t 1
2
1
4
2
:
then
1
( +
3
+
2
)
2
;
In this sense this policy rule stabilizes economy eliminating some of the
in‡uence of shocks.
From aggregate supply equation it follows that
pt
This implies that positive shocks
t
wt =
1
:
2 t
imply reduction of real wage rate wt
pt .
What would happen if monetary policy authority knew the realization of
t
before setting
and running its policy and had an agenda to stabilize the economy? The answer turns to be
straightforward. If it sets
mt =
41
t;
then there are no shocks in this economy and no uncertainty. Aggregate output does not ‡uctuate
and is at its long-run level, 1.
w = p;
y = p
p =
w;
y:
It seems natural to derive also monetary policy rule in case when wages are not sticky and are
set for each period. In such a case, the model can be summarized as
[AD] : yt = mt
pt +
[AS] : yt = pt
[W ages] : wt = Et
[Shock] :
t
=
t;
wt ;
1 [pt ] ;
t 1
+
(17)
t:
Therefore, in equilibrium
pt =
yt =
1
(mt + wt +
2
1
(mt wt +
2
t) ;
t) :
To …nd equilibrium level of output derive the expected level of prices and wages using (17)
wt = Et
1 [pt ]
=( +
)
t 1
+ (1
)
t 2:
Therefore,
yt =
=
1
f
2
1
:
2 t
t 1
+ (1
)
t 2
[( +
)
t 1
+ (1
)
t 2]
+
t 1
+
tg
Clearly, yt in this case does not depend on money supply, which implies that money supply
rule selected above cannot reduce variability of output. Is this a general inference for any money
supply rule? Yes. To see that take again expected price level now assuming that we have a general
(deterministic) money supply rule mt
wt = Et
1 [pt ]
= mt +
t 1:
This implies that aggregate output is
yt =
1
:
2 t
Monetary policy has no e¤ect in this general case too. Therefore, there is nothing to derive/any
42
rule suits. The reason why this happens is that there are no rigidities since wages adjust in a period.
Higher money supply simply alters prices.
Another possible extension of the model is to consider static expectations wt = pt
1.
In such a
case in equilibrium it has to be that
1
(mt + wt +
2
1
(mt wt +
=
2
= pt 1 ;
pt =
t) ;
yt
t) ;
wt
t
=
t 1
mt =
+
t 1
t;
+ (1
)
t 2:
From equations for pt and wt it follows that
1
2
pt =
In turn, from
t
=
t 1
+
t
t
1X
2
t
p0 +
1
2
=0
t
(m +
):
is follows that
t
=
t
0
+
t
X
t
:
=0
Assume that
0
=
0
= 0.
Aggregate output, therefore, is
yt =
1
2
(
[( +
t 1
11X
22
=2
1
2
1
2
)
1
2
t 1
+ (1
t 1
) ]
(
t 2
X
t
+( +
)
t 1
=0
[( +
)
+ (1
) ]
X2
k=0
1
p0 + t :
2
)
k
k
!
+( +
)
1
+
)
The exercise which computes the variance of aggregate output requires tedious algebra. Instead
of that notice that in this case too monetary policy can eliminate some of the e¤ect of previous
shocks on output setting
+
= 0 or ( +
)
+ (1
)
= 0. In this version of the setup
monetary policy matters for aggregate output since, again, wages are rigid. They are indexed to
previous levels of prices. Monetary policy alters the prices within period which alters the output of
…rms but keeps their costs (wages) …xed.
43
A model with constant growth of money
In the previous (sub-)section we considered a policy which attempted to stabilize the economy. In
this (sub-)section we consider a di¤erent policy and keep the setup of the economy unchanged. The
policy that we consider bears the name of its main proponent: Milton Friedman. Friedman Rule
(or Policy) is monetary policy which makes nominal interest rate zero. Consider Fisher Equation,
i=r+ :
Friedman Rule then sets
=
r:
A motivation behind such a policy can be derived from cash-advance models, which are slightly
more advanced to be covered in this course. In short in these models money is an inferior asset
in the sense that it does not earn interest, whereas bonds earn nominal interest i. The agents are
forced to keep money since they use it for their purchases. Intuitively, though, such a policy works
since, it manages to set nominal interest rate to 0 and eliminates the di¤erence between keeping
money and bonds. Therefore, it makes money less inferior (or not inferior at all).
Assume that output and velocity of money are constant. In such a case, from the Quantity
Equation,
v + m = p + y;
it follows that in‡ation is equal to the growth rate of money
t
= mt
mt
Suppose that real interest rate is constant and denote
1:
r = g. Therefore, in terms of this exposition
Friedman Rule is
mt = mt
1
+ g:
This is the monetary policy rule which we consider in this section. The remainder of the model is
[AD] : yt = mt
pt +
t;
[AS] : yt = pt wt ;
1
[W ages] : wt = (Et 1 [pt ] + Et
2
[Shock] : t = t 1 + t :
2 [pt ]) ;
From [AD] and [AS] it follows that
pt =
yt =
1
(mt + wt +
2
1
(mt wt +
2
44
t) ;
(18)
t) :
(19)
Assuming rational expectations and using [W ages] gives
Et
1 [pt ]
1
(Et 1 [mt ] + Et 1 [wt ] + Et 1 [ t ])
2
1
1
mt 1 + g + (Et 1 [pt ] + Et 2 [pt ]) +
2
2
1
1
mt + (Et 1 [pt ] + Et 2 [pt ]) + t 1 ;
2
2
=
=
=
(20)
t 1
;
Therefore,
Et
1 [pt ]
1
(Et 1 [mt ] + Et 1 [wt ] + Et 1 [ t ])
2
1
1
mt 1 + g + (Et 1 [pt ] + Et 2 [pt ]) +
2
2
2
1
mt + Et 2 [pt ] + t 1 :
3
2
=
=
=
Notice that Et
1 [mt ]
= mt
1
(21)
t 1
;
+ g since money supply follows a deterministic/…xed rule.
To …nd the expected price level conditional on information available at time t
Et
2 [pt ]
=
=
1
(Et 2 [mt ] + Et 2 [wt ] + Et
2
1
mt + Et 2 [pt ] + 2 t 2 :
2
2, consider
2 [ t ]) ;
Therefore,
Et
2 [pt ]
= mt +
2
t 2:
Plugging back into the expression for expected price (20) gives
Et
1 [pt ]
=
2
3
3
mt +
2
t 1
+
+
2
3
1
2
2
t 2
;
and from [W ages] equation it follows that
wt = mt +
1
3
= mt +
t 1
t 1
2
3
2
t 2
t 1:
According to (19), income level then is
yt =
1
2
t
+
1
3
t 1:
The second term in this expression highlights the persistence of shocks. This policy does not
45
eliminate
t 1.
We have relatively volatile output and expected in‡ation at rate g,
E [ t ] = E [pt
= mt
pt
mt
1]
1
= mt
mt
1
2
3
+E
t 1
+
1
3
2
E
t 2
2
3
t 2
+
1
3
2
t 3
= g:
Consider a slight extension of the discussion. Suppose that the monetary policy authority/central
bank can cheat the agents and unexpectedly increase money supply from g = g to g0 (g0 > g). Suppose for simplicity that there are no shocks in the economy, so that
t
=
t
0. This implies that
agents expect prices and prices in general are
Et
1 [pt ]
= Et
2 [pt ]
= pt = mt
1
+g
and wages are given by
wt = mt
1
+ g:
Therefore, aggregate supply is at its long-run level yt = 0. This happens because there is no
uncertainty.
Aggregate demand is given by
yt =
1
[mt
2
(mt
1
+ g)] :
Therefore, if the monetary policy authority increases the growth of money supply to g0 then it
increases output at least for a short period,
yt =
1
(g0
2
g) :
Suppose policy makers’ agenda is to boost output. In such a case it would be tempting for
the policy makers to surprise public with sudden increases of in‡ation. In other words, the policy
makers although might have announced a …xed monetary policy rule they might want to deviate
from that commitment. In such a case, would the public believe that policy makers will stick to the
announced policy? The answer to the question is "No" (most probably). We will continue exploring
this commitment problem more formally in the next sections.
Discretionary monetary policy
Suppose now that the central bank’s agenda is to maximize output and minimize in‡ation. To
formalize that assume that the central bank has a loss function,
Lt =
yt +
2
(pt
46
pt
2
1) ;
which it attempts to minimize choosing the monetary policy rule. Clearly this is equivalent to
maximizing
Lt . Therefore, the central bank’s optimal problem is
Lt = max yt
2
mt
(pt
pt
2
1)
:
Aggregate demand and supply are the same as in previous section. In equilibrium we have then
that
1
(mt + wt +
2
1
(mt wt +
2
pt =
yt =
t) ;
t) :
In turn, assume that wages and shocks are given by
wt = petjt
=
t
1;
t 1
t:
+
Suppose that the central bank sets its policy prior to observing the shock. However, it has an
important advantage in the sense that it can set its monetary policy rule conditional to the wages
that prevail in the economy. Therefore, the central bank minimizes the expected loss and solves,
Et
1 [Lt ]
(
= max Et
mt
1
"
1
(mt
2
wt +
t 1
+
1
(mt + wt +
2 2
t)
2
t 1
The solution of this problem is given by
@Et 1 [Lt ]
= 0;
@mt
which (in this setting) is equivalent to
Et
1
@Lt
= 0;
@mt
or simply
Et
1
1
2
1
(mt + wt +
2 2
t 1
+
t)
Therefore,
mt =
2
+ 2pt
1
47
wt
t 1:
pt
1
= 0:
+
t)
pt
1
#)
:
Plugging this monetary policy into the expressions for output and prices gives
pt =
yt =
1
1
+ pt
1
+ pt
1
+
1
;
2 t
wt +
1
:
2 t
The expression for prices implies that expected in‡ation is
E [ t] =
1
:
Therefore, increasing the weight on in‡ation in the loss function
reduces in‡ation. In other words,
if central bank values more lower in‡ation then it sets monetary policy rule to make sure in‡ation
is lower.
In turn, in order to …nd output assume that the agents have rational expectations and derive
wage rate from the equation for prices.
wt =
1
+ pt
1;
Wages depend on parameter . This is because in this setting monetary policy rule is contingent
on prevailing wages.
Therefore, output is given by
yt =
1
:
2 t
This monetary policy therefore does not limit the e¤ect of shocks on output but a¤ects in‡ation.
Political cycles and discretionary monetary policy
Consider a modi…cation of the previous setup. Suppose that there are two political parties A and
B in the economy and it is an election year. Currently it is period mid-t
end of period t
1 and elections are at the
1. Suppose these parties have di¤erent tastes for (weights on) in‡ation
A
and
B
and appoint monetary policy authority accordingly. Party A has lower tolerance to in‡ation than
party B,
A
>
B.
The wages for period t are set before the election contingent on prices that
prevail in the next period. Therefore, wages are set without knowing which party wins the election
and which tastes for in‡ation prevail in the economy in period t. The probability that party A wins
the elections is
A.
The reminder of the model is the same as the model in previous section.
After the elections there is one party in the economy. Let it be party i where i is either A or B.
48
Therefore, the economy is described by
1
mit + wt + t ;
2
1
mit wt + t ;
2
2
+ 2pt 1 wt
pit =
yti =
mit =
E
i
t
t 1
i
1
=
;
i
1
wt =
+ pt
1:
i
Clearly, expected in‡ation is lower in case party A won the election. Combining expressions for
price level and monetary policy gives
1
pit =
+ pt
1
1
:
2 t
+
i
This is price level conditional that party i has won the elections. With probability
at time t is
pA
t
and with probability 1
pt =
it is
A
1
A
pB
t ,
A)
+ (1
A
A
price level
therefore, unconditional price level is
1
+ pt
1
+
B
1
:
2 t
This implies that wages are given by
wt =
1
A
+ (1
1
A)
A
+ pt
1:
B
Wages are a weighted average of tolerances for in‡ation and the previous level of prices. Therefore,
even though only one party is elected at time t the preferences of both parties matter.
In turn, expected in‡ation rate is given by
E [ t] =
1
A
+ (1
1
A)
A
:
B
To determine the level of output, use the expressions for money supply and wages
yti =
1
i
1
A
+ (1
A)
A
1
+
B
1
:
2 t
Therefore, if A is elected the expected output is
E [yt ] = (1
A)
1
A
1
;
B
which is a negative number. This number increases with the probability that the party B wins
elections 1
A.
The expected output level is negative since party A tolerates in‡ation less than
49
party B. The existence of party B pushes in‡ation up. To keep it low party A reduces money
supply and output. Output declines to a negative number
If B is elected then the expected output is positive and given by
E [yt ] =
1
A
1
B
> 0:
A
Monetary policy under commitment and discretion
Previous sections discussed the properties of a monetary policy rule which reduces the variance of
output. The discussion in these sections suggested that in certain cases there might a commitment
problem with an announced policy in the sense that policy makers might want public to believe
in the implementation of a policy but they might deviate later from it. The latter situation arises
especially when the policy makers pursue two opposing policy goals at the same time. The monetary
policy authorities/central banks usually have exactly two policy goals: stabile in‡ation and output.
Consider an economy where the central bank has these two policy goals. To formalize this and
keep the matter relatively simple, assume that the central bank selects monetary policy rule to
minimize
Lt =
2
t
y)2 :
+ (yt
Monetary policy tries to minimize the deviation of output from its long-run level y and the deviation
of in‡ation
t
from its long-run level 0.
is a positive parameter. It o¤ers the importance of
deviation of output from its long-run level for the central bank and therefore for monetary policy.
In this context L is the central bank’s loss function.
There is no trend in the model which we consider here, deviation of output from the long-run
level stem from exogenous shocks, and ideally long-run in‡ation is 0. Therefore, minimizing the
2
t
y)2 amounts to minimizing their variance.
and (yt
Suppose that the reminder of the economy is given by versions of AD and AS curves with
expected wages
[AD] : yt = a (mt
[AS] : yt = b (pt
[W ages] : wt = petjt
where
d
t
and
s
t
pt ) +
wt ) +
d
t;
s
t;
1;
are uncorrected i.i.d. random variables with 0 mean and variance
2
d
and
2,
s
correspondingly. a and b are positive parameters.
Further, suppose that agents’have rational expectations, so that
petjt
1
= Et
1 [pt ] :
For the purposes of the current discussion it is more convenient to write [AS] curve with in‡ation
50
rates
[AS]
:
yt = b (pt
= b(
Et
t
Et
1 [pt ]
1 [ t ])
Et
1 [pt 1 ]
+ Et
1 [pt ])
+
s
t;
s
t
+
In this model we will give an important advantage to the central bank assuming that it sets
monetary policy after observing the shock
s
t
d.
t
and
Usually, monetary policy can react quickly,
although hardly without a lag. However, what we actually need here is that it reacts faster to
shocks than the private sector/prices and wages. This seems to be a realistic assumption and opens
a door for monetary policy to have real e¤ects. Monetary policy can have real e¤ects in this case
since it can mitigate at least within-period shocks.
We will further assume that monetary policy is given by
[P olicy] :
where ,
and
t
=
d
t
+
s
t;
+
are policy parameters which the central bank chooses. Clearly this is an indirect
formulation of monetary policy rule. One way to think about it is that the central bank sets money
supply rule so that
t
is given by equation [P olicy]. Once
t
is determined this money supply can
be determined from [AD] equation.
With this monetary policy rule, agents’expectations about in‡ation are
e
tjt 1
Therefore, whatever value of
= :
the central bank would choose, the agents know it and will adjust
their expectations accordingly. In this sense, the central bank can in‡uence the agents’expectations.
It will make use of this in its optimization problem.
Monetary policy under commitment
In this case the central bank announces its policy a period ahead (at t
commits to it. This means that the central bank selects at time t
minimize its expected loss function Et
know the possible realization of
s.
t
1 [Lt ].
At time t
1 period for period t) and
1 parameters
,
and
to
1, however, similar to agents it does not
Therefore,
Et
1 [ t]
= :
From equations [AS], [W ages], and [P olicy] and the expression for loss function it follows that
the central bank solves:
Et
1 [Lt ]
= min Et
; ;
1
+
d
t
+
s
t
2
+ Et
1
b
+
d
t
+
s
t
+
s
t
y
2
:
In order to solve this problem, open up the brackets and use the de…nition of variance for random
51
variables with 0 mean, e.g., Et
Et
+
1
d
t
+
s
t
2
1
h
d 2
t
i
= Et
1
2
+
=
2.
d
=
2
d
t
+2
2 2
d
+
2 2
s;
+
s
t
y
2
d
t
+
+2
+
d
t
s
t)
(
+(
s 2
t)
Similarly,
Et
b
1
+
d
t
s
t
+
2
= (b )2
2
d
+ (b + 1)2
2
s
+ y2:
Therefore, the optimal problems of the central bank is
Et
1 [Lt ]
= min
; ;
n
2
2 2
d
+
2 2
s
+
h
(b )2
+
2
d
+ (b + 1)2
2
s
= 0:
2
s
+ y2
io
:
The …rst order conditions of this optimal problem are
[ ] : 2 = 0;
The optimal conditions for
[ ] : 2
2
d
+ 2b2
[ ] : 2
2
s
+ 2 (b + 1) b
and
imply that
2
d
=
= 0;
= 0. However,
=
b
.
1+ b2
The policy rule,
therefore, is
t
=
b
s
t;
1 + b2
From modi…ed [AS] it follows then that output is
yt =
1
1 + b2
s
t:
The variance of in‡ation and output under this rule are
V [ t] =
V [yt ] =
b
output to vary with
2
s;
1 + b2
1
1 + b2
If the central bank does not value stability of output,
s.
t
2
2
2
s:
= 0, then it sets in‡ation to 0 and allows
If, however, it places very large weight on the stability of output,
then it allows in‡ation to vary with
s
t
= +1,
but keeps output constant so that it has 0 variance. In
general, it is easy to show that the variance of in‡ation increases with
and the variance of output
declines with it.
We don’t we have the in‡uence of
is that shock
d
t
d
t
in monetary policy. The intuition behind such a result
pushes prices and output proportionately and in the same direction. Therefore,
52
it does not create a trade-o¤ between o¤setting in‡ation and output volatilities. In turn, shock
s
t
pushes prices and output in di¤erent directions. For example, positive shock
s
t
increases output,
but reduces prices. Therefore, it creates a trade-o¤.
s
t
Although the central bank reacts to shocks to
it sets
= 0. Therefore, unconditional
expectation of in‡ation and expected in‡ation for agents’ are 0. Under this policy the expected
value of the loss function of the central bank is
Et
1 [Lt ]
1
1 + b2
=
2
s
+ y2 :
Monetary policy under discretion (without commitment)
A problem with the commitment equilibrium is that at time t the policy announced at t
1 may
not longer be the optimal rule. In other words, if policy maker is able to change the announced
policy at time t then it might achieve lower loss. This is because at that time, in‡ation expectations
have been formed and can be treated as given. The central bank then could exploit this.
Formally, assume that agents believe that the central bank will deliver Et
case there is no Et
1 [ t]
1 [ t]
= 0. In such a
in [AS]. Therefore, from [AS], [W ages], and [P olicy] and the expression
for loss function it follows that the central bank solves:
Et
1 [Lt ]
= min Et
; ;
+
1
d
t
2
s
t
+
+ Et
b
1
+
d
t
+
s
t
2
s
+( b
+
s
t
y
2
:
This can be equivalently written as
Et
1 [Lt ]
= min
; ;
n
2
+
2 2
d
+
2 2
s
+
h
(b )2
2
d
+ (b + 1)2
y)2
io
:
The …rst order conditions in this case are
[ ] : 2 + 2 b( b
[ ] : 2
2
d
+ 2 b2
[ ] : 2
2
s
+ 2 (b + 1) b
Since the central bank sets the same
and
2
d
= 0;
2
s
= 0:
it has to have lower loss setting
=
instead of
y) = 0;
b
1 + b2
y
= 0. Use [AS] - where expected in‡ation is zero - and the policy rule to compute
53
expected value of the loss function for under this policy
Et
t
=
1 [ t]
=
yt =
Clearly, expected loss with
Et
1 [Lt ]
=
b
y
1+ b2
=
b
1+
set
=
so that Et
1 [ t]
b
y
s
t;
1 + b2
b
y;
1 + b2
b2
1
y+
2
1+ b
1 + b2
1
y2 +
1 + b2
is lower than with
incentive to cheat the agents: Announce that
b
y
1+ b2
b2
s
t;
2
s
:
= 0. Therefore, the central bank has
= 0 so that agents perceive Et
1 [ t]
= 0 but later
> 0.
If the central bank announces policy but alters it after the announcement then the policy rule
is not “time consistent” and is not credible. The agents will not believe that the central bank is
committed and the commitment equilibrium falls apart.
Assume now that the central bank cannot commit to a rule and look for a policy which is
optimal at period t when expectation of in‡ation is taken as given by the central bank. Call this a
discretionary policy. Clearly, if this policy does not appear to be the same as the one above, then
there is time inconsistency problem.
With discretionary monetary policy, the central bank solves
Lt = min
t
n
2
t
+ [b (
Et
t
1 [ t ])
+
s
t
o
y]2 :
The central bank minimizes loss function but not its expected value, since it is not committed to
any rule and makes the decision after the realization of shocks
The …rst order condition then is
t
To …nd out Et
1 [ t]
+ b [b (
t
Et
1 [ t ])
+
s
t
y] = 0:
take the expected value of this expression
Et
which means that at period t
1 [ t]
= by;
1 the agents expect positive in‡ation at period t. In turn, in‡ation
and output are
t
=
yt =
b
s
+ by;
1 + b2 t
1
s
+ b2 y:
1 + b2 t
Therefore, the central bank runs higher in‡ation than in case if it could commit to a rule.
54
In terms of previous choice parameters, clearly, this situation corresponds to
=
b
1+ b2
and
= by.
The central bank’s expected value of the loss function at time t
1 then is (use Et
1 [ t]
= by
in [AS])
Et
1 [Lt ]
= Et
1
"
b
1 + b2
=
=
1
1 + b2
2
s
+
2
s
t
+ by
h
#
+ Et
1
"
1
1 + b2
2
s
t
y
#
i
( b)2 + 1 y 2
Denote the expected values of loss in case of credible commitment, cheating, and discretion as
EtCM
1
[Lt ], EtCH1 [Lt ], and EtD 1 [Lt ].
EtCM
1 [Lt ] =
EtCH1 [Lt ] =
EtD 1 [Lt ] =
1
2
+ y2 ;
1 + b2 s
1
y 2 + 2s ;
1 + b2
1
2
2
+ ( b)2 y 2 :
s +y
1 + b2
It is clear that
CH
EtD 1 [Lt ] > EtCM
1 [Lt ] > Et 1 [Lt ] :
Since the loss of the central bank is higher in case of discretion and if it fails to commit then
discretionary equilibrium prevails, then the central bank might want to use some mechanisms to
show the public/agents that it is committed. Such mechanisms are readily available in case of
repeated interactions when the public can punish the central bank (in terms of changing their
beliefs for example) if it deviates from its announced policy. Another way to align incentives is to
increase the independence of the central bank which often means reduction of . In such a case,
however, the central bank although would commit to a rule it will allow for uncontrolled changes
in output.
Business cycles
The term business cycles is used to coin the ‡uctuations in aggregate output and other activity
(e.g., unemployment, trade) over medium-term. In an economy, medium-term is usually associated
with a period of several months or years. Fluctuations can be both upward and downward. Upward
changes in output are called economic booms, while downward changes are called economic busts
or recessions. Relatively long lasting recessions are called a depression. These ‡uctuations are
measured relative to the long-term growth trend of the output and are largely unpredictable.
55
The explanation of business cycles is one of the primary issues in macroeconomics. Business
cycles have been studies starting from the time of Adam Smith and David Ricardo. Economists
tend to di¤er a lot, however, in terms of their explanations of causes of business cycles and proposed remedies. This is such a central topic that the economists even form schools of thought in
explanation of business cycles.
Some of the most highlighted shocks to aggregate output and other aggregate variables in an
economy are
Technology shocks: 100 years ago travelling from Barcelona to New-York would take much
more time than now. This is a drastic example how production functions can change over
time. New technologies like PCs and robots alter the production process and usually raise
productivity. Sometimes, production facilities break down or employees use too much facebook, so productivity falls. Such a technological change is not always smooth; it often comes
in some of form of shocks.
Weather shocks: Agriculture and tourism industries are very weather-dependent. Other
industries could also depend on weather if their employees work e¤ort depends on weather.
Fluctuations in weather then a¤ect output in these industries.
Monetary shocks: We have seen that in certain cases money supply and in‡ation a¤ect
output. This implies that random changes to monetary policy or liquidity in the economy can
lead to output ‡uctuations as well.
Political shocks: The government can in‡uence the real economy through public expenditure, and regulations. If it changes expenditures, tax laws, antitrust regulation, and expectations then that can cause ‡uctuations in aggregate output.
56
Taste shocks: All the examples above are rather about the supply of goods. There could
be also shocks to the demand for goods and services in terms of shifts in preferences. Such
shifts can cause ‡uctuations and can come for example from introduction of new products
that makes others obsolete in terms of the preference.
Usually none of these shocks can explain large shifts in output (and other aggregate variables)
such as observed in real economies. However, there are mechanisms in the economy that can
amplify these shocks. These shocks can be ampli…ed because of, for example, intra- and intertemporal substitution. If a negative shock hits the economy and output declines then consumers
might wish to work less and enjoy more leisure (intra-temporal substitution). This would reduce
output further. Moreover, if consumers like smoothing their consumption then they would save less
(inter-temporal substitution) so that capital would decline and output would be lower in future.
Price stickiness could be another amplifying mechanism. If wages are sticky, for example, then after
a negative shock to productivity …rms would like to pay lower wages but they cannot. Instead of
lowering wages and keeping rather steady-level of output they would …re labor force and reduce
output. Financial frictions, in terms of inability to lend and borrow freely, can amplify the e¤ects
of shocks too. If there are …nancial frictions then even small shocks can force …rms into bankruptcy.
This will a¤ect …nancial sector that lent money to the bankrupt …rms and reduce credit. Often
then additional …rms have to declare bankruptcy, and sometimes even banks fail. Bank failures
reduce liquidity and credit. They can a¤ect all creditors and debtors and therefore can have large
economic consequences.
The classical and neo-classical (fresh-water) school of thought tends to believe that the origins
of business cycle ‡uctuations are completely exogenous processes which a¤ect aggregate output
through changes in technological e¢ ciency (e.g., introduction of computers and facebook) and other
real variables. They hypothesize that the economy is frictionless so that there are no price rigidities
and/or …nancial frictions. In this respect, they hypothesize that the cycles which follow these
shocks are the optimal/best response of the economy. Therefore, these schools of thought believe
that policies might not be e¤ective in tackling business cycles. Neo-classical theories include the
Real Business Cycle theory by Kydland and Prescott (1982), which is built-around, in particular, the
rational expectations assumption. These theories tend to have solid micro-foundations. However,
they tend to under emphasize the importance of frictions and distortions in the real economy.
The Keynesian and neo-Keynesian (saltwater) schools of thought tend to believe that the origins
of business cycle ‡uctuations also include shocks to nominal variables. They hypothesize that the
economy involves frictions so that there are price rigidities, …nancial frictions, and other failures
in the economy. In such frameworks nominal variables a¤ect real variables. Moreover, the cycles
which follow these shocks are not the best response of the economy. Therefore, these schools of
thought tend to believe that there is a scope of policy intervention (…scal and/or monetary). A
standard example of Keynesian model is the AD-AS model (IS-LM-Phillips curve model). We
will see two "extensions" of it in the next two sections. These models, however, lack solid microfoundations. Neo-Keynesian theories tend to alleviate that problem. Prominent economists such
57
as Michael Woodford and Gregory Mankiw have contributed to the development of neo-Keynesian
theories. These theories are usually presented in a form of Dynamic Stochastic General Equilibrium
(DSGE) models, which are very far away from the scope of this course because of complexity. Such
models, however, are commonly used in the Central Banks and other (…nancial) institutions.
There are other schools of thought too. For example Monetarist school of thought (largely due
to Milton Friedman) and Austrian school of though (largely due to Friedrich Hayek).
Business cycles - The Carlin and Soskice (2005) model
This section presents a very classical Keynesian model of business cycles research by Carlin and
Soskice (2005). This model is an extension/analogue of the standard AD-AS model. The standard
AD-AS model consists of 3 equations. These are IS and LM equations, which describe the aggregate
demand (AD),
[IS] : Y = C + I (r) + G;
M
[LM ] :
= L (r + e ; Y ) :
P
and aggregate supply equation (AS). In Macroeconomics 1 and 2, AD-AS model is presented in a
static form. It is presented using levels of prices, but not changes of prices (in‡ation).
This model also consists of 3 equations. Its …rst equation is the IS curve. IS curve in this model,
however, is written in a somewhat reduced form and in terms of logarithms of variables
[IS] : y = A
where A and
ar;
are positive parameters. For example, A depends on the level of consumption
thriftiness. In turn,
measures the magnitude of the e¤ect of interest rate on income in the goods
market. This model is dynamic and incorporates time dimension. In particular it is assumed here
that income and interest rate are time dependant. Moreover, this model assumes that interest rate
(because of some imperfect adjustment mechanisms) a¤ects the level of income with time lag so
that
yt = A
art
1:
There exists interest rate level rs that leads to equilibrium level of output ye (stabilizing level of
interest rate).
Graphically, IS curve looks like this
58
The second equation in this model is the aggregate supply curve, which is written in the form
of the Phillips curve (where agents have static expectations)
t
=
t 1
+
(yt
ye ) :
Graphically, this Phillips curve (PC) in short run and in long run is presented below
Here, the upward sloping curve (line) is the short run Phillips curve and the vertical curve (line)
is the long run Phillips curve.
T
is the Central Bank’s targeted level of in‡ation.
The targeted level of in‡ation is one of the parameters of the Central Bank’s monetary policy.
In this model the Central Bank designs its monetary policy so that to minimize its loss function
ye ) 2 +
L = (yt
where
T 2
t
;
is a positive parameter which highlights the importance of in‡ation stabilization for the
Central Bank.
The Central Bank sets its monetary policy so that to solve
n
L = min (yt
t
ye )2 +
s:t:
t
=
t 1
+
(yt
59
ye ) :
t
T 2
o
;
Substituting Phillips curve
order condition
t
into L and di¤erentiating with respect to yt gives the following …rst
@
L = 0 , (yt
@yt
ye ) +
t 1
+
(yt
T
ye )
=0
Substituting Phillips curve back into this equation gives
(yt
ye ) =
T
t
:
This equation summarizes the monetary policy rule. Carlin and Soskice call it MR-AD equation.
Graphically the solution of problem can be presented in the following manner.
The Central Bank’s indi¤erence curves are ellipses (circles if
= 1) with a bliss point at ye ;
T
.
In sum, the model is given by the following 3 equations
[IS] : yt = A
[P C] :
[M R
Plugging [M R
=
AD] : (yt
t 1
1;
+
(22)
(yt
ye ) =
ye ) ;
t
(23)
T
:
(24)
AD] into [P C] gives the level of in‡ation
t
Moreover, expressing
t
from [M R
=
1
1+
t 1
2
+
2
T
:14
(25)
AD] and plugging it into [P C] gives
[Y R] : yt = ye
14
t
art
1
1
+
t 1
T
:
(26)
Denote ! = 1+1 2 and rewrite this equation as t = ! t 1 + (1 !) T . The solution of this equation is t =
P
! t 0 + (1 !) T t =0 ! t . As t increases the …rst term tends to 0. In turn, the second term tends to T .
Substituting this into [M R AD] gives the dynamics of output. From the dynamics of output and [IS] the dynamics
of interest rate can be found.
60
Call this equation Y R: output rule. In turn, from IS equation it follows that
yt
ye =
a (rt
1
rs ) :
Therefore,
[T R] : rt
1
rs =
1
a
1
1
+
t 1
T
:
(27)
This is an analogue of Taylor rule (it does not include output gap) which we call T R. This rule
directly follows from the Central Bank’s optimization rule. Therefore, it is the policy rule of the
Central Bank in terms of adjusting the interest rate. In this respect, T R suggests by how much the
interest rate should be adjusted if in‡ation deviates from its target.
Lets now turn to the analysis of ‡uctuations and policy responses. Without loss of generality, in
our example economy which we discuss below we will assume that in‡ation target is set to 2% (i.e.,
T
= 2). Moreover, time starts at t = 0 and at t = 0 the economy is in its long run (equilibrium)
level of output and (targeted) in‡ation.
Shocks causing ‡uctuations
Shock to the IS curve: Consider a positive to shock to the IS curve which raises A to A0 at
t = 0 (and remains there forever). The Central Bank can do nothing to a¤ect the increase the
output in t = 0 since its monetary policy rule a¤ects interest rate which has time lag in a¤ecting
output. However, it can a¤ect changes in the economy in the second period. In order to do so it
needs to make a forecast of Phillips in t = 1. With the forecast of Phillips curve it will identify its
constraint for designing its optimal problem, and …nd the solution of the optimal problem.
Suppose further that the shift of A to A0 has triggered (through Phillips curve) increase in the
current level of in‡ation to
0
= 4%. From Phillips and IS curves, therefore, we have that the
change in A is given by
4=2+
i.e., A0 =
2
A0
A ;
+ A.
Given that expectations are static the forecast of next period (t = 1) in‡ation is 4. For any level
of output, this implies that in‡ation is going to be higher in the next period implying that next
period’s Phillips curve has shifted up. The graphical representation of this process is as follows
61
Notice that if the Central Bank decides to lower in‡ation in t = 1 then it will cause a recession.
This is because any level of in‡ation below 4 corresponds to lower output level (lower than ye ).
Moreover, it can a¤ect in‡ation at t = 0 and move along the new Phillips curve.
According to the Taylor rule the Central Bank will calculate in‡ation at t = 0 so that it increases
interest rate at t = 0. In the …gure o¤ered above this corresponds to choosing points B and B 0 .
Clearly, therefore, the Central Bank causes a recession in t = 1. What happens next? At t = 1 since
output is lower than ye according to the Phillips curve in‡ation at t = 2 will be lower than at t = 1.
This implies that the forecast of Phillips curve for t = 2 at t = 1 is to the right of the dashed P C
in the …gure above. The Central Bank then will set slightly lower interest rate and in‡ation. This
will increase output. This adjustment process will continue till the point in time when in‡ation is
back to its targeted level of 2. Of course, it will imply that the new stabilizing level of interest rate
is higher than the old stabilizing level of interest rate, rs0 > rs .
Supply Shock:
Consider a permanent shift of the level output at t = 0. This corresponds to a
shift of the long run level of output ye to some ye0 such that ye0 > ye and can happen, for example,
because of technical progress.
Suppose that the level of shock is such that it triggers the in‡ation to decline to 0. From the
Phillips curve, therefore, the magnitude of the shock has to be
0=2+
ye
62
ye0 ;
i.e., ye0 =
2
+ ye .
Clearly, in this case given that long run Phillips curve shifts to the right (see the …gure above
for graphical representation) it has to be that in the short run Phillips curve in the long run shifts
to the right too. To have such a shift there needs to be a new M R
AD curve since it has to go
through the intersection of long run and short run Phillips curves.
Using the forecast of Phillips curve the Central Bank knows that this shock will imply lower
in‡ation at t = 1. It will design its optimal problem accordingly. The solution of its optimal problem
implies Y R equation. According to Y R, the Central Bank would like to have higher output in period
t = 1 since yt increases with ye . From T R equation and IS curve this implies that the Central Bank
will reduce in‡ation at t = 0 from 2% and the interest rate from rs . In the …gure o¤ered above it
will move to points C and C 0 . Later, it will gradually increase in‡ation and interest rate so that to
reach ye0 and
T.
Endogenous business cycles - The Goodwin (1967) model
The Goodwin (1967) model combines not so orthodox growth models (Harrod, 1939; Domar, 1946)
with Phillips curve in order to generate endogenous business cycles. Fluctuations are due to cyclical
relationship between employment and wages. This model is not well micro-founded, however, it can
serve for a nice illustration of endogenous emergence of cycles.
This model has origins in Biology. In particular, it is related to a large class of Predator-Prey
models which describe dynamic biological systems. A common outcome of these systems is a vicious
cycle of large prey population. Imagine increasing the number of predators. That would reduce
63
the sample of prey and therefore reduce the number of predators. These dynamics give a raise to
Lotka-Volterra di¤erential equation. Similar equation is derived from this model.
At any time t, the aggregate output is assumed to be given by
Yt = max
where
t
is the productivity of labor Lt , and
Kt
t Lt ;
;
pins down capital output ratio
Kt
Yt
when
Kt
t Lt .
This production function implies that labor and capital are complementary. Moreover, it is
clearly not a neo-classical production function. For example, it violates diminishing returns property
of the neo-classical production functions.
In equilibrium …rm will be reluctant to hire more than
t Lt
Kt
amount of e¢ ciency adjusted labor
since hiring more would not increase their production. Moreover, they would be reluctant to
hire more than
t Lt
amount of
Kt
. We will focus on the case when they hire so that
t Lt
=
Kt
:
This implies that
Yt =
t Lt
=
Kt
:
Further, assume that time is continuous and labor productivity grows at a rate of a. The growth
this expression is
t
t+1
rate of labor productivity in discrete time is given by
t
. The continuous time analogue of
1 d t
.
t dt
Let total population be Nt and grow at a rate of
, i.e.,
1 dNt
Nt dt
=
. Therefore, employment
rate is given by
"t =
Lt
:
Nt
Denoting growth rates by letter g, this implies that employment rate grows at a rate of
g" = gL
;
(28)
where gL is the growth rate of labor. Notice that when the (equilibrium) amount of labor and
population grow at the same rate then the rate of growth of employment g" is zero. In case,
however, when population grows at a higher (lower) rate than labor then g" is negative (positive).
The rate of growth of labor can be found from the production function. Under the assumption
that
t Lt
=
Kt
the rate of growth of labor is
gL = gY
a:
(29)
Clearly, when K is …xed the amount of labor should decline with labor productivity since the
amount of labor in e¢ ciency units should be constant.
The Phillips curve in this model depicts relationship between percentage changes of wages w
64
(price of labor) and employment (output). The Phillips curve is assumed to be given by
g w = "t
where
and
;
(30)
are positive parameters that characterize labor market conditions (e.g., existence of
minimum wage, labor unions).
Denote the share of worker compensation by
t
Apparently, 1
equilibrium
t Lt
t
=
t
wt Lt
:
Yt
=
would be compensation of capital. Now, since we focus on the case when in
Kt
the share of worker compensation is
t
=
wt
;
t
which has a growth rate of
g = "t
a:
(31)
The last thing which needs to be determined to close the model is the growth rate of output.
Clearly, since Yt =
Kt
output and capital grow at the same rate.
Assume that there are two types of agents in this economy: workers and agents who own capital
and supply no labor. Call the latter type of agents "capitalists." Workers consume their income
immediately and do not save. In turn, capitalists save a constant fraction of s of their income. This
means that total savings are given by
St = s (1
t ) Yt :
15
Capital accumulation rule, therefore, is given by
dKt
dt
= St
Kt
= s (1
s (1
=
t ) Yt
t)
Kt
Kt :
The growth rate of capital and output then is
gK = gY =
Notice that for sharp increase of
t,
s (1
t)
:
(32)
which corresponds to sharply increasing the share of worker’s
income, the growth rate of Y can become negative according to (32). This would imply that the
15
Recall that in the Solow (1956) model there is no distinction between workers and "capitalists." In this sense, Solow
(1956) assumes that both agents save the same percentage of their income.
65
growth rates of labor and employment become negative according to (29) and (28). Negative growth
rate of employment would reduce employment. Therefore, it would reduce wages according to the
Phillips curve (30) and
according to (31). This is the predator-prey mechanism in this model.
This model can be summarized by two di¤erential equations
g" =
g
s (1
=
t)
"t
a
;
a:
These equations are known as Lotka-Volterra di¤erential equations. Clearly, g" declines as
creases. This is because higher
t
t
in-
reduces investments and growth of output. Therefore, it reduces
growth of employment. Moreover, higher employment increases g . This is because increasing
employment increases output and wages. Higher wages imply higher share of worker compensation.
In the steady-state equilibrium there is no dynamics in the system. Therefore, in the steady-state
equilibrium
0 =
s (1
0 =
)
"
a
;
a:
Solving for the share of worker compensation and employment gives
SS
= 1
"SS =
s
+a
( + a + );
:
Consider the di¤erential equations in order to see the dynamic adjustment in this model. The
following …gure o¤ers the phase diagram (time evolution of the system).
66
Suppose that "t > "SS and
at "t =
"SS
t
<
SS
. This implies that the point ("t ; t ) is in quadrant I. Since
we have that g = 0 and g increases with "t , it has to be that g > 0 for the points
("t ; t ) in quadrant I. This implies that over time
t
by the arrow pointing to the right (i.e., increasing
t ).
g" = 0 and g" declines with
t,
increases in quadrant I, which is depicted
Moreover, since at
t
=
SS
we have that
it has to be that g" > 0 for the points ("t ; t ) in quadrant I. This
implies that over time "t increases in quadrant I, which is depicted by the arrow pointing to up
(i.e., increasing "t ). The intuition behind the arrows in the remaining quadrants follows a similar
logic.
Imagine the economy starts at some point in quadrant I. In such a case, over time it will
gravitate to quadrant II then to quadrant III, quadrant IV , and return back to quadrant I.
This is illustrated by the circle of arrows around the steady-state. It turns out that dynamics are
periodic ‡uctuations in this model in the sense that this process never converges to the steady state
nor diverges to
1.
An economy which is at the steady-state can appear in quadrant I for example because of a
positive shock to employment (e.g., population declines temporarily) and/or negative shock to labor
income share (e.g., labor productivity increases temporarily). After such a shock it will experience
‡uctuations along the cycle and never converge back to the its steady-state. These ‡uctuations are
endogenous business cycles. Such ‡uctuations we have encountered for prices in the Cobweb Model.
In the Cobweb Model there would be never ending ‡uctuations if
2
=1
A Real Business Cycles Model
This section mostly follows (1) Chapter 9 in Doepke et al. (1999) and King and Rebelo (1999).
67
Before Kydland and Prescott (1982) economists thought that classical models can be used for
studies of long-term phenomena such as economic growth. However, short- and medium-term ‡uctuations are not well explained with classical models. Kydland and Prescott (1982) revolutionized
business cycles theory and economics in two ways. First, they showed that realistic ‡uctuations can
emerge in classical (well micro-founded) macroeconomic models as an optimal response to exogenous shocks. By doing so they basically originated the real business cycles (RBC) theory. Second,
they o¤ered ways for evaluating the predictions of the models so that to gauge their relevance and
…t to the real world.
Usually it is impossible to analytically solve the models used in real business cycles theory.
Instead numerical/simulation methods are used. In order to resort to numerical methods the values
of model parameters should be known. Kydland and Prescott (1982) advocated the use of real
world data for calibrate (estimating) the parameters of the models.
In these models, it is also very hard to derive analytical comparative statics for understanding the
qualitative predictions of the models. Numerical comparative statics are used instead of analytical
comparative statics. The numerical comparative statics are also useful for evaluation of quantitative
predictions. In particular, researchers evaluate the response of the model variables over time to
exogenous shocks (impulse-response functions) and compare them to the patterns in real world
data. In line with Kydland and Prescott (1982), the comparison is in terms of
the direction and the shape of the response of model variables;
the magnitude of response in terms of mean and standard deviation; and
the signs and magnitude of correlations between model variables.
The table below o¤ers business cycle statistics (moments) for main macroeconomic variables
from US data. This table is taken from King and Rebelo (1999). It is used for the latter two
points. All data are quarterly and are for the period of 1947 Q1-1992 Q4. Variables representing
quantities such as output (Y), consumption (C), investment (I), and hours worked are in per capita
terms. Consumption includes consumption of non-durables. Investments include private …xed
capital formation and expenditures on durables. Wage rate is the real compensation per hour.
Interest rate is basically the interest rate paid on treasury bill minus in‡ation. A is Hicks-neutral
productivity in aggregate output. It is de…ned as Solow residual (ln A = ln Y
68
ln K (1
) ln L).
The …rst column o¤ers sample standard deviation of relevant variables. The standard deviation
of a variable X with observations from fX1 ; X2 ; :::; XT g is de…ned as
v
u
u
ST D (X)T = t
1
T
1
T
X
Xt
t=1
T
1X
Xt
T
t=1
!2
:
It is the statistical/empirical analogue of the variance (more precisely the square root of the variance). The second column o¤ers the standard deviation of the relevant variables relative to the
standard deviation of output (Y). Clearly consumption of non-durables is much less volatile than
output. Investments, which include expenditures on durables, are around three times more volatile
than output. The number of hours worked has around the same volatility as output. However,
wages (and interest rate) vary much less than output and output per hour worked.
The second column of the table shows the …rst order autocorrelations among variables. The …rst
order autocorrelation of the variable X can be found using the following regression
Xt =
where
is a constant and
t
+
1 Xt 1
+
t;
is by construction orthogonal to Xt
1.
Coe¢ cient
1
is the …rst
order autocorrelation of the variable X. It is called …rst order autocorrelation because we have the
1st order time di¤erence (i.e., Xt
1)
and the correlation of X with itself (i.e., auto-correlation).
69
Alternatively,
1
can be computed as
1
T 2
1
T
X
Xt
XT
Xt
1
XT
1
t=2
=
ST D (X)T ST D (X)T
:
1
This coe¢ cient shows how much the observations of variables are interrelated (linearly). In other
words it shows how good is the current value of the variable for predicting its future value. Now
imagine that there is a shock to X because of some exogenous reasons ( ). In such a case, if
close to 1 from below then this shock will stay in X for a long time. If
stay in X forever. If
1
1
1
is
is equal to 1 then it will
is higher than 1 then it will stay in X forever moreover it will propagate
and get larger over time.
From the second column it is clear that all quantity variables are very highly autocorrelated.
Perhaps, one of the most important autocorrelations for RBC theory is that of A. It is fairly large,
which means that previous values of A predict current values with high precision and a shock to A
will persist in A for a long time.
The last column of this table o¤ers contemporaneous correlations between variables and output.
The correlation between variables X and Z can be found using the following regression
Xt =
where
is a constant and
t
+ Zt +
t;
is by construction orthogonal to Zt . Coe¢ cient
between variables X and Z. Alternatively,
1
T 1
=
T
X
is the correlation
can be computed as
Xt
XT
Zt
ZT
t=1
ST D (X)T ST D (Z)T
:
This coe¢ cient measures the degree two variables are related (linearly). This is the reason it is
called "co"-rrelation. More precisely, it corresponds to contemporaneous correlation because it is
for the observations from the same period of time.
The third column suggests that output and other quantities are very highly correlated. This
implies that the driving processes behind them might be quite the same. However, output and real
wage rate are not well correlated.
The Basic RBC model
Consider a closed economy which is populated by a very large number of identical and in…nitelylived households of mass one. In a period, the representative household is endowed with 1 unit
of time which it can use for work/labor l and leisure 1
l. It derives instantaneous utility from
consumption c and leisure
u (ct ; 1
lt ) = ln ct + ln (1
70
lt ) :
The lifetime utility of the household at time zero is the expected discounted sum of the instantaneous
utilities
U (c; 1
l) = E0
" +1
X
t
#
u (ct ; 1
lt ) ;
t=0
where
=
1
1+
is the discount factor and
> 0 is the discount rate. The representative household
has rational expectations and E0 is the expectation operator given all the available information at
the beginning of the economy, time 0.
The assumption that households live forever can be justi…ed thinking that we have families of
altruistic households who care about the utilities of their o¤springs as they care about their utilities.
It turns out that this assumption makes calculus easier. In turn, the assumption that there are a
very large number of households implies that each households is atomistic and can be ignored in
the analysis Therefore, non of the households can dictate prices and quantities and takes them as
given.
The representative household earns market wage w for each labor unit. It owns the capital
stocks in the economy which at time t are at the level of kt . The household earns market interest
rate of r for each unit of supplies capital. At time t, the proceeds from labor and capital are equal
to wt lt + rt kt . The household uses these proceeds to fund its consumption and savings. Therefore,
its budget constraint is given by
wt lt + rt kt = ct + st :
Savings are translated into investments
st = it :
Investments create new capital according to the law of motion of capital:
kt+1 = it + (1
where
) kt ;
2 (0; 1) is the rate of depreciation of capital.
This implies that the budget constraint of the household can be rewritten as
wt lt + rt kt + (1
) kt
ct
kt+1 = 0:
The household maximizes its lifetime utility with respect to the budget constraint. Formally, it
solves the following problem.
max
fct ;lt ;kt+1 g+1
t=0
(
E0
s:t:
" +1
X
u (ct ; 1
t=0
wt lt + rt kt + (1
k0 > 0
t
) kt
given:
71
ct
#)
lt )
kt+1 = 0;
We will use Lagrangian to solve this problem. We de…ne the Lagrangian as
L = E0
" +1
X
t
u (ct ; 1
lt ) + qt [wt lt + rt kt + (1
) kt
ct
kt+1 ]
t=0
#
:
Imagine that it is now some time t so that the household knows all the values of the variables
(including random variables) for time t but does not know the t + 1 values. In such a case, take the
derivative of the Lagrangian with respect to ct and lt and set it to zero
@L
=0
@ct
@L
=0
@lt
:
:
1
= qt ;
ct
1
lt
= qt wt :
Take also the derivative of the Lagrangian with respect to kt+1 and set it to zero. Notice that kt+1
shows up in the Lagrangian at time t as kt+1 and at time t + 1 in rt+1 kt+1 + (1
@L
= 0 : qt = Et qt+1 [rt+1 + (1
@kt+1
We have Et in front of qt+1 [rt+1 + (1
) kt+1
)] :
)] because t + 1 variables are subject to random shocks and
the household uses rational expectations to predict them. Notice that given the values of kt and it
the value of kt+1 is uniquely determined.
The …rst equation tells that the marginal utility of consumption
1
c
is equal to q, which is called
the shadow value of the marginal unit of capital. The second equation is the supply of labor. The
bene…t of supplying a marginal unit of labor is wt , which is the real wage rate and measures the
number of consumption goods. In the second equation. wt is scaled by qt and qt wt is the bene…t
of supplying a marginal unit of labor in terms of utility. In equilibrium, the household should be
indi¤erent between marginally increasing its labor supply or marginally increasing its leisure time.
Therefore, bene…t in terms of more consumption because of labor should be equal to the utility loss
because of less time in leisure. The marginal disutility of labor is given by
1 lt .
Plug the …rst equation into the second equation to …nd
1
1
= Et
[rt+1 + (1
ct
ct+1
)] :
This equation is called Euler Equation. It equates the marginal utility of consumption to the
discounted value of the marginal utility of consumption times the earned interest. It states that in
equilibrium the household should be indi¤erent between consuming a marginal unit of good right
now or delaying its consumption to the next period and saving to earn a net interest of rt+1 +(1
).
In short it tells that in equilibrium the household should be indi¤erent between consumption and
saving.
For any given level of prices, the household has three variables to solve for (ct ; lt ; kt+1 ) and three
72
equations:
ct
;
wt
1
1
[consumption] :
= Et
[rt+1 + (1
ct
ct+1
[investment] : wt lt + rt kt + (1
) kt ct
[labor supply] : lt = 1
)] ;
kt+1 = 0:
Notice that labor supply increases with real wage and declines with consumption. This is because
household’s consumption re‡ects how well o¤ it is. If the household anticipates higher wealth then
it would accordignly increase consumption and reduce its current labor supply (have you ever seen
lazy millionaires?).
In the standard neoclassical model, labor supply is inelastic. Therefore, changes for example
in government expenditures have no real e¤ects on output. However, in this framework they can
have. Imagine an increase in government expenditures. The household knows that this is going
to be followed by a tax hike to cover the expenses. Therefore, it expects to have lower wealth.
The anticipation of lower wealth would lead to lower consumption and would increase current labor
supply. Higher labor supply would increase output, which we discuss below.
A very large number of identical …rms produce consumption goods. The representative …rm has
a Cobb-Douglas production technology:
yt = At kt1
where
lt ;
2 (0; 1) and At is the technology level. Higher/lower At implies higher/lower level of output
for given levels of capital and labor.
At each period of time, the representative …rm solves the following problem
t
= max fyt
kt ;lt
wt lt
rt kt g
s:t:
yt = At kt1
lt :
Therefore, its demand for capital and labor are given by
@ t
=0
@kt
@ t
=0
@lt
: rt = (1
: wt =
)
yt
;
kt
yt
:
lt
These equations characterize the production/supply side. In equilibrium, supply and demand
are equal. This corresponds to plugging these two equations (supply prices) into the three equations
73
above (equations for demand side). If we do so then we obtain the following three equations:
lt = 1
ct
"
1
lt
1
At k t
"
1
1
= Et
(1
ct
ct+1
At kt1
lt + (1
(33)
lt
1
At+1 kt+1
lt+1
)
+ (1
kt+1
) kt
ct
##
)
;
kt+1 = 0:
(34)
(35)
In these equations, the variables are c, k, l, and A. We need additional equation for A to have
4 equations and 4 variables. We will assume that
ln At = ln At
where
t
1
is an i.i.d. random variable with 0 mean and
+
2
t;
(36)
variance.
These 4 equations constitute the basic real business cycles model. The shocks originate in
equation (36) and are
t.
As discussed before, ln At is computed using capital and labor series and
the following formulae
ln At = ln Yt
The mean and variance of the shocks
and
ln Kt
(1
) ln Lt :
are obtained running a regression that has the form
of (36). One of the major points of Kydland and Prescott (1982) is that many of the real world
business cycles summarized in Table 1 above can be matched using this model and the estimated
shocks.
It is important to stress that in this model shocks propagate to other variables and persist
over time. A shock that arrives at time t persists in the economy for several periods because of
two reasons. Suppose that a positive shock has arrived, so that the values of
and A are higher
than expected. First reason that the shock persists is that the arrival of positive shock increases the
marginal product of labor for a given value of k and increases output. Some of that increased amount
of output will be consumed and the remainder will be used for investments. Higher investments
will create higher amount of capital in the next period. Therefore, in the next period the amount of
capital will be higher than if there was no positive shock. This will imply higher marginal product
of labor and higher output, leading again to higher investments than there would have been without
the positive shock. The second reason is that A has autocorrelation as measured by . Higher than
expected shock
will persist then in A. It turns out that usually this channel is quantitatively
the most important one. Therefore, the magnitude of the persistence
is very important for RBC
models.
How do we solve equations (33)-(36) for c, k, l, and A? For a fairly general set of parameters
we cannot actually derive the analytic solution of this model. To …nd solution of this model then
we run computer simulations.16 There are certain important insights, however, that we can draw
16
Usually this is done for the log-linearized versions of these equations around the steady-state of the system. For a
function f (x) this involves writing a …rst order logarithmic approximation: f (x) = f (x ) + f 0 (x ) (ln x ln x ).
74
from this system of equations without explicitly solving them.
We will say that our model economy is in a "steady
Steady state and technology shocks:
state" when all variables in equations (33)-(36) are time invariant. We will say that the economy is
in a "deterministic" steady state if there are no shocks, , in the economy.
Suppose that in the long-run there are no shocks. It is possible to show that then in the long
term all variables in equations (33)-(36) are time invariant if j j < 1. Let’s assume that j j < 1.
According to the table from King and Rebelo (1999) this is …ne at least for the US.17
Suppose that in the steady state we have a given value for A (e.g., A = 1). Using (33)-(35) it is
easy to show that in the steady state we have
l =
k =
"
h
1
(1
1
1
1
(1
)
(1
c =
h
(1
i
h
1
) +
#1
A
)
1
)
(1
i
)
(1
i;
)
l;
(1
"
)
1
(1
1
)
(1
)
A
#1
l:
Let’s consider now a permanent increase in the value of A to A0 . Such a permanent increase
would imply a new steady state where
l =
k0 =
"
1
c0 =
h
1
1
h
(1
1
(1
)
(1
)
1
(1
i
h
1
) +
#1
A0
(1
i
)
)
(1
i;
)
l;
(1
)
1
"
(1
1
)
(1
)
A0
#1
l:
Clearly, in this new steady state capital stock and consumption are higher than in the old steady
state: k 0 > k and c0 > c. Therefore, positive (negative) technology shocks increase (reduce) the
steady state levels of capital, output, and consumption.
Transition and technology shocks:
Consider a model economy which starts at a deterministic
steady state and receives positive technology shock so that A increases to A0 . We know that the
new steady state of this economy features higher levels of capital, output, and consumption. How
does the economy get to the new steady state? If there are no further shocks than it can be shown
17
If j j > 1 then we will simply need to consider a "detrended" version of the model where we subtract the growth of
A from A, Y , K, and C.
75
from (33)-(35) that the economy will gradually transit toward the new steady state. During this
transition, capital, output, and consumption will increase.
For a more general discussion, we need to consider an economy that starts at a deterministic
steady state and periodically receives technology shocks which imply di¤erent values of K, Y , and
C in future steady state. After each shock the economy starts adjusting/transiting toward the new
steady state.
Since we are not able to solve for the variables (33)-(36) we will need to simulate the model
economy on a computer in order to assess its performance in terms of generated cycles in output,
investments, consumption, and other variables. In order to simulate it we need to use values for
parameters. The usual values of the parameters (for the US) are
= 0:667;
= 0:984; = 0:025;
= 3:48; = 0:979;
= 0:0072:18
The simulations provide the simulated values of model variables. We use then the de…nitions
of the standard deviation and correlations to check the model predictions. The following table,
borrowed from King and Rebelo (1999), summarizes the results from this exercise.
The simulation results indicate that the model closely matches the volatility of output, investments, consumption, and wages. It matches also autocorrelations and the observed procyclicality
(positive correlations with output) of almost all variables.
Although this "simple" model preforms astonishingly well for matching these data moments it
fails in matching the cyclicality of labor hours and interest rate. Hansen (1985) has suggested a
way to circumvent the problem with labor supply making it an indivisible choice at the individual
level so that the at macro level the elasticity of labor supply to shocks is very high. However,
18
See Table 2 in King and Rebelo (1999).
76
that might create excess volatility in wages. To match the volatility of the interest rate and wage
rates then this simple real business cycle model is complemented with price rigidities. Adding price
rigidities, however, makes the model much more intractable and brings back Keynesian arguments.
We discuss one of the ways price rigidities are usually modelled in these frameworks in the next
section.19
Price Rigidities - The Calvo (1983) model
Classical and Keynesian schools di¤er the most in their view of how markets work. Classical (and
neo-classical) school of thought conjectures that markets are perfect (i.e., frictions and distortions
are insigni…cant). In this respect, it conjectures that prices freely adjust to equilibrate demand
and supply in all markets. Classical dichotomy holds under this conjecture, i.e., money supply
does not matter for real variables. Keynesian (and new-Keynesian) school of thought conjectures
that there are insigni…cant frictions and distortions. In particular, these imperfections create price
rigidities. Therefore, prices do not adjust (fully) to equilibrate demand and supply in all markets.
This breaks classical dichotomy. Moreover, in such a setup shifts in demand and supply can a¤ect
output through prices too.20
There are two common ways for modelling price rigidity. The easy way to do so assumes that
changes in prices depend on time (time-dependent models.) For example, Taylor (1980) assumes
that …rms change their prices each n-th period and that in each period
1
n -th
of …rms change their
prices. Calvo (1983) assumes that in each period with some probability some of the …rms can
change their prices. More precisely, Calvo (1983) assumes that at the beginning of each period a
random event decides which of the …rms can change their prices and which of the …rms cannot. A
more cumbersome, but more appealing, way for modelling price rigidity assumes that prices depend
on the state of the economy (menu cost models.) In these models …rms change prices when the
expected bene…t of changing their prices is higher than the cost of changing prices (menu cost.) The
complications arise because expected bene…t of changing prices depends on the current and future
states of the economy.
DSGE models with Calvo (1983) style price rigidities tend to be the main workhorses in the
central banks. We will now cover a version of the Calvo (1983) model. It will provide us with an
upward sloping supply curve/Phillips curve.
Time is discrete. The economy is populated by a continuum of mass one of identical and in…nitely
lived households. The representative household derives instantaneous utility from consumption of
a basket of goods. The lifetime utility of the household is given by
U=
+1
X
t
Ct ;
t=0
19
Another criticism that applies to the RBC models is their reliance on Solow residual which is usually not very well
estimated.
20
The assumption that prices are very rigid …nds limited support in microeconomic data.
77
where C is a constant elasticity of substitution basket of goods i;
2 1
31
Z
C = 4 C~i di5
(37)
0
where
it by .
2 (0; 1). The elasticity of substitution between any pair C~i and C~j (i 6= j) is
1
1
.21 Denote
The households spend their entire income on purchases of C. Therefore, the representative
household’s budget constraint it
PC C =
Z1
Pi C~i di;
(38)
0
where PC is the price of C and Pi is the price of good i.
The household chooses its demand for di¤erent goods to maximize its lifetime utility. Since it
has no dynamic decisions (and therefore no inter-temporal trade-o¤s), the maximization of lifetime
utility is equivalent to the maximization of instantaneous utility streams. The problem of the
representative household can divided to two steps. In the …rst
n step
o the representative household
chooses C to maximize its utility. In the second step it chooses C~i
to maximize C. Therefore,
i2[0;1]
in the …rst step it solves
8
<
max C
C :
0
@PC C
Z1
0
19
=
Pi C~i diA ;
;
for any time t. The solution of this problem gives the shadow value of marginally relaxing the
budget constraint
1 = PC :
In the second step the household solves the following problem
8 2
31
>
Z1
<
max
PC 4 C~i di5
>
~
C
f i gi2[0;1] :
0
Z1
0
9
>
=
~
Pi Ci di :
>
;
The solution of this problem is given by …rst order conditions for all goods i. Treating the integrals
as sums, these …rst order conditions are
21
"ci ;cj =
~i =C
~j )
d ln(C
d ln
@C
~
@C
j
= @C
~
@ Ci
=
~i =C
~j )
d ln(C
1 ~
1)
~
d ln(C
=Ci
j
h i
C~ 1
C~i : Pi = PC C i :
C
=
1
1
:
78
(39)
where C =
Z1
C~i di. This expression together with (37) and budget constraint (38) implies that
0
Pi
1
PC C
C
=
1
C~i :
Therefore, it implies that
2 1
Z
4 P
0
1
i
31
di5
Finally,
1
PC C
C
=
1
2 1
31
Z
4 C~i di5
0
1
1
= PC
2 1
Z
4
PC =
Pi
C
1
0
1
C
3
1
1
1
= PC
1
:
1
di5
;
(40)
which means that the aggregate level of price is a basket of prices of goods i.
Moreover, the (inverse) demand function implies that
2 1
31
Z
Pi = PC 4 C~i di5
C~i
1
0
Therefore,
C~i =
PC
Pi
=
PC
Pi
1
1
C
C:
Each …rm produces an i good. Firms have monopoly in their product and set prices. Firms choose
prices to maximize their real pro…ts.
In case when there are no price rigidities …rms solve
max
Pi
PC
Pi ~
Ci
PC
'i C~i ;
s:t:
C~i =
PC
Pi
1
1
C:
where 'i is the marginal cost of producing C~i amount of good i. Plugging the demand function
79
Pi
PC
into pro…t and taking the …rst order condition with respect to
Pi
PC
:
gives
Pi
'
= i;
PC
This expression is the (inverse) supply of good i. Under perfect competition the relative price is
equal to marginal cost. Here, because we have monopolists, the price is equal to
than M C C~i since 2 (0; 1).
'i
which is greater
This relation should hold for any time t. Assuming that …rms are symmetric, in logarithms, the
expression o¤ered above can be written as
pt = ln 't
where ln
< 0 since
ln ;
2 (0; 1).
Suppose now that there are price rigidities. At any time t any …rm i with a probability of
a sticky price and cannot change its price. With probability 1
has
it does not have a sticky price
and can change its price. In this case …rms set their prices not knowing when they’ll be able to
reset them. Therefore, expectations of future prices will matter for their current decisions. A …rm
which does not have a sticky price knows that if it sets price then with probability
the next period (exactly one period). Moreover, with probability
2 (1
2 periods and exactly for 3 periods with probability
(1
it will last for
) it will last exactly for
). The expected length of time until
price reset is given by
+1
X
t
1
=
1
t=0
:
Firms in this case maximize their present discounted value of pro…t streams whenever they can
adjust their prices. Assume that at time t …rm i is able to adjust its prices then it solves its problem
taking into account that for certain period of time (indexed by k below) it will not be able to reset
the price. Therefore, it solves at time t
Vt = max
fPi;t g
(
Et
s:t:
C~i;t+k =
" +1
X
(
)
k=0
PC;t+k
Pi;t
Pi;t
k
PC;t+k
C~i;t+k
'i;t+k C~i;t+k
#)
;
(41)
Ct+k :
To solve this problem plug C~i;t+k into Vt and take the …rst order condition with respect to Pi;t . This
exercise gives
[Pi;t ] : 0 = Et
" +1
X
k=0
(
)
k
(1
)
Pi;t
PC;t+k
+ 'i;t+k
Pi;t
PC;t+k
1
1
PC;t+k
Ct+k
#
Given that Pi;t does not depend on k it can be taken out of the sum and this expression can be
80
rewritten as
0 = Et
" +1
X
)k (1
(
)
k=0
Pi;t
PC;t+k
#
+ 'i;t+k PC;t+k Ct+k ;
Therefore, the price of …rm i at time t relative to aggregate price PC;t is given by
Pi;t
=
PC;t
1
+1
X
k=0
+1
X
PC;t+k
PC;t
)k Et 'i;t+k
(
Ct+k
:
k
(
) Et
1
PC;t+k
PC;t
Ct+k
k=0
Let all the …rms that are able to set price at time t be symmetric, i.e., 'i;t+k
't+k . Moreover,
let also all the …rms which are not able to set prices at time t be symmetric. Therefore, denoting
the price of …rms which are able to adjust it at time t by Pta gives
Pta
=
PC;t
1
+1
X
k=0
+1
X
PC;t+k
PC;t
)k Et 't+k
(
Ct+k
:
)k Et
(
1
PC;t+k
PC;t
(42)
Ct+k
k=0
The aggregate price which prevails is the weighted average of prices of …rms which adjust their price
and …rms which do not adjust their price. It is given by (40):
1
Pt
1
= Pt
1
) (Pta )
+ (1
1
;
(43)
where we have dropped subscript C.
Equations (42) and (43) constitute new-Keynesian Phillips curve. To see this, …rst divide (43)
by Pt . Next, consider the …rst order log-linear approximations of these equations around the steadystate, where all …rms are able to set prices. To do so denote
Pt
1
f (Pt
1 ; Qt )
=
g (Pt
1 ; Qt )
= f ePt 1 ; eQt =
1
+ (1
) Qt
The …rst order approximation of g around ln P^t
g (Pt
1 ; Qt )
=
^
ln ePt
^
+ ePt
+ (1
1
1
1
1
^
^
) eQt
ln ePt
1
1
^
1 ; ln Qt
1
P^t
1
1
1
^
eQt
81
1
1
1
1
) ln eQt
+ (1
1
1:
point is
) ln eQt
1
1
1;
^
+ (1
ePt
1
1
1
^
ln ePt
^
Q
t
1
1
1
Pt
1
^
1
ln eQt
ln P^t
Qt
1
^t
ln Q
At the steady-state P^t
1
^ t = 1. Therefore, around the steady-state it has to be that
=Q
g (Pt
Since f (Pt
1 ; Qt )
= g (ln Pt
1 ; Qt )
1 ; ln Qt )
=
1
and f (Pt
1
1
+ (1
1 ; Qt )
0 = pt
Notice that pt
Pt
1
Qt :
0 we have that around the steady-state
+ (1
1
)
) qt :
= ln PPt t 1 which is approximately minus 1 times in‡ation rate. Therefore,
t
1
=
qt :
Now consider log-linear approximation of (42) around the steady-state assuming for simplicity
that in the steady-state C = 1. To do so rewrite it as
Qt
+1
X
)k Et
(
k=0
"
#
1
PC;t+k
PC;t
Ct+k =
+1
X
1
"
PC;t+k
PC;t
)k Et 't+k
(
k=0
#
Ct+k :
The approximation of the left had side around the steady-state point is as follows. Denote
g~ (Pt
1 ; Qt )
= ln eQt
+1
X
)k Et
(
k=0
The …rst order log-linear approximation of g~ (Pt
PC;t+k
g~ Qt ; Ct+k ;
PC;t
+ Qt
^t
ln Q
+1
X
k=0
^
+ ln eQt
+1
X
(
k=0
^
+ ln eQt
+1
X
k=0
(
^
ln eQt
2
(
2
)k Et 4(
!
^
^
P
t+k
^
P
t
!
1) ln eCt+k
82
3
^
ln eCt+k 5
Ct+k
ln e
^
P
t+k
^
P
t
1
1
#
1
Pt+k
Pt
ln eCt+k :
^ t ; ln C^t+k ; ln
around ln Q
)k Et 4 ln e
k=0
^
P
t+k
^
P
t
1 ; Qt )
ln e
2
(
)k Et 4 ln e
)k Et 4 ln e
2
+1
X
"
!
1
^
!
1
e
point is
3
ln eCt+k 5
3
ln C^t+k 5
^
P
t+k
^
P
t
P^t+k
P^t
^
P
t+k
^
P
t
e
^
P
t+k
^
P
t
Pt+k
Pt
!3
^
Pt+k 5
ln
P^t
^ 0 this is given by
Around the the steady-state point 0; C;
PC;t+k
g~ Qt ; Ct+k ;
PC;t
C^
1
+ Qt C^
1
1) C^
+(
+1
X
1
+
1
)k Et [Ct+k ]
(
ln C^
k=0
Pt+k
:
Pt
)k Et
(
+1
X
k=0
Therefore, the approximation of the left hand side is
Qt
+1
X
)k Et
(
k=0
"
1
Pt+k
Pt
Ct+k
#
1
+
1
+(
1)
1
qt +
1
+1
X
(
)k Et [ct+k ]
k=0
+1
X
)k (Et [pt+k ]
(
pt ) :
k=0
In turn, the approximation of the right hand side is
1
+1
X
"
k
(
Pt+k
Pt
) Et 't+k
k=0
Ct+k
#
1
1
+
+
1'
Equate these expressions and notice that
qt + pt = (1
)
+1
X
+
1
1
'
1
+1
X
'
1 zt
k=0
(
)k (Et [pt+k ]
pt ) :
= 1 to get
)k Et pt+k + ln
(
't+k
:
'
= qt + pt . The above equation is the solution of the following di¤erence equation
where z0 = 0;
Et zt+1
Et
1 zt
=
(1
) pt + ln
't
:
'
Solving for qt from this equation gives
qt =
Using the relation
)k Et 't+k
k=0
k=0
Denote by Et
(
)k Et [ct+k ]
(
k=0
+1
X
+1
X
1
t
Et [qt+1 +
t+1 ]
+ (1
) ln
't
:
'
= qt gives
t
= Et [
t+1 ]
+
1
83
(1
) ln
't
:
'
This is the new-Keynesian Phillips curve. The di¤erence between it and Keynesian Phillips curve
is that it contains no backward-looking terms but contains forward-looking (expected) in‡ation
matters.
84
Expectations and …nancial markets
Bonds
Bonds are (usually) very primitive …nancial instruments. They are forms loans (or I owe you/IOU).
More precisely, a bond is a certi…cate which states that the issuer is indebted to the owner of the
certi…cate. It states the amount of the debt to be paid back (principal), the time when the debt
has to be paid (maturity date). Depending on the type of the bond, the issuer might also be
obliged to pay the owner interest (coupon). Interest is usually payable at given and …xed intervals
(e.g., monthly or annual). Quite often bonds are negotiable in the sense that the ownership can be
transferred. This makes certain types of bonds highly liquid.
Firms tend to issue bonds in order to meet their …nancial requirements. According to the Pecking
Order Theory (of …rm’s capital structure), issuing debt/bonds is …rms’second most preferred way
to raise …nance for investments. First most preferred way is to use internal …nance, and the least
preferred way is to raise …nance through issuing equity. In this respect, equity (stocks) and bonds are
quite much alike. They are both …nancial instruments for …rms. However, there is a big di¤erence
between bonds and equity/stocks. The stockholders of a …rm are investors since they have equity
stakes (ownership rights) in the …rm. In turn, the bondholders have creditor stakes in the …rm
since they have lent money to the …rm. Bondholders are creditors, therefore, they usually have
(absolute) priority over stockholders and will be repaid …rst in the event of bankruptcy. Another
and less important di¤erence is that bonds usually have a de…ned maturity (consoles or annuities
don’t have maturity but are very rarely issued). In contrast, stocks are typically for inde…nite
period. Not many …rms, however, issue bonds. Moreover, …rms which issue bonds typically are
quite large.
Almost in all countries one of the most signi…cant issuers of bonds is the government. The
government uses bonds in order to take loans. These loans, together with taxes, the government
uses in order to cover its expenditures. For example, usually during recessions tax proceeds decline.
In turn, the governments implement counter-cyclical policies and increase their expenditures. These
expenditures they …nance then using loans/bonds.
Government bonds tend to be sold on auctions, where participants are, for example, banks,
investment and hedge funds, various speculators, the central bank, etc. (You can imagine something alike a stock/derivative exchange. In fact, bond futures are massively traded on derivative
exchanges.) These markets are usually very liquid.22 One of the most liquid markets is the market
for US government bonds (US Treasury Bonds). Currently (18.03.2014), US debt issued in various
forms of bonds and IOUs amounts to $17,546,814,482,078.90 (yes! more than $17 trillion.)
There are many (and very complicated) types of bonds. For our purposes it is su¢ cient to
consider the following simplistic examples. Hereafter, in our discussion in this chapter we will
22
Very recently, because of the …nancial crisis (and, perhaps, reckless landing/borrowing) in several EU countries bond
markets dried up. The governments of these countries then faced tough …nancial constraints and asked for loans (i.e.,
for bond purchases) from the ECB, etc.
85
consider perfect competition in all (bond) markets and no lending constraints (i.e., think about
very liquid bond market where some of the participants can print money.) Bond 1 matures in 1
year and pays principal P . What is the fair market value/price of such a bond? Let yearly nominal
interest rate be i11 , where subscript stands for the compounding time (1 year) and superscript
stands for the time when compounding will happen (in 1 year). Then the price of this bond is the
discounted value of the principal
P
:
1 + i11
PB1 =
Bond 2 matures in 3 years and pays principal P . What is the fair market value/price of Bond 2?
Let the yearly interest rate be the same across years and, as in previous example, let it be equal to
i11 . The price of Bond 2 is the discounted value of the principal. It is
P
PB2 =
1 + i11
3:
Clearly, as long as i11 > 0, PB1 > PB2 since Bond 1 matures earlier than Bond 2.
Notice that bank loans seem to have somewhat di¤erent structure. Usually, when an individual
takes L (EUR) loan for a year from a bank, the loan speci…es the yearly nominal interest rate i.
~ In such a case taking a
Therefore, the individual has to repay (1 + i) L. Denote the latter by L.
~ Therefore, there is almost no di¤erence.
loan is equivalent to issuing a bond with principal L.
Bond 3 matures in T years and pays principal P . Let the yearly interest rate be i11 . The price
of Bond 3 is
P
PB3 =
1 + i11
T
:
Bond 4 matures in T years and pays principal P . Moreover it pays yearly coupons C. Let the
yearly interest rate be i11 . The price of Bond 4 is
PB4 =
P
1+
T
i11
+
T
X
t=1
C
1 + i11
t:
Bond 5 is exactly the same as Bond 4 but its coupon payments vary over time so that we have Ct
instead of C. The price of Bond 5 is
PB5 =
P
1+
T
i11
+
T
X
t=1
Ct
1 + i11
t:
Consider again Bond 2. Assume now, however, that interest rate during the …rst year is i11 , in
the second year it is i21 , and in the third year it is i31 . In such a case, the price of Bond 2 is
PB2 =
1+
i11
P
1 + i21
86
1 + i31
:
In such a case the prices of Bond 3, Bond 4, and Bond 5 would be
PB3
PB4
PB5
=
=
=
T
Y
t=1
T
Y
t=1
T
Y
t=1
P
;
1 + it1
T
t
XY C
P
+
;
1 + it1
1 + i1
P
+
1 + it1
t=1 =1
T Y
t
X
t=1 =1
Ct
:
1 + i1
Bonds 1, 2, and 3 are called zero coupon bonds since they pay no coupons. In …nance interest
rate is usually called yield since it indicates what a loan (an investment) yields. The time sequence
of (similar) interest rates is termed as yield curve or term structure of interest rate. In our examples,
the yield curve is i11 , i21 , i31 , ..., iT1 .
Imagine a market where zero coupon bonds are traded. which promise the same principal P .
Let there be bonds of all possible maturities: 1 year, 2 years, ..., and T years. It is straightforward
to determine the yield curve from the prices of these bonds. With slight abuse of previous notation,
let the prices of bonds with maturities 1 year, 2 years, ..., and T years be PB1 , PB2 , ..., and PBT ,
correspondingly. Therefore, interest rates are given by
i11 =
P
PB1
1;
P
1;
1 + i11 PB2
P
=
1;
1
1 + i1 1 + i21 PB3
:::
!
TY1
P
1
1:
=
t
1 + i1 PBT
i21 =
i31
iT1
t=1
Call this algorithm ( ).
Usually yield curves are upward sloping. The following …gure illustrates the yield curve of US
Treasury bonds as of 9th of February 2005. It has interest rate/yield on Y-axis and time to maturity
on X-axis.
87
Clearly, in this example the yield curve is upward sloping with diminishing marginal rate of
increase. One of the widely accepted explanations for upward sloping yield curves is that longer
maturities entail greater risks for creditors/lenders. Lenders then demand a risk premium and
demand more premium for longer maturities. This explanation depends on the notion that currently
creditors (as well as we) know less about the distant future than about the very near term.
Another commonly accepted explanation is related to the IS-LM Model. Imagine for example,
that investors are expecting either a gradual shift of IS to the right or a gradual shift of LM to the
left (or both). In such case, they would expect to have higher interest rates in future and trade
accordingly.
This explanation points a de…ciency in our pricing formulas since it points out that interest rates
might be highly variable/random. In such a case in all our formulas we need to replace interest
rates with their expected values. For example in the most general case we have for the price of
Bond 5
PB5 =
T
Y
t=1
P
1+
it;e
1
+
T Y
t
X
t=1
To make things simple consider the case when Ct
C
:
1 + i1 ;e
=1
0 and T = 3. Clearly, this corresponds to Bond
2. The price of the bond is then
PB2 =
P
1+
i1;e
1
1 + i2;e
1
1 + i3;e
1
:
Notice that the price of the bond depends negatively on expected interest rates. Therefore, if
creditors are expecting higher rates in future the price of the bond declines. This is because if
investors are expecting higher rates in future they would rather wait and invest in future. Borrowers
in such a case receive less for the same principal. Therefore, again, expectation matter.
88
Money supply and risk-free bonds
Central banks usually announce their monetary policy in terms of a level of nominal interest rate.
They adjust their money supply so that they meet the targeted rate. How do they do that? They
buy or sell bonds in the market. To see how this works, consider, again, a deterministic world and
zero coupon bonds. Lets take as an example Bond 1,
PB1 =
P
:
1 + i11
If the central bank buys many such bonds, PB1 would increase. Higher PB1 implies lower i11 .
Conversely, if the central bank sells many such bonds, PB1 would decline which would imply higher
i11 . When the central bank buys bonds it does so using money. Therefore, it increases money supply.
In contrast, when it sells bonds it receives money. Therefore, it reduces money supply. This implies
that if the central bank announces that it plans to reduce interest rate then it plans to increase
money supply. Of course, if it announces that it plans to increase interest rate then it plans to
reduce money supply.
The central banks are (usually) public institutions. Therefore, they try to avoid excess risk. The
central banks then usually buy and sell government bonds, which tend to be thought to be relatively
safe assets. (After all, the government can almost always avoid defaulting on bonds denominated
in national currency with printing money.)
It is commonly thought that one of the most safe government bonds are the short maturity
US Treasury Bonds. Economists usually take them as the risk-free assets/bonds. The di¤erence
between the interest paid on US Treasury Bonds and interest paid on equivalent (e.g., in terms
of maturity and coupons) bonds to certain extent represents risk premium (i.e., the premium the
investors demand to hold more risky asset.).
Yield curve and zero coupon bonds in reality
In reality there are almost no zero coupon bonds. Usually, zero coupon bonds are only those that
have paid their last coupon and are very close to maturity. This renders our algorithm ( ) almost
useless. For …guring out the yield curve a special algorithm is used. This algorithm is called
bootstrap.
The following example illustrates this algorithm. Imagine we have a bond very close to its
maturity, which has paid all its coupons. Basically we have a bond like B1 . We observe its price
and know its principal. Therefore, interest rate (in a perfect market) is
i11 =
P
PB1
1:
Further, imagine we have a bond which matures in two years and pays coupon in a year. We know
its price (PB0 2 ), principal (P ), and the value of coupon that it pays (C). Therefore, in a perfect
89
market it has to be that,
P
PB0 2 =
1 + i11
+
1 + i21
C
:
1 + i11
It is easy to …gure out i21 from this expression,
i21 =
1+
P
PB0 2
i11
C
1:
Stocks, stock prices, and stock markets
The stock of a …rm is the equity stake of its owners. It represents the residual assets of the
…rm that would be paid to stockholders after discharge of all senior claims (e.g., bonds/debt). A
stockholder/shareholder is an individual who legally owns one or more shares of stock in a …rm.
Firms (usually) maximize their value appropriately designing production and marketing. It
turns out that this is equivalent to maximizing their shareholder value.
To see this, suppose that we have a …rm which lives T periods and has real pro…t stream
t
in each period t. Denote real interest rate by r and, for simplicity, assume that it is constant over
time. At time t = 1, the present value of this …rm is given by
V1 =
=
1
+
1
1+r
2
T
X
1
(1 + r)
=1
+
1
1
(1 + r)2
3 :::
+
1
(1 + r)t
1
t :::
+
1
(1 + r)T
1
T
:
The present value of the …rm V1 is the total present value of its stocks.
Clearly, V1 can be rewritten in the following manner
V1 =
where
1
1
+
1
V2 ;
1+r
are the pro…ts in current period (t = 1) and
1
1+r V2
are the discounted future capital gains.
For simplicity, assume that this …rm has zero net debt. A stockholder/shareholder owns a
share/fraction of V1 . Let this fraction be some ! from (0; 1). Therefore, in other words, a shareholder
has an entitlement to !V1 amount of value (real money). It is clear that the maximization of V1 is
equivalent to maximization of !V1 .
In practice, shareholders usually call pro…ts
dt instead of
t.
t
dividends. In such a case it is practical to write
In this respect, the value of the …rm is simply discounted sum of dividends.
If this …rm is privately or publicly traded in a perfect market then its market value is equal to
its value. Therefore, the price of a 1 percent ownership of the stock of the …rm (or the price of 1%
share simply) is 0:01V1 . In practice, S is used to denote stock price, instead of V .
90
Stock markets and speculators
Firms are publicly traded (usually) in stock markets/exchanges. Examples of stock markets are
New York Stock Exchange and London Stock Exchange. In a stock market, speculators engage in
a trade of stocks of many …rms simultaneously.
Hereafter, we will assume that the values of …rms are exogenously given keeping in mind that
they depend on the pro…ts of the …rms. For publicly traded …rms, this is equivalent to assuming
that the stock prices of the …rms are given.
In a deterministic world, speculators invest in the stocks of those …rms which will have the
highest returns. For example, if there are two …rms, Firm 1 and Firm 2, which currently have stock
price of S and are expected to have stock price S2 < S1 , then the speculators would buy stocks of
Firm A. This is because the return on buying the stock of Firm 1 (R1 ) is higher than the return
on buying the stock of Firm 2 (R2 ),
R1 =
S1
S
S
>
S2
S
S
= R2 :
Of course, in complete markets, because of arbitrage the price of the stocks of Firm 1 would increase
so that after all R1 = R2 . We have assumed however that the stock prices are given. Therefore, S
does not change in our example.
Stochastic returns and portfolios of stocks: The world is not deterministic in the sense that
future pro…ts of …rms are usually not very predictable. Therefore, the future prices of stocks are not
very predictable. Speculators/investors then decide upon expected returns. Moreover, they usually
hold portfolios of stocks.
For example, if N …rms are traded in the stock market and there are M speculator/investor,
then speculator j holds a portfolio
Pj =
N
X
i;j Si ;
i=1
where
i;j
is the amount of stock Si in the portfolio of speculator j. If
does not have stocks of Firm i. If
stocks of Firm i. If
i;j
=
i;j
i;j
= 0 then speculator j
= 1 then speculator j holds real money equivalent to all the
1 then speculator j does not have stocks of Firm i. Moreover, speculator
j is in debt to deliver real money equivalent to all stocks of Firm i. In other words, speculator j
has short-selled stocks of Firm i.
Can
i;j
be higher than 1 or lower than
1? Yes, it can. This is because Si is the real value
of the stock of Firm i. In such a case contracts can be written promising multiples of the value of
that stock. Consider, for example, a situation when speculator j has all the stocks of Firm i as well
as a certi…cate promising to pay exactly twice the value of the stock whenever exercised. In such a
case the speculator has real value of 3Si . In other words
91
i;j
= 3.
Clearly, for any …rm it has to be that the percentage of its stocks sums to 1 across all speculators,
M
X
i;j
= 1:
j=1
In terms of portfolios, speculators/investors select the amounts of stock ownership
and focus
on the returns of their portfolios. If the price of a selected portfolio is P then the return on it is
percentage change of its price over time. Using the notation of our previous example, the return is
Rj =
Pj
P
P
:
These returns are stochastic (are random variable) because the future value of portfolio Pj is a
random variable.
Utility maximization and mean-variance trade-o¤
Speculators buy stocks to maximize utility. It turns out that generally the problem of speculators
to design an appropriate portfolio can be summarized in terms of expected value and risk trade-o¤
given that the future value of portfolio is a random variable.
The following example illustrates this point. Consider a speculator who has utility function u (:).
Let u (:) be strictly-increasing and concave in portfolio returns. Suppose the returns on portfolios
have normal distribution with E expected value and
2
variance. In such a case, the expected utility
of the speculator is
+1
Z
E [u (R)] =
u (R) f (R; E; ) dR;
1
where
1
2
e
2,
where
f (R; E; ) = p
Clearly, the expected utility depends on E and
Use Z to denote
Z=
R
E
1
2
(R
2
E 2
) :
is a measure of risk.
:
Given the properties of expectation and variance operators (see Appendix), Z is a normal random
variable with expected value 0 and variance 1. Z is usually called standardized return.
Replace R with Z in the expression for expected utility (notice that the limits of integration
don’t change):
+1
Z
E [u (E + Z)] =
u (E + Z) ' (Z) dZ;
1
92
where ' (Z) is the density function of standard normal distribution,
1
' (Z) = p e
2
1 2
Z
2
:
Next, we take the derivative of the expected utility E [u] with respect to the standard deviation
of return . Assuming that E [u] is …nite, the derivative is
dE [u]
d
+1
Z
d
u (E + Z) ' (Z) dZ
d
=
1
+1
Z
u0 (E + Z)
=
dE
+ Z ' (Z) dZ:
d
1
An indi¤erence curve is de…ned as the (locus of) points where
+1
Z
0=
u0 (E + Z)
dE[u]
d
= 0:
dE
+ Z ' (Z) dZ:
d
1
This expression can be further rewritten as
dE
=
d
+1
Z
u0 (E + Z) Z' (Z) dZ
1
+1
Z
;
u0 (E + Z) ' (Z) dZ
1
where E is the expected return of the portfolio and
is its standard deviation. E ( ) represents
indi¤erence curve in expected value E and standard deviation (variance) space . It is increasing
and convex in . In other words,
dE
d
> 0 and
d2 E
d 2
> 0 (see Appendix).
Utility increases moving the indi¤erence curve from right to left in variance and mean space.
The following …gure shows these indi¤erence curves
93
Measures of returns and risk
Expected value of a random variable is its concentration point. If the random variable has continuous
distribution function then almost never it will be equal to its expected value. However, most likely
its realization will be closer to its expected value than other points in distribution. Therefore,
expected value of a random variable tends to be the best guess available.
In our previous example, we used variance as a measure of risk. Variance of random variable
shows how dispersed its outcomes may be. Therefore, it is one of the most commonly applied risk
measures. It is especially relevant for normally distributed random variables.
Digression (Central Limit Theorem):
weights
1
N.
Consider a portfolio consisting of i.i.d. stocks with
According to the Central Limit Theorem the returns on this portfolio will be approxi-
mately normally distributed. The Central Limit Theorem is very powerful result. This result is one
of the reasons why normal distribution is used so often.
There are many other measures of risk. Examples are the range, the semi-inter-quartile range,
the semi-variance, and the mean absolute deviation. Each of these measures can have slightly
di¤erent implication (scale) for risk.
The range (RANGE) is de…ned as
h the di¤erence
i between the highest and lowest outcomes. Let
min
max
the returns on portfolio Rj be from Rj ; Rj
. Then the range is
RAN GE = Rjmax
Rjmin :
The semi-inter-quartile range (SRANGE) is usually de…ned as the di¤erence between the 75th and
25th quantiles of the random variable. In our example,
SRAN GE = Rjq75
94
Rjq25 :
The variance is a central moment in the sense that considers deviations from the mean/expected
value, i.e., V [Rj ] = E [Rj
E [Rj ]]2 . In this sense it gives equal weight to deviations from the
mean/expected value. However, risk averse investors might be more concerned about returns below
the mean (i.e., downside risk but not upside risk). The semi-variance (SEMIVAR) is a measure of
risk that relates just to that risk. It de…ned as
~j =
R
(
Rj
E [Rj ] if
Rj < E [Rj ]
0
if Rj
h i2
~j :
SEM IV AR = E R
E [Rj ]
Variance and semi-variance can be sensitive to observations distant from the mean/expected value
(i.e., outliers). The mean absolute deviation (MAD) avoids this problem. It is de…ned as
M AD = E [jRj
E [Rj ]j] :
Hereafter, we will maintain an assumption that returns on stocks/portfolios have normal distribution. Therefore, we will characterize the risk with variance (or its square root: standard deviation).
Measuring portfolio return and risk to assemble a portfolio
An expected utility maximizing speculator/investor assembles its portfolio of stocks so that to
achieve the highest possible utility. It assembles the portfolio selecting the number of each type of
stocks ( in our previous example). The expected return and risk of portfolio are directly linked to
the expected returns and risks of underlying assets/stocks.
The following very simplistic example illustrates this point. Let there be only 2 …rms and 2
stocks Firm 1 and Firm 2, and S1 and S2 . The current value of the portfolio of the speculator is
P =
where
1
is the number of S1 stocks and
2
1 S1
+
2 S2 ;
is the number of S2 stocks.
Slightly abusing the notation, let the next period prices of stocks 1 and 2 be S11 and S21 , correspondingly. Therefore, next period price of the portfolio is
P1 =
1
1 S1
+
1
2 S2 :
The return on portfolio then is
S1 + 2 S21
P
P
1
1
S1
S2 S2 S2
1 S1 S1
=
+ 2
P
S1
P
S2
= ! 1 RS1 + ! 2 RS2 ;
R =
P1
P
=
1
S11
95
S2
where RS1 is the return on stock S1 , RS2 is the return on stock S2 . ! 1 is the weight of stock S1 in
the portfolio, and ! 2 is the weight of stock S2 in the portfolio. These weights sum up to 1,
1 = !1 + !2:
In this respect this formula implies that the return on portfolio R is the weighted sum of returns
on underlying assets/stocks RS1 and RS2 . Notice that selecting the number of stocks in portfolio
1
and/or
2
the speculator selects weights of stocks ! 1 and ! 2 in the portfolio.
The expected value of the return on this portfolio is
E [R] = E [! 1 RS1 + ! 2 RS2 ]
= ! 1 E [RS1 ] + ! 2 E [RS2 ] :
In turn, the variance is
V [R] = ! 21 V [RS1 ] + ! 22 V [RS2 ] + 2! 1 ! 2 COV [RS1 ; RS2 ] ;
where COV [RS1 ; RS2 ] is the covariance of RS1 and RS2 ;
COV [RS1 ; RS2 ] = E [(RS1
E [RS1 ]) (RS2
E [RS2 ])] :
Intuitively, the portfolio P = ! 1 RS1 + ! 2 RS2 varies because so do RS1 and RS2 . However, we need
to take into account that RS1 and RS2 co-vary or vary together and that can amplify or reduce the
variance of the portfolio. The following two examples illustrate the latter point. Suppose S1 and
S2 are almost the same (may be because they are stocks of vertically or horizontally interrelated
…rms) and
RS1 = RS2 :
In such a case
V [RS1 ] = V [RS2 ] ;
COV [RS1 ; RS2 ] = V [RS1 ] ;
and
V [R] = ! 21 V [RS1 ] + ! 22 V [RS1 ] + 2! 1 ! 2 V [RS1 ]
=
! 21 + 2! 1 ! 2 + ! 22 V [RS1 ]
= (! 1 + ! 2 )2 V [RS1 ]
= V [RS1 ] :
This inference should have followed since if RS1 = RS2 then P = RS1 . This example corresponds to
96
the case when S1 and S2 co-vary perfectly (they are almost the same). Suppose now that S1 and
S2 are extremely di¤erent (may be because they are stocks of competing …rms) and when RS1 = 1
then RS2 = 0 but when RS1 = 0 then RS2 = 1. Let further RS1 be equal to 1 with probability 12 ,
which implies that E [RS1 ] = E [RS2 ] = 21 . Moreover,
V [RS1 ] = E [RS1
= E RS2 1
E [RS1 ]]2 = E RS1
RS1 +
1
2
2
1
1
= :
4
4
Apparently, the same holds for RS2 : V [RS2 ] = 41 . However, notice that
COV [RS1 ; RS2 ] = E
RS1
= E [RS1 RS2 ]
=
1
2
RS2
1
1
1
RS2
RS1 +
2
2
4
1
1
1
1
1
E [RS1 ] + = 0
E [RS2 ]
E [RS1 ] +
2
4
2
2
4
1
2
= E RS1 RS2
1
E [RS2 ]
2
1
:
4
Therefore, since the expected value of the return on portfolio is simply the weighted sum of expected
returns on stocks, we have that
E [R] = E [! 1 RS1 + ! 2 RS2 ]
= ! 1 E [RS1 ] + ! 2 E [RS2 ]
1
:
=
2
In turn, the variance of the portfolio is
1
1
V [R] = ! 21 + ! 22
4
4
= (! 1
2! 1 ! 2
1
4
1
! 2 )2 ;
4
which can be strictly less than the variance of any of the stocks in portfolio. Let ! 1 = ! 2 then
V [R] = 0 < 41 . However, if ! 1 = 1 or ! 2 = 1 then V [R] = 41 . This happens because of negative
covariance between RS1 and RS2 and is called diversi…cation of risk.
This example conveniently illustrates a very important point. In case RS1 and RS2 don’t vary
together perfectly, speculators can choose the weights of S1 and S2 in their portfolios so that to
minimize variance/risk for the same level of expected return.
The magnitude of the covariance is not easy to interpret since it depends on the possible realizations of random variables. Usually, therefore, another measure is used to describe (linear) relation
between random variables. It is the normalized version of the covariance and is called correlation
97
coe¢ cient. It is de…ned as
COV [RS1 ; RS2 ]
:
=p
V [RS1 ] V [RS2 ]
S1 ;S2
By de…nition,
V [RS1 ] =
2
S1
RS1 ;RS2
2 [ 1; 1]. Using the correlation coe¢ cient and denoting V [R] =
2
S2
and V [RS2 ] =
2
P
2,
P
we have that the variance of the portfolio returns is
= ! 21
2
S1
+ ! 22
2
S2
+ 2! 1 ! 2
S1 S2 S1 ;S2 :
A speculator chooses the weights ! 1 and ! 2 to construct its portfolio and makes its choices upon
expected value and variance of returns. Therefore, it is interesting to see how expected value and
variance of the portfolio returns depend on ! 1 and ! 2 . First of all notice that ! 2 = 1 ! 1 . Therefore,
it is enough to choose only one of the weights, e.g., ! 1 . Let for simplicity E [RS1 ] > E [RS2 ]. In
such a case apparently the expected value of portfolio returns in linear and increasing function of
!1
E [R] = E [RS2 ] + ! 1 (E [RS1 ]
E [RS2 ]) :
In turn,
2
P
= ! 21
2
S1
! 21
=
! 1 )2
+ (1
2
S1
+
2
S2
2
2
S2
+ 2! 1 (1
S1 S2 S1 ;S2
!1)
2! 1
S1 S2 S1 ;S2
S2
S2
S1 S1 ;S2
+
2
S2 :
This is an upward opening parabola since
2
S1
for any value of
+
2
S2
RS1 ;RS2 .
2
S1 S2 S1 ;S2
=(
S1
S2 )
2
+2
S1 S2
1
S1 ;S2
>0
The following …gures illustrate the relationships between E [R] and
2
P
and ! 1
In the second …gure ! mv
1 is the ! 1 which delivers the lowest variance of returns. It can be found
98
from the …rst order condition:
@ 2P
= 0 , 2! 1
@! 1
2
S1
+
2
S2
2
S1 S2 S1 ;S2
2
S2
S2
S1 S1 ;S2
= 0:
Therefore,
! mv
1 =
S2
2
S1
+
S2
2
S2
S1 S1 ;S2
2
:
S1 S2 S1 ;S2
Given that portfolio returns are linear in ! 1 , we can easily replace ! 1 in the last …gure with
E [R]. In such a case we would have
In this …gure the curve
2
P
is the minimum variance opportunity set. This is the set of variance
and expected return points which o¤ers the minimum variance for a given expected rate of return.
In turn, E [Rmv ] is the expected return of minimum variance portfolio.
Given that we have for indi¤erence curves expected returns as functions of standard deviation,
usually this …gure is transposed.
The speculators’optimal choice of portfolio/weights is given by the point of tangency of indi¤erence curves and minimum variance opportunity set. This is because indi¤erence curves are convex
99
and utility increases as these curves shift to the left. Moreover, exactly because of that no portfolio
will be selected below the horizontal line that corresponds to E [Rmv ].
This analysis easily proceeds for the case when the number of stocks/assets is greater than
2. We have not de…ned what is S2 precisely. We needed simply that it has expected value and
variance. Therefore, it can be for example a linear combination of stocks of many …rms. Even
more, it can include bonds. Hereafter, we will assume that portfolios can include many types
of assets/…nancial instruments: stocks, bonds, options, etc. Therefore, we will let speculators to
assemble their portfolios using not only stocks. Moreover, we will assume that all speculators know
the correct variances and expected returns of assets.
Lets assume now that S2 is risk free asset with returns RS2 , which for convenience we will denote
by Rf . For example it is US Treasury bond or combination of perfectly negatively correlated assets.
In turn, S1 is itself the portfolio of all risky assets with returns RS1 which have expected value and
variance E [RS1 ] and
2 .
S1
The portfolio consisting of S1 and S2 has expected return and variance
of returns
E [R] = ! 1 E [RS1 ] + ! 2 Rf
= Rf + ! 1 (E [RS1 ]
Rf ) :
and
2
P
= ! 21
2
S1 :
This implies that the expected value of this portfolio is linear function of its standard deviation:
E [R] = ! 1 E [RS1 ] + ! 2 Rf
= Rf
(E [RS1 ]
Rf )
P
:
S1
The following …gure o¤ers the minimum variance set in this case together with choices of ! 1 . The
minimum variance set is graphed using solid lines. In turn, possible choices are in dashed lines.
100
Portfolios along all these dashed lines are possible. However, only one dominates in terms of
expected return and variance. It is the portfolio at point M . With the presence of risk free asset
the speculators form their portfolios taking the tangency point of their indi¤erence curves and the
upward sloping solid line called capital market line, CM L. In this respect their portfolios are
combinations of Rf and portfolio M .
How do the speculators construct CM L? To do so they need to know Rf and portfolio M .
Rf is the return on risk free asset which might be safely thought to be US Treasury bonds. What
about M ? M is the portfolio of all risky assets (yes, it is S1 in this example, if you were wondering).
Therefore, it is at the tangency point of CM L and the convex minimum variance sets.
The market is in equilibrium when prices are such that markets clear. All assets then are held.
In other words prices adjust so that excess demand and supply of assets are zero. This market
clearing condition implies that equilibrium is attained at a single-tangency portfolio, M , which all
investors combine with risk free asset and is a portfolio where all assets are held according to their
market value weights. (At this point the weight of Rf is zero. Moreover, this portfolio is called the
market portfolio.) Let the market value of an asset i be Vi and there be N assets then the market
value weight of the asset i is
wi =
Vi
N
X
;
Vi
i=1
where
N
X
Vi is the total market value of all assets. Market equilibrium is not attained until the
i=1
tangency portfolio M is the market portfolio. Moreover, the value of the risk free rate of return
should be such that the aggregate borrowing and lending are equal.
The upward sloping solid line is called Capital Market Line. In turn, the result that in equilibrium investors hold a weighted average of risk free asset and market portfolio M is called Two-fund
Separation.
101
Market price of risk and the CAPM
The Capital Asset Pricing Model, CAPM in short, is a model which allows us to determine the
market price of risk and the appropriate measure of risk for a single asset. It rests on several very
hard assumptions (which actually we have maintained thus far). These assumptions are that
1. There are no market distortions and frictions (e.g., there are no price setters and information
is costlesly available to all)
2. Speculators/investors maximize their expected utility and are risk averse.
3. There exists an unlimited supply of risk free asset at risk free rate
4. The quantities of assets are …xed. Moreover, all assets are tradable and perfectly divisible
5. All investors have the same information about the markets
The CAPM also needs that the market portfolio is e¢ cient. This follows from utility maximization and common knowledge: the market is the sum of all individual holdings and all individual
holdings are e¢ cient.
In equilibrium, for any asset i it has to be that its weight in the market portfolio M is equal to
wi =
Vi
N
X
:
Vi
i=1
Consider a portfolio which consists of ! i percentage of asset i and 1
! i percentage of the market
portfolio M . The expected return on such a portfolio is
E [R] = ! i E [Ri ] + (1
! i ) E [Rm ] ;
where E [Ri ] and E [Rm ] are the expected returns on asset i and market portfolio M . The standard
deviation of this portfolio is
P
where
i;m
h
= ! 2i
2
i
+ (1
! i )2
2
m
+ 2! i (1
!i)
i1
2
i;m
;
is the covariance between asset i and market portfolio m.
Clearly, the opportunity set in terms of expected value and variance for various combinations
of asset i and portfolio M is given by the convex Minimum Variance Opportunity set. In turn, the
changes of the expected return and risk (here standard deviation) of this portfolio with ! i are given
102
by
@
E [R] = E [Ri ]
@! i
@
! i 2i
P =
@! i
h
! 2i
E [Rm ] ;
(1
2
i
2
m
!i)
+ (1
! i )2
+ (1
2
m
2! i )
i;m
+ 2! i (1
!i)
i;m
i
1
2
The main insight of this model is that the market portfolio already contains the asset i. Therefore, ! i is the excess demand for it. In equilibrium it should be zero and we would evaluate
and
@
@! i
P
@
@! i E
[R]
at ! i = 0. This gives
@
E [R]
@! i
@
@! i
= E [Ri ]
E [Rm ] ;
! i =0
2
m
i;m
=
P
:
m
! i =0
Therefore, the slope of the expected return and risk trade-o¤ evaluated at the market portfolio is
@
@! i E [R]
@
@! i P
! i =0
=
E [Ri ]
( i;m
E [Rm ]
:
2 )=
m
m
This expression shows how an individual asset a¤ects the return and risk of market portfolio.
The …nal insight is that this slope is equal to the slope of CM L, which is
E [Rm ]
Rf
:
m
Equating these two we have that
E [Ri ]
( i;m
E [Rm ] Rf
E [Rm ]
=
;
2 )=
m
m
m
or equivalently,
E [Ri ] = Rf + (E [Rm ]
Rf )
i;m
:
2
m
This equation is known as the Capital Asset Pricing Model. It states that the required rate of return
on any asset i is equal to the risk free rate of return plus a risk premium. The risk premium is the
price of the risk (E [Rm ]
Rf ) times the quantity of risk
=
i;m
2
m
. The quantity of risk is denoted by
i;m
:
2
m
This is the contribution of asset i to the portfolio risk. Most importantly, the variance of the asset
i a measure of risk does not matter in this context. Whatever matters is how its returns correlate
103
with market portfolio returns.
The risk free asset has
= 0 since it does not contribute to the risk of market portfolio (i.e.,
its covariance with market portfolio is zero). In turn, the market portfolio has
= 1 since it is
perfectly correlated with itself.
The following …gure illustrates the CAPM where it is depicted by the blue line.
This model has several important properties. First, in equilibrium because of arbitrage every
asset should be priced so that its risk adjusted return is exactly on the blue line. Next the total
risk can be separated to two
Total risk = systematic risk + unsystematic risk,
where systematic risk is how an asset covaries with the market portfolio and unsystematic risk is
the risk not dependent on the market/economy. The speculators/investors are willing to pay a risk
premium to avoid the systematic risk. Unsystematic risk cannot be avoided.
How to …nd
of an asset and the magnitude of unsystematic risk. The CAPM implies that one
can estimate those from the following empirical speci…cation to …nd
Ri = a +
i Rm
i
+ ";
where a is a constant and " is a random variable which does not correlate with the market. The
variance of Ri then is
2
i
where
2 2
i m
is the systematic risk and
2
"
=
2 2
i m
+
2
";
is the unsystematic risk.
A second important property of the CAPM is that the measures of risk of individual assets are
linearly additive when the assets are combined in portfolios. For instance, if an asset i has a risk
104
of
i
and asset j has a risk of
j
and they are combined with ! i proportion, then total risk of the
portfolio is
p
= !i
+ (1
i
!i)
j:
To show this use the properties of covariance:
p
=
E [(! i Ri + (1
E [Ri ]) (Rm
V [Rm ]
= ! i i + (1 ! i ) j :
= !i
E [(Ri
! i ) Rj
E [! i Ri + (1 ! i ) Rj ]) (Rm
V [Rm ]
E [(Rj
E [Rm ])]
+ (1 ! i )
E [Rm ])]
E [Rj ]) (Rm
V [Rm ]
E [Rm ])]
Black-Scholes model
The Black-Scholes model, is a model which allows us to determine the market price of European
options. This mode maintains the following assumptions
1. There are no market distortions and frictions
2. Speculators/investors maximize their expected utility
3. There exists an unlimited supply of risk free asset at risk free rate rf
4. All assets are tradable and perfectly divisible
5. Stocks pay no dividends during the life-time of the options
European options are widely used …nancial instruments.
De…nition 1 Option is a contract which gives its owner an option (the right but not obligation)
to buy or sell an underlying asset at a speci…ed price (strike price) before a speci…ed date (time to
maturity).
1. Options giving the right to sell are called Call Options
2. Options giving the right to buy are called Put Options
3. Options which can be exercised only at a speci…ed date (but not before it) are called European
Options.
To derive Black-Scholes model lets consider a stock which has random returns and a European
Call option written on it. Denote, the price of the stock at time t by St . Assume that the price of
the stock follows a geometric Brownian motion:
dSt =
t St dt
+ St dWt ;
105
where d stands for the operator of in…nitesimally small change over time.
and
is a positive constant. Both
t
and
t
is called drift coe¢ cient
will acquire meaning when Wt is presented. This
(di¤erential) equation equation states that the change of the value of the stock happens because of
deterministic shifts in the mean and because of a random process Wt .
Wt and, therefore, its in…nitesimally small change dWt , is a random variable. More precisely,
Wt is a Wiener process: It is a random variable which changes over time so that the expected
change over any time interval is 0 (e.g., E [dWt ]) and its variance over time T is equal to T (e.g.,
V [dWt ] = dt). A discrete analogue for W is a simple random walk.
The above equation can be rewritten as
dSt
=
St
where
dSt
St
t dt
+ dWt ;
is the rate of return on the stock during a very short time of dt. The properties of Wt
random variable imply that
dSt
St
dSt
V ar
St
E
=
t dt;
=
2
dt:
European option written on this stock speci…es the time when it can be exercised. Denote it by
T . It speci…es also the strike price. Denote it by K. As we will see, at time t < T the price of this
option V is a (complicated) function of S, , , t, K, T , and rf :
V = V (S; ; ; t; K; T; rf ) :
From It¯o’s lemma it follows that
dV =
S
@V
1
@V
+
+
@S
@t
2
2 2@
S
2V
@S 2
dt + S
@V
dW:
@S
Combining this di¤erential equation with the di¤erential equation for stock price gives
dV =
@V
1
dt +
@t
2
2 2@
S
2V
@S 2
dt +
@V
dS:
@S
To derive the price of this option Black-Scholes model uses a non-arbitrage argument. This
involves constructing a risk free portfolio and equating its returns to the risk free returns.
Consider a portfolio of one option on this stock and a short position of
value of the portfolio is then
Pt = Vt
Choose the
St :
so that to hedge this portfolio (reduce its variance).
106
in this stock. The
The change in the price of the portfolio is given by
dPt = dVt
dSt
@V
@V
1 2 2 @2V
=
dt +
dt +
S
dS
2
@t
2
@S
@S
dS:
The …rst two terms are deterministic. Therefore, they don’t matter for hedging strategy and to
hedge the portfolio against (volatility) randomness select
@V
@S
=
(this is called delta/dynamic
hedging). This implies that
1
@V
+
@t
2
dPt =
2 2@
S
2V
@S 2
dt:
Since this portfolio entails no risk its returns should be equal to the returns on risk-free asset
rf . In other words,
dPt
= rf dt:23
Pt
Therefore,
dPt = Pt rf dt
and
1
@V
+
@t
2
2 2@
S
2V
@S 2
dt = (Vt
St ) rf dt:
The latter implies that
rf V =
@V
1
+
@t
2
2 2@
S
2V
@S 2
+ rf S
@V
:
@S
This is a second order partial di¤erential equation.
In order to solve it one needs to have boundary conditions. First, let this be a call option. At
the time to maturity T , the price of call option is easy to derive. It is
VT = max fST
K; 0g :
This is because time T is the exercise date. The owner of this option would exercise it only if ST is
higher than the strike price and would make a net gain of ST
K. Second,
V (0; ; ; t; K; T; rf ) = 0:
Third,
lim V (S; ; ; t; K; T; rf ) = S:
S!+1
Solving this di¤erential equation is much beyond the frames and focus of this course.
23
This is because the return on portfolio is given by
Pt+1 Pt
.
Pt
1 dPt
.
Pt dt
107
If the time is discrete then this becomes
1 Pt+1 Pt
Pt t+1 t
=
Appendix
Appendix - Reminder of Statistics 0
Probability Theory
There are at least two ways to approach probability:
Classical (or a priori) Probability
– Classical Probability can be de…ned in the following way: If a random experiment can
result in n mutually exclusive and equally likely outcomes and if nA of these outcomes
have an attribute A, then the probability of A is the fraction nA =n.
Axiomatic Probability (or so-called Kolmogorov’s Axiomatics):
– Probability Space is a triple
1.
; A; Pr (:) , where
is the sample space: A collection of all possible outcomes of an experiment
Any given experiment result is an element of
.
2. A is the event space, or algebra of events.
It is a collection of subsets of
, including
.
3. Pr (:) is the probability function de…ned on A: For any event A 2 A, Pr (:) is a
quantitative (numerical) measure of the likelihood that this event A is observed once
the experiment is completed.
Basic de…nitions and theorems
The following de…nitions are needed to construct probability space:
De…nition 2 [Sample space] The sample space, denoted by
possible outcomes of a conceptual experiment. Often,
, is the collection or the totality of all
is called the sure event since it includes any
outcome that can occur.
De…nition 3 [Event and event space] An event is a subset of the sample space. The class of all
events associated with a given experiment is de…ned to be the event space.
Any event space A possesses the following properties:
1.
2A
2. If A 2 A, then the complement set A 2 A
3. If A1 ; A2 ; ::: 2 A, then [i
1 Ai
2A
108
De…nition 4 Spaces which satisfy properties 1, 2, and 3 are called
algebra.
De…nition 5 [Probability function] A probability function Pr (:) is a set function with domain A
and counterdomain of [0; 1] interval. It which satis…es the following axioms:
Pr (:)
0 for 8A 2 A
Pr ( ) = 1
If A1 ; A2 ; ::: 2 A are pairwise disjoint, then Pr ([i
De…nition 6 The triple
1 Ai )
=
X
Pr (Ai )
i 1
; A; Pr (:) is called probability space.
The following de…nitions are for conditional probability and independence of events
De…nition 7 Let A; B 2 A and Pr (B) > 0. The conditional probability of event B given the
occurrence of A is denoted and given by Pr ( Bj A) =
Pr(B\A)
Pr(A) .
Theorem 8 (The law of total probability) Let A1 ; A2 ; :::; An 2 A form a partition of
Then
Pr (B) =
n
X
. Let B 2 A.
Pr ( Bj Ai ) Pr (Ai ) :
i=1
Theorem 9 (Bayes’formula) Let A1 ; A2 ; :::; An 2 A form a partition of , for any i let Pr (Ai ) > 0.
Then for any event B 2 A with Pr (B) > 0
Pr ( Aj j B) =
Pr ( Bj Aj ) Pr (Aj )
Pr (B \ Aj )
= n
:
X
Pr (B)
Pr ( Bj Ai ) Pr (Ai )
i=1
De…nition 10 Events A1 and A2 are independent i¤
Pr (A1 \ A2 ) = Pr (A1 ) Pr (A2 ) :
Similarly, events A1 ; A2 ; :::; An 2 A are independent i¤ the probability of any intersection of any
sub-sample of fA1 ; A2 ; :::; An g is the multiplication of probabilities of the terms in the intersection.
Random variables
De…nition 11 (Intuitive de…nition) A random variable x is a real-valued function of the elements
$ of a sample space
.
For example, a random variable x is the sum of the two numbers that occur when we roll a
pair of fair dice one. The events are the numbers. The space of possible events is comprised of
109
36 elements. The value of a random variable depends on the outcome of the experiment being
observed. Each possible value X of a random variable x de…nes an event: function x ($) assigns
values X 2 R to the set of sample space outcomes $.
De…nition 12 (Technical de…nition) For a given probability space
; A; Pr (:) , a function x ($) :
! R is said to be a random variable if for 8X 2 R event that occurs the event $ on which
x ($) < X, AX = f$ : x ($) < Xg, belongs to the events space A, AX 2 A.
De…nition 13 The cumulative distribution function of a random variable x de…ned on the probability space
; A; Pr (:) is
Fx (X) = Pr (AX ) = Pr (x ($) < X) :
Notice that the cumulative distribution function would be a non-decreasing function Fx (X) :
R ! [0; 1] since higher X implies bigger space of $ events where x ($) < X. More precisely,
cumulative distribution functions satisfy the following properties.
limX!
1 Fx (X)
=0
limX!+1 Fx (X) = 1
For any X1 and X2 where X1 < X2 , Fx (X1 )
Fx (X2 )
limh!0 Fx (X + h) = Fx (X) for any X : R
De…nition 14 Discrete density function or probability mass function of a discrete random variable
x is
fx (X) =
(
p xi
x = xi ; i = 1; 2:::
0
otherwise
:
De…nition 15 Continuous density function or probability mass function of a continuous random
variable x is
fx (X) =
dFx (X)
:
dX
Density functions satisfy the following properties.
fx (X)
RX
0
1 fx (z) dz
R +1
1
= Fx (X)
fx (z) dz = 1
110
Basic properties of expectation, variance, and covariance
1. The basic properties of expectation operator are
N
N
For any random variables fXi gN
i=1 , real functions fhi gi=1 , and real numbers f i gi=1
E
"N
X
#
i hi (Xi ) =
i=1
N
X
[hi (Xi )] :
i=1
If for any hi and hj it is the case that hi (X)
E [hi (X)]
iE
hj (X) then
E [hj (X)] :
2. The basic properties of variance and covariance operators are
N
N
For any random variables fXi gN
i=1 , real functions fhi gi=1 , and real numbers f i gi=1
"N
X
V
#
i hi (Xi )
i=1
"
= COV
=
N
X
N
X
i hi (Xi ) ;
i=1
2
iV
N
X
#
i hi (Xi )
i=1
[hi (Xi )] +
i=1
X
i j COV
[hi (Xi ) ; hj (Xj )] :
i6=j
where for any hi and hj
COV [hi (Xi ) ; hj (Xj )] = E [(E [hi (Xi )]
and the second term consists of N (N
hi (Xi );hj (Xj )
hi (Xi )) (E [hj (Xj )]
hj (Xj ))] :
1) items. Correlation is de…ned as
COV [hi (Xi ) ; hj (Xj )]
=p
:
V [hi (Xi )] V [hj (Xj )]
Correlation shows linear relation. It is from
1 to 1,
hi (Xi );hj (Xj )
2 [ 1; 1].
Given its de…nition covariance operator satis…es the following properties
COV [Xi ; Xi ] = V [Xi ] ;
COV
COV
i
+
i Xi
+
i
k
+
+
i Xi ;
j
+
j Xj
=
i j COV
[Xi ; Xj ] ;
k Xk ;
j
+
j Xj
=
i j COV
[Xi ; Xj ] +
Therefore, if
Xi =
+ Xj ;
then
COV [Xi ; Xj ] = COV [ + Xj ; Xj ] = V [Xj ] :
111
k j COV
[Xi ; Xj ] :
Mean-variance trade-o¤
dE
d
It can be shown that indi¤erence curve E ( ) is increasing and convex. In other words,
d2 E
d 2
> 0 and
> 0. This follows from that utility function u (:) is increasing and concave.
To show that
+1
Z
u0 (E + Z) Z' (Z) dZ
dE
=
d
1
+1
Z
>0
u0 (E + Z) ' (Z) dZ
1
consider the numerator
+1
+1
Z
Z0
Z
0
0
u (E + Z) Z' (Z) dZ =
u (E + Z) Z' (Z) dZ +
u0 (E + Z) Z' (Z) dZ:
1
Denote by Z~ =
1
0
Z, and rewrite
+1
Z
u0 (E + Z) Z' (Z) dZ =
1
=
Z0
0
u
E
Z~
1
+1
Z
u0 E
0
=
+1
Z
Z~ '
Z~ d
Z~ +
+1
Z
u0 (E + Z) Z' (Z) dZ
0
+1
Z
~
~
~
~
Z Z' Z dZ +
u0 (E + Z) Z' (Z) dZ
0
u0 (E + Z)
u0 (E
Z) Z' (Z) dZ:
0
Clearly, since u00 < 0 it has to be that
+1
Z
u0 (E + Z) Z' (Z) dZ < 0;
1
which implies that
dE
d
> 0.
In turn, to show that
and their average
d2 E
d 2
E1 +E2
;
2
> 0, consider two points on the indi¤erence curve (E1 ;
1+ 2
2
. Notice that by construction u (E1 +
The indi¤erence curve would be convex if for any two points (E1 ;
1
u (E1 +
2
1 Z)
1
+ u (E2 +
2
2 Z)
<u
112
E1 + E2
+
2
1 Z)
1)
1
+
2
and (E2 ;
= u (E2 +
and (E2 ;
2
1)
Z :
2)
2 Z).
and Z
2 ),
This clearly holds since u is a concave function, which implies that
E u
E1 + E2
+
2
1
+
2
2
Z
> E [u (E1 +
113
1 Z)]
= E [u (E2 +
2 Z)] :
References
Cagan, P. (1956). The monetary dynamics of hyperin‡ation. In M. Friedman (Ed.), Studies in the
Quantity Theory of Money. Chicago: University of Chicago Press.
Calvo, G. A. (1983). Staggered prices in a utility-maximizing framework. Journal of Monetary
Economics 12 (3), 383–398.
Carlin, W. and D. Soskice (2005). The 3-equation New Keynesian Model— a graphical exposition.
Contributions in Macroeconomics 5 (1).
Doepke, M., A. Lehnert, and A. Sellgren (1999). Macroeconomics. Available online (last accessed
29.03.2015). http://faculty.wcas.northwestern.edu/~mdo738/book.htm.
Domar, E. D. (1946). Capital expansion, rate of growth, and employment. Econometrica 14 (2),
137–147.
Goodwin, R. (1967). A growth cycle. In C. H. Feinstein (Ed.), Socialism, Capitalism & Economic
Growth, pp. 54–58. London: Macmillan.
Hansen, G. D. (1985).
Indivisible labor and the business cycle.
Journal of Monetary Eco-
nomics 16 (3), 309–327.
Harrod, R. F. (1939). An essay in dynamic theory. Economic Journal 49 (193), 14–33.
Kaldor, N. (1934). A classi…catory note on the determinateness of equilibrium. Rreview of Economic
Studies 1 (2), 122–136.
King, R. G. and S. T. Rebelo (1999). Resuscitating real business cycles. Volume 1, Part B of
Handbook of Macroeconomics, pp. 927–1007. Elsevier.
Kydland, F. E. and E. C. Prescott (1982). Time to build and aggregate ‡uctuations. Econometrica 50 (6), 1345–1370.
Nagel, R. (1995). Unraveling in guessing games: An experimental study. American Economic
Review 85 (5), 1313–1326.
Solow, R. M. (1956). A contribution to the theory of economic growth. Quarterly Journal of
Economics 70 (1), 65–94.
Taylor, J. B. (1980). Aggregate dynamics and staggered contracts. Journal of Political Economy 88 (1), 1.
114