Macroeconomics III Lecture notes This version: March 29, 2015 Contents Expectations 3 Introduction to the concept of expectations . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Types of expectations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Traditional models with expectations 14 The Cobweb Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 The Cagan Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 The Lucas Imperfect Information Model with AD . . . . . . . . . . . . . . . . . . . . . . . 28 The Sticky Wage Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 The E¤ectiveness of monetary policy (with …xed rules) 37 A model with stabilizing monetary policy . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 A model with constant growth of money . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 Discretionary monetary policy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Political cycles and discretionary monetary policy . . . . . . . . . . . . . . . . . . . . . . . 48 Monetary policy under commitment and discretion . . . . . . . . . . . . . . . . . . . . . . 50 Monetary policy under commitment . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Monetary policy under discretion (without commitment) . . . . . . . . . . . . . . . . 53 Business cycles 55 Business cycles - The Carlin and Soskice (2005) model . . . . . . . . . . . . . . . . . . . . 58 Endogenous business cycles - The Goodwin (1967) model . . . . . . . . . . . . . . . . . . 63 A Real Business Cycles Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 The Basic RBC model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 Price Rigidities - The Calvo (1983) model . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 These notes may contain typos/mistakes and are subject to changes/updates during our course. Please keep track if there are any. 1 Expectations and …nancial markets 84 Bonds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 Stocks, stock prices, and stock markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Measures of returns and risk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Measuring portfolio return and risk to assemble a portfolio . . . . . . . . . . . . . . 94 Market price of risk and the CAPM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Black-Scholes model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Appendix 107 Appendix - Reminder of Statistics 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Mean-variance trade-o¤ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 2 Expectations Introduction to the concept of expectations Why did you start reading these notes? Would you start and/or continue reading if you expect these notes to be useless and/or your test(s) to be very easy? I suppose the answer is no at least for some students.2 This is an example how expectations matter for strategies, actions, and later for performance at individual level. Consider another example to see how expectations matter for economic performance at individual, as well as aggregate, level. Suppose we have an economy of 1000 …rms and consumers. In this economy, …rms hire consumers’labor to produce and consumers use wages they receive to buy …rms’ products. Suppose that one of the …rms expects low demand for its products prior to deciding how much to produce. In order to produce according to its expectation, it would hire few labor and pay low wage bill. Further, imagine that instead of one …rm, all …rms expect low demand for their products. In such a case, all …rms will hire few labor and pay low wage bills. Therefore, consumers will have low income and consume few products, which will reinforce …rms’expectations. You continue reading this chapter, and I will bring more such examples. The main focus of our examples, and classes in general, will be how expectations can matter macroeconomic performance and aggregate economic ‡uctuations due to supply or demand shocks. Prior to proceeding to these examples, lets digress (which we will do often) and discuss formally what expectations are. Expectations - intuitive and formal discussion We know for sure that if we throw an apple (perhaps, not an iPhone) it will eventually hit the ground because of gravitation. Since we know for sure, we expect the event "thrown apple hits the ground" to happen. Can you claim with the same certainty that tomorrow it won’t rain around our university? In these examples we have two di¤erent (random) events. One of the events is "thrown apple hits the ground." Whereas, the other event is "no rain tomorrow around our university." These events happen with some probabilities. The …rst event happens with probability 1, i.e., it happens for sure/with certainty. (Since it happens with certainty we don’t call it a random event.) The second event, however, happens with probability less than 1. For example, let it happen with probability 0:5. Then you would say that with probability 0:5 you expect to have no rain around our university tomorrow. Lets now consider a bit more sophisticated examples. Suppose we have a lottery which pays 100 EUR with probability 0:01 and 0 EUR otherwise, i.e., with probability 0:99. What is the expected pay-o¤ of this lottery? It is 1 EUR: 0:01 2 100 + 0:99 0 = 1: These notes are not useless and I promise your tests will not be easy. 3 Therefore, if this lottery costs 5 EUR, one might think at least twice prior to buying it. We have two possible realizations of random variable "pay-o¤" in this example: "100 EUR" and "0 EUR." These realizations have associated probabilities 0:01 and 0:99. When we are calculating expected value of the random variable "pay-o¤" we weight each of these possible realizations with the corresponding probability. Intuitively, we do so in order to give larger weight to realizations which are more likely to happen (here: 0 EUR is more likely to happen and gets 0:99 weight which is larger than 0:01 weight of 100 EUR). Suppose now we have a random variable x which takes (independently distributed) values from a …nite and ordered set of real numbers fxj gN j=1 with associated probabilities pxj real numbers pxj N j=1 fxj gN j=1 N . j=1 The set of are the possible realizations of x. In turn, the space of associated probabilities is the probability distribution of x. To relate to previous example, let x be the pay-o¤ of a lottery which gives xj amount of Euros with probability pxj where j = 1; :::; N . For instance, if x1 is the event when pay-o¤ is 0 EUR and x2 is the event when pay-o¤ is 10 EUR. Then px1 and px2 are the probabilities of those events. What is then the expected value of x? Use E [x] to denote it. E [x] is given by E [x] = px1 x1 + px2 x2 + ::: + pxN xN = N X pxj xj ; j=1 which clearly generalizes our previous example in a straightforward manner. If there were (countably) in…nite possible realizations of x so that we had fxj g+1 j=1 and pxj write E [x] = +1 X +1 , j=1 then we would simply p xj x j : j=1 In any case E [x] is just a real number, of course assuming that E [x] < +1. Lets continue this generalization process. Suppose now that random variable x takes values from a continuous set of real numbers [A; B], where A < B (e.g., A = 0 and B = 100). Denote the probability that the realization of x happens to be less than X by F (X) = P (x < X) ; where F is the probability distribution function of x. F function maps the set of possible realizations [A; B] to [0; 1]. To derive the expected value of x we need to know the probability of each of the possible realizations of x. For a second suppose that x, as in previous example, took (independently distributed) values from a discrete and ordered space fxj g+1 j=1 . In such a case the probability of observing exactly realization x2 is the di¤erence between probability that the realization of x is less than x3 and the probability that x is less than x2 . In other words, p x2 = p x1 + p x2 4 p x1 : According to our new notation this can be written then as Denote this di¤erence by px2 = F (x3 ) F (x2 ) : F = F (x3 ) F (x2 ) : F; Lets go back to the case when x takes values from a continuous set [A; B]. Suppose, as in previous example, x1 ; x2 ; and x3 are consecutive possible realizations of x. Since the set of possible realizations of x is continuous the distance between these realizations is in…nitesimally small. In such a circumstance we consider an in…nitesimally small change in F for obtaining the probability that x takes a value of x2 . This in…nitesimally small change we denote by dF instead of F . The expected value of x in this case is E [x] = ZB x ~dF (~ x) :3 A In this expression, x ~ are possible realizations of x and dF (~ x) are their associated probability. We use integral instead of sum since we are summing/integrating over a continuum of in…nitesimally small points. In case when F is a di¤erentiable function on [A; B] (i.e., dF (~ x) d~ x exists on that interval) we can rewrite E [x] as E [x] = ZB x ~f (~ x) d~ x; A where f (~ x) = dF (~ x) d~ x is the probability density function of x. In economics and in many other disciplines we call the possible realizations of a random variable "possible states" and the space of possible realizations "space of possible states." You can easily notice that the expected value of x depends on both the space of possible states and the probability distribution of x. For example, suppose for simplicity that F (~ x) = x ~ B B A (i.e., we have a uniform distribution) then E [x] = 1 B A ZB x ~d~ x A = 1 (B + A) : 2 Therefore, changing B and/or A, which corresponds to changing the space of possible states of x, 3 Again, we need to have E [x] < +1 so that E [x] is something well de…ned, i.e., it is just a real number. 5 changes its expected value. Consider a change in F now. Suppose, F (x) = 1 A B Zx 1 x ~ d~ x A where 1 z 1 p e 2( 2 Zz (Z) dZ: (z) = (z) = 2 ) ; 1 (i.e., we have a truncated normal distributed random variable with non-truncated mean variance . and and are normal distribution and density functions, correspondingly) In such a case, it can be shown that E [x] = + A B B A : Cooking up examples for random variables with discrete distributions is much easier. Suppose, we have (1) : fxj gN j=1 = f1; 2g and pxj N j=1 f0:5; 0:5g, and (3) : fxj gN j=1 = f1; 2g and = f0:5; 0:5g, (2) : fxj gN j=1 = f2; 2g and pxj pxj N j=1 N j=1 = = f0:8; 0:2g. Clearly, the di¤erence between (1) and (2) is in the space of possible states. In turn, the di¤erence between (1) and (3) is in the associated probabilities/distribution of states. Expected values in each of these cases are 1:5, 2, and 1:2, correspondingly. For more discussion see Appendix - Statistics 0. Economic agents act according to their expectations. Often in real life, as well as in economics, we might not exactly know the entire space of possible states of a random variable neither we might know exactly its probability distribution function. In our examples we will see that economic performance depends on economic agents’ beliefs of what the possible states are and what the distribution function is. Further examples and Keynesian-beauty contest In order to further assert that expectations and the way they are formed matter in economics consider a game called "p-beauty contest" and …rst run in Nagel (1995). The rules of the game are as follows: Each of N -players is asked to choose a number from the interval [0; 100] The winner is the player whose choice is closest to p times the mean of the choices of all players, where p < 1 (e.g., p = 0:5). In this game the random variable for a player is the mean of the choices of all players. Meanwhile, the probability distribution of this variable depends on the types and beliefs/knowledge of all players. 6 For example, it turns out that if all players are rational in the sense that everyone performs iterated elimination of (weakly) dominated strategies and all players know about that (i.e., it is a common knowledge that everyone is rational) then it is straightforward to guess what would be the mean of choices of all players. Under these assumptions everyone simply chooses 0. To see this consider a player. This player will never choose a number above 100p since it is dominated by 100p. Moreover, given that the player believes that others are rational too, s/he will not pick a number above 100p2 since s/he will know that no one will pick above 100p. Similarly believing that everyone is rational, s/he will not pick a number above 100p3 and so on, until all numbers but zero are eliminated. If p > 1 then 100 can also be an equilibrium (and 0 is not a "stable" equilibrium). For p = 1 any number chosen by all players can be an equilibrium. This game mimics the problem a seller in the stock market, for example. The seller wants to sell his shares when the price of the share is at its peak, just before at least someone wants to sell. In order to do that (i.e., design its actions) the seller needs to know the types and beliefs/knowledge of other sellers. This example motivated John Maynard Keynes to propose the original setup of this game in Chapter 12 of his work: The general theory of employment, interest and money (1936). In that work we proposed an explanation behind ‡uctuations in prices in equity markets in terms of changes in beliefs of sellers and buyers. Instead of picking numbers, Keynes used an analogy based on a newspaper contest, in which players are asked to choose from a set of photographs of women that are the "most beautiful." Those who picked the most popular face are then the winners. (This is the reason why the name of the game is beauty contest). Imagine that there are three players. A naive strategy in this game would be to pick the most beautiful face one perceives. If everyone does so, one could deviate with a more sophisticated strategy and pick up a face which s/he thinks is most likely to be chosen by the other two. If everyone does so, one could deviate with a more sophisticated strategy and pick up a face which s/he thinks is most likely to be expected to be chosen by the other two, etc. Keynes wrote: "It is not a case of choosing those [faces] that, to the best of one’s judgment, are really the prettiest, nor even those that average opinion genuinely thinks the prettiest. We have reached the third degree where we devote our intelligences to anticipating what average opinion expects the average opinion to be. And there are some, I believe, who practice the fourth, …fth and higher degrees." It turns out, however, that in reality in p-beauty contest games assumptions of rationality are often violated and therefore, equilibrium is not at 0. To see this check the following webpage: http://www.marietta.edu/~delemeeg/expernom/nagel.htm. The following comment is taken from that website and summarizes the thought processes a high school class student participating in a newspaper contest (submitted to one of the newspaper studies, the Spektrum der Wissenschaft). The game is a p-beauty contest with p = 2=3. "I would like to submit the proposal of a class grade 8e of the Felix-Klein-Gymnasium Goettingen for your game: 0:0228623. How did this value come up? Johanna . . . asked in the math-class whether we should participate in this contest. The idea was accepted with great enthusiasm and 7 lots of suggestions were made immediately. About half of the class wanted to submit their favorite numbers. To send one number for all, maybe one could take the average of all these numbers. A …rst concern came from Ulfert, who stated that numbers greater than 66 2=3 had no chance to win. Sonja suggested to take 2=3 of the average. At that point it got too complicated to some students and the …nding of the decision was postponed. In the next class Helena proposed to multiply 33 1=3 with 2=3 and again with 2=3. However, Ulfert disagreed, because starting like that one could multiply it again with 2=3. Others agreed with him that this process then could be continued. They tried and realized that the numbers became smaller and smaller. A lot of students gave up at that point, thinking that this way a solution could not be found. Other believed to have found the path of the solution: one just has to submit a very small number. However, one could not agree how many of the people who participated realized this process. Johanna supposed that the people who read this newspaper are quite sophisticated. At the end of the class 7 to 8 students heatedly continued to discuss this problem. The next day the math teacher received the following message: We think it best to submit number 0:0228623." Consider another game called "ultimatum game." There are two players in this game. Player 1 is entitled to a 10 EUR and players decide how to divide it. The rules of the game are as follows Player 1 proposes a division of the sum Player 2 can either accept or reject this proposal – If the player 2 rejects, neither player receives anything – If the player 2 accepts, 10 EUR is split according to the proposal The extensive form representation of the game is: In this …gure it assumed that player 1 either gives 2 EUR to player 2 or 5 EUR. Player 2 then decides to accept or reject. If player 2 rejects in any of these cases both get 0 EUR. In turn, if player 2 accepts the o¤er money is split according to the proposal. The strategy of player 1 is the proposal coupled with the expected strategy/response of player 2, which depends on the type of player 2. Imagine that player 1 and 2 care only about money in this game (i.e., players are expected pay-o¤ maximizers). If player 1 knows about the type of player 8 2 then s/he knows that whatever positive amount s/he proposes player 2 will accept. Therefore, s/he can propose something very close to 0 (in fact, perhaps, exactly 0) and get as much of the pay-o¤ as possible (something around 10 EUR). If, however, player 1 expects that player 2 has a strict preference over equality of the split of the award (i.e., player 2 will reject a proposal if it splits reward very unequally) then s/he might propose something higher than 0 EUR and close to 5 EUR. This is because otherwise s/he gets nothing. 9 Types of expectations In previous (sub-)section we de…ned and discussed expectation operators for discrete and continuous random variables. We saw that we can compute the expected value of a random variable in cases when we know exactly what is the space of its possible states and the probability of those states (distribution function). We saw as well that if we have three random variables which have di¤erent spaces of possible states and/or probabilities of those states then their expected values can be di¤erent (i.e., expected value depends on the set of possible states and on distribution function). Further, we saw examples when economic agents act according to their expectations and they might not exactly know the entire space of possible states of a random variable neither its probability distribution function. What to do if we don’t know exactly the distribution function of a random variable? To alleviate such a problem we use statistics. In other words, we observe realizations of the random variable and use them in order to infer certain moments of its distribution. Expected value is the …rst moment. In particular, suppose that we have f^ xj gN j=1 realizations of random variable x. Further, we have no priors about the probability of each of the observations. In such a case the mean of observations xeN N 1 X = x ^j N j=1 is the sample (statistical) analogue of the expected value of x. As E [x] it is just a number. Here we have 1 N in front of each realization since we need to treat each of the observations equally likely (i.e., each observation has 1 N probability to occur). A central theorem in probability theory called Law of Large Numbers provides us with a proof that if we have in…nite realizations of the random variable then the mean of the sample is the expected value of the random variable, Pr lim xeN = E [x] N !+1 = 1: Usually in economics and in particular in macroeconomics we observe realizations of random variables over time. Therefore, often instead of index j and N we use t and T , where t indexes time and T refer to its most recent value. Then we write, for example, xeT = T 1X x ^t T t=1 where x ^1 is the value that random variable x has obtained at time 1, x ^2 is the value that random variable x has obtained at time 2, etc. Again, this is the best (unbiased and consistent) guess of the expected value of x when we have no priors on the likelihood of its observations. It is usually 10 the default option, therefore, and we will treat T 1X x ^t T xeT = t=1 as the …rst type of expectation. It incorporates all observations and treats them equally. In certain circumstances we might have priors over the likelihood of observations to reoccur. For example, it might be that we know that older observations are less likely to happen. To put more meat into the discussion, suppose that we observe f^ xt gTt=1 realizations of random variable x and would like to compute the mean of x, perhaps because we will apply it for our actions in time T + 1. We will denote xeT by xeT +1 for that purpose. Suppose further that if we know that for any t from 1 to T if the likelihood of x ^t to reoccur ~ ~ ~ ~ is < 1 then the likelihood of x ^t 1 to reoccur is 1 . (Notice that < 1 implies that ~ 1 ~ < ~ . Therefore x ^t 1 is less likely to happen than x ^t .) In such a case, we would write the sample mean as xeT +1 ( ) = = x ^T + (1 T X (1 )x ^T )T t 1 + ::: + (1 )T 1 x ^1 x ^t : t=1 where ~ is the likelihood that we think x ^T will happen. x ^T We use xeT +1 ( xeT +1 ( ) that xeT +1 ( ) instead of xeT 1 happens with likelihood (1 ). ( ). has a special name. It is called exponentially weighted average with time decay. Notice ) can be easily written in a recursive form of xeT +1 ( ) = x ^T + (1 ) xeT ( ) = xeT ( ) + [^ xT xeT ( )] : According to the second line xeT +1 ( ) is a sum of old expectation xeT ( ) and a weighted di¤erence between the realization of x at time T , x ^T , and its expectation xeT ( ). This last term adapts/corrects the current expectation to the error: [^ xT xeT ( )]. In this context, is called a correction para- meter. Economists call these types of expectations "adaptive expectations." These are the second type of expectations. Hereafter, we will use notation xeT +1jT to denote the expectation of random variable xT +1 conditional on information available at time T . Macroeconomists have used extensively (and abused) adaptive expectations in their models before 1970s. The assumption of adaptive expectations has been imposed in these models without much justi…cation. For example, this assumption usually generates persistent errors in these models T X 1 (i.e., limT !+1 T [^ xt xeT ( )] 6= 0) which seems to be odd. It implies that economic agents make t=1 persistent errors (i.e., have no intention to correct their errors). 11 An assumption regarding expectations which is consistent with models is "rational expectations" assumption. This assumption states that economic agents use the model to form expectations. A model is a description of an economy. Assuming that it is the right one, rational expectations assumption states that the agents know the economy entirely and they use that information to form their expectations. "The agents know the economy" means that they know the structure behind demand and supply and that there are random variables, with given distribution functions, which a¤ect supply and demand. In such a circumstance in terms of the model, agents’ expectations are not systematically wrong in that all errors are random/not persistent. Therefore, under this assumption deviations from perfect foresight in the model are only random. Denote these expectations as xeT +1jT = ET [xT +1 ] ; where ET [xT +1 ] E [ xT +1 j T] is the expected value of x at time T + 1 given all information till time time T + 1. Information is summarized by T and includes all the possible structures in the economy and values of fundamentals. These are the third type of expectations. Lets see why this can work better than adaptive expectations which miss some of the structure of the model. Suppose that agents are assumed to have adaptive expectations, and the model economy features a constantly rising in‡ation rate. In such a circumstance agents would be assumed to always underestimate in‡ation since they are assumed to predict in‡ation by looking at in‡ation in previous years. Under rational expectations assumption, since the trend in in‡ation is part of the model agents would take it into account in forming their expectations and there won’t be such a bias. Formally, this can be represented in the following manner. Consider an economy which starts at time 1 and where in‡ation at any time T is given by T where is a constant, T indexes time, and = T +T + T; is a random variable with 0 mean and 2 variance. Further, suppose we are at the end of T = 2 and the agents’in this economy need to form expectation of in‡ation for time T + 1 = 3. If agents have adaptive expectations with correction parameter 13 , then e 3 At the end of T = 2 the realizations of 1 3 2 1X = 3 t=1 2 3 2 t t: are known. Suppose, 1 = 0:01 and in‡ation then can be rewritten as e 3 1 3 = = 1 ( + 2 + 0:02) + 3 5 44 + : 9 30 12 2 3 ( + 1 + 0:01) 2 = 0:02. Expected In case of rational expectations, e 3j2 = E[ 3j 2] = E[ +3+ 3j 2] = +3+E[ 3j 2] = + 3: Notice that e 3j2 > e 3 1 3 : In this simple economy, trend in in‡ation can be thought to represent policy changes of a central bank. In other words, suppose there is a central bank in this economy which constantly increases in‡ation. One of the in‡uential drivers behind widespread adoption of rational expectations in macroeconomics has been the famous Lucas’critique. In terms of our discussion, Lucas’critique is that in models which do not feature agents with rational expectations, agents might not react to policy changes. This seems not to reconcile well with reality since in such a case agents can be tricked almost always with some policy changes. There is a famous Lincoln Quote on this issue: "You can fool some of the people all of the time, and all of the people some of the time, but you can not fool all of the people all of the time." Despite these seemingly plausible properties and widespread use, rational expectations assumption has received criticism on the grounds that it assumes that agents know everything. Criticism goes on saying that in the real world agents (consumers, …rms, etc) do not know exactly the economy.4 (If this were to happen, would Economics be a science?) 4 There is an emerging …eld in macroeconomics which deals with this issue. The models in this …eld feature agents which continuously learn the economy and know the economy fully at the end of time horizon (i.e., they are asymptotically rational). See for details "Evans, G., and Honkapohja, S. (2001). Learning and expectations in macroeconomics. Princeton University Press." 13 Traditional models with expectations This section highlights how important is the assumption on expectations in two traditional models. The models are Cobweb Model and Cagan Model. The Cobweb Model The Cobweb Model Kaldor (1934) proposes an explanation why prices might be subject to periodic ‡uctuations in certain markets. It assumes that …rms must choose output before prices are observed, demand and supply (prices) are uncertain, and that …rms’have adaptive expectations with = 1. These type of expectations are called static expectations since there is no correction to error. Firms’ expectations about prices at time T +1 are based on the prices that prevailed just in previous period, at time T . In other words, using p to denote prices, peT +1 = peT ( ) + [pT peT ( )] = pT : Hereafter, we will replace T + 1 with t. It seems that such a model can be well applicable to agricultural markets. In such markets, producers invest in production (start the production process) much before they sell their output. Periodic ‡uctuations then can happen for example because of supply shocks such as bad weather. For example, according this model if producers of corn experience very bad weather and have reduced output they would get higher prices. In the next period, they expect high prices and therefore will produce a lot. This will dampen the prices and lead to expectation of low prices. Expecting low prices, the producers of corn will produce few corn and in equilibrium price will rise, etc. In graphical terms the process described above can be represented in the following manner. The di¤erence between these two …gures is the relation between the slopes of the demand and supply curves. In the …gure to the left, the slope of the supply curve is higher than the absolute 14 value of the slope of the demand curve. In …gure to the right, the slope of the supply curve is lower than the absolute value of the slope of the demand curve. Suppose an economy represented by these …gures starts at time t = 0 at the intersection of S (supply) and D (demand) curves. In that period the economy receives a negative shock to supply so that prices in the next period are p1 . In period t = 1 the economy receives a counterbalancing positive supply shock which brings supply curve to its original position. In this period, however, expected prices for period t = 1 are still p1 . Producers produce according to p1 . The demand, however, is shorter than supply. Therefore, maximum that producers are able to charge is p2 which is lower than p1 (p2 < p1 ). For period 2 then producers expect price p2 and produce accordingly. In such a case, in period 2 it turns out that they have produced less than the demand is. They sell then at a higher price p3 (p2 < p3 ). This process continues and generates periodic ‡uctuations in prices. In case of the …gure to the left, prices tend to stabilize after the shock. This happens because inverse supply (or supply prices) reacts more than does inverse demand (or demand prices). Faced with higher demand in period t = 2, for example, …rms rise their prices. They do so much that dampens the demand in the next period. However, in case of the …gure to the right, prices do not stabilize. This happens because inverse supply reacts less than does the inverse demand.5 Lets make this model more formal. Suppose at any time t demand and supply functions are given by Dt = mI St = rI + rp pet + where mI ; mp ; rI ; and rp are positive parameters. demand and supply curves. 1;t and 2;t 1;t ; mp pt + 1 mp 2;t ; 1 rp .are and absolute values of the slopes of are shocks/disturbances. Let independently distributed (i.i.d.) and have 0 mean and 2 1;t and 2;t be identically and variance. The values of these shocks are not known at the time when price expectations are formed. At time t, supply function is designed according to the expected value of the price. To stay in line with our story, assume that the economy starts at p0 where D and S curves intersect. Moreover, from p0 to p1 and makes pe1 = p1 . Further, 2;1 = 0 and 1;t 2;0 .and 2;0 < 0 so that price shifts 0 8t > 1. 2;t Market clearing condition requires that at each and every point in time quantity demanded is equal to the quantity supplied, Dt = St : Therefore, mI mp pt + pt = mI rI mp and 5 1;t = rI + rp pet + rp e p + mp t 1;t mp 2;t 2;t : If in this economy the absolute values of the slopes of demand and supply were equal then there would be permanent price ‡uctuations with constant magnitude. 15 Denote 1 = 2 = t = mI rI mp rp mp 1;t 2;t : mp And rewrite pt as pt = Assuming that pet = pt 1 e 2 pt 1 + t: we have pt = 2 pt 1 1 + t: This is a very basic stochastic di¤erence equation. Its solution is the sum of a general solution of homogenous equation pt = 2 pt 1 and particular solution of the entire equation. The solution of pt = 2 pt 1 is very basic t 1 p1 ; 2) pht = ( where p1 is the price where the economy started. Now we need to guess a particular solution of the general equation. It turns out that for this form of equations the particular solution has the following form ppt = t 1 X ( 2) t 1 ( 1 ): + =1 Therefore, pt = ( t 2) 1 p1 + t 1 X ( 2) t 1 ( 1 + Ignore the second term in this expression and notice that if 2 ) :6 =1 ( 2) t 1 > 1 then the absolute value of is increasing over time. Therefore, pt diverges to in…nity. Parameter mp rp . the ratio of slopes of supply and demand curves value of the slope of the inverse demand curve supply). However, if 2 is the inverse of It is greater than 1 in case when rp > j mp j ; which is equivalent to say that the slope of the inverse supply curve 1 mp 2 1 rp is lower than the absolute (so that inverse demand reacts more than inverse < 1 then the absolute value of ( t 1 2) declines over time. Therefore, pt converges to a number. In this case rp < j mp j ; which means that the slope of the inverse supply curve is higher than the absolute value of the slope of the inverse demand curve (so that inverse 6 To see that this is the solution, plug this expression into the equation above. 16 demand reacts less than inverse supply). What is the number that price converges to? The answer to this question is quite simple. Convergence here means that price stabilizes over time (i.e., we have a steady-state). We have assumed that t p= = 0 for any t > 1. Therefore, stable prices means 2 p; 1 and price converges to p= 1 : 1+ 2 Notice that p is the level of price where our example D and S intersect. In other words, p = p0 . To make this analysis clearer, let’s consider a numerical examples. Suppose, p1 = 1; mI = mp = 1; rI = 0: and consider two values of rp rp1 = 0:5; rp2 = 1:5: This implies that Assuming the value of p1 and 1;t 0 and 2;t>1 2;1 = 1 = 1; 1 2 = 0:5; t = 2;0 2 2 = 1:5; 2;t : determines that values of the shock (of course given that 0). Let’s determine the paths of p for these parameter values. First, consider the case when rp1 = 0:5 p2 = 1 0:5p1 + 2 = 0:5; p3 = 1 0:5p2 + 3 = 0:75; p4 = 1 0:5p3 + 4 = 0:625; ::: p15 = 0:666687012; ::: 2 p+1 = : 3 The price converges to 23 , which is equal to 1 1+ 2 . 17 Now consider the case when rp2 = 1:5 p2 = 1 1:5p1 + 2 = 0:5; p3 = 1 1:5p2 + 3 = 1:75; p4 = 1 1:5p3 + 4 = 1:625; ::: p15 = 175:5575562; ::: p+1 = 1: The price diverges. Of course, negative price does not make sense. Therefore, one would have stopped at p2 saying that there is something fundamentally wrong in this economy. What happens if < 1 and we have adaptive expectations? If pet = pet 1 + pt pet 1 < 1 then 1 where pt 1 = e 2 pt 1 1 + t 1: Expectations can be written as pet = [1 e 2 )] pt 1 (1 + + 1 + t 1 : In this case we have a non-homogenous di¤erence equation in expectations. Similar to the previous discussion, expectations converge to a number if j1 (1 + 2 )j < 1 and diverge to in…nity otherwise. Steady expectations are given by pe = [1 p e (1 + 1 = 1+ 2 )] p e + 1; : 2 Since we have adaptive expectations, pe is steady when prices are steady p= 1 1+ :7 2 Interestingly, there can be situations when prices (and their expected values) are stable under adaptive expectations with that 1 7 (1 + 2) < 1 but unstable under static expectations. For instance, suppose < 0 then since by de…nition To verify this plug pe = 1 1+ 2 into p = 1 2p e 2 > 0 the absolute value of 1 : 18 (1 + 2) is always less than 2: j1 (1 + (1 + This implies that if 1 (1 + 2) 2 )j 2) 2 , 2) 1 < 1: < 0 then it is more likely that prices (and their expected values) < 1 than under static expectations. This happens suppresses the reaction of inverse supply curve to the observed price. For instance, suppose that j1 (1 + 1 < are stable under adaptive expectations with because = (1 + 2 2 )j = 1:5 so that we are back to our example. Further, suppose = 0:5. In such a case = 0:25 and prices (together with their expected values) are stable under adaptive expectations. We have seen, however, that for this example prices (and their expected values) are not stable under static expectations. In case, however 1 In case that 1 1+ 2 (1 + 2) > 0 then j1 (1 + 1 (1 + 2 )j = 1 2) < 2 (1 + 1 , 1+ 2 < : 2 > 1 (and therefore prices are not stable under static expectations) it is always the case < , which means that adaptive expectations are more likely to generate stable prices 2 2 (and expectations). For instance, again suppose that j1 2) (1 + 2 )j = 1:5 but now 2 = 0:1. In such a case = 0:75 and prices (together with their expected values) are stable under adaptive expectations. What would happen if we had rational expectations in this model? In case of rational expectations, expectations are implied by the model. In order to answer the question then we need to solve for expected price level from demand and supply equations. Luckily, we have done most of the job. Write Et 1 [pt ] instead of pet in 1 e 2 pt t: + pt = 2 Et 1 [pt ] 1 + t: Take the expected value of pt conditional on information at time t Et 1 [pt ] = Et = 1 [ 1] 2 Et 1 [Et 1 [pt ]] 2 Et 1 [pt ] : 1 Therefore, Et 1 [pt ] = 19 1 1+ 2 1. + Et 1 [ t] and this relation holds for any t.8 This implies that in short-term pt = 1 1 2 + 1+ 2 t; and shocks can be the only reason of ‡uctuations in the economy. Such an inference holds since in this case the realizations of are not important given that it has a zero mean and is i.i.d./unpredictable. In the long-term, however, we assume that there are no shocks. In some sense this corresponds to assuming that everything is perfect. In such a case, Et p= 1 1+ 1 [pt ] = Et 1 [pt ] and : 2 This analysis points that types of expectations matter for the level of aggregate economic activity and prices and for ‡uctuations. In case of static or adaptive expectations in this model there can be prolonged ‡uctuations in the economy much after the shock. However, in case of rational expectations there are no ‡uctuations after the shock. The Cagan Model Cagan Model has been an in‡uential contribution to policy making and academia. Cagan in his 1956 article (Cagan, 1956) proposed a model which delivered novel explanation for extraordinarily high in‡ation/hyperin‡ation. This model seems to do well in explaining for the behavior of in‡ation and the demand for money even in the midst of such distress. The common line of thought is that hyperin‡ation is because of continued supply of very large quantities of money by the central bank. Famous examples are hyperin‡ation in Germany during the period of 1921-1923 and Argentina in 1989. Hyperin‡ation in both countries happened because their governments decided to fund their debt printing money. Cagan Model stresses the destabilizing e¤ects that expectations might have on in‡ation. In Cagan Model in‡ation can destabilize and become very large even with very small amount of money injected into the economy. Therefore, an implication of the model is that policy makers should be wary with …nancing a de…cit by printing money even of small quantities. Cagan Model consists of several blocks. The …rst block of this model is money demand equation Mt = L (Yt ; it ) ; Pt where Mt is the amount of desired money holdings, Pt is aggregate price, and money is the medium of exchange. Therefore, Mt Pt is the real amount of desired money holdings. L (Yt ; it ) is money demand function. It is assumed to depend on current output/income Yt and nominal interest rate it . More precisely, it is assumed to increase with current output/income since increasing output implies 8 We have just found expectation from the model. In this sense we have assumed that agents which live in the economy described by the model, know the economy and use the model to form expectations. Moreover, in this sense adaptive expectations are a notion imposed on the model (additional condition/assumption so to say.) 20 that the same amount of money buys more goods. In turn, money demand is assumed to depend negatively on nominal interest rate. A justi…cation for this assumption is that nominal interest rate is paid on bonds/nominal savings but it is not paid on money holdings. Money holdings can be freely converted into savings. Therefore, nominal interest rate is the opportunity cost of holding money. Higher nominal interest rate increases this opportunity cost and reduces the desire to hold money. (This equation can be thought to be a result of combination of Quantity Equation of Money and Liquidity Preference Theory.) Suppose that L (:; :) is di¤erentiable function in both arguments. Formally, our assumptions about L (:; :) can be summarized as @L (Y; i) @Y @L (Y; i) @i > 0; < 0; where time index t has been dropped because these inequalities are assumed to hold for any t. The second block is the Fisher Equation. In case we have a deterministic setup Fisher Equation is 1 + it = (1 + t+1 ) (1 + rt ) ; where it is the nominal rate agreed for bonds at time t and paid at time t + 1 on bond holdings from time t. t+1 is in‡ation in the period from t to t + 1, t+1 = Pt+1 Pt : Pt In turn, rt is real interest rate. It is straightforward to obtain this equation. Suppose at time t one has acquired an asset At at value Pt At . At time t + 1 the value of the asset has changed to Pt+1 At+1 . The percentage change of the value of asset is the nominal interest earned on Pt At and the percentage change of the volume/quantity of the asset is the real interest rate, Pt+1 At+1 Pt At Pt+1 At+1 = 1; Pt At P t At Pt+1 At+1 = 1; Pt At Pt+1 Pt At+1 At = +1 +1 Pt At = (1 + t+1 ) (1 + rt ) 1: it = 1; In Cagan Model, however, future prices are not known. They are assumed to follow a rane . This also dom/stochastic process. In such a case, we replace Pt+1 with its expected value Pt+1 21 implies that we need to replace in‡ation rate with its expected value, and Fisher Equation becomes 1 + it = 1 + e t+1 (1 + rt ) : There is no expectation on it since, although it has to be paid at time t + 1, it is determined/agreed at time t.9 The analysis in Cagan Model particularly focuses on periods of hyperin‡ation. In these periods nominal values change much more rapidly than real values. Therefore, lets assume that real variables are constant. Moreover, assume that the logarithm of the money demand function is linear in ln Yt and ln (1 + it ). In particular, ln L (Yt ; it ) = 0 + 1 ln Y = : where 0, 1, and ln ln 1 + are positive constants and = 1+ e t+1 0 e t+1 (1 + r) ; + 1 ln Y ln (1 + r). In such a case, combining money demand equation and Fisher Equation we have to have that ln Mt e ln Pt+1 ln Pt = ln Pt : Denote z = ln Z and rewrite the equation above mt or pt = 1+ pet+1 pt = + 1 1+ mt + pt ; 1+ pet+1 ; This is a fairly interesting equation. It suggests that current price is function of current money supply and expected price level in future. Moreover, current price increases with money supply. This happens because, given constant output increasing money supply simply increases prices. (Prices are the rates at which money is converted to goods). Current price also increases with expected future price. This happens because higher expected price increases expected in‡ation. Given constant real interest rate, this implies that nominal interest rate increases, which reduces the desire to hold money. However, the supply of money is unchanged, i.e., Mt = f ixed. Therefore, the rates at which money is exchanged for goods increase which is the same as to say that prices, pt , increase. In terms of equations: pet+1 ") e t+1 If we assume that 9 " and rt = const ) it ") L (Yt ; it ) # and Mt = const ) pt " : = 0, then price (well... the logarithm of it) becomes a weighted average of You might be used to the following form of Fisher Equation it = et+1 + rt . This is an approximation of the one stated above. To see this assume that it ; et+1 , and rt are close to 0 and apply the following approximation x = ln (1 + x) when x is close to zero. 22 current money supply and expected in‡ation pt = 1 1+ mt + pet+1 : 1+ For simplicity we will maintain this assumption and to keep the house clean will denote = 1 1+ (0; 1) so that ) pet+1 : pt = mt + (1 2 (1) This equation together with assumption on expectations is the Cagan Model. We will analyze now how di¤erent types of expectations matter for dynamics in this economy. Moreover, we will check how changes in money supply matter for changes in prices and in‡ation, which is the percentage change in prices. Suppose that agents in this economy have adaptive expectations. Further, for simplicity suppose 1 coincided with its realization pet that the expected value of price for time t pt 1 = pt 2 and t 1 1 = pt 1 (i.e., = 0). In such a case pet+1 = (1 ) pet + pt = (1 ) (1 ) pet 1 + pt 1 = (1 ) [(1 ) pt 1 + pt 1] = (1 ) pt 1 + pt + pt + pt : Therefore, the Cagan Model can be expressed as pt = 1 (1 ) mt + (1 1 ) (1 ) pt (1 ) 1: The general solution of this di¤erence equation is the sum of the general solution of homogenous equation and particular solution of this equation. It is "easy" to guess and verify that it is given by pt = (1 1 ) (1 ) (1 ) t p0 + 1 (1 ) t X (1 1 =0 ) (1 ) (1 ) t m ; where p0 is the initial level of prices. It is a given number. Suppose only few mt 6= 0 (or mt is a stationary function). The logarithm of the price level in such case is stable if (1 1 ) (1 ) < 1: (1 ) With this inequality, the logarithm of the price level converges over time to 0. Therefore, price 23 converges to 1. In‡ation, in turn, can be written as = pt t pt 1 (1 1 = + 1 (1 t 1 ) (1 ) (1 ) ( 1 mt ) It is straightforward to notice that (1 1 p0 ) (1 t 1 X (1 1 ) (1 )(1 ) 1 (1 ) =0 ) (1 ) (1 ) t 1 m ) < 1. Therefore, we have stability in terms of prices and in‡ation in this economy if expectations are for prices. This situation is not so interesting for our current purposes. Consider now how things change if we have adaptive expectations not for price levels but for in‡ation: e t+1 = (1 e t ) + t: Suppose again that at time t 1 prices were stable. Therefore, in‡ation was equal to 0. This implies that e t+1 = (1 = ) (1 e t 1 ) + t 1 + t t or equivalently, pet+1 pt = (pt pt 1) Therefore, pet+1 = (1 + ) pt pt 1: This can drastically change the solution of the model. To see that, plug this expression back into (1) and obtain pt = 1 (1 ) (1 + ) mt 1 (1 (1 ) pt ) (1 + ) 1: The general solution of this di¤erence equation is pt = 1 (1 (1 ) ) (1 + ) t p0 + 1 (1 ) (1 + ) t X =0 1 (1 (1 ) ) (1 + ) t m ; For the current purposes we know the model su¢ ciently well. Let’s focus on the main novelty in terms of inference which Cagan introduced. Suppose that p0 = 0 and economy is at p0 at time t and m 0. Therefore, pt = p0 for any t. In this situation money supply is constant and is equal to 1. Economy, price levels, and in‡ation are stable pt = 0; 24 t = 0: Consider a deviation from this situation. Suppose the government in this economy wants to raise money supply at the beginning of time t = 0 so that m0 > 0 and keep m 0 for the rest of the periods. Perhaps, it does so in order to …nance its de…cit. In such a case pt = Therefore, if (1 1 (1 1 ) )(1+ ) (1 ) (1 + ) 1 (1 (1 ) ) (1 + ) t m0 : < 1 over time prices converge back to 0: However, if (1 1 (1 ) )(1+ ) > 1 then prices diverge to in…nity. This happens even though only m0 > 0 so that there is no continual expansion of money supply. This is the "possible" destabilizing e¤ect of (forward looking) expectations. In‡ation in this case is given by t = 1 (1 Clearly, in case when (1 (1 ) (1 + ) 1 (1 1 (1 ) )(1+ ) ) +1 ) (1 + ) 1 (1 (1 ) ) (1 + ) t 1 m0 : > 1 in‡ation is also ever growing in absolute terms or spiralling out of control. The summary of the main novelty here is: In case expectations are for in‡ation, for certain parameter values it is enough to slightly increase money supply for a very short period to have ever increasing prices and in‡ation because of expectations over in‡ation. This is in contrast with the common line of thought that in order to have ever increasing in‡ation and prices money supply has to increase permanently. Moreover, it suggests that …scal authority (central government) and monetary authority (central bank) should be independent so that government would not be able to …nance its de…cit printing money. This independence is implemented in many countries as of now and even is part of constitutions of some of those countries. What would happen if we had rational expectations in this model? (AGAIN) Rational expectations assumption means that the agents use the model to form their expectations. Therefore they’ll use the following equation for forming expectations: ) pet+1 : pt = mt + (1 Replace pet+1 with Et [pt+1 ] and rewrite this equation pt = mt + (1 ) Et [pt+1 ] : Take expectation over information that available at time t Et 1 [pt ] = Et 1 [mt ] + (1 ) Et 1. 1 [Et [pt+1 ]] : According to the Law of Iterated Expectations Et 1 [Et [pt+1 ]] = Et 25 1 [pt+1 ] : Lets assume mt is deterministic. Therefore, we have a di¤erence equation in terms of expectations Et 1 [pt ] = mt + (1 ) Et 1 [pt+1 ] : Since the coe¢ cient in front of the lead variable is less than one it has to be that expected price tends to in…nity in this model. Therefore, with rational expectations assumption we don’t have convergence/stabilization of the economy. Prices and their expected values diverge to in…nity. This situation is called self-ful…lling in‡ation. Agents expect to have high in‡ation. Prices rise accordingly, and agents’expectations ful…ll. If we impose though additional condition that lim (1 1 ) !+1 Et [pt+ ] = 0; then we will have stability in this model. This condition is sometimes called "no bubble" condition in the sense that it makes sure that expected prices do not become in…nitely large. Our main equation implies that pt = mt + (1 ) Et [pt+1 ] : Et [pt+1 ] = mt+1 + (1 ) Et [pt+2 ] Et [pt+2 ] = mt+2 + (1 ) Et [pt+3 ] ::: Plugging back Et [pt+1 ], Et [pt+2 ], etc, gives pt = mt + (1 ) f mt+1 + (1 ) f mt+2 + (1 ) Et [pt+3 ]gg : Iterating till in…nity we have pt = +1 X (1 ) mt+ : =0 Therefore, with "no bubble" condition prices are forward looking and can be stationary. Prices are forward looking implies that they take into account future changes in policy/money supply.10 Basically, this is the Lucas Critique to models which do not feature rational expectations. In those models changes in policy parameters do not a¤ect expectations and therefore current actions and economic outcomes. However, if expectations are rational then policy changes can a¤ect actions and economic outcomes immediately and therefore change the environment which the change of policy/policy-makers might not have anticipated. Suppose that m 10 0; then pt = 0. If m is constant and equal to m > 0 then by the formula of This is one of the reasons why it might be important to announce policies much before their implementation. 26 geometric progression we get +1 X pt = m (1 ) =0 = m: This holds for any t. Notice that in this case pt is constant which is in sharp contrast to the case of adaptive expectations assumption. If instead, for example, mt+1 = mt+2 = m ~ and mt+k = m for any k > 2, then with the same logic pt+3 = m " pt+2 = " pt+1 = m ~ +m +1 X (1 ) =1 m ~ + (1 )m ~ +m = m ~ + (1 +1 X (1 ) =2 = [ + This implies that if m0 > 0 and m # (1 )] m ~ + (1 ) m; # )2 m: 0 for the rest of the periods then p0 = m0 and pt = 0 for any t > 0. Lets extend slightly the model and assume that money supply is not deterministic but follows a random process mt = m + where m is constant and is the realization of t: is an i.i.d. random variable with 0 mean and constant variance . t at time t and is known at time t. Therefore, mt at time t is not a random variable. However, mt+1 is a random variable at time t since its realization is not known. The assumption that mt = m+ t corresponds to assuming that money supply is subject to unanticipated shocks. Where could such shocks come from? A source for such shocks in the model could be that the government/central bank does not fully announce its money supply policy and allows some random/unpredictable changes. Maintaining the "no bubble condition" price can be rewritten as pt = +1 X (1 ) Et [mt+ ] ; =0 where of course Et [mt ] = Et [m + t] pt = =m+ (m + t since the value of t) + m +1 X (1 =1 = (m + t) + (1 27 ) m: t ) is known and Notice that this is similar to what we previously obtained and given that are not predictable they don’t matter for expectations and prices. Appendix If expectations are for prices then prices are stationary. What is the value they converge to when money supply is constant? Here is the answer to that question: If money supply is constant m > 0; p0 = 0, and lim pt = t!+1 1 (1 (1 )(1 ) 1 (1 ) < 1 then t X (1 t!+1 1 ) (1 ) (1 ) m lim ) =0 t Rewrite this equation as t h X (1 )(1 ) 1 (1 ) lim pt = t!+1 and notice that +1 h X (1 )(1 ) 1 (1 ) =0 t h X (1 )(1 ) 1 (1 ) lim t!+1 =0 h (1 )(1 ) 1 (1 ) i i t i 1 (1 =0 m lim ) t!+1 h (1 )(1 ) 1 (1 ) i i t = +1. Apply L’Hôpital’s rule to …gure out the limit: @ @t = = lim t!+1 lim t!+1 t h X (1 )(1 ) 1 (1 ) =0 @ @t h (1 )(1 ) 1 (1 ) h i i t i t (1 )(1 ) 1 (1 ) h i t h i (1 )(1 ) (1 )(1 ) ln 1 (1 ) 1 (1 ) = ln h 1 1 (1 ) (1 )(1 ) i: At the end of time horizon, price converges to lim pt = t!+1 1 (1 ) ln h 1 1 (1 ) (1 )(1 ) i m: The Lucas Imperfect Information Model with AD The Lucas Imperfect Information Model provides us with an explanation of why in short run aggregate supply curve might be upward sloping, but not vertical.11 Expectations matter for dynamics of prices and quantities in this model too. Prior to proceeding to the model lets digress shortly and review AD-AS Model. 11 Macro II subject slightly covers this model. You can …nd and read a slightly di¤erent version of this model in Part A of Chapter 6 of Romer (2006) textbook. 28 The AD-AS Model consists of two equations/curves: aggregate demand (AD) and aggregate supply (AS). Aggregate demand curve is determined from the IS-LM Model. IS-LM-model has two equations: investments-savings and liquidity-money (money demand). These equations are [IS] : Y = C (Y; T ) + I (r) + G; M [LM ] : = L (Y; r + e ) ; P where C is consumption, I is investment, G and T are government spending and taxes (…scal policy parameters), M is money supply/demand (monetary policy parameter). C increases with Y and declines with T . I declines with r. In a period, prices are assumed to be given. Money supply is also given. The central bank controls it. Therefore, unknowns are Y and r in the IS-LM Model. Graphically this model can be represented in the following manner. The solutions of Y and r depend on price level P , …scal and monetary policy parameters G; T , and M , and on expected in‡ation level, e. The solution of Y is the aggregate demand curve, AD = Y (P; G; T; M; e ): It declines with P . Negative relation stems from LM curve. This curve shifts up when prices increase implying higher equilibrium interest rate and lower income. The graphical representation of AD-AS model is In this …gure, LRAS is long-run aggregate supply (AS) curve. SRAS is short-run aggregate supply curve. The former is vertical since we assume that in long-run factor inputs are given/prices 29 are determined in the model. In turn, the latter is upward sloping and lower is its slope higher is the e¤ect of shifts of aggregate demand on income. Shifts of aggregate demand AD = Y (P; G; T; M; e) in this model can stem from changes in policy parameters G; T; and M , and expected in‡ation level e. Lucas Imperfect Information Model provides us with a SRAS curve. In Lucas Imperfect Information Model, there are N …rms. Each …rm i (i = 1; N ) is assumed to face demand d i where y N = y + N (pi i is income spent on each good (y = ln Y ). p) ; i is a taste parameter which has normal distribution with 0 mean and is i.i.d. across …rms. pi is the price of the product of …rm i. p is the aggregate level which consumers face. Aggregate price levels is average price in this model p= N 1 X pi : N i=1 Consumers know this price level since they are consuming all types of goods produced in the economy. Further each …rm i is assumed to have the following supply function s i where and = + pe ) ; (pi are positive parameters, pi is the price of the good of …rm i, and pe is …rms’expected aggregate level price. There is intra-temporal uncertainty here which stems from the assumption that …rms cannot observe the prices (shocks) of their rivals. This is especially meaningful for …rms from di¤erent industries. Supply positively depends on the gap between pi pe which is the (expected) relative price of the …rm. In this regard, for example pi > pe can happen when …rm i has received positive shock to demand relative to others. Market clearing requires that in equilibrium + (pi pe ) = s i d, i = y + N (pi i p) : Therefore, solving for the price of …rm i we get pi = 1 + y + N + i 1 + ( pe + p) : (2) The aggregate supply is then y = N [ + ( + ) pi ( pe + p) i] : Take the average of this expression and assume that N is high. According to the Law of Large 30 Numbers then 1 N N X i = 0 and i=1 y=N[ + (p pe )] : If there is no uncertainty, which means that pe = p, then Y is vertical y = N: This is long run aggregate supply curve (LRAS). The expected price is conditional on observation of pi , pe = E [ pj pi ] (3) so that …rms can update their beliefs within a period. Since i is driving the uncertainty in this model and it has normal distribution, p has a normal distribution too. It can be shown that in such a case the conditional expectation of p is its expected value plus an error term which has zero expected value, E [ pj pi ] = E [p] + (pi = (1 where > 0, and the error of prediction is pi E [p]) ) E [p] + pi : (4) E [p]. Take expected value of (2) conditional on pi pi = 1 + y + N + E [ pj pi ] : i (5) Plug (4) into (5) and take the average of pi , p= 1 ( + ) (1 y ) N Compute aggregate output denoting ( + ) (1 + E [p] : ) N = b, y = y + b (p E [p]) : (6) This is short run aggregate supply curve. Clearly it is upward sloping. In case p > E [p] clearly in short run y > y . Such a relation holds since if p > E [p] then on average …rms have received a positive shock (their prices have turned out to be higher than expected aggregate price) and have increased output. Combining aggregate supply curve with aggregate demand curve gives equilibrium levels of price and quantity. To keep the discussion as simple as possible suppose that aggregate demand is given by the Quantity Equation of Money P Y = V M: 31 Suppose further that the velocity of money is equal to 1. The logarithm of the aggregate demand therefore is y=m p: Then prices can be determined from y + b (p E [p]) = m p; (7) which is common market clearing condition. Under rational expectations the agents use (7) to derive their expectations. To …nd it, take the expected value of (7), E [p] = E [m] y: Just as in Cagan Model, therefore, agents’expectations of prices depend on expectation of nominal money stock. Plugging this back into (7) and solving for p gives equilibrium price levels p= b 1 E [m] + m 1+b 1+b y: Aggregate price level in equilibrium is a weighted average of expected money supply E [m] and actual money supply m. Plugging p from this expression into the aggregate demand equation gives the equilibrium level of output y=y+ b (m 1+b E [m]) : This implies that under rational expectation assumption short run deviations of output from long run level are possible only through unanticipated changes in money supply. If money supply rule is known then simply m E [m] = 0 and y = y. Therefore, output ‡uctuations in this model with rational expectations assumption are purely driven by unanticipated changes in money supply. Prices, though, respond to unanticipated changes to money supply. We have a version of the AD-AS Model where aggregate demand curve is y=m p; which means that aggregate demand is assumed not to depend on …scal policy. This is the reason why inference does not include policy parameters other than money supply. Lucas Imperfect Information Model with AD implies a very well known and very debated relationship in macroeconomics which is called Phillips Curve. Phillips Curve illustrates the short-run trade-o¤ between in‡ation and output. To derive it assume that long-run output is …xed and put time index in (6) to obtain yt = y + b (pt Et = y + b [(pt pt 1) = y + b( Et 1 [ t ]) : t 1 [pt ]) (Et 32 1 [pt ] Et 1 [pt 1 ])] In short we have an expectation augmented Phillips Curve yt = y + b ( Et t 1 [ t ]) : In case when in‡ation is higher than expected, short-run output is higher than the long-run output. In this sense, if policy makers can manage to full the agents they can increase short-run output. Usually Phillips Curve is written, however, as a trade-o¤ between in‡ation and unemployment. Assuming that output is inversely proportional to unemployment and using u to denote the long-run level of unemployment, Phillips Curve can be rewritten as ut = u ~b ( Et t 1 [ t ]) ; where ~b > 0 is a constant. Increasing then unanticipated in‡ation reduces unemployment. The Sticky Wage Model In Macroeconomics II you have also seen a version of the Sticky Wage Model. We will now consider a version of that model adding microstructure which hasn’t been discussed in detail in the previous course. We will show again that expectations matter for macroeconomic outcomes. Suppose that there is a continuum of mass one of identical and in…nitely lived consumers. Each consumer is endowed with L units of time. The consumers can supply their time to …rms at wage rate w or use it for leisure. The consumers derive utility from consumption of N di¤erent types of goods fxj gN j=1 and have disutility from supplying labor. They have constant elasticity of substitution utility function of the form U (x1 ; :::; xN ; L l) = N X xj + ln (L l) : j=1 where 2 (0; 1) and 1 1 is the elasticity of substitution between di¤erent pairs of goods x.12 l is the amount of labor force that the representative consumer supplies to …rms and L time. l is its leisure is a positive parameter. The consumers maximize their utility subject budget constraint taking prices pxj N j=1 of goods x and wage rate as given. Therefore, the representative consumer solves the following problem in x 12 The elasticity of substitution between any i and k (i 6= k) pair of x is 33 d ln x i k du(xk )=dxk d ln du(xi )=dxi = 1 1 . each period of time 8 N <X max : fxj gN j=1 ;l j=1 s:t: N X 0 = wl xj + ln (L l) 9 = ; pxj xj j=1 To solve this problem we use Lagrangian L = max Normalize 8 N <X : fxj gN j=1 ;l j=1 xj + 2 l) + 4wl ln (L N X j=1 39 = pxj xj 5 ; = 1. The optimal rules are given by …rst order conditions [xj ] : pxj = xj 1 ; 1 ; [l] : w = L l where the …rst one is the inverse demand function of good xj and the second one is the inverse supply function of labor. Notice that there are actually N inverse demand functions because there are N goods x. This means that we have N + 1 optimal rules. Assume that e¤ective wage rate is given by w = pe f (u; z) ; where pe is the expected aggregate price level (pe = E h 1 N i p j=1 xj ), f is a function which decreases PN in the …rst argument and increases in the second argument. u is the level of unemployment in the economy, and z are institutional features in the economy such as unemployment bene…ts, minimum wages, etc. Suppose that there are N …rms. Each …rm produces a type of x good. Firms’ input for production is labor and a unit of labor produces a unit of output in all …rms. Firms are price setters. Therefore, the problem of the …rm which produces good xj is max pxj xj wxj xj s:t: pxj = xj 1 : This implies that the inverse supply function is p xj = 34 1 w: Inverse supply function implies that pxj p and from the inverse demand function it follows that in equilibrium all x goods are produced at the same quantities xj x. Notice that in this framework since …rms are price setters price is higher than marginal cost w. As tends to 1 goods become more similar/substitutable since 1 1 tends to in…nity and px tends to the marginal cost. Given that in equilibrium all …rms produce the same amount of output they hire the same amount of labor. Since there is a continuum of mass one of consumers, the total amount of labor that …rms hire is l. This implies that unemployment rate is u= L l L l 13 : L =1 Moreover, the production of a unit of good requires unit of labor. Unemployment rate then can be rewritten as Y L u=1 where Y is total output, Y = The intersection of PN j=1 xj . w p w p = pe f p = ; Y ;z ; L 1 (8) gives equilibrium level of output and prices. At the same time it gives equilibrium level of unemployment. = pe f p 1 Y ;z : L Suppose that expected price relative to actual price equation to continue to hold f 1 Y L;z pe p (9) increases. In such a case, in order this should decline. That will happen if unemployment in- creases/output declines. 13 Although we treat this as unemployment rate, this is actually the non-participation rate. Unemployment rate would be the di¤erence between participation rate in a non-distorted economy and the participation rate in this distorted economy. 35 Equation (9) holds in each period of time. Add time subscripts to it and write pt = 1 f 1 Yt ; z pet : L This is the aggregate supply curve. In long-run pt = pet so that long-run output and unemployment are given by =f 1 Y ;z : L In short-run price level pt is an increasing function of Yt and expected price level pet . Higher Yt in this setup increases the wages paid according to (8). This increases the marginal costs of the …rms and therefore the prices that they charge. Higher expected price also increases wages according to (8) and therefore results in higher actual prices. If we assume for example that f (ut ; z) = 1 ut + z = Yt + z; L then the short-run aggregate supply curve is given by pt = 1 or Yt = Yt + z pet ; L pt pet z L: Long-run aggregate supply curve then will be Y =( z) L: 36 The E¤ectiveness of monetary policy (with …xed rules) We have seen that expectations and their types can matter for aggregate economic performance. They can matter for both the levels of nominal and real variables and for their dynamics. We will now turn to more rigorous discussion of how monetary policies can matter for economic performance given the type of expectations. A model with stabilizing monetary policy Often policy makers perceive an agenda of stabilizing the economy. They set …scal and monetary policy rules in order to achieve that. The reason they do so is that consumers/households are believed to be risk averse. Therefore, they might be better-o¤ if the economy is more stable. The intuition behind such a results is that risk averse consumers prefer steady income over volatile income. This (sub-)section discusses properties and e¤ects of a monetary policy rule which attempts to stabilize the economy. The discussion follows a simpli…ed version of the AD-AS Model. Since the focus of the section is on monetary policy, aggregate demand (in logarithms) is simpli…ed to yt = m t t pt + t: can be thought to be the velocity of money. In such a case, this equation is the Quantity Equation of Money. In the reminder we will assume that t is a random variable which follows a simple AR(1) process of the form t where 2 (0; 1) is a parameter and t = t 1 + t; is an i.i.d. random variable with 0 mean and the unconditional expected value of Denote with (10) t, E[ = Denote with 2 unconditional variance of 2 t 1] + E [ t] + 0: t, V [ t ]. Unconditional variance of = E[ t E [ t ]]2 = E [ t ]2 Plug t 1 + t for t, E [ t ]2 = E [ t 1] 2 + 2E [( 37 variance. E [ t ]. It is straightforward to see that is equal to 0, E [ t] = 2 t 1 ) ( t )] + E [ t ]2 t is Given that t is i.i.d. E [( t 1 ) ( t )] = 0. Therefore, 2 2 = 2 1 Conditional on information available at time t Et 1 [ t] = Et = : 1 mean and variance of 1 [ t 1] + Et t are 1 [ t] (11) t 1; and Vt 1 [ t] = Et 1[ t = Et 1[ Et t 1 + 2 1 [ t ]] t 1] t 2 = 2 : Aggregate supply in this model is given by yt = pt wt ; where wt are wages. Aggregate supply increases with prices and declines with wages, which summarize/represent the costs of the …rms. In this model long-run level of output, prices, and money supply are normalized to 1 so that in long-run y = m = p = w = 0: Agents are assumed to live for 2 periods. Their wage contracts …x wages for 2 periods. In this sense contracts are for a long term and wages are sticky. Wages are …xed according to the agents’ expectation of prices for 2 periods. At birth (denote by t 2) they inherit information about previous periods of time from their parents and require wage at time t according to wt = 1 e p 2 tjt 1 + petjt 2 : (12) Monetary authority pursues an agenda of stabilizing the economy. It tries to set money supply so that to reduce the e¤ect of shocks on output conditional on information it has. The shock arrives after monetary policy has been implemented. In other words, monetary policy authority sets money supply at the beginning of time t not knowing the realization of the shock at time t, but knowing its realizations in earlier periods In this sense monetary policy authority does not have superior information. Suppose that monetary policy rule/money supply is mt = ht + (1 38 ) ht 1; (13) where and Et 2 [0; 1] and < 0. 1 [ t] = ht = t 1; With this policy rule monetary policy authority can reduce the expected shock t 1 since when increases money supply declines. t 1 In equilibrium aggregate demand is equal to aggregate supply, pt wt = mt pt + t: Therefore, 1 (mt + wt + 2 1 (mt wt + 2 pt = yt = t) ; (14) t) : (15) Plugging monetary policy rule (13) and wages (12) into output equation gives yt = In this expression, t 1 1 2 t 1 and t 2 + (1 ) t 2 + 1 e p 2 tjt t 1 + petjt : 2 are known at time t. The dynamic path of the model is given by pt = t 1 2 t 1 = yt = t 1 + 1 2 + (1 ) t 2 + t ) t 2 + t + 1 e p 2 tjt 1 + petjt 2 ; 1 e p 2 tjt 1 + petjt 2 : t; t 1 We have, therefore, unknowns yt , + (1 t, pt , and petjt 1 and petjt 2 and need equations for the latter two. Suppose that agents have rational expectations in this model. In such a case, expected prices need to be …gured out from the model and petjt 1 = Et e 1 [pt ] ; p tjt 2 = Et 2 [pt ] : Take the expectation of price pt (14) conditional on information at time t Et 1 [pt ] = 1 (Et 2 1 [mt ] + Et 1 [wt ] + Et 1 [ t ]) : According to (13) and (11) the conditional expectation of mt is Et 1 [mt ] = = Et 1 [ t 1] + (1 + (1 ) t 1 39 ) Et t 2: 1 [ t 2] 1, Et 2 [pt ] is a given number at time t 1, which means that Et 1 [Et 2 [pt ]] = Et 2 [pt ]. According to (12), then, the conditional expectation of the wage rate is given by Et 1 [wt ] 1 Et 1 [Et 1 [pt ] + Et 2 [pt ]] 2 1 (Et 1 [pt ] + Et 2 [pt ]) : 2 = = Therefore, the expected value of pt conditional on information from time t Et 1 [pt ] = 1 ( + 2 ) + (1 t 1 ) t 2 + 1 (Et 2 1 [pt ] 1 is + Et 2 [pt ]) 2 [pt ] : ; or equivalently, Et 1 [pt ] = 2 ( + 3 ) t 1 + (1 ) t 2 1 + Et 2 (16) This means that expected price is a function of previous realizations of shock and expectations. We still need an equation for Et information at time t 2 [pt ]. Take the expectation of price pt (14) conditional on 2, Et 2 [pt ] = 1 (Et 2 According to (10), the expected value of Et 2 [ t] 2 [mt ] + Et 2 [wt ] + Et 2 [ t ]) : conditional on information at time t t = Et 2 = 2 2 t 2 + t 1 + 2 is t t 2: Conditional expectation of monetary policy rules is Et 2 [mt ] = Et = [1 2 [ t 1] (1 + (1 ) ] ) Et 2 [ t 2] t 2: In turn, according to (12) and the Law of Iterated Expectations conditional expectation of wage rate is Et 2 [wt ] 1 Et 2 [Et 2 = Et 2 [pt ] : = 1 [pt ] + Et 2 [pt ]] Therefore, Et 2 [pt ] = 1 [1 2 (1 ) ] t 2 + Et 2 [pt ] + or equivalently, Et 2 [pt ] = (1 ) + 40 + 2 t 2; 2 t 2 ; The expected value of price conditional on time t Et 2 ( + 3 1 (16) can be rewritten as 1 3 (1 3 ) + + 2 1 1 [( + ) t 1 + (1 ) t 2] + t 2 2 1 1 2 ( + ) t 1+ 3 (1 ) + 4 3 3 1 (1 )+ + 2 t 2: 4 + 2 1 [pt ] = ) t 1 + t 2: In turn, output (15) can be rewritten as yt = Group together the coe¢ cients of t 1 and t 2 and use the condition that t 2 t 1 = t 2 + t 1 to rewrite output as yt = 1 ( + 3 ) t 1 + 1 : 2 t The …rst term in this expression is the e¤ect of shocks in previous periods on output. Monetary policy, for example, can eliminate their e¤ect setting = and reduce the e¤ect of shocks on output. In such a case yt = where t 1 ; 2 t is shock in period t which is not observed by the monetary policy authority before it sets and runs the policy rule. Policy rule, therefore, cannot a¤ect t. Such a policy rule implies that the variance of output is V [yt ] = Notice that if is not selected to eliminate V [yt ] = which is larger than 1 2 . 4 1 4 t 1 2 1 4 2 : then 1 ( + 3 + 2 ) 2 ; In this sense this policy rule stabilizes economy eliminating some of the in‡uence of shocks. From aggregate supply equation it follows that pt This implies that positive shocks t wt = 1 : 2 t imply reduction of real wage rate wt pt . What would happen if monetary policy authority knew the realization of t before setting and running its policy and had an agenda to stabilize the economy? The answer turns to be straightforward. If it sets mt = 41 t; then there are no shocks in this economy and no uncertainty. Aggregate output does not ‡uctuate and is at its long-run level, 1. w = p; y = p p = w; y: It seems natural to derive also monetary policy rule in case when wages are not sticky and are set for each period. In such a case, the model can be summarized as [AD] : yt = mt pt + [AS] : yt = pt [W ages] : wt = Et [Shock] : t = t; wt ; 1 [pt ] ; t 1 + (17) t: Therefore, in equilibrium pt = yt = 1 (mt + wt + 2 1 (mt wt + 2 t) ; t) : To …nd equilibrium level of output derive the expected level of prices and wages using (17) wt = Et 1 [pt ] =( + ) t 1 + (1 ) t 2: Therefore, yt = = 1 f 2 1 : 2 t t 1 + (1 ) t 2 [( + ) t 1 + (1 ) t 2] + t 1 + tg Clearly, yt in this case does not depend on money supply, which implies that money supply rule selected above cannot reduce variability of output. Is this a general inference for any money supply rule? Yes. To see that take again expected price level now assuming that we have a general (deterministic) money supply rule mt wt = Et 1 [pt ] = mt + t 1: This implies that aggregate output is yt = 1 : 2 t Monetary policy has no e¤ect in this general case too. Therefore, there is nothing to derive/any 42 rule suits. The reason why this happens is that there are no rigidities since wages adjust in a period. Higher money supply simply alters prices. Another possible extension of the model is to consider static expectations wt = pt 1. In such a case in equilibrium it has to be that 1 (mt + wt + 2 1 (mt wt + = 2 = pt 1 ; pt = t) ; yt t) ; wt t = t 1 mt = + t 1 t; + (1 ) t 2: From equations for pt and wt it follows that 1 2 pt = In turn, from t = t 1 + t t 1X 2 t p0 + 1 2 =0 t (m + ): is follows that t = t 0 + t X t : =0 Assume that 0 = 0 = 0. Aggregate output, therefore, is yt = 1 2 ( [( + t 1 11X 22 =2 1 2 1 2 ) 1 2 t 1 + (1 t 1 ) ] ( t 2 X t +( + ) t 1 =0 [( + ) + (1 ) ] X2 k=0 1 p0 + t : 2 ) k k ! +( + ) 1 + ) The exercise which computes the variance of aggregate output requires tedious algebra. Instead of that notice that in this case too monetary policy can eliminate some of the e¤ect of previous shocks on output setting + = 0 or ( + ) + (1 ) = 0. In this version of the setup monetary policy matters for aggregate output since, again, wages are rigid. They are indexed to previous levels of prices. Monetary policy alters the prices within period which alters the output of …rms but keeps their costs (wages) …xed. 43 A model with constant growth of money In the previous (sub-)section we considered a policy which attempted to stabilize the economy. In this (sub-)section we consider a di¤erent policy and keep the setup of the economy unchanged. The policy that we consider bears the name of its main proponent: Milton Friedman. Friedman Rule (or Policy) is monetary policy which makes nominal interest rate zero. Consider Fisher Equation, i=r+ : Friedman Rule then sets = r: A motivation behind such a policy can be derived from cash-advance models, which are slightly more advanced to be covered in this course. In short in these models money is an inferior asset in the sense that it does not earn interest, whereas bonds earn nominal interest i. The agents are forced to keep money since they use it for their purchases. Intuitively, though, such a policy works since, it manages to set nominal interest rate to 0 and eliminates the di¤erence between keeping money and bonds. Therefore, it makes money less inferior (or not inferior at all). Assume that output and velocity of money are constant. In such a case, from the Quantity Equation, v + m = p + y; it follows that in‡ation is equal to the growth rate of money t = mt mt Suppose that real interest rate is constant and denote 1: r = g. Therefore, in terms of this exposition Friedman Rule is mt = mt 1 + g: This is the monetary policy rule which we consider in this section. The remainder of the model is [AD] : yt = mt pt + t; [AS] : yt = pt wt ; 1 [W ages] : wt = (Et 1 [pt ] + Et 2 [Shock] : t = t 1 + t : 2 [pt ]) ; From [AD] and [AS] it follows that pt = yt = 1 (mt + wt + 2 1 (mt wt + 2 44 t) ; (18) t) : (19) Assuming rational expectations and using [W ages] gives Et 1 [pt ] 1 (Et 1 [mt ] + Et 1 [wt ] + Et 1 [ t ]) 2 1 1 mt 1 + g + (Et 1 [pt ] + Et 2 [pt ]) + 2 2 1 1 mt + (Et 1 [pt ] + Et 2 [pt ]) + t 1 ; 2 2 = = = (20) t 1 ; Therefore, Et 1 [pt ] 1 (Et 1 [mt ] + Et 1 [wt ] + Et 1 [ t ]) 2 1 1 mt 1 + g + (Et 1 [pt ] + Et 2 [pt ]) + 2 2 2 1 mt + Et 2 [pt ] + t 1 : 3 2 = = = Notice that Et 1 [mt ] = mt 1 (21) t 1 ; + g since money supply follows a deterministic/…xed rule. To …nd the expected price level conditional on information available at time t Et 2 [pt ] = = 1 (Et 2 [mt ] + Et 2 [wt ] + Et 2 1 mt + Et 2 [pt ] + 2 t 2 : 2 2, consider 2 [ t ]) ; Therefore, Et 2 [pt ] = mt + 2 t 2: Plugging back into the expression for expected price (20) gives Et 1 [pt ] = 2 3 3 mt + 2 t 1 + + 2 3 1 2 2 t 2 ; and from [W ages] equation it follows that wt = mt + 1 3 = mt + t 1 t 1 2 3 2 t 2 t 1: According to (19), income level then is yt = 1 2 t + 1 3 t 1: The second term in this expression highlights the persistence of shocks. This policy does not 45 eliminate t 1. We have relatively volatile output and expected in‡ation at rate g, E [ t ] = E [pt = mt pt mt 1] 1 = mt mt 1 2 3 +E t 1 + 1 3 2 E t 2 2 3 t 2 + 1 3 2 t 3 = g: Consider a slight extension of the discussion. Suppose that the monetary policy authority/central bank can cheat the agents and unexpectedly increase money supply from g = g to g0 (g0 > g). Suppose for simplicity that there are no shocks in the economy, so that t = t 0. This implies that agents expect prices and prices in general are Et 1 [pt ] = Et 2 [pt ] = pt = mt 1 +g and wages are given by wt = mt 1 + g: Therefore, aggregate supply is at its long-run level yt = 0. This happens because there is no uncertainty. Aggregate demand is given by yt = 1 [mt 2 (mt 1 + g)] : Therefore, if the monetary policy authority increases the growth of money supply to g0 then it increases output at least for a short period, yt = 1 (g0 2 g) : Suppose policy makers’ agenda is to boost output. In such a case it would be tempting for the policy makers to surprise public with sudden increases of in‡ation. In other words, the policy makers although might have announced a …xed monetary policy rule they might want to deviate from that commitment. In such a case, would the public believe that policy makers will stick to the announced policy? The answer to the question is "No" (most probably). We will continue exploring this commitment problem more formally in the next sections. Discretionary monetary policy Suppose now that the central bank’s agenda is to maximize output and minimize in‡ation. To formalize that assume that the central bank has a loss function, Lt = yt + 2 (pt 46 pt 2 1) ; which it attempts to minimize choosing the monetary policy rule. Clearly this is equivalent to maximizing Lt . Therefore, the central bank’s optimal problem is Lt = max yt 2 mt (pt pt 2 1) : Aggregate demand and supply are the same as in previous section. In equilibrium we have then that 1 (mt + wt + 2 1 (mt wt + 2 pt = yt = t) ; t) : In turn, assume that wages and shocks are given by wt = petjt = t 1; t 1 t: + Suppose that the central bank sets its policy prior to observing the shock. However, it has an important advantage in the sense that it can set its monetary policy rule conditional to the wages that prevail in the economy. Therefore, the central bank minimizes the expected loss and solves, Et 1 [Lt ] ( = max Et mt 1 " 1 (mt 2 wt + t 1 + 1 (mt + wt + 2 2 t) 2 t 1 The solution of this problem is given by @Et 1 [Lt ] = 0; @mt which (in this setting) is equivalent to Et 1 @Lt = 0; @mt or simply Et 1 1 2 1 (mt + wt + 2 2 t 1 + t) Therefore, mt = 2 + 2pt 1 47 wt t 1: pt 1 = 0: + t) pt 1 #) : Plugging this monetary policy into the expressions for output and prices gives pt = yt = 1 1 + pt 1 + pt 1 + 1 ; 2 t wt + 1 : 2 t The expression for prices implies that expected in‡ation is E [ t] = 1 : Therefore, increasing the weight on in‡ation in the loss function reduces in‡ation. In other words, if central bank values more lower in‡ation then it sets monetary policy rule to make sure in‡ation is lower. In turn, in order to …nd output assume that the agents have rational expectations and derive wage rate from the equation for prices. wt = 1 + pt 1; Wages depend on parameter . This is because in this setting monetary policy rule is contingent on prevailing wages. Therefore, output is given by yt = 1 : 2 t This monetary policy therefore does not limit the e¤ect of shocks on output but a¤ects in‡ation. Political cycles and discretionary monetary policy Consider a modi…cation of the previous setup. Suppose that there are two political parties A and B in the economy and it is an election year. Currently it is period mid-t end of period t 1 and elections are at the 1. Suppose these parties have di¤erent tastes for (weights on) in‡ation A and B and appoint monetary policy authority accordingly. Party A has lower tolerance to in‡ation than party B, A > B. The wages for period t are set before the election contingent on prices that prevail in the next period. Therefore, wages are set without knowing which party wins the election and which tastes for in‡ation prevail in the economy in period t. The probability that party A wins the elections is A. The reminder of the model is the same as the model in previous section. After the elections there is one party in the economy. Let it be party i where i is either A or B. 48 Therefore, the economy is described by 1 mit + wt + t ; 2 1 mit wt + t ; 2 2 + 2pt 1 wt pit = yti = mit = E i t t 1 i 1 = ; i 1 wt = + pt 1: i Clearly, expected in‡ation is lower in case party A won the election. Combining expressions for price level and monetary policy gives 1 pit = + pt 1 1 : 2 t + i This is price level conditional that party i has won the elections. With probability at time t is pA t and with probability 1 pt = it is A 1 A pB t , A) + (1 A A price level therefore, unconditional price level is 1 + pt 1 + B 1 : 2 t This implies that wages are given by wt = 1 A + (1 1 A) A + pt 1: B Wages are a weighted average of tolerances for in‡ation and the previous level of prices. Therefore, even though only one party is elected at time t the preferences of both parties matter. In turn, expected in‡ation rate is given by E [ t] = 1 A + (1 1 A) A : B To determine the level of output, use the expressions for money supply and wages yti = 1 i 1 A + (1 A) A 1 + B 1 : 2 t Therefore, if A is elected the expected output is E [yt ] = (1 A) 1 A 1 ; B which is a negative number. This number increases with the probability that the party B wins elections 1 A. The expected output level is negative since party A tolerates in‡ation less than 49 party B. The existence of party B pushes in‡ation up. To keep it low party A reduces money supply and output. Output declines to a negative number If B is elected then the expected output is positive and given by E [yt ] = 1 A 1 B > 0: A Monetary policy under commitment and discretion Previous sections discussed the properties of a monetary policy rule which reduces the variance of output. The discussion in these sections suggested that in certain cases there might a commitment problem with an announced policy in the sense that policy makers might want public to believe in the implementation of a policy but they might deviate later from it. The latter situation arises especially when the policy makers pursue two opposing policy goals at the same time. The monetary policy authorities/central banks usually have exactly two policy goals: stabile in‡ation and output. Consider an economy where the central bank has these two policy goals. To formalize this and keep the matter relatively simple, assume that the central bank selects monetary policy rule to minimize Lt = 2 t y)2 : + (yt Monetary policy tries to minimize the deviation of output from its long-run level y and the deviation of in‡ation t from its long-run level 0. is a positive parameter. It o¤ers the importance of deviation of output from its long-run level for the central bank and therefore for monetary policy. In this context L is the central bank’s loss function. There is no trend in the model which we consider here, deviation of output from the long-run level stem from exogenous shocks, and ideally long-run in‡ation is 0. Therefore, minimizing the 2 t y)2 amounts to minimizing their variance. and (yt Suppose that the reminder of the economy is given by versions of AD and AS curves with expected wages [AD] : yt = a (mt [AS] : yt = b (pt [W ages] : wt = petjt where d t and s t pt ) + wt ) + d t; s t; 1; are uncorrected i.i.d. random variables with 0 mean and variance 2 d and 2, s correspondingly. a and b are positive parameters. Further, suppose that agents’have rational expectations, so that petjt 1 = Et 1 [pt ] : For the purposes of the current discussion it is more convenient to write [AS] curve with in‡ation 50 rates [AS] : yt = b (pt = b( Et t Et 1 [pt ] 1 [ t ]) Et 1 [pt 1 ] + Et 1 [pt ]) + s t; s t + In this model we will give an important advantage to the central bank assuming that it sets monetary policy after observing the shock s t d. t and Usually, monetary policy can react quickly, although hardly without a lag. However, what we actually need here is that it reacts faster to shocks than the private sector/prices and wages. This seems to be a realistic assumption and opens a door for monetary policy to have real e¤ects. Monetary policy can have real e¤ects in this case since it can mitigate at least within-period shocks. We will further assume that monetary policy is given by [P olicy] : where , and t = d t + s t; + are policy parameters which the central bank chooses. Clearly this is an indirect formulation of monetary policy rule. One way to think about it is that the central bank sets money supply rule so that t is given by equation [P olicy]. Once t is determined this money supply can be determined from [AD] equation. With this monetary policy rule, agents’expectations about in‡ation are e tjt 1 Therefore, whatever value of = : the central bank would choose, the agents know it and will adjust their expectations accordingly. In this sense, the central bank can in‡uence the agents’expectations. It will make use of this in its optimization problem. Monetary policy under commitment In this case the central bank announces its policy a period ahead (at t commits to it. This means that the central bank selects at time t minimize its expected loss function Et know the possible realization of s. t 1 [Lt ]. At time t 1 period for period t) and 1 parameters , and to 1, however, similar to agents it does not Therefore, Et 1 [ t] = : From equations [AS], [W ages], and [P olicy] and the expression for loss function it follows that the central bank solves: Et 1 [Lt ] = min Et ; ; 1 + d t + s t 2 + Et 1 b + d t + s t + s t y 2 : In order to solve this problem, open up the brackets and use the de…nition of variance for random 51 variables with 0 mean, e.g., Et Et + 1 d t + s t 2 1 h d 2 t i = Et 1 2 + = 2. d = 2 d t +2 2 2 d + 2 2 s; + s t y 2 d t + +2 + d t s t) ( +( s 2 t) Similarly, Et b 1 + d t s t + 2 = (b )2 2 d + (b + 1)2 2 s + y2: Therefore, the optimal problems of the central bank is Et 1 [Lt ] = min ; ; n 2 2 2 d + 2 2 s + h (b )2 + 2 d + (b + 1)2 2 s = 0: 2 s + y2 io : The …rst order conditions of this optimal problem are [ ] : 2 = 0; The optimal conditions for [ ] : 2 2 d + 2b2 [ ] : 2 2 s + 2 (b + 1) b and imply that 2 d = = 0; = 0. However, = b . 1+ b2 The policy rule, therefore, is t = b s t; 1 + b2 From modi…ed [AS] it follows then that output is yt = 1 1 + b2 s t: The variance of in‡ation and output under this rule are V [ t] = V [yt ] = b output to vary with 2 s; 1 + b2 1 1 + b2 If the central bank does not value stability of output, s. t 2 2 2 s: = 0, then it sets in‡ation to 0 and allows If, however, it places very large weight on the stability of output, then it allows in‡ation to vary with s t = +1, but keeps output constant so that it has 0 variance. In general, it is easy to show that the variance of in‡ation increases with and the variance of output declines with it. We don’t we have the in‡uence of is that shock d t d t in monetary policy. The intuition behind such a result pushes prices and output proportionately and in the same direction. Therefore, 52 it does not create a trade-o¤ between o¤setting in‡ation and output volatilities. In turn, shock s t pushes prices and output in di¤erent directions. For example, positive shock s t increases output, but reduces prices. Therefore, it creates a trade-o¤. s t Although the central bank reacts to shocks to it sets = 0. Therefore, unconditional expectation of in‡ation and expected in‡ation for agents’ are 0. Under this policy the expected value of the loss function of the central bank is Et 1 [Lt ] 1 1 + b2 = 2 s + y2 : Monetary policy under discretion (without commitment) A problem with the commitment equilibrium is that at time t the policy announced at t 1 may not longer be the optimal rule. In other words, if policy maker is able to change the announced policy at time t then it might achieve lower loss. This is because at that time, in‡ation expectations have been formed and can be treated as given. The central bank then could exploit this. Formally, assume that agents believe that the central bank will deliver Et case there is no Et 1 [ t] 1 [ t] = 0. In such a in [AS]. Therefore, from [AS], [W ages], and [P olicy] and the expression for loss function it follows that the central bank solves: Et 1 [Lt ] = min Et ; ; + 1 d t 2 s t + + Et b 1 + d t + s t 2 s +( b + s t y 2 : This can be equivalently written as Et 1 [Lt ] = min ; ; n 2 + 2 2 d + 2 2 s + h (b )2 2 d + (b + 1)2 y)2 io : The …rst order conditions in this case are [ ] : 2 + 2 b( b [ ] : 2 2 d + 2 b2 [ ] : 2 2 s + 2 (b + 1) b Since the central bank sets the same and 2 d = 0; 2 s = 0: it has to have lower loss setting = instead of y) = 0; b 1 + b2 y = 0. Use [AS] - where expected in‡ation is zero - and the policy rule to compute 53 expected value of the loss function for under this policy Et t = 1 [ t] = yt = Clearly, expected loss with Et 1 [Lt ] = b y 1+ b2 = b 1+ set = so that Et 1 [ t] b y s t; 1 + b2 b y; 1 + b2 b2 1 y+ 2 1+ b 1 + b2 1 y2 + 1 + b2 is lower than with incentive to cheat the agents: Announce that b y 1+ b2 b2 s t; 2 s : = 0. Therefore, the central bank has = 0 so that agents perceive Et 1 [ t] = 0 but later > 0. If the central bank announces policy but alters it after the announcement then the policy rule is not “time consistent” and is not credible. The agents will not believe that the central bank is committed and the commitment equilibrium falls apart. Assume now that the central bank cannot commit to a rule and look for a policy which is optimal at period t when expectation of in‡ation is taken as given by the central bank. Call this a discretionary policy. Clearly, if this policy does not appear to be the same as the one above, then there is time inconsistency problem. With discretionary monetary policy, the central bank solves Lt = min t n 2 t + [b ( Et t 1 [ t ]) + s t o y]2 : The central bank minimizes loss function but not its expected value, since it is not committed to any rule and makes the decision after the realization of shocks The …rst order condition then is t To …nd out Et 1 [ t] + b [b ( t Et 1 [ t ]) + s t y] = 0: take the expected value of this expression Et which means that at period t 1 [ t] = by; 1 the agents expect positive in‡ation at period t. In turn, in‡ation and output are t = yt = b s + by; 1 + b2 t 1 s + b2 y: 1 + b2 t Therefore, the central bank runs higher in‡ation than in case if it could commit to a rule. 54 In terms of previous choice parameters, clearly, this situation corresponds to = b 1+ b2 and = by. The central bank’s expected value of the loss function at time t 1 then is (use Et 1 [ t] = by in [AS]) Et 1 [Lt ] = Et 1 " b 1 + b2 = = 1 1 + b2 2 s + 2 s t + by h # + Et 1 " 1 1 + b2 2 s t y # i ( b)2 + 1 y 2 Denote the expected values of loss in case of credible commitment, cheating, and discretion as EtCM 1 [Lt ], EtCH1 [Lt ], and EtD 1 [Lt ]. EtCM 1 [Lt ] = EtCH1 [Lt ] = EtD 1 [Lt ] = 1 2 + y2 ; 1 + b2 s 1 y 2 + 2s ; 1 + b2 1 2 2 + ( b)2 y 2 : s +y 1 + b2 It is clear that CH EtD 1 [Lt ] > EtCM 1 [Lt ] > Et 1 [Lt ] : Since the loss of the central bank is higher in case of discretion and if it fails to commit then discretionary equilibrium prevails, then the central bank might want to use some mechanisms to show the public/agents that it is committed. Such mechanisms are readily available in case of repeated interactions when the public can punish the central bank (in terms of changing their beliefs for example) if it deviates from its announced policy. Another way to align incentives is to increase the independence of the central bank which often means reduction of . In such a case, however, the central bank although would commit to a rule it will allow for uncontrolled changes in output. Business cycles The term business cycles is used to coin the ‡uctuations in aggregate output and other activity (e.g., unemployment, trade) over medium-term. In an economy, medium-term is usually associated with a period of several months or years. Fluctuations can be both upward and downward. Upward changes in output are called economic booms, while downward changes are called economic busts or recessions. Relatively long lasting recessions are called a depression. These ‡uctuations are measured relative to the long-term growth trend of the output and are largely unpredictable. 55 The explanation of business cycles is one of the primary issues in macroeconomics. Business cycles have been studies starting from the time of Adam Smith and David Ricardo. Economists tend to di¤er a lot, however, in terms of their explanations of causes of business cycles and proposed remedies. This is such a central topic that the economists even form schools of thought in explanation of business cycles. Some of the most highlighted shocks to aggregate output and other aggregate variables in an economy are Technology shocks: 100 years ago travelling from Barcelona to New-York would take much more time than now. This is a drastic example how production functions can change over time. New technologies like PCs and robots alter the production process and usually raise productivity. Sometimes, production facilities break down or employees use too much facebook, so productivity falls. Such a technological change is not always smooth; it often comes in some of form of shocks. Weather shocks: Agriculture and tourism industries are very weather-dependent. Other industries could also depend on weather if their employees work e¤ort depends on weather. Fluctuations in weather then a¤ect output in these industries. Monetary shocks: We have seen that in certain cases money supply and in‡ation a¤ect output. This implies that random changes to monetary policy or liquidity in the economy can lead to output ‡uctuations as well. Political shocks: The government can in‡uence the real economy through public expenditure, and regulations. If it changes expenditures, tax laws, antitrust regulation, and expectations then that can cause ‡uctuations in aggregate output. 56 Taste shocks: All the examples above are rather about the supply of goods. There could be also shocks to the demand for goods and services in terms of shifts in preferences. Such shifts can cause ‡uctuations and can come for example from introduction of new products that makes others obsolete in terms of the preference. Usually none of these shocks can explain large shifts in output (and other aggregate variables) such as observed in real economies. However, there are mechanisms in the economy that can amplify these shocks. These shocks can be ampli…ed because of, for example, intra- and intertemporal substitution. If a negative shock hits the economy and output declines then consumers might wish to work less and enjoy more leisure (intra-temporal substitution). This would reduce output further. Moreover, if consumers like smoothing their consumption then they would save less (inter-temporal substitution) so that capital would decline and output would be lower in future. Price stickiness could be another amplifying mechanism. If wages are sticky, for example, then after a negative shock to productivity …rms would like to pay lower wages but they cannot. Instead of lowering wages and keeping rather steady-level of output they would …re labor force and reduce output. Financial frictions, in terms of inability to lend and borrow freely, can amplify the e¤ects of shocks too. If there are …nancial frictions then even small shocks can force …rms into bankruptcy. This will a¤ect …nancial sector that lent money to the bankrupt …rms and reduce credit. Often then additional …rms have to declare bankruptcy, and sometimes even banks fail. Bank failures reduce liquidity and credit. They can a¤ect all creditors and debtors and therefore can have large economic consequences. The classical and neo-classical (fresh-water) school of thought tends to believe that the origins of business cycle ‡uctuations are completely exogenous processes which a¤ect aggregate output through changes in technological e¢ ciency (e.g., introduction of computers and facebook) and other real variables. They hypothesize that the economy is frictionless so that there are no price rigidities and/or …nancial frictions. In this respect, they hypothesize that the cycles which follow these shocks are the optimal/best response of the economy. Therefore, these schools of thought believe that policies might not be e¤ective in tackling business cycles. Neo-classical theories include the Real Business Cycle theory by Kydland and Prescott (1982), which is built-around, in particular, the rational expectations assumption. These theories tend to have solid micro-foundations. However, they tend to under emphasize the importance of frictions and distortions in the real economy. The Keynesian and neo-Keynesian (saltwater) schools of thought tend to believe that the origins of business cycle ‡uctuations also include shocks to nominal variables. They hypothesize that the economy involves frictions so that there are price rigidities, …nancial frictions, and other failures in the economy. In such frameworks nominal variables a¤ect real variables. Moreover, the cycles which follow these shocks are not the best response of the economy. Therefore, these schools of thought tend to believe that there is a scope of policy intervention (…scal and/or monetary). A standard example of Keynesian model is the AD-AS model (IS-LM-Phillips curve model). We will see two "extensions" of it in the next two sections. These models, however, lack solid microfoundations. Neo-Keynesian theories tend to alleviate that problem. Prominent economists such 57 as Michael Woodford and Gregory Mankiw have contributed to the development of neo-Keynesian theories. These theories are usually presented in a form of Dynamic Stochastic General Equilibrium (DSGE) models, which are very far away from the scope of this course because of complexity. Such models, however, are commonly used in the Central Banks and other (…nancial) institutions. There are other schools of thought too. For example Monetarist school of thought (largely due to Milton Friedman) and Austrian school of though (largely due to Friedrich Hayek). Business cycles - The Carlin and Soskice (2005) model This section presents a very classical Keynesian model of business cycles research by Carlin and Soskice (2005). This model is an extension/analogue of the standard AD-AS model. The standard AD-AS model consists of 3 equations. These are IS and LM equations, which describe the aggregate demand (AD), [IS] : Y = C + I (r) + G; M [LM ] : = L (r + e ; Y ) : P and aggregate supply equation (AS). In Macroeconomics 1 and 2, AD-AS model is presented in a static form. It is presented using levels of prices, but not changes of prices (in‡ation). This model also consists of 3 equations. Its …rst equation is the IS curve. IS curve in this model, however, is written in a somewhat reduced form and in terms of logarithms of variables [IS] : y = A where A and ar; are positive parameters. For example, A depends on the level of consumption thriftiness. In turn, measures the magnitude of the e¤ect of interest rate on income in the goods market. This model is dynamic and incorporates time dimension. In particular it is assumed here that income and interest rate are time dependant. Moreover, this model assumes that interest rate (because of some imperfect adjustment mechanisms) a¤ects the level of income with time lag so that yt = A art 1: There exists interest rate level rs that leads to equilibrium level of output ye (stabilizing level of interest rate). Graphically, IS curve looks like this 58 The second equation in this model is the aggregate supply curve, which is written in the form of the Phillips curve (where agents have static expectations) t = t 1 + (yt ye ) : Graphically, this Phillips curve (PC) in short run and in long run is presented below Here, the upward sloping curve (line) is the short run Phillips curve and the vertical curve (line) is the long run Phillips curve. T is the Central Bank’s targeted level of in‡ation. The targeted level of in‡ation is one of the parameters of the Central Bank’s monetary policy. In this model the Central Bank designs its monetary policy so that to minimize its loss function ye ) 2 + L = (yt where T 2 t ; is a positive parameter which highlights the importance of in‡ation stabilization for the Central Bank. The Central Bank sets its monetary policy so that to solve n L = min (yt t ye )2 + s:t: t = t 1 + (yt 59 ye ) : t T 2 o ; Substituting Phillips curve order condition t into L and di¤erentiating with respect to yt gives the following …rst @ L = 0 , (yt @yt ye ) + t 1 + (yt T ye ) =0 Substituting Phillips curve back into this equation gives (yt ye ) = T t : This equation summarizes the monetary policy rule. Carlin and Soskice call it MR-AD equation. Graphically the solution of problem can be presented in the following manner. The Central Bank’s indi¤erence curves are ellipses (circles if = 1) with a bliss point at ye ; T . In sum, the model is given by the following 3 equations [IS] : yt = A [P C] : [M R Plugging [M R = AD] : (yt t 1 1; + (22) (yt ye ) = ye ) ; t (23) T : (24) AD] into [P C] gives the level of in‡ation t Moreover, expressing t from [M R = 1 1+ t 1 2 + 2 T :14 (25) AD] and plugging it into [P C] gives [Y R] : yt = ye 14 t art 1 1 + t 1 T : (26) Denote ! = 1+1 2 and rewrite this equation as t = ! t 1 + (1 !) T . The solution of this equation is t = P ! t 0 + (1 !) T t =0 ! t . As t increases the …rst term tends to 0. In turn, the second term tends to T . Substituting this into [M R AD] gives the dynamics of output. From the dynamics of output and [IS] the dynamics of interest rate can be found. 60 Call this equation Y R: output rule. In turn, from IS equation it follows that yt ye = a (rt 1 rs ) : Therefore, [T R] : rt 1 rs = 1 a 1 1 + t 1 T : (27) This is an analogue of Taylor rule (it does not include output gap) which we call T R. This rule directly follows from the Central Bank’s optimization rule. Therefore, it is the policy rule of the Central Bank in terms of adjusting the interest rate. In this respect, T R suggests by how much the interest rate should be adjusted if in‡ation deviates from its target. Lets now turn to the analysis of ‡uctuations and policy responses. Without loss of generality, in our example economy which we discuss below we will assume that in‡ation target is set to 2% (i.e., T = 2). Moreover, time starts at t = 0 and at t = 0 the economy is in its long run (equilibrium) level of output and (targeted) in‡ation. Shocks causing ‡uctuations Shock to the IS curve: Consider a positive to shock to the IS curve which raises A to A0 at t = 0 (and remains there forever). The Central Bank can do nothing to a¤ect the increase the output in t = 0 since its monetary policy rule a¤ects interest rate which has time lag in a¤ecting output. However, it can a¤ect changes in the economy in the second period. In order to do so it needs to make a forecast of Phillips in t = 1. With the forecast of Phillips curve it will identify its constraint for designing its optimal problem, and …nd the solution of the optimal problem. Suppose further that the shift of A to A0 has triggered (through Phillips curve) increase in the current level of in‡ation to 0 = 4%. From Phillips and IS curves, therefore, we have that the change in A is given by 4=2+ i.e., A0 = 2 A0 A ; + A. Given that expectations are static the forecast of next period (t = 1) in‡ation is 4. For any level of output, this implies that in‡ation is going to be higher in the next period implying that next period’s Phillips curve has shifted up. The graphical representation of this process is as follows 61 Notice that if the Central Bank decides to lower in‡ation in t = 1 then it will cause a recession. This is because any level of in‡ation below 4 corresponds to lower output level (lower than ye ). Moreover, it can a¤ect in‡ation at t = 0 and move along the new Phillips curve. According to the Taylor rule the Central Bank will calculate in‡ation at t = 0 so that it increases interest rate at t = 0. In the …gure o¤ered above this corresponds to choosing points B and B 0 . Clearly, therefore, the Central Bank causes a recession in t = 1. What happens next? At t = 1 since output is lower than ye according to the Phillips curve in‡ation at t = 2 will be lower than at t = 1. This implies that the forecast of Phillips curve for t = 2 at t = 1 is to the right of the dashed P C in the …gure above. The Central Bank then will set slightly lower interest rate and in‡ation. This will increase output. This adjustment process will continue till the point in time when in‡ation is back to its targeted level of 2. Of course, it will imply that the new stabilizing level of interest rate is higher than the old stabilizing level of interest rate, rs0 > rs . Supply Shock: Consider a permanent shift of the level output at t = 0. This corresponds to a shift of the long run level of output ye to some ye0 such that ye0 > ye and can happen, for example, because of technical progress. Suppose that the level of shock is such that it triggers the in‡ation to decline to 0. From the Phillips curve, therefore, the magnitude of the shock has to be 0=2+ ye 62 ye0 ; i.e., ye0 = 2 + ye . Clearly, in this case given that long run Phillips curve shifts to the right (see the …gure above for graphical representation) it has to be that in the short run Phillips curve in the long run shifts to the right too. To have such a shift there needs to be a new M R AD curve since it has to go through the intersection of long run and short run Phillips curves. Using the forecast of Phillips curve the Central Bank knows that this shock will imply lower in‡ation at t = 1. It will design its optimal problem accordingly. The solution of its optimal problem implies Y R equation. According to Y R, the Central Bank would like to have higher output in period t = 1 since yt increases with ye . From T R equation and IS curve this implies that the Central Bank will reduce in‡ation at t = 0 from 2% and the interest rate from rs . In the …gure o¤ered above it will move to points C and C 0 . Later, it will gradually increase in‡ation and interest rate so that to reach ye0 and T. Endogenous business cycles - The Goodwin (1967) model The Goodwin (1967) model combines not so orthodox growth models (Harrod, 1939; Domar, 1946) with Phillips curve in order to generate endogenous business cycles. Fluctuations are due to cyclical relationship between employment and wages. This model is not well micro-founded, however, it can serve for a nice illustration of endogenous emergence of cycles. This model has origins in Biology. In particular, it is related to a large class of Predator-Prey models which describe dynamic biological systems. A common outcome of these systems is a vicious cycle of large prey population. Imagine increasing the number of predators. That would reduce 63 the sample of prey and therefore reduce the number of predators. These dynamics give a raise to Lotka-Volterra di¤erential equation. Similar equation is derived from this model. At any time t, the aggregate output is assumed to be given by Yt = max where t is the productivity of labor Lt , and Kt t Lt ; ; pins down capital output ratio Kt Yt when Kt t Lt . This production function implies that labor and capital are complementary. Moreover, it is clearly not a neo-classical production function. For example, it violates diminishing returns property of the neo-classical production functions. In equilibrium …rm will be reluctant to hire more than t Lt Kt amount of e¢ ciency adjusted labor since hiring more would not increase their production. Moreover, they would be reluctant to hire more than t Lt amount of Kt . We will focus on the case when they hire so that t Lt = Kt : This implies that Yt = t Lt = Kt : Further, assume that time is continuous and labor productivity grows at a rate of a. The growth this expression is t t+1 rate of labor productivity in discrete time is given by t . The continuous time analogue of 1 d t . t dt Let total population be Nt and grow at a rate of , i.e., 1 dNt Nt dt = . Therefore, employment rate is given by "t = Lt : Nt Denoting growth rates by letter g, this implies that employment rate grows at a rate of g" = gL ; (28) where gL is the growth rate of labor. Notice that when the (equilibrium) amount of labor and population grow at the same rate then the rate of growth of employment g" is zero. In case, however, when population grows at a higher (lower) rate than labor then g" is negative (positive). The rate of growth of labor can be found from the production function. Under the assumption that t Lt = Kt the rate of growth of labor is gL = gY a: (29) Clearly, when K is …xed the amount of labor should decline with labor productivity since the amount of labor in e¢ ciency units should be constant. The Phillips curve in this model depicts relationship between percentage changes of wages w 64 (price of labor) and employment (output). The Phillips curve is assumed to be given by g w = "t where and ; (30) are positive parameters that characterize labor market conditions (e.g., existence of minimum wage, labor unions). Denote the share of worker compensation by t Apparently, 1 equilibrium t Lt t = t wt Lt : Yt = would be compensation of capital. Now, since we focus on the case when in Kt the share of worker compensation is t = wt ; t which has a growth rate of g = "t a: (31) The last thing which needs to be determined to close the model is the growth rate of output. Clearly, since Yt = Kt output and capital grow at the same rate. Assume that there are two types of agents in this economy: workers and agents who own capital and supply no labor. Call the latter type of agents "capitalists." Workers consume their income immediately and do not save. In turn, capitalists save a constant fraction of s of their income. This means that total savings are given by St = s (1 t ) Yt : 15 Capital accumulation rule, therefore, is given by dKt dt = St Kt = s (1 s (1 = t ) Yt t) Kt Kt : The growth rate of capital and output then is gK = gY = Notice that for sharp increase of t, s (1 t) : (32) which corresponds to sharply increasing the share of worker’s income, the growth rate of Y can become negative according to (32). This would imply that the 15 Recall that in the Solow (1956) model there is no distinction between workers and "capitalists." In this sense, Solow (1956) assumes that both agents save the same percentage of their income. 65 growth rates of labor and employment become negative according to (29) and (28). Negative growth rate of employment would reduce employment. Therefore, it would reduce wages according to the Phillips curve (30) and according to (31). This is the predator-prey mechanism in this model. This model can be summarized by two di¤erential equations g" = g s (1 = t) "t a ; a: These equations are known as Lotka-Volterra di¤erential equations. Clearly, g" declines as creases. This is because higher t t in- reduces investments and growth of output. Therefore, it reduces growth of employment. Moreover, higher employment increases g . This is because increasing employment increases output and wages. Higher wages imply higher share of worker compensation. In the steady-state equilibrium there is no dynamics in the system. Therefore, in the steady-state equilibrium 0 = s (1 0 = ) " a ; a: Solving for the share of worker compensation and employment gives SS = 1 "SS = s +a ( + a + ); : Consider the di¤erential equations in order to see the dynamic adjustment in this model. The following …gure o¤ers the phase diagram (time evolution of the system). 66 Suppose that "t > "SS and at "t = "SS t < SS . This implies that the point ("t ; t ) is in quadrant I. Since we have that g = 0 and g increases with "t , it has to be that g > 0 for the points ("t ; t ) in quadrant I. This implies that over time t by the arrow pointing to the right (i.e., increasing t ). g" = 0 and g" declines with t, increases in quadrant I, which is depicted Moreover, since at t = SS we have that it has to be that g" > 0 for the points ("t ; t ) in quadrant I. This implies that over time "t increases in quadrant I, which is depicted by the arrow pointing to up (i.e., increasing "t ). The intuition behind the arrows in the remaining quadrants follows a similar logic. Imagine the economy starts at some point in quadrant I. In such a case, over time it will gravitate to quadrant II then to quadrant III, quadrant IV , and return back to quadrant I. This is illustrated by the circle of arrows around the steady-state. It turns out that dynamics are periodic ‡uctuations in this model in the sense that this process never converges to the steady state nor diverges to 1. An economy which is at the steady-state can appear in quadrant I for example because of a positive shock to employment (e.g., population declines temporarily) and/or negative shock to labor income share (e.g., labor productivity increases temporarily). After such a shock it will experience ‡uctuations along the cycle and never converge back to the its steady-state. These ‡uctuations are endogenous business cycles. Such ‡uctuations we have encountered for prices in the Cobweb Model. In the Cobweb Model there would be never ending ‡uctuations if 2 =1 A Real Business Cycles Model This section mostly follows (1) Chapter 9 in Doepke et al. (1999) and King and Rebelo (1999). 67 Before Kydland and Prescott (1982) economists thought that classical models can be used for studies of long-term phenomena such as economic growth. However, short- and medium-term ‡uctuations are not well explained with classical models. Kydland and Prescott (1982) revolutionized business cycles theory and economics in two ways. First, they showed that realistic ‡uctuations can emerge in classical (well micro-founded) macroeconomic models as an optimal response to exogenous shocks. By doing so they basically originated the real business cycles (RBC) theory. Second, they o¤ered ways for evaluating the predictions of the models so that to gauge their relevance and …t to the real world. Usually it is impossible to analytically solve the models used in real business cycles theory. Instead numerical/simulation methods are used. In order to resort to numerical methods the values of model parameters should be known. Kydland and Prescott (1982) advocated the use of real world data for calibrate (estimating) the parameters of the models. In these models, it is also very hard to derive analytical comparative statics for understanding the qualitative predictions of the models. Numerical comparative statics are used instead of analytical comparative statics. The numerical comparative statics are also useful for evaluation of quantitative predictions. In particular, researchers evaluate the response of the model variables over time to exogenous shocks (impulse-response functions) and compare them to the patterns in real world data. In line with Kydland and Prescott (1982), the comparison is in terms of the direction and the shape of the response of model variables; the magnitude of response in terms of mean and standard deviation; and the signs and magnitude of correlations between model variables. The table below o¤ers business cycle statistics (moments) for main macroeconomic variables from US data. This table is taken from King and Rebelo (1999). It is used for the latter two points. All data are quarterly and are for the period of 1947 Q1-1992 Q4. Variables representing quantities such as output (Y), consumption (C), investment (I), and hours worked are in per capita terms. Consumption includes consumption of non-durables. Investments include private …xed capital formation and expenditures on durables. Wage rate is the real compensation per hour. Interest rate is basically the interest rate paid on treasury bill minus in‡ation. A is Hicks-neutral productivity in aggregate output. It is de…ned as Solow residual (ln A = ln Y 68 ln K (1 ) ln L). The …rst column o¤ers sample standard deviation of relevant variables. The standard deviation of a variable X with observations from fX1 ; X2 ; :::; XT g is de…ned as v u u ST D (X)T = t 1 T 1 T X Xt t=1 T 1X Xt T t=1 !2 : It is the statistical/empirical analogue of the variance (more precisely the square root of the variance). The second column o¤ers the standard deviation of the relevant variables relative to the standard deviation of output (Y). Clearly consumption of non-durables is much less volatile than output. Investments, which include expenditures on durables, are around three times more volatile than output. The number of hours worked has around the same volatility as output. However, wages (and interest rate) vary much less than output and output per hour worked. The second column of the table shows the …rst order autocorrelations among variables. The …rst order autocorrelation of the variable X can be found using the following regression Xt = where is a constant and t + 1 Xt 1 + t; is by construction orthogonal to Xt 1. Coe¢ cient 1 is the …rst order autocorrelation of the variable X. It is called …rst order autocorrelation because we have the 1st order time di¤erence (i.e., Xt 1) and the correlation of X with itself (i.e., auto-correlation). 69 Alternatively, 1 can be computed as 1 T 2 1 T X Xt XT Xt 1 XT 1 t=2 = ST D (X)T ST D (X)T : 1 This coe¢ cient shows how much the observations of variables are interrelated (linearly). In other words it shows how good is the current value of the variable for predicting its future value. Now imagine that there is a shock to X because of some exogenous reasons ( ). In such a case, if close to 1 from below then this shock will stay in X for a long time. If stay in X forever. If 1 1 1 is is equal to 1 then it will is higher than 1 then it will stay in X forever moreover it will propagate and get larger over time. From the second column it is clear that all quantity variables are very highly autocorrelated. Perhaps, one of the most important autocorrelations for RBC theory is that of A. It is fairly large, which means that previous values of A predict current values with high precision and a shock to A will persist in A for a long time. The last column of this table o¤ers contemporaneous correlations between variables and output. The correlation between variables X and Z can be found using the following regression Xt = where is a constant and t + Zt + t; is by construction orthogonal to Zt . Coe¢ cient between variables X and Z. Alternatively, 1 T 1 = T X is the correlation can be computed as Xt XT Zt ZT t=1 ST D (X)T ST D (Z)T : This coe¢ cient measures the degree two variables are related (linearly). This is the reason it is called "co"-rrelation. More precisely, it corresponds to contemporaneous correlation because it is for the observations from the same period of time. The third column suggests that output and other quantities are very highly correlated. This implies that the driving processes behind them might be quite the same. However, output and real wage rate are not well correlated. The Basic RBC model Consider a closed economy which is populated by a very large number of identical and in…nitelylived households of mass one. In a period, the representative household is endowed with 1 unit of time which it can use for work/labor l and leisure 1 l. It derives instantaneous utility from consumption c and leisure u (ct ; 1 lt ) = ln ct + ln (1 70 lt ) : The lifetime utility of the household at time zero is the expected discounted sum of the instantaneous utilities U (c; 1 l) = E0 " +1 X t # u (ct ; 1 lt ) ; t=0 where = 1 1+ is the discount factor and > 0 is the discount rate. The representative household has rational expectations and E0 is the expectation operator given all the available information at the beginning of the economy, time 0. The assumption that households live forever can be justi…ed thinking that we have families of altruistic households who care about the utilities of their o¤springs as they care about their utilities. It turns out that this assumption makes calculus easier. In turn, the assumption that there are a very large number of households implies that each households is atomistic and can be ignored in the analysis Therefore, non of the households can dictate prices and quantities and takes them as given. The representative household earns market wage w for each labor unit. It owns the capital stocks in the economy which at time t are at the level of kt . The household earns market interest rate of r for each unit of supplies capital. At time t, the proceeds from labor and capital are equal to wt lt + rt kt . The household uses these proceeds to fund its consumption and savings. Therefore, its budget constraint is given by wt lt + rt kt = ct + st : Savings are translated into investments st = it : Investments create new capital according to the law of motion of capital: kt+1 = it + (1 where ) kt ; 2 (0; 1) is the rate of depreciation of capital. This implies that the budget constraint of the household can be rewritten as wt lt + rt kt + (1 ) kt ct kt+1 = 0: The household maximizes its lifetime utility with respect to the budget constraint. Formally, it solves the following problem. max fct ;lt ;kt+1 g+1 t=0 ( E0 s:t: " +1 X u (ct ; 1 t=0 wt lt + rt kt + (1 k0 > 0 t ) kt given: 71 ct #) lt ) kt+1 = 0; We will use Lagrangian to solve this problem. We de…ne the Lagrangian as L = E0 " +1 X t u (ct ; 1 lt ) + qt [wt lt + rt kt + (1 ) kt ct kt+1 ] t=0 # : Imagine that it is now some time t so that the household knows all the values of the variables (including random variables) for time t but does not know the t + 1 values. In such a case, take the derivative of the Lagrangian with respect to ct and lt and set it to zero @L =0 @ct @L =0 @lt : : 1 = qt ; ct 1 lt = qt wt : Take also the derivative of the Lagrangian with respect to kt+1 and set it to zero. Notice that kt+1 shows up in the Lagrangian at time t as kt+1 and at time t + 1 in rt+1 kt+1 + (1 @L = 0 : qt = Et qt+1 [rt+1 + (1 @kt+1 We have Et in front of qt+1 [rt+1 + (1 ) kt+1 )] : )] because t + 1 variables are subject to random shocks and the household uses rational expectations to predict them. Notice that given the values of kt and it the value of kt+1 is uniquely determined. The …rst equation tells that the marginal utility of consumption 1 c is equal to q, which is called the shadow value of the marginal unit of capital. The second equation is the supply of labor. The bene…t of supplying a marginal unit of labor is wt , which is the real wage rate and measures the number of consumption goods. In the second equation. wt is scaled by qt and qt wt is the bene…t of supplying a marginal unit of labor in terms of utility. In equilibrium, the household should be indi¤erent between marginally increasing its labor supply or marginally increasing its leisure time. Therefore, bene…t in terms of more consumption because of labor should be equal to the utility loss because of less time in leisure. The marginal disutility of labor is given by 1 lt . Plug the …rst equation into the second equation to …nd 1 1 = Et [rt+1 + (1 ct ct+1 )] : This equation is called Euler Equation. It equates the marginal utility of consumption to the discounted value of the marginal utility of consumption times the earned interest. It states that in equilibrium the household should be indi¤erent between consuming a marginal unit of good right now or delaying its consumption to the next period and saving to earn a net interest of rt+1 +(1 ). In short it tells that in equilibrium the household should be indi¤erent between consumption and saving. For any given level of prices, the household has three variables to solve for (ct ; lt ; kt+1 ) and three 72 equations: ct ; wt 1 1 [consumption] : = Et [rt+1 + (1 ct ct+1 [investment] : wt lt + rt kt + (1 ) kt ct [labor supply] : lt = 1 )] ; kt+1 = 0: Notice that labor supply increases with real wage and declines with consumption. This is because household’s consumption re‡ects how well o¤ it is. If the household anticipates higher wealth then it would accordignly increase consumption and reduce its current labor supply (have you ever seen lazy millionaires?). In the standard neoclassical model, labor supply is inelastic. Therefore, changes for example in government expenditures have no real e¤ects on output. However, in this framework they can have. Imagine an increase in government expenditures. The household knows that this is going to be followed by a tax hike to cover the expenses. Therefore, it expects to have lower wealth. The anticipation of lower wealth would lead to lower consumption and would increase current labor supply. Higher labor supply would increase output, which we discuss below. A very large number of identical …rms produce consumption goods. The representative …rm has a Cobb-Douglas production technology: yt = At kt1 where lt ; 2 (0; 1) and At is the technology level. Higher/lower At implies higher/lower level of output for given levels of capital and labor. At each period of time, the representative …rm solves the following problem t = max fyt kt ;lt wt lt rt kt g s:t: yt = At kt1 lt : Therefore, its demand for capital and labor are given by @ t =0 @kt @ t =0 @lt : rt = (1 : wt = ) yt ; kt yt : lt These equations characterize the production/supply side. In equilibrium, supply and demand are equal. This corresponds to plugging these two equations (supply prices) into the three equations 73 above (equations for demand side). If we do so then we obtain the following three equations: lt = 1 ct " 1 lt 1 At k t " 1 1 = Et (1 ct ct+1 At kt1 lt + (1 (33) lt 1 At+1 kt+1 lt+1 ) + (1 kt+1 ) kt ct ## ) ; kt+1 = 0: (34) (35) In these equations, the variables are c, k, l, and A. We need additional equation for A to have 4 equations and 4 variables. We will assume that ln At = ln At where t 1 is an i.i.d. random variable with 0 mean and + 2 t; (36) variance. These 4 equations constitute the basic real business cycles model. The shocks originate in equation (36) and are t. As discussed before, ln At is computed using capital and labor series and the following formulae ln At = ln Yt The mean and variance of the shocks and ln Kt (1 ) ln Lt : are obtained running a regression that has the form of (36). One of the major points of Kydland and Prescott (1982) is that many of the real world business cycles summarized in Table 1 above can be matched using this model and the estimated shocks. It is important to stress that in this model shocks propagate to other variables and persist over time. A shock that arrives at time t persists in the economy for several periods because of two reasons. Suppose that a positive shock has arrived, so that the values of and A are higher than expected. First reason that the shock persists is that the arrival of positive shock increases the marginal product of labor for a given value of k and increases output. Some of that increased amount of output will be consumed and the remainder will be used for investments. Higher investments will create higher amount of capital in the next period. Therefore, in the next period the amount of capital will be higher than if there was no positive shock. This will imply higher marginal product of labor and higher output, leading again to higher investments than there would have been without the positive shock. The second reason is that A has autocorrelation as measured by . Higher than expected shock will persist then in A. It turns out that usually this channel is quantitatively the most important one. Therefore, the magnitude of the persistence is very important for RBC models. How do we solve equations (33)-(36) for c, k, l, and A? For a fairly general set of parameters we cannot actually derive the analytic solution of this model. To …nd solution of this model then we run computer simulations.16 There are certain important insights, however, that we can draw 16 Usually this is done for the log-linearized versions of these equations around the steady-state of the system. For a function f (x) this involves writing a …rst order logarithmic approximation: f (x) = f (x ) + f 0 (x ) (ln x ln x ). 74 from this system of equations without explicitly solving them. We will say that our model economy is in a "steady Steady state and technology shocks: state" when all variables in equations (33)-(36) are time invariant. We will say that the economy is in a "deterministic" steady state if there are no shocks, , in the economy. Suppose that in the long-run there are no shocks. It is possible to show that then in the long term all variables in equations (33)-(36) are time invariant if j j < 1. Let’s assume that j j < 1. According to the table from King and Rebelo (1999) this is …ne at least for the US.17 Suppose that in the steady state we have a given value for A (e.g., A = 1). Using (33)-(35) it is easy to show that in the steady state we have l = k = " h 1 (1 1 1 1 (1 ) (1 c = h (1 i h 1 ) + #1 A ) 1 ) (1 i ) (1 i; ) l; (1 " ) 1 (1 1 ) (1 ) A #1 l: Let’s consider now a permanent increase in the value of A to A0 . Such a permanent increase would imply a new steady state where l = k0 = " 1 c0 = h 1 1 h (1 1 (1 ) (1 ) 1 (1 i h 1 ) + #1 A0 (1 i ) ) (1 i; ) l; (1 ) 1 " (1 1 ) (1 ) A0 #1 l: Clearly, in this new steady state capital stock and consumption are higher than in the old steady state: k 0 > k and c0 > c. Therefore, positive (negative) technology shocks increase (reduce) the steady state levels of capital, output, and consumption. Transition and technology shocks: Consider a model economy which starts at a deterministic steady state and receives positive technology shock so that A increases to A0 . We know that the new steady state of this economy features higher levels of capital, output, and consumption. How does the economy get to the new steady state? If there are no further shocks than it can be shown 17 If j j > 1 then we will simply need to consider a "detrended" version of the model where we subtract the growth of A from A, Y , K, and C. 75 from (33)-(35) that the economy will gradually transit toward the new steady state. During this transition, capital, output, and consumption will increase. For a more general discussion, we need to consider an economy that starts at a deterministic steady state and periodically receives technology shocks which imply di¤erent values of K, Y , and C in future steady state. After each shock the economy starts adjusting/transiting toward the new steady state. Since we are not able to solve for the variables (33)-(36) we will need to simulate the model economy on a computer in order to assess its performance in terms of generated cycles in output, investments, consumption, and other variables. In order to simulate it we need to use values for parameters. The usual values of the parameters (for the US) are = 0:667; = 0:984; = 0:025; = 3:48; = 0:979; = 0:0072:18 The simulations provide the simulated values of model variables. We use then the de…nitions of the standard deviation and correlations to check the model predictions. The following table, borrowed from King and Rebelo (1999), summarizes the results from this exercise. The simulation results indicate that the model closely matches the volatility of output, investments, consumption, and wages. It matches also autocorrelations and the observed procyclicality (positive correlations with output) of almost all variables. Although this "simple" model preforms astonishingly well for matching these data moments it fails in matching the cyclicality of labor hours and interest rate. Hansen (1985) has suggested a way to circumvent the problem with labor supply making it an indivisible choice at the individual level so that the at macro level the elasticity of labor supply to shocks is very high. However, 18 See Table 2 in King and Rebelo (1999). 76 that might create excess volatility in wages. To match the volatility of the interest rate and wage rates then this simple real business cycle model is complemented with price rigidities. Adding price rigidities, however, makes the model much more intractable and brings back Keynesian arguments. We discuss one of the ways price rigidities are usually modelled in these frameworks in the next section.19 Price Rigidities - The Calvo (1983) model Classical and Keynesian schools di¤er the most in their view of how markets work. Classical (and neo-classical) school of thought conjectures that markets are perfect (i.e., frictions and distortions are insigni…cant). In this respect, it conjectures that prices freely adjust to equilibrate demand and supply in all markets. Classical dichotomy holds under this conjecture, i.e., money supply does not matter for real variables. Keynesian (and new-Keynesian) school of thought conjectures that there are insigni…cant frictions and distortions. In particular, these imperfections create price rigidities. Therefore, prices do not adjust (fully) to equilibrate demand and supply in all markets. This breaks classical dichotomy. Moreover, in such a setup shifts in demand and supply can a¤ect output through prices too.20 There are two common ways for modelling price rigidity. The easy way to do so assumes that changes in prices depend on time (time-dependent models.) For example, Taylor (1980) assumes that …rms change their prices each n-th period and that in each period 1 n -th of …rms change their prices. Calvo (1983) assumes that in each period with some probability some of the …rms can change their prices. More precisely, Calvo (1983) assumes that at the beginning of each period a random event decides which of the …rms can change their prices and which of the …rms cannot. A more cumbersome, but more appealing, way for modelling price rigidity assumes that prices depend on the state of the economy (menu cost models.) In these models …rms change prices when the expected bene…t of changing their prices is higher than the cost of changing prices (menu cost.) The complications arise because expected bene…t of changing prices depends on the current and future states of the economy. DSGE models with Calvo (1983) style price rigidities tend to be the main workhorses in the central banks. We will now cover a version of the Calvo (1983) model. It will provide us with an upward sloping supply curve/Phillips curve. Time is discrete. The economy is populated by a continuum of mass one of identical and in…nitely lived households. The representative household derives instantaneous utility from consumption of a basket of goods. The lifetime utility of the household is given by U= +1 X t Ct ; t=0 19 Another criticism that applies to the RBC models is their reliance on Solow residual which is usually not very well estimated. 20 The assumption that prices are very rigid …nds limited support in microeconomic data. 77 where C is a constant elasticity of substitution basket of goods i; 2 1 31 Z C = 4 C~i di5 (37) 0 where it by . 2 (0; 1). The elasticity of substitution between any pair C~i and C~j (i 6= j) is 1 1 .21 Denote The households spend their entire income on purchases of C. Therefore, the representative household’s budget constraint it PC C = Z1 Pi C~i di; (38) 0 where PC is the price of C and Pi is the price of good i. The household chooses its demand for di¤erent goods to maximize its lifetime utility. Since it has no dynamic decisions (and therefore no inter-temporal trade-o¤s), the maximization of lifetime utility is equivalent to the maximization of instantaneous utility streams. The problem of the representative household can divided to two steps. In the …rst n step o the representative household chooses C to maximize its utility. In the second step it chooses C~i to maximize C. Therefore, i2[0;1] in the …rst step it solves 8 < max C C : 0 @PC C Z1 0 19 = Pi C~i diA ; ; for any time t. The solution of this problem gives the shadow value of marginally relaxing the budget constraint 1 = PC : In the second step the household solves the following problem 8 2 31 > Z1 < max PC 4 C~i di5 > ~ C f i gi2[0;1] : 0 Z1 0 9 > = ~ Pi Ci di : > ; The solution of this problem is given by …rst order conditions for all goods i. Treating the integrals as sums, these …rst order conditions are 21 "ci ;cj = ~i =C ~j ) d ln(C d ln @C ~ @C j = @C ~ @ Ci = ~i =C ~j ) d ln(C 1 ~ 1) ~ d ln(C =Ci j h i C~ 1 C~i : Pi = PC C i : C = 1 1 : 78 (39) where C = Z1 C~i di. This expression together with (37) and budget constraint (38) implies that 0 Pi 1 PC C C = 1 C~i : Therefore, it implies that 2 1 Z 4 P 0 1 i 31 di5 Finally, 1 PC C C = 1 2 1 31 Z 4 C~i di5 0 1 1 = PC 2 1 Z 4 PC = Pi C 1 0 1 C 3 1 1 1 = PC 1 : 1 di5 ; (40) which means that the aggregate level of price is a basket of prices of goods i. Moreover, the (inverse) demand function implies that 2 1 31 Z Pi = PC 4 C~i di5 C~i 1 0 Therefore, C~i = PC Pi = PC Pi 1 1 C C: Each …rm produces an i good. Firms have monopoly in their product and set prices. Firms choose prices to maximize their real pro…ts. In case when there are no price rigidities …rms solve max Pi PC Pi ~ Ci PC 'i C~i ; s:t: C~i = PC Pi 1 1 C: where 'i is the marginal cost of producing C~i amount of good i. Plugging the demand function 79 Pi PC into pro…t and taking the …rst order condition with respect to Pi PC : gives Pi ' = i; PC This expression is the (inverse) supply of good i. Under perfect competition the relative price is equal to marginal cost. Here, because we have monopolists, the price is equal to than M C C~i since 2 (0; 1). 'i which is greater This relation should hold for any time t. Assuming that …rms are symmetric, in logarithms, the expression o¤ered above can be written as pt = ln 't where ln < 0 since ln ; 2 (0; 1). Suppose now that there are price rigidities. At any time t any …rm i with a probability of a sticky price and cannot change its price. With probability 1 has it does not have a sticky price and can change its price. In this case …rms set their prices not knowing when they’ll be able to reset them. Therefore, expectations of future prices will matter for their current decisions. A …rm which does not have a sticky price knows that if it sets price then with probability the next period (exactly one period). Moreover, with probability 2 (1 2 periods and exactly for 3 periods with probability (1 it will last for ) it will last exactly for ). The expected length of time until price reset is given by +1 X t 1 = 1 t=0 : Firms in this case maximize their present discounted value of pro…t streams whenever they can adjust their prices. Assume that at time t …rm i is able to adjust its prices then it solves its problem taking into account that for certain period of time (indexed by k below) it will not be able to reset the price. Therefore, it solves at time t Vt = max fPi;t g ( Et s:t: C~i;t+k = " +1 X ( ) k=0 PC;t+k Pi;t Pi;t k PC;t+k C~i;t+k 'i;t+k C~i;t+k #) ; (41) Ct+k : To solve this problem plug C~i;t+k into Vt and take the …rst order condition with respect to Pi;t . This exercise gives [Pi;t ] : 0 = Et " +1 X k=0 ( ) k (1 ) Pi;t PC;t+k + 'i;t+k Pi;t PC;t+k 1 1 PC;t+k Ct+k # Given that Pi;t does not depend on k it can be taken out of the sum and this expression can be 80 rewritten as 0 = Et " +1 X )k (1 ( ) k=0 Pi;t PC;t+k # + 'i;t+k PC;t+k Ct+k ; Therefore, the price of …rm i at time t relative to aggregate price PC;t is given by Pi;t = PC;t 1 +1 X k=0 +1 X PC;t+k PC;t )k Et 'i;t+k ( Ct+k : k ( ) Et 1 PC;t+k PC;t Ct+k k=0 Let all the …rms that are able to set price at time t be symmetric, i.e., 'i;t+k 't+k . Moreover, let also all the …rms which are not able to set prices at time t be symmetric. Therefore, denoting the price of …rms which are able to adjust it at time t by Pta gives Pta = PC;t 1 +1 X k=0 +1 X PC;t+k PC;t )k Et 't+k ( Ct+k : )k Et ( 1 PC;t+k PC;t (42) Ct+k k=0 The aggregate price which prevails is the weighted average of prices of …rms which adjust their price and …rms which do not adjust their price. It is given by (40): 1 Pt 1 = Pt 1 ) (Pta ) + (1 1 ; (43) where we have dropped subscript C. Equations (42) and (43) constitute new-Keynesian Phillips curve. To see this, …rst divide (43) by Pt . Next, consider the …rst order log-linear approximations of these equations around the steadystate, where all …rms are able to set prices. To do so denote Pt 1 f (Pt 1 ; Qt ) = g (Pt 1 ; Qt ) = f ePt 1 ; eQt = 1 + (1 ) Qt The …rst order approximation of g around ln P^t g (Pt 1 ; Qt ) = ^ ln ePt ^ + ePt + (1 1 1 1 1 ^ ^ ) eQt ln ePt 1 1 ^ 1 ; ln Qt 1 P^t 1 1 1 ^ eQt 81 1 1 1 1 ) ln eQt + (1 1 1: point is ) ln eQt 1 1 1; ^ + (1 ePt 1 1 1 ^ ln ePt ^ Q t 1 1 1 Pt 1 ^ 1 ln eQt ln P^t Qt 1 ^t ln Q At the steady-state P^t 1 ^ t = 1. Therefore, around the steady-state it has to be that =Q g (Pt Since f (Pt 1 ; Qt ) = g (ln Pt 1 ; Qt ) 1 ; ln Qt ) = 1 and f (Pt 1 1 + (1 1 ; Qt ) 0 = pt Notice that pt Pt 1 Qt : 0 we have that around the steady-state + (1 1 ) ) qt : = ln PPt t 1 which is approximately minus 1 times in‡ation rate. Therefore, t 1 = qt : Now consider log-linear approximation of (42) around the steady-state assuming for simplicity that in the steady-state C = 1. To do so rewrite it as Qt +1 X )k Et ( k=0 " # 1 PC;t+k PC;t Ct+k = +1 X 1 " PC;t+k PC;t )k Et 't+k ( k=0 # Ct+k : The approximation of the left had side around the steady-state point is as follows. Denote g~ (Pt 1 ; Qt ) = ln eQt +1 X )k Et ( k=0 The …rst order log-linear approximation of g~ (Pt PC;t+k g~ Qt ; Ct+k ; PC;t + Qt ^t ln Q +1 X k=0 ^ + ln eQt +1 X ( k=0 ^ + ln eQt +1 X k=0 ( ^ ln eQt 2 ( 2 )k Et 4( ! ^ ^ P t+k ^ P t ! 1) ln eCt+k 82 3 ^ ln eCt+k 5 Ct+k ln e ^ P t+k ^ P t 1 1 # 1 Pt+k Pt ln eCt+k : ^ t ; ln C^t+k ; ln around ln Q )k Et 4 ln e k=0 ^ P t+k ^ P t 1 ; Qt ) ln e 2 ( )k Et 4 ln e )k Et 4 ln e 2 +1 X " ! 1 ^ ! 1 e point is 3 ln eCt+k 5 3 ln C^t+k 5 ^ P t+k ^ P t P^t+k P^t ^ P t+k ^ P t e ^ P t+k ^ P t Pt+k Pt !3 ^ Pt+k 5 ln P^t ^ 0 this is given by Around the the steady-state point 0; C; PC;t+k g~ Qt ; Ct+k ; PC;t C^ 1 + Qt C^ 1 1) C^ +( +1 X 1 + 1 )k Et [Ct+k ] ( ln C^ k=0 Pt+k : Pt )k Et ( +1 X k=0 Therefore, the approximation of the left hand side is Qt +1 X )k Et ( k=0 " 1 Pt+k Pt Ct+k # 1 + 1 +( 1) 1 qt + 1 +1 X ( )k Et [ct+k ] k=0 +1 X )k (Et [pt+k ] ( pt ) : k=0 In turn, the approximation of the right hand side is 1 +1 X " k ( Pt+k Pt ) Et 't+k k=0 Ct+k # 1 1 + + 1' Equate these expressions and notice that qt + pt = (1 ) +1 X + 1 1 ' 1 +1 X ' 1 zt k=0 ( )k (Et [pt+k ] pt ) : = 1 to get )k Et pt+k + ln ( 't+k : ' = qt + pt . The above equation is the solution of the following di¤erence equation where z0 = 0; Et zt+1 Et 1 zt = (1 ) pt + ln 't : ' Solving for qt from this equation gives qt = Using the relation )k Et 't+k k=0 k=0 Denote by Et ( )k Et [ct+k ] ( k=0 +1 X +1 X 1 t Et [qt+1 + t+1 ] + (1 ) ln 't : ' = qt gives t = Et [ t+1 ] + 1 83 (1 ) ln 't : ' This is the new-Keynesian Phillips curve. The di¤erence between it and Keynesian Phillips curve is that it contains no backward-looking terms but contains forward-looking (expected) in‡ation matters. 84 Expectations and …nancial markets Bonds Bonds are (usually) very primitive …nancial instruments. They are forms loans (or I owe you/IOU). More precisely, a bond is a certi…cate which states that the issuer is indebted to the owner of the certi…cate. It states the amount of the debt to be paid back (principal), the time when the debt has to be paid (maturity date). Depending on the type of the bond, the issuer might also be obliged to pay the owner interest (coupon). Interest is usually payable at given and …xed intervals (e.g., monthly or annual). Quite often bonds are negotiable in the sense that the ownership can be transferred. This makes certain types of bonds highly liquid. Firms tend to issue bonds in order to meet their …nancial requirements. According to the Pecking Order Theory (of …rm’s capital structure), issuing debt/bonds is …rms’second most preferred way to raise …nance for investments. First most preferred way is to use internal …nance, and the least preferred way is to raise …nance through issuing equity. In this respect, equity (stocks) and bonds are quite much alike. They are both …nancial instruments for …rms. However, there is a big di¤erence between bonds and equity/stocks. The stockholders of a …rm are investors since they have equity stakes (ownership rights) in the …rm. In turn, the bondholders have creditor stakes in the …rm since they have lent money to the …rm. Bondholders are creditors, therefore, they usually have (absolute) priority over stockholders and will be repaid …rst in the event of bankruptcy. Another and less important di¤erence is that bonds usually have a de…ned maturity (consoles or annuities don’t have maturity but are very rarely issued). In contrast, stocks are typically for inde…nite period. Not many …rms, however, issue bonds. Moreover, …rms which issue bonds typically are quite large. Almost in all countries one of the most signi…cant issuers of bonds is the government. The government uses bonds in order to take loans. These loans, together with taxes, the government uses in order to cover its expenditures. For example, usually during recessions tax proceeds decline. In turn, the governments implement counter-cyclical policies and increase their expenditures. These expenditures they …nance then using loans/bonds. Government bonds tend to be sold on auctions, where participants are, for example, banks, investment and hedge funds, various speculators, the central bank, etc. (You can imagine something alike a stock/derivative exchange. In fact, bond futures are massively traded on derivative exchanges.) These markets are usually very liquid.22 One of the most liquid markets is the market for US government bonds (US Treasury Bonds). Currently (18.03.2014), US debt issued in various forms of bonds and IOUs amounts to $17,546,814,482,078.90 (yes! more than $17 trillion.) There are many (and very complicated) types of bonds. For our purposes it is su¢ cient to consider the following simplistic examples. Hereafter, in our discussion in this chapter we will 22 Very recently, because of the …nancial crisis (and, perhaps, reckless landing/borrowing) in several EU countries bond markets dried up. The governments of these countries then faced tough …nancial constraints and asked for loans (i.e., for bond purchases) from the ECB, etc. 85 consider perfect competition in all (bond) markets and no lending constraints (i.e., think about very liquid bond market where some of the participants can print money.) Bond 1 matures in 1 year and pays principal P . What is the fair market value/price of such a bond? Let yearly nominal interest rate be i11 , where subscript stands for the compounding time (1 year) and superscript stands for the time when compounding will happen (in 1 year). Then the price of this bond is the discounted value of the principal P : 1 + i11 PB1 = Bond 2 matures in 3 years and pays principal P . What is the fair market value/price of Bond 2? Let the yearly interest rate be the same across years and, as in previous example, let it be equal to i11 . The price of Bond 2 is the discounted value of the principal. It is P PB2 = 1 + i11 3: Clearly, as long as i11 > 0, PB1 > PB2 since Bond 1 matures earlier than Bond 2. Notice that bank loans seem to have somewhat di¤erent structure. Usually, when an individual takes L (EUR) loan for a year from a bank, the loan speci…es the yearly nominal interest rate i. ~ In such a case taking a Therefore, the individual has to repay (1 + i) L. Denote the latter by L. ~ Therefore, there is almost no di¤erence. loan is equivalent to issuing a bond with principal L. Bond 3 matures in T years and pays principal P . Let the yearly interest rate be i11 . The price of Bond 3 is P PB3 = 1 + i11 T : Bond 4 matures in T years and pays principal P . Moreover it pays yearly coupons C. Let the yearly interest rate be i11 . The price of Bond 4 is PB4 = P 1+ T i11 + T X t=1 C 1 + i11 t: Bond 5 is exactly the same as Bond 4 but its coupon payments vary over time so that we have Ct instead of C. The price of Bond 5 is PB5 = P 1+ T i11 + T X t=1 Ct 1 + i11 t: Consider again Bond 2. Assume now, however, that interest rate during the …rst year is i11 , in the second year it is i21 , and in the third year it is i31 . In such a case, the price of Bond 2 is PB2 = 1+ i11 P 1 + i21 86 1 + i31 : In such a case the prices of Bond 3, Bond 4, and Bond 5 would be PB3 PB4 PB5 = = = T Y t=1 T Y t=1 T Y t=1 P ; 1 + it1 T t XY C P + ; 1 + it1 1 + i1 P + 1 + it1 t=1 =1 T Y t X t=1 =1 Ct : 1 + i1 Bonds 1, 2, and 3 are called zero coupon bonds since they pay no coupons. In …nance interest rate is usually called yield since it indicates what a loan (an investment) yields. The time sequence of (similar) interest rates is termed as yield curve or term structure of interest rate. In our examples, the yield curve is i11 , i21 , i31 , ..., iT1 . Imagine a market where zero coupon bonds are traded. which promise the same principal P . Let there be bonds of all possible maturities: 1 year, 2 years, ..., and T years. It is straightforward to determine the yield curve from the prices of these bonds. With slight abuse of previous notation, let the prices of bonds with maturities 1 year, 2 years, ..., and T years be PB1 , PB2 , ..., and PBT , correspondingly. Therefore, interest rates are given by i11 = P PB1 1; P 1; 1 + i11 PB2 P = 1; 1 1 + i1 1 + i21 PB3 ::: ! TY1 P 1 1: = t 1 + i1 PBT i21 = i31 iT1 t=1 Call this algorithm ( ). Usually yield curves are upward sloping. The following …gure illustrates the yield curve of US Treasury bonds as of 9th of February 2005. It has interest rate/yield on Y-axis and time to maturity on X-axis. 87 Clearly, in this example the yield curve is upward sloping with diminishing marginal rate of increase. One of the widely accepted explanations for upward sloping yield curves is that longer maturities entail greater risks for creditors/lenders. Lenders then demand a risk premium and demand more premium for longer maturities. This explanation depends on the notion that currently creditors (as well as we) know less about the distant future than about the very near term. Another commonly accepted explanation is related to the IS-LM Model. Imagine for example, that investors are expecting either a gradual shift of IS to the right or a gradual shift of LM to the left (or both). In such case, they would expect to have higher interest rates in future and trade accordingly. This explanation points a de…ciency in our pricing formulas since it points out that interest rates might be highly variable/random. In such a case in all our formulas we need to replace interest rates with their expected values. For example in the most general case we have for the price of Bond 5 PB5 = T Y t=1 P 1+ it;e 1 + T Y t X t=1 To make things simple consider the case when Ct C : 1 + i1 ;e =1 0 and T = 3. Clearly, this corresponds to Bond 2. The price of the bond is then PB2 = P 1+ i1;e 1 1 + i2;e 1 1 + i3;e 1 : Notice that the price of the bond depends negatively on expected interest rates. Therefore, if creditors are expecting higher rates in future the price of the bond declines. This is because if investors are expecting higher rates in future they would rather wait and invest in future. Borrowers in such a case receive less for the same principal. Therefore, again, expectation matter. 88 Money supply and risk-free bonds Central banks usually announce their monetary policy in terms of a level of nominal interest rate. They adjust their money supply so that they meet the targeted rate. How do they do that? They buy or sell bonds in the market. To see how this works, consider, again, a deterministic world and zero coupon bonds. Lets take as an example Bond 1, PB1 = P : 1 + i11 If the central bank buys many such bonds, PB1 would increase. Higher PB1 implies lower i11 . Conversely, if the central bank sells many such bonds, PB1 would decline which would imply higher i11 . When the central bank buys bonds it does so using money. Therefore, it increases money supply. In contrast, when it sells bonds it receives money. Therefore, it reduces money supply. This implies that if the central bank announces that it plans to reduce interest rate then it plans to increase money supply. Of course, if it announces that it plans to increase interest rate then it plans to reduce money supply. The central banks are (usually) public institutions. Therefore, they try to avoid excess risk. The central banks then usually buy and sell government bonds, which tend to be thought to be relatively safe assets. (After all, the government can almost always avoid defaulting on bonds denominated in national currency with printing money.) It is commonly thought that one of the most safe government bonds are the short maturity US Treasury Bonds. Economists usually take them as the risk-free assets/bonds. The di¤erence between the interest paid on US Treasury Bonds and interest paid on equivalent (e.g., in terms of maturity and coupons) bonds to certain extent represents risk premium (i.e., the premium the investors demand to hold more risky asset.). Yield curve and zero coupon bonds in reality In reality there are almost no zero coupon bonds. Usually, zero coupon bonds are only those that have paid their last coupon and are very close to maturity. This renders our algorithm ( ) almost useless. For …guring out the yield curve a special algorithm is used. This algorithm is called bootstrap. The following example illustrates this algorithm. Imagine we have a bond very close to its maturity, which has paid all its coupons. Basically we have a bond like B1 . We observe its price and know its principal. Therefore, interest rate (in a perfect market) is i11 = P PB1 1: Further, imagine we have a bond which matures in two years and pays coupon in a year. We know its price (PB0 2 ), principal (P ), and the value of coupon that it pays (C). Therefore, in a perfect 89 market it has to be that, P PB0 2 = 1 + i11 + 1 + i21 C : 1 + i11 It is easy to …gure out i21 from this expression, i21 = 1+ P PB0 2 i11 C 1: Stocks, stock prices, and stock markets The stock of a …rm is the equity stake of its owners. It represents the residual assets of the …rm that would be paid to stockholders after discharge of all senior claims (e.g., bonds/debt). A stockholder/shareholder is an individual who legally owns one or more shares of stock in a …rm. Firms (usually) maximize their value appropriately designing production and marketing. It turns out that this is equivalent to maximizing their shareholder value. To see this, suppose that we have a …rm which lives T periods and has real pro…t stream t in each period t. Denote real interest rate by r and, for simplicity, assume that it is constant over time. At time t = 1, the present value of this …rm is given by V1 = = 1 + 1 1+r 2 T X 1 (1 + r) =1 + 1 1 (1 + r)2 3 ::: + 1 (1 + r)t 1 t ::: + 1 (1 + r)T 1 T : The present value of the …rm V1 is the total present value of its stocks. Clearly, V1 can be rewritten in the following manner V1 = where 1 1 + 1 V2 ; 1+r are the pro…ts in current period (t = 1) and 1 1+r V2 are the discounted future capital gains. For simplicity, assume that this …rm has zero net debt. A stockholder/shareholder owns a share/fraction of V1 . Let this fraction be some ! from (0; 1). Therefore, in other words, a shareholder has an entitlement to !V1 amount of value (real money). It is clear that the maximization of V1 is equivalent to maximization of !V1 . In practice, shareholders usually call pro…ts dt instead of t. t dividends. In such a case it is practical to write In this respect, the value of the …rm is simply discounted sum of dividends. If this …rm is privately or publicly traded in a perfect market then its market value is equal to its value. Therefore, the price of a 1 percent ownership of the stock of the …rm (or the price of 1% share simply) is 0:01V1 . In practice, S is used to denote stock price, instead of V . 90 Stock markets and speculators Firms are publicly traded (usually) in stock markets/exchanges. Examples of stock markets are New York Stock Exchange and London Stock Exchange. In a stock market, speculators engage in a trade of stocks of many …rms simultaneously. Hereafter, we will assume that the values of …rms are exogenously given keeping in mind that they depend on the pro…ts of the …rms. For publicly traded …rms, this is equivalent to assuming that the stock prices of the …rms are given. In a deterministic world, speculators invest in the stocks of those …rms which will have the highest returns. For example, if there are two …rms, Firm 1 and Firm 2, which currently have stock price of S and are expected to have stock price S2 < S1 , then the speculators would buy stocks of Firm A. This is because the return on buying the stock of Firm 1 (R1 ) is higher than the return on buying the stock of Firm 2 (R2 ), R1 = S1 S S > S2 S S = R2 : Of course, in complete markets, because of arbitrage the price of the stocks of Firm 1 would increase so that after all R1 = R2 . We have assumed however that the stock prices are given. Therefore, S does not change in our example. Stochastic returns and portfolios of stocks: The world is not deterministic in the sense that future pro…ts of …rms are usually not very predictable. Therefore, the future prices of stocks are not very predictable. Speculators/investors then decide upon expected returns. Moreover, they usually hold portfolios of stocks. For example, if N …rms are traded in the stock market and there are M speculator/investor, then speculator j holds a portfolio Pj = N X i;j Si ; i=1 where i;j is the amount of stock Si in the portfolio of speculator j. If does not have stocks of Firm i. If stocks of Firm i. If i;j = i;j i;j = 0 then speculator j = 1 then speculator j holds real money equivalent to all the 1 then speculator j does not have stocks of Firm i. Moreover, speculator j is in debt to deliver real money equivalent to all stocks of Firm i. In other words, speculator j has short-selled stocks of Firm i. Can i;j be higher than 1 or lower than 1? Yes, it can. This is because Si is the real value of the stock of Firm i. In such a case contracts can be written promising multiples of the value of that stock. Consider, for example, a situation when speculator j has all the stocks of Firm i as well as a certi…cate promising to pay exactly twice the value of the stock whenever exercised. In such a case the speculator has real value of 3Si . In other words 91 i;j = 3. Clearly, for any …rm it has to be that the percentage of its stocks sums to 1 across all speculators, M X i;j = 1: j=1 In terms of portfolios, speculators/investors select the amounts of stock ownership and focus on the returns of their portfolios. If the price of a selected portfolio is P then the return on it is percentage change of its price over time. Using the notation of our previous example, the return is Rj = Pj P P : These returns are stochastic (are random variable) because the future value of portfolio Pj is a random variable. Utility maximization and mean-variance trade-o¤ Speculators buy stocks to maximize utility. It turns out that generally the problem of speculators to design an appropriate portfolio can be summarized in terms of expected value and risk trade-o¤ given that the future value of portfolio is a random variable. The following example illustrates this point. Consider a speculator who has utility function u (:). Let u (:) be strictly-increasing and concave in portfolio returns. Suppose the returns on portfolios have normal distribution with E expected value and 2 variance. In such a case, the expected utility of the speculator is +1 Z E [u (R)] = u (R) f (R; E; ) dR; 1 where 1 2 e 2, where f (R; E; ) = p Clearly, the expected utility depends on E and Use Z to denote Z= R E 1 2 (R 2 E 2 ) : is a measure of risk. : Given the properties of expectation and variance operators (see Appendix), Z is a normal random variable with expected value 0 and variance 1. Z is usually called standardized return. Replace R with Z in the expression for expected utility (notice that the limits of integration don’t change): +1 Z E [u (E + Z)] = u (E + Z) ' (Z) dZ; 1 92 where ' (Z) is the density function of standard normal distribution, 1 ' (Z) = p e 2 1 2 Z 2 : Next, we take the derivative of the expected utility E [u] with respect to the standard deviation of return . Assuming that E [u] is …nite, the derivative is dE [u] d +1 Z d u (E + Z) ' (Z) dZ d = 1 +1 Z u0 (E + Z) = dE + Z ' (Z) dZ: d 1 An indi¤erence curve is de…ned as the (locus of) points where +1 Z 0= u0 (E + Z) dE[u] d = 0: dE + Z ' (Z) dZ: d 1 This expression can be further rewritten as dE = d +1 Z u0 (E + Z) Z' (Z) dZ 1 +1 Z ; u0 (E + Z) ' (Z) dZ 1 where E is the expected return of the portfolio and is its standard deviation. E ( ) represents indi¤erence curve in expected value E and standard deviation (variance) space . It is increasing and convex in . In other words, dE d > 0 and d2 E d 2 > 0 (see Appendix). Utility increases moving the indi¤erence curve from right to left in variance and mean space. The following …gure shows these indi¤erence curves 93 Measures of returns and risk Expected value of a random variable is its concentration point. If the random variable has continuous distribution function then almost never it will be equal to its expected value. However, most likely its realization will be closer to its expected value than other points in distribution. Therefore, expected value of a random variable tends to be the best guess available. In our previous example, we used variance as a measure of risk. Variance of random variable shows how dispersed its outcomes may be. Therefore, it is one of the most commonly applied risk measures. It is especially relevant for normally distributed random variables. Digression (Central Limit Theorem): weights 1 N. Consider a portfolio consisting of i.i.d. stocks with According to the Central Limit Theorem the returns on this portfolio will be approxi- mately normally distributed. The Central Limit Theorem is very powerful result. This result is one of the reasons why normal distribution is used so often. There are many other measures of risk. Examples are the range, the semi-inter-quartile range, the semi-variance, and the mean absolute deviation. Each of these measures can have slightly di¤erent implication (scale) for risk. The range (RANGE) is de…ned as h the di¤erence i between the highest and lowest outcomes. Let min max the returns on portfolio Rj be from Rj ; Rj . Then the range is RAN GE = Rjmax Rjmin : The semi-inter-quartile range (SRANGE) is usually de…ned as the di¤erence between the 75th and 25th quantiles of the random variable. In our example, SRAN GE = Rjq75 94 Rjq25 : The variance is a central moment in the sense that considers deviations from the mean/expected value, i.e., V [Rj ] = E [Rj E [Rj ]]2 . In this sense it gives equal weight to deviations from the mean/expected value. However, risk averse investors might be more concerned about returns below the mean (i.e., downside risk but not upside risk). The semi-variance (SEMIVAR) is a measure of risk that relates just to that risk. It de…ned as ~j = R ( Rj E [Rj ] if Rj < E [Rj ] 0 if Rj h i2 ~j : SEM IV AR = E R E [Rj ] Variance and semi-variance can be sensitive to observations distant from the mean/expected value (i.e., outliers). The mean absolute deviation (MAD) avoids this problem. It is de…ned as M AD = E [jRj E [Rj ]j] : Hereafter, we will maintain an assumption that returns on stocks/portfolios have normal distribution. Therefore, we will characterize the risk with variance (or its square root: standard deviation). Measuring portfolio return and risk to assemble a portfolio An expected utility maximizing speculator/investor assembles its portfolio of stocks so that to achieve the highest possible utility. It assembles the portfolio selecting the number of each type of stocks ( in our previous example). The expected return and risk of portfolio are directly linked to the expected returns and risks of underlying assets/stocks. The following very simplistic example illustrates this point. Let there be only 2 …rms and 2 stocks Firm 1 and Firm 2, and S1 and S2 . The current value of the portfolio of the speculator is P = where 1 is the number of S1 stocks and 2 1 S1 + 2 S2 ; is the number of S2 stocks. Slightly abusing the notation, let the next period prices of stocks 1 and 2 be S11 and S21 , correspondingly. Therefore, next period price of the portfolio is P1 = 1 1 S1 + 1 2 S2 : The return on portfolio then is S1 + 2 S21 P P 1 1 S1 S2 S2 S2 1 S1 S1 = + 2 P S1 P S2 = ! 1 RS1 + ! 2 RS2 ; R = P1 P = 1 S11 95 S2 where RS1 is the return on stock S1 , RS2 is the return on stock S2 . ! 1 is the weight of stock S1 in the portfolio, and ! 2 is the weight of stock S2 in the portfolio. These weights sum up to 1, 1 = !1 + !2: In this respect this formula implies that the return on portfolio R is the weighted sum of returns on underlying assets/stocks RS1 and RS2 . Notice that selecting the number of stocks in portfolio 1 and/or 2 the speculator selects weights of stocks ! 1 and ! 2 in the portfolio. The expected value of the return on this portfolio is E [R] = E [! 1 RS1 + ! 2 RS2 ] = ! 1 E [RS1 ] + ! 2 E [RS2 ] : In turn, the variance is V [R] = ! 21 V [RS1 ] + ! 22 V [RS2 ] + 2! 1 ! 2 COV [RS1 ; RS2 ] ; where COV [RS1 ; RS2 ] is the covariance of RS1 and RS2 ; COV [RS1 ; RS2 ] = E [(RS1 E [RS1 ]) (RS2 E [RS2 ])] : Intuitively, the portfolio P = ! 1 RS1 + ! 2 RS2 varies because so do RS1 and RS2 . However, we need to take into account that RS1 and RS2 co-vary or vary together and that can amplify or reduce the variance of the portfolio. The following two examples illustrate the latter point. Suppose S1 and S2 are almost the same (may be because they are stocks of vertically or horizontally interrelated …rms) and RS1 = RS2 : In such a case V [RS1 ] = V [RS2 ] ; COV [RS1 ; RS2 ] = V [RS1 ] ; and V [R] = ! 21 V [RS1 ] + ! 22 V [RS1 ] + 2! 1 ! 2 V [RS1 ] = ! 21 + 2! 1 ! 2 + ! 22 V [RS1 ] = (! 1 + ! 2 )2 V [RS1 ] = V [RS1 ] : This inference should have followed since if RS1 = RS2 then P = RS1 . This example corresponds to 96 the case when S1 and S2 co-vary perfectly (they are almost the same). Suppose now that S1 and S2 are extremely di¤erent (may be because they are stocks of competing …rms) and when RS1 = 1 then RS2 = 0 but when RS1 = 0 then RS2 = 1. Let further RS1 be equal to 1 with probability 12 , which implies that E [RS1 ] = E [RS2 ] = 21 . Moreover, V [RS1 ] = E [RS1 = E RS2 1 E [RS1 ]]2 = E RS1 RS1 + 1 2 2 1 1 = : 4 4 Apparently, the same holds for RS2 : V [RS2 ] = 41 . However, notice that COV [RS1 ; RS2 ] = E RS1 = E [RS1 RS2 ] = 1 2 RS2 1 1 1 RS2 RS1 + 2 2 4 1 1 1 1 1 E [RS1 ] + = 0 E [RS2 ] E [RS1 ] + 2 4 2 2 4 1 2 = E RS1 RS2 1 E [RS2 ] 2 1 : 4 Therefore, since the expected value of the return on portfolio is simply the weighted sum of expected returns on stocks, we have that E [R] = E [! 1 RS1 + ! 2 RS2 ] = ! 1 E [RS1 ] + ! 2 E [RS2 ] 1 : = 2 In turn, the variance of the portfolio is 1 1 V [R] = ! 21 + ! 22 4 4 = (! 1 2! 1 ! 2 1 4 1 ! 2 )2 ; 4 which can be strictly less than the variance of any of the stocks in portfolio. Let ! 1 = ! 2 then V [R] = 0 < 41 . However, if ! 1 = 1 or ! 2 = 1 then V [R] = 41 . This happens because of negative covariance between RS1 and RS2 and is called diversi…cation of risk. This example conveniently illustrates a very important point. In case RS1 and RS2 don’t vary together perfectly, speculators can choose the weights of S1 and S2 in their portfolios so that to minimize variance/risk for the same level of expected return. The magnitude of the covariance is not easy to interpret since it depends on the possible realizations of random variables. Usually, therefore, another measure is used to describe (linear) relation between random variables. It is the normalized version of the covariance and is called correlation 97 coe¢ cient. It is de…ned as COV [RS1 ; RS2 ] : =p V [RS1 ] V [RS2 ] S1 ;S2 By de…nition, V [RS1 ] = 2 S1 RS1 ;RS2 2 [ 1; 1]. Using the correlation coe¢ cient and denoting V [R] = 2 S2 and V [RS2 ] = 2 P 2, P we have that the variance of the portfolio returns is = ! 21 2 S1 + ! 22 2 S2 + 2! 1 ! 2 S1 S2 S1 ;S2 : A speculator chooses the weights ! 1 and ! 2 to construct its portfolio and makes its choices upon expected value and variance of returns. Therefore, it is interesting to see how expected value and variance of the portfolio returns depend on ! 1 and ! 2 . First of all notice that ! 2 = 1 ! 1 . Therefore, it is enough to choose only one of the weights, e.g., ! 1 . Let for simplicity E [RS1 ] > E [RS2 ]. In such a case apparently the expected value of portfolio returns in linear and increasing function of !1 E [R] = E [RS2 ] + ! 1 (E [RS1 ] E [RS2 ]) : In turn, 2 P = ! 21 2 S1 ! 21 = ! 1 )2 + (1 2 S1 + 2 S2 2 2 S2 + 2! 1 (1 S1 S2 S1 ;S2 !1) 2! 1 S1 S2 S1 ;S2 S2 S2 S1 S1 ;S2 + 2 S2 : This is an upward opening parabola since 2 S1 for any value of + 2 S2 RS1 ;RS2 . 2 S1 S2 S1 ;S2 =( S1 S2 ) 2 +2 S1 S2 1 S1 ;S2 >0 The following …gures illustrate the relationships between E [R] and 2 P and ! 1 In the second …gure ! mv 1 is the ! 1 which delivers the lowest variance of returns. It can be found 98 from the …rst order condition: @ 2P = 0 , 2! 1 @! 1 2 S1 + 2 S2 2 S1 S2 S1 ;S2 2 S2 S2 S1 S1 ;S2 = 0: Therefore, ! mv 1 = S2 2 S1 + S2 2 S2 S1 S1 ;S2 2 : S1 S2 S1 ;S2 Given that portfolio returns are linear in ! 1 , we can easily replace ! 1 in the last …gure with E [R]. In such a case we would have In this …gure the curve 2 P is the minimum variance opportunity set. This is the set of variance and expected return points which o¤ers the minimum variance for a given expected rate of return. In turn, E [Rmv ] is the expected return of minimum variance portfolio. Given that we have for indi¤erence curves expected returns as functions of standard deviation, usually this …gure is transposed. The speculators’optimal choice of portfolio/weights is given by the point of tangency of indi¤erence curves and minimum variance opportunity set. This is because indi¤erence curves are convex 99 and utility increases as these curves shift to the left. Moreover, exactly because of that no portfolio will be selected below the horizontal line that corresponds to E [Rmv ]. This analysis easily proceeds for the case when the number of stocks/assets is greater than 2. We have not de…ned what is S2 precisely. We needed simply that it has expected value and variance. Therefore, it can be for example a linear combination of stocks of many …rms. Even more, it can include bonds. Hereafter, we will assume that portfolios can include many types of assets/…nancial instruments: stocks, bonds, options, etc. Therefore, we will let speculators to assemble their portfolios using not only stocks. Moreover, we will assume that all speculators know the correct variances and expected returns of assets. Lets assume now that S2 is risk free asset with returns RS2 , which for convenience we will denote by Rf . For example it is US Treasury bond or combination of perfectly negatively correlated assets. In turn, S1 is itself the portfolio of all risky assets with returns RS1 which have expected value and variance E [RS1 ] and 2 . S1 The portfolio consisting of S1 and S2 has expected return and variance of returns E [R] = ! 1 E [RS1 ] + ! 2 Rf = Rf + ! 1 (E [RS1 ] Rf ) : and 2 P = ! 21 2 S1 : This implies that the expected value of this portfolio is linear function of its standard deviation: E [R] = ! 1 E [RS1 ] + ! 2 Rf = Rf (E [RS1 ] Rf ) P : S1 The following …gure o¤ers the minimum variance set in this case together with choices of ! 1 . The minimum variance set is graphed using solid lines. In turn, possible choices are in dashed lines. 100 Portfolios along all these dashed lines are possible. However, only one dominates in terms of expected return and variance. It is the portfolio at point M . With the presence of risk free asset the speculators form their portfolios taking the tangency point of their indi¤erence curves and the upward sloping solid line called capital market line, CM L. In this respect their portfolios are combinations of Rf and portfolio M . How do the speculators construct CM L? To do so they need to know Rf and portfolio M . Rf is the return on risk free asset which might be safely thought to be US Treasury bonds. What about M ? M is the portfolio of all risky assets (yes, it is S1 in this example, if you were wondering). Therefore, it is at the tangency point of CM L and the convex minimum variance sets. The market is in equilibrium when prices are such that markets clear. All assets then are held. In other words prices adjust so that excess demand and supply of assets are zero. This market clearing condition implies that equilibrium is attained at a single-tangency portfolio, M , which all investors combine with risk free asset and is a portfolio where all assets are held according to their market value weights. (At this point the weight of Rf is zero. Moreover, this portfolio is called the market portfolio.) Let the market value of an asset i be Vi and there be N assets then the market value weight of the asset i is wi = Vi N X ; Vi i=1 where N X Vi is the total market value of all assets. Market equilibrium is not attained until the i=1 tangency portfolio M is the market portfolio. Moreover, the value of the risk free rate of return should be such that the aggregate borrowing and lending are equal. The upward sloping solid line is called Capital Market Line. In turn, the result that in equilibrium investors hold a weighted average of risk free asset and market portfolio M is called Two-fund Separation. 101 Market price of risk and the CAPM The Capital Asset Pricing Model, CAPM in short, is a model which allows us to determine the market price of risk and the appropriate measure of risk for a single asset. It rests on several very hard assumptions (which actually we have maintained thus far). These assumptions are that 1. There are no market distortions and frictions (e.g., there are no price setters and information is costlesly available to all) 2. Speculators/investors maximize their expected utility and are risk averse. 3. There exists an unlimited supply of risk free asset at risk free rate 4. The quantities of assets are …xed. Moreover, all assets are tradable and perfectly divisible 5. All investors have the same information about the markets The CAPM also needs that the market portfolio is e¢ cient. This follows from utility maximization and common knowledge: the market is the sum of all individual holdings and all individual holdings are e¢ cient. In equilibrium, for any asset i it has to be that its weight in the market portfolio M is equal to wi = Vi N X : Vi i=1 Consider a portfolio which consists of ! i percentage of asset i and 1 ! i percentage of the market portfolio M . The expected return on such a portfolio is E [R] = ! i E [Ri ] + (1 ! i ) E [Rm ] ; where E [Ri ] and E [Rm ] are the expected returns on asset i and market portfolio M . The standard deviation of this portfolio is P where i;m h = ! 2i 2 i + (1 ! i )2 2 m + 2! i (1 !i) i1 2 i;m ; is the covariance between asset i and market portfolio m. Clearly, the opportunity set in terms of expected value and variance for various combinations of asset i and portfolio M is given by the convex Minimum Variance Opportunity set. In turn, the changes of the expected return and risk (here standard deviation) of this portfolio with ! i are given 102 by @ E [R] = E [Ri ] @! i @ ! i 2i P = @! i h ! 2i E [Rm ] ; (1 2 i 2 m !i) + (1 ! i )2 + (1 2 m 2! i ) i;m + 2! i (1 !i) i;m i 1 2 The main insight of this model is that the market portfolio already contains the asset i. Therefore, ! i is the excess demand for it. In equilibrium it should be zero and we would evaluate and @ @! i P @ @! i E [R] at ! i = 0. This gives @ E [R] @! i @ @! i = E [Ri ] E [Rm ] ; ! i =0 2 m i;m = P : m ! i =0 Therefore, the slope of the expected return and risk trade-o¤ evaluated at the market portfolio is @ @! i E [R] @ @! i P ! i =0 = E [Ri ] ( i;m E [Rm ] : 2 )= m m This expression shows how an individual asset a¤ects the return and risk of market portfolio. The …nal insight is that this slope is equal to the slope of CM L, which is E [Rm ] Rf : m Equating these two we have that E [Ri ] ( i;m E [Rm ] Rf E [Rm ] = ; 2 )= m m m or equivalently, E [Ri ] = Rf + (E [Rm ] Rf ) i;m : 2 m This equation is known as the Capital Asset Pricing Model. It states that the required rate of return on any asset i is equal to the risk free rate of return plus a risk premium. The risk premium is the price of the risk (E [Rm ] Rf ) times the quantity of risk = i;m 2 m . The quantity of risk is denoted by i;m : 2 m This is the contribution of asset i to the portfolio risk. Most importantly, the variance of the asset i a measure of risk does not matter in this context. Whatever matters is how its returns correlate 103 with market portfolio returns. The risk free asset has = 0 since it does not contribute to the risk of market portfolio (i.e., its covariance with market portfolio is zero). In turn, the market portfolio has = 1 since it is perfectly correlated with itself. The following …gure illustrates the CAPM where it is depicted by the blue line. This model has several important properties. First, in equilibrium because of arbitrage every asset should be priced so that its risk adjusted return is exactly on the blue line. Next the total risk can be separated to two Total risk = systematic risk + unsystematic risk, where systematic risk is how an asset covaries with the market portfolio and unsystematic risk is the risk not dependent on the market/economy. The speculators/investors are willing to pay a risk premium to avoid the systematic risk. Unsystematic risk cannot be avoided. How to …nd of an asset and the magnitude of unsystematic risk. The CAPM implies that one can estimate those from the following empirical speci…cation to …nd Ri = a + i Rm i + "; where a is a constant and " is a random variable which does not correlate with the market. The variance of Ri then is 2 i where 2 2 i m is the systematic risk and 2 " = 2 2 i m + 2 "; is the unsystematic risk. A second important property of the CAPM is that the measures of risk of individual assets are linearly additive when the assets are combined in portfolios. For instance, if an asset i has a risk 104 of i and asset j has a risk of j and they are combined with ! i proportion, then total risk of the portfolio is p = !i + (1 i !i) j: To show this use the properties of covariance: p = E [(! i Ri + (1 E [Ri ]) (Rm V [Rm ] = ! i i + (1 ! i ) j : = !i E [(Ri ! i ) Rj E [! i Ri + (1 ! i ) Rj ]) (Rm V [Rm ] E [(Rj E [Rm ])] + (1 ! i ) E [Rm ])] E [Rj ]) (Rm V [Rm ] E [Rm ])] Black-Scholes model The Black-Scholes model, is a model which allows us to determine the market price of European options. This mode maintains the following assumptions 1. There are no market distortions and frictions 2. Speculators/investors maximize their expected utility 3. There exists an unlimited supply of risk free asset at risk free rate rf 4. All assets are tradable and perfectly divisible 5. Stocks pay no dividends during the life-time of the options European options are widely used …nancial instruments. De…nition 1 Option is a contract which gives its owner an option (the right but not obligation) to buy or sell an underlying asset at a speci…ed price (strike price) before a speci…ed date (time to maturity). 1. Options giving the right to sell are called Call Options 2. Options giving the right to buy are called Put Options 3. Options which can be exercised only at a speci…ed date (but not before it) are called European Options. To derive Black-Scholes model lets consider a stock which has random returns and a European Call option written on it. Denote, the price of the stock at time t by St . Assume that the price of the stock follows a geometric Brownian motion: dSt = t St dt + St dWt ; 105 where d stands for the operator of in…nitesimally small change over time. and is a positive constant. Both t and t is called drift coe¢ cient will acquire meaning when Wt is presented. This (di¤erential) equation equation states that the change of the value of the stock happens because of deterministic shifts in the mean and because of a random process Wt . Wt and, therefore, its in…nitesimally small change dWt , is a random variable. More precisely, Wt is a Wiener process: It is a random variable which changes over time so that the expected change over any time interval is 0 (e.g., E [dWt ]) and its variance over time T is equal to T (e.g., V [dWt ] = dt). A discrete analogue for W is a simple random walk. The above equation can be rewritten as dSt = St where dSt St t dt + dWt ; is the rate of return on the stock during a very short time of dt. The properties of Wt random variable imply that dSt St dSt V ar St E = t dt; = 2 dt: European option written on this stock speci…es the time when it can be exercised. Denote it by T . It speci…es also the strike price. Denote it by K. As we will see, at time t < T the price of this option V is a (complicated) function of S, , , t, K, T , and rf : V = V (S; ; ; t; K; T; rf ) : From It¯o’s lemma it follows that dV = S @V 1 @V + + @S @t 2 2 2@ S 2V @S 2 dt + S @V dW: @S Combining this di¤erential equation with the di¤erential equation for stock price gives dV = @V 1 dt + @t 2 2 2@ S 2V @S 2 dt + @V dS: @S To derive the price of this option Black-Scholes model uses a non-arbitrage argument. This involves constructing a risk free portfolio and equating its returns to the risk free returns. Consider a portfolio of one option on this stock and a short position of value of the portfolio is then Pt = Vt Choose the St : so that to hedge this portfolio (reduce its variance). 106 in this stock. The The change in the price of the portfolio is given by dPt = dVt dSt @V @V 1 2 2 @2V = dt + dt + S dS 2 @t 2 @S @S dS: The …rst two terms are deterministic. Therefore, they don’t matter for hedging strategy and to hedge the portfolio against (volatility) randomness select @V @S = (this is called delta/dynamic hedging). This implies that 1 @V + @t 2 dPt = 2 2@ S 2V @S 2 dt: Since this portfolio entails no risk its returns should be equal to the returns on risk-free asset rf . In other words, dPt = rf dt:23 Pt Therefore, dPt = Pt rf dt and 1 @V + @t 2 2 2@ S 2V @S 2 dt = (Vt St ) rf dt: The latter implies that rf V = @V 1 + @t 2 2 2@ S 2V @S 2 + rf S @V : @S This is a second order partial di¤erential equation. In order to solve it one needs to have boundary conditions. First, let this be a call option. At the time to maturity T , the price of call option is easy to derive. It is VT = max fST K; 0g : This is because time T is the exercise date. The owner of this option would exercise it only if ST is higher than the strike price and would make a net gain of ST K. Second, V (0; ; ; t; K; T; rf ) = 0: Third, lim V (S; ; ; t; K; T; rf ) = S: S!+1 Solving this di¤erential equation is much beyond the frames and focus of this course. 23 This is because the return on portfolio is given by Pt+1 Pt . Pt 1 dPt . Pt dt 107 If the time is discrete then this becomes 1 Pt+1 Pt Pt t+1 t = Appendix Appendix - Reminder of Statistics 0 Probability Theory There are at least two ways to approach probability: Classical (or a priori) Probability – Classical Probability can be de…ned in the following way: If a random experiment can result in n mutually exclusive and equally likely outcomes and if nA of these outcomes have an attribute A, then the probability of A is the fraction nA =n. Axiomatic Probability (or so-called Kolmogorov’s Axiomatics): – Probability Space is a triple 1. ; A; Pr (:) , where is the sample space: A collection of all possible outcomes of an experiment Any given experiment result is an element of . 2. A is the event space, or algebra of events. It is a collection of subsets of , including . 3. Pr (:) is the probability function de…ned on A: For any event A 2 A, Pr (:) is a quantitative (numerical) measure of the likelihood that this event A is observed once the experiment is completed. Basic de…nitions and theorems The following de…nitions are needed to construct probability space: De…nition 2 [Sample space] The sample space, denoted by possible outcomes of a conceptual experiment. Often, , is the collection or the totality of all is called the sure event since it includes any outcome that can occur. De…nition 3 [Event and event space] An event is a subset of the sample space. The class of all events associated with a given experiment is de…ned to be the event space. Any event space A possesses the following properties: 1. 2A 2. If A 2 A, then the complement set A 2 A 3. If A1 ; A2 ; ::: 2 A, then [i 1 Ai 2A 108 De…nition 4 Spaces which satisfy properties 1, 2, and 3 are called algebra. De…nition 5 [Probability function] A probability function Pr (:) is a set function with domain A and counterdomain of [0; 1] interval. It which satis…es the following axioms: Pr (:) 0 for 8A 2 A Pr ( ) = 1 If A1 ; A2 ; ::: 2 A are pairwise disjoint, then Pr ([i De…nition 6 The triple 1 Ai ) = X Pr (Ai ) i 1 ; A; Pr (:) is called probability space. The following de…nitions are for conditional probability and independence of events De…nition 7 Let A; B 2 A and Pr (B) > 0. The conditional probability of event B given the occurrence of A is denoted and given by Pr ( Bj A) = Pr(B\A) Pr(A) . Theorem 8 (The law of total probability) Let A1 ; A2 ; :::; An 2 A form a partition of Then Pr (B) = n X . Let B 2 A. Pr ( Bj Ai ) Pr (Ai ) : i=1 Theorem 9 (Bayes’formula) Let A1 ; A2 ; :::; An 2 A form a partition of , for any i let Pr (Ai ) > 0. Then for any event B 2 A with Pr (B) > 0 Pr ( Aj j B) = Pr ( Bj Aj ) Pr (Aj ) Pr (B \ Aj ) = n : X Pr (B) Pr ( Bj Ai ) Pr (Ai ) i=1 De…nition 10 Events A1 and A2 are independent i¤ Pr (A1 \ A2 ) = Pr (A1 ) Pr (A2 ) : Similarly, events A1 ; A2 ; :::; An 2 A are independent i¤ the probability of any intersection of any sub-sample of fA1 ; A2 ; :::; An g is the multiplication of probabilities of the terms in the intersection. Random variables De…nition 11 (Intuitive de…nition) A random variable x is a real-valued function of the elements $ of a sample space . For example, a random variable x is the sum of the two numbers that occur when we roll a pair of fair dice one. The events are the numbers. The space of possible events is comprised of 109 36 elements. The value of a random variable depends on the outcome of the experiment being observed. Each possible value X of a random variable x de…nes an event: function x ($) assigns values X 2 R to the set of sample space outcomes $. De…nition 12 (Technical de…nition) For a given probability space ; A; Pr (:) , a function x ($) : ! R is said to be a random variable if for 8X 2 R event that occurs the event $ on which x ($) < X, AX = f$ : x ($) < Xg, belongs to the events space A, AX 2 A. De…nition 13 The cumulative distribution function of a random variable x de…ned on the probability space ; A; Pr (:) is Fx (X) = Pr (AX ) = Pr (x ($) < X) : Notice that the cumulative distribution function would be a non-decreasing function Fx (X) : R ! [0; 1] since higher X implies bigger space of $ events where x ($) < X. More precisely, cumulative distribution functions satisfy the following properties. limX! 1 Fx (X) =0 limX!+1 Fx (X) = 1 For any X1 and X2 where X1 < X2 , Fx (X1 ) Fx (X2 ) limh!0 Fx (X + h) = Fx (X) for any X : R De…nition 14 Discrete density function or probability mass function of a discrete random variable x is fx (X) = ( p xi x = xi ; i = 1; 2::: 0 otherwise : De…nition 15 Continuous density function or probability mass function of a continuous random variable x is fx (X) = dFx (X) : dX Density functions satisfy the following properties. fx (X) RX 0 1 fx (z) dz R +1 1 = Fx (X) fx (z) dz = 1 110 Basic properties of expectation, variance, and covariance 1. The basic properties of expectation operator are N N For any random variables fXi gN i=1 , real functions fhi gi=1 , and real numbers f i gi=1 E "N X # i hi (Xi ) = i=1 N X [hi (Xi )] : i=1 If for any hi and hj it is the case that hi (X) E [hi (X)] iE hj (X) then E [hj (X)] : 2. The basic properties of variance and covariance operators are N N For any random variables fXi gN i=1 , real functions fhi gi=1 , and real numbers f i gi=1 "N X V # i hi (Xi ) i=1 " = COV = N X N X i hi (Xi ) ; i=1 2 iV N X # i hi (Xi ) i=1 [hi (Xi )] + i=1 X i j COV [hi (Xi ) ; hj (Xj )] : i6=j where for any hi and hj COV [hi (Xi ) ; hj (Xj )] = E [(E [hi (Xi )] and the second term consists of N (N hi (Xi );hj (Xj ) hi (Xi )) (E [hj (Xj )] hj (Xj ))] : 1) items. Correlation is de…ned as COV [hi (Xi ) ; hj (Xj )] =p : V [hi (Xi )] V [hj (Xj )] Correlation shows linear relation. It is from 1 to 1, hi (Xi );hj (Xj ) 2 [ 1; 1]. Given its de…nition covariance operator satis…es the following properties COV [Xi ; Xi ] = V [Xi ] ; COV COV i + i Xi + i k + + i Xi ; j + j Xj = i j COV [Xi ; Xj ] ; k Xk ; j + j Xj = i j COV [Xi ; Xj ] + Therefore, if Xi = + Xj ; then COV [Xi ; Xj ] = COV [ + Xj ; Xj ] = V [Xj ] : 111 k j COV [Xi ; Xj ] : Mean-variance trade-o¤ dE d It can be shown that indi¤erence curve E ( ) is increasing and convex. In other words, d2 E d 2 > 0 and > 0. This follows from that utility function u (:) is increasing and concave. To show that +1 Z u0 (E + Z) Z' (Z) dZ dE = d 1 +1 Z >0 u0 (E + Z) ' (Z) dZ 1 consider the numerator +1 +1 Z Z0 Z 0 0 u (E + Z) Z' (Z) dZ = u (E + Z) Z' (Z) dZ + u0 (E + Z) Z' (Z) dZ: 1 Denote by Z~ = 1 0 Z, and rewrite +1 Z u0 (E + Z) Z' (Z) dZ = 1 = Z0 0 u E Z~ 1 +1 Z u0 E 0 = +1 Z Z~ ' Z~ d Z~ + +1 Z u0 (E + Z) Z' (Z) dZ 0 +1 Z ~ ~ ~ ~ Z Z' Z dZ + u0 (E + Z) Z' (Z) dZ 0 u0 (E + Z) u0 (E Z) Z' (Z) dZ: 0 Clearly, since u00 < 0 it has to be that +1 Z u0 (E + Z) Z' (Z) dZ < 0; 1 which implies that dE d > 0. In turn, to show that and their average d2 E d 2 E1 +E2 ; 2 > 0, consider two points on the indi¤erence curve (E1 ; 1+ 2 2 . Notice that by construction u (E1 + The indi¤erence curve would be convex if for any two points (E1 ; 1 u (E1 + 2 1 Z) 1 + u (E2 + 2 2 Z) <u 112 E1 + E2 + 2 1 Z) 1) 1 + 2 and (E2 ; = u (E2 + and (E2 ; 2 1) Z : 2) 2 Z). and Z 2 ), This clearly holds since u is a concave function, which implies that E u E1 + E2 + 2 1 + 2 2 Z > E [u (E1 + 113 1 Z)] = E [u (E2 + 2 Z)] : References Cagan, P. (1956). The monetary dynamics of hyperin‡ation. In M. Friedman (Ed.), Studies in the Quantity Theory of Money. Chicago: University of Chicago Press. Calvo, G. A. (1983). Staggered prices in a utility-maximizing framework. Journal of Monetary Economics 12 (3), 383–398. Carlin, W. and D. Soskice (2005). The 3-equation New Keynesian Model— a graphical exposition. Contributions in Macroeconomics 5 (1). Doepke, M., A. Lehnert, and A. Sellgren (1999). Macroeconomics. Available online (last accessed 29.03.2015). http://faculty.wcas.northwestern.edu/~mdo738/book.htm. Domar, E. D. (1946). Capital expansion, rate of growth, and employment. Econometrica 14 (2), 137–147. Goodwin, R. (1967). A growth cycle. In C. H. Feinstein (Ed.), Socialism, Capitalism & Economic Growth, pp. 54–58. London: Macmillan. Hansen, G. D. (1985). Indivisible labor and the business cycle. Journal of Monetary Eco- nomics 16 (3), 309–327. Harrod, R. F. (1939). An essay in dynamic theory. Economic Journal 49 (193), 14–33. Kaldor, N. (1934). A classi…catory note on the determinateness of equilibrium. Rreview of Economic Studies 1 (2), 122–136. King, R. G. and S. T. Rebelo (1999). Resuscitating real business cycles. Volume 1, Part B of Handbook of Macroeconomics, pp. 927–1007. Elsevier. Kydland, F. E. and E. C. Prescott (1982). Time to build and aggregate ‡uctuations. Econometrica 50 (6), 1345–1370. Nagel, R. (1995). Unraveling in guessing games: An experimental study. American Economic Review 85 (5), 1313–1326. Solow, R. M. (1956). A contribution to the theory of economic growth. Quarterly Journal of Economics 70 (1), 65–94. Taylor, J. B. (1980). Aggregate dynamics and staggered contracts. Journal of Political Economy 88 (1), 1. 114
© Copyright 2024