60. The Permanent Income Model#

60.1. Overview#

This lecture describes a rational expectations version of the famous permanent income model of Milton Friedman [Friedman, 1956].

Robert Hall cast Friedman’s model within a linear-quadratic setting [Hall, 1978].

Like Hall, we formulate an infinite-horizon linear-quadratic savings problem.

We use the model as a vehicle for illustrating

  • alternative formulations of the state of a dynamic system

  • the idea of cointegration

  • impulse response functions

  • the idea that changes in consumption are useful as predictors of movements in income

Background readings on the linear-quadratic-Gaussian permanent income model are Hall’s [Hall, 1978] and chapter 2 of [Ljungqvist and Sargent, 2018].

Let’s start with some imports

import matplotlib.pyplot as plt
plt.rcParams["figure.figsize"] = (11, 5)  #set default figure size
import numpy as np
import random
from numba import jit

60.2. The Savings Problem#

In this section, we state and solve the savings and consumption problem faced by the consumer.

60.2.1. Preliminaries#

We use a class of stochastic processes called martingales.

A discrete-time martingale is a stochastic process (i.e., a sequence of random variables) {Xt} with finite mean at each t and satisfying

Et[Xt+1]=Xt,t=0,1,2,

Here Et:=E[|Ft] is a conditional mathematical expectation conditional on the time t information set Ft.

The latter is just a collection of random variables that the modeler declares to be visible at t.

  • When not explicitly defined, it is usually understood that Ft={Xt,Xt1,,X0}.

Martingales have the feature that the history of past outcomes provides no predictive power for changes between current and future outcomes.

For example, the current wealth of a gambler engaged in a “fair game” has this property.

One common class of martingales is the family of random walks.

A random walk is a stochastic process {Xt} that satisfies

Xt+1=Xt+wt+1

for some IID zero mean innovation sequence {wt}.

Evidently, Xt can also be expressed as

Xt=j=1twj+X0

Not every martingale arises as a random walk (see, for example, Wald’s martingale).

60.2.2. The Decision Problem#

A consumer has preferences over consumption streams that are ordered by the utility functional

(60.1)#E0[t=0βtu(ct)]

where

  • Et is the mathematical expectation conditioned on the consumer’s time t information

  • ct is time t consumption

  • u is a strictly concave one-period utility function

  • β(0,1) is a discount factor

The consumer maximizes (60.1) by choosing a consumption, borrowing plan {ct,bt+1}t=0 subject to the sequence of budget constraints

(60.2)#ct+bt=11+rbt+1+ytt0

Here

  • yt is an exogenous endowment process.

  • r>0 is a time-invariant risk-free net interest rate.

  • bt is one-period risk-free debt maturing at t.

The consumer also faces initial conditions b0 and y0, which can be fixed or random.

60.2.3. Assumptions#

For the remainder of this lecture, we follow Friedman and Hall in assuming that (1+r)1=β.

Regarding the endowment process, we assume it has the state-space representation

(60.3)#zt+1=Azt+Cwt+1yt=Uzt

where

  • {wt} is an IID vector process with Ewt=0 and Ewtwt=I.

  • The spectral radius of A satisfies ρ(A)<1/β.

  • U is a selection vector that pins down yt as a particular linear combination of components of zt.

The restriction on ρ(A) prevents income from growing so fast that discounted geometric sums of some quadratic forms to be described below become infinite.

Regarding preferences, we assume the quadratic utility function

u(ct)=(ctγ)2

where γ is a bliss level of consumption.

Note

Along with this quadratic utility specification, we allow consumption to be negative. However, by choosing parameters appropriately, we can make the probability that the model generates negative consumption paths over finite time horizons as low as desired.

Finally, we impose the no Ponzi scheme condition

(60.4)#E0[t=0βtbt2]<

This condition rules out an always-borrow scheme that would allow the consumer to enjoy bliss consumption forever.

60.2.4. First-Order Conditions#

First-order conditions for maximizing (60.1) subject to (60.2) are

(60.5)#Et[u(ct+1)]=u(ct),t=0,1,

These optimality conditions are also known as Euler equations.

If you’re not sure where they come from, you can find a proof sketch in the appendix.

With our quadratic preference specification, (60.5) has the striking implication that consumption follows a martingale:

(60.6)#Et[ct+1]=ct

(In fact, quadratic preferences are necessary for this conclusion 1.)

One way to interpret (60.6) is that consumption will change only when “new information” about permanent income is revealed.

These ideas will be clarified below.

60.2.5. The Optimal Decision Rule#

Now let’s deduce the optimal decision rule 2.

Note

One way to solve the consumer’s problem is to apply dynamic programming as in this lecture. We do this later. But first we use an alternative approach that is revealing and shows the work that dynamic programming does for us behind the scenes.

In doing so, we need to combine

  1. the optimality condition (60.6)

  2. the period-by-period budget constraint (60.2), and

  3. the boundary condition (60.4)

To accomplish this, observe first that (60.4) implies limtβt2bt+1=0.

Using this restriction on the debt path and solving (60.2) forward yields

(60.7)#bt=j=0βj(yt+jct+j)

Take conditional expectations on both sides of (60.7) and use the martingale property of consumption and the law of iterated expectations to deduce

(60.8)#bt=j=0βjEt[yt+j]ct1β

Expressed in terms of ct we get

(60.9)#ct=(1β)[j=0βjEt[yt+j]bt]=r1+r[j=0βjEt[yt+j]bt]

where the last equality uses (1+r)β=1.

These last two equations assert that consumption equals economic income

  • financial wealth equals bt

  • non-financial wealth equals j=0βjEt[yt+j]

  • total wealth equals the sum of financial and non-financial wealth

  • a marginal propensity to consume out of total wealth equals the interest factor r1+r

  • economic income equals

    • a constant marginal propensity to consume times the sum of non-financial wealth and financial wealth

    • the amount the consumer can consume while leaving its wealth intact

60.2.5.1. Responding to the State#

The state vector confronting the consumer at t is [btzt].

Here

  • zt is an exogenous component, unaffected by consumer behavior.

  • bt is an endogenous component (since it depends on the decision rule).

Note that zt contains all variables useful for forecasting the consumer’s future endowment.

It is plausible that current decisions ct and bt+1 should be expressible as functions of zt and bt.

This is indeed the case.

In fact, from this discussion, we see that

j=0βjEt[yt+j]=Et[j=0βjyt+j]=U(IβA)1zt

Combining this with (60.9) gives

(60.10)#ct=r1+r[U(IβA)1ztbt]

Using this equality to eliminate ct in the budget constraint (60.2) gives

bt+1=(1+r)(bt+ctyt)=(1+r)bt+r[U(IβA)1ztbt](1+r)Uzt=bt+U[r(IβA)1(1+r)I]zt=bt+U(IβA)1(AI)zt

To get from the second last to the last expression in this chain of equalities is not trivial.

A key is to use the fact that (1+r)β=1 and (IβA)1=j=0βjAj.

We’ve now successfully written ct and bt+1 as functions of bt and zt.

60.2.5.2. A State-Space Representation#

We can summarize our dynamics in the form of a linear state-space system governing consumption, debt and income:

(60.11)#zt+1=Azt+Cwt+1bt+1=bt+U[(IβA)1(AI)]ztyt=Uztct=(1β)[U(IβA)1ztbt]

To write this more succinctly, let

xt=[ztbt],A~=[A0U(IβA)1(AI)1],C~=[C0]

and

U~=[U0(1β)U(IβA)1(1β)],y~t=[ytct]

Then we can express equation (60.11) as

(60.12)#xt+1=A~xt+C~wt+1y~t=U~xt

We can use the following formulas from linear state space models to compute population mean μt=Ext and covariance Σt:=E[(xtμt)(xtμt)]

(60.13)#μt+1=A~μtwithμ0 given
(60.14)#Σt+1=A~ΣtA~+C~C~withΣ0 given

We can then compute the mean and covariance of y~t from

(60.15)#μy,t=U~μtΣy,t=U~ΣtU~

60.2.5.3. A Simple Example with IID Income#

To gain some preliminary intuition on the implications of (60.11), let’s look at a highly stylized example where income is just IID.

(Later examples will investigate more realistic income streams.)

In particular, let {wt}t=1 be IID and scalar standard normal, and let

zt=[zt11],A=[0001],U=[1μ],C=[σ0]

Finally, let b0=z01=0.

Under these assumptions, we have yt=μ+σwtN(μ,σ2).

Further, if you work through the state space representation, you will see that

bt=σj=1t1wjct=μ+(1β)σj=1twj

Thus, income is IID and debt and consumption are both Gaussian random walks.

Defining assets as bt, we see that assets are just the cumulative sum of unanticipated incomes prior to the present date.

The next figure shows a typical realization with r=0.05, μ=1, and σ=0.15

r = 0.05
β = 1 / (1 + r)
σ = 0.15
μ = 1
T = 60

@jit
def time_path(T):
    w = np.random.randn(T+1)  # w_0, w_1, ..., w_T
    w[0] = 0
    b = np.zeros(T+1)
    for t in range(1, T+1):
        b[t] = w[1:t].sum()
    b = -σ * b
    c = μ + (1 - β) * (σ * w - b)
    return w, b, c

w, b, c = time_path(T)

fig, ax = plt.subplots(figsize=(10, 6))

ax.plot(μ + σ * w, 'g-', label="Non-financial income")
ax.plot(c, 'k-', label="Consumption")
ax.plot( b, 'b-', label="Debt")
ax.legend(ncol=3, mode='expand', bbox_to_anchor=(0., 1.02, 1., .102))
ax.grid()
ax.set_xlabel('Time')

plt.show()
_images/d901c758c1c26db5aa12ae5ad5c88d07775f1c92d3b66f116f3fe9e5aed4ad69.png

Observe that consumption is considerably smoother than income.

The figure below shows the consumption paths of 250 consumers with independent income streams

fig, ax = plt.subplots(figsize=(10, 6))

b_sum = np.zeros(T+1)
for i in range(250):
    w, b, c = time_path(T)  # Generate new time path
    rcolor = random.choice(('c', 'g', 'b', 'k'))
    ax.plot(c, color=rcolor, lw=0.8, alpha=0.7)

ax.grid()
ax.set(xlabel='Time', ylabel='Consumption')

plt.show()
_images/d04f298b838f78ef1faed8f9f9e7bd70baf707469d160cad198293d94d8b1323.png

60.3. Alternative Representations#

In this section, we shed more light on the evolution of savings, debt and consumption by representing their dynamics in several different ways.

60.3.1. Hall’s Representation#

Hall [Hall, 1978] suggested an insightful way to summarize the implications of LQ permanent income theory.

First, to represent the solution for bt, shift (60.9) forward one period and eliminate bt+1 by using (60.2) to obtain

ct+1=(1β)j=0βjEt+1[yt+j+1](1β)[β1(ct+btyt)]

If we add and subtract β1(1β)j=0βjEtyt+j from the right side of the preceding equation and rearrange, we obtain

(60.16)#ct+1ct=(1β)j=0βj{Et+1[yt+j+1]Et[yt+j+1]}

The right side is the time t+1 innovation to the expected present value of the endowment process {yt}.

We can represent the optimal decision rule for (ct,bt+1) in the form of (60.16) and (60.8), which we repeat:

(60.17)#bt=j=0βjEt[yt+j]11βct

Equation (60.17) asserts that the consumer’s debt due at t equals the expected present value of its endowment minus the expected present value of its consumption stream.

A high debt thus indicates a large expected present value of surpluses ytct.

Recalling again our discussion on forecasting geometric sums, we have

Etj=0βjyt+j=U(IβA)1ztEt+1j=0βjyt+j+1=U(IβA)1zt+1Etj=0βjyt+j+1=U(IβA)1Azt

Using these formulas together with (60.3) and substituting into (60.16) and (60.17) gives the following representation for the consumer’s optimum decision rule:

(60.18)#ct+1=ct+(1β)U(IβA)1Cwt+1bt=U(IβA)1zt11βctyt=Uztzt+1=Azt+Cwt+1

Representation (60.18) makes clear that

  • The state can be taken as (ct,zt).

    • The endogenous part is ct and the exogenous part is zt.

    • Debt bt has disappeared as a component of the state because it is encoded in ct.

  • Consumption is a random walk with innovation (1β)U(IβA)1Cwt+1.

    • This is a more explicit representation of the martingale result in (60.6).

60.3.2. Cointegration#

Representation (60.18) reveals that the joint process {ct,bt} possesses the property that Engle and Granger [Engle and Granger, 1987] called cointegration.

Cointegration is a tool that allows us to apply powerful results from the theory of stationary stochastic processes to (certain transformations of) nonstationary models.

To apply cointegration in the present context, suppose that zt is asymptotically stationary 3.

Despite this, both ct and bt will be non-stationary because they have unit roots (see (60.11) for bt).

Nevertheless, there is a linear combination of ct,bt that is asymptotically stationary.

In particular, from the second equality in (60.18) we have

(60.19)#(1β)bt+ct=(1β)U(IβA)1zt

Hence the linear combination (1β)bt+ct is asymptotically stationary.

Accordingly, Granger and Engle would call [(1β)1] a cointegrating vector for the state.

When applied to the nonstationary vector process [btct], it yields a process that is asymptotically stationary.

Equation (60.19) can be rearranged to take the form

(60.20)#(1β)bt+ct=(1β)Etj=0βjyt+j

Equation (60.20) asserts that the cointegrating residual on the left side equals the conditional expectation of the geometric sum of future incomes on the right 4.

60.3.3. Cross-Sectional Implications#

Consider again (60.18), this time in light of our discussion of distribution dynamics in the lecture on linear systems.

The dynamics of ct are given by

(60.21)#ct+1=ct+(1β)U(IβA)1Cwt+1

or

ct=c0+j=1tw^jforw^t+1:=(1β)U(IβA)1Cwt+1

The unit root affecting ct causes the time t variance of ct to grow linearly with t.

In particular, since {w^t} is IID, we have

(60.22)#Var[ct]=Var[c0]+tσ^2

where

σ^2:=(1β)2U(IβA)1CC(IβA)1U

When σ^>0, {ct} has no asymptotic distribution.

Let’s consider what this means for a cross-section of ex-ante identical consumers born at time 0.

Let the distribution of c0 represent the cross-section of initial consumption values.

Equation (60.22) tells us that the variance of ct increases over time at a rate proportional to t.

A number of different studies have investigated this prediction and found some support for it (see, e.g., [Deaton and Paxson, 1994], [Storesletten et al., 2004]).

60.3.4. Impulse Response Functions#

Impulse response functions measure responses to various impulses (i.e., temporary shocks).

The impulse response function of {ct} to the innovation {wt} is a box.

In particular, the response of ct+j to a unit increase in the innovation wt+1 is (1β)U(IβA)1C for all j1.

60.3.5. Moving Average Representation#

It’s useful to express the innovation to the expected present value of the endowment process in terms of a moving average representation for income yt.

The endowment process defined by (60.3) has the moving average representation

(60.23)#yt+1=d(L)wt+1

where

  • d(L)=j=0djLj for some sequence dj, where L is the lag operator 5

  • at time t, the consumer has an information set 6 wt=[wt,wt1,]

Notice that

yt+jEt[yt+j]=d0wt+j+d1wt+j1++dj1wt+1

It follows that

(60.24)#Et+1[yt+j]Et[yt+j]=dj1wt+1

Using (60.24) in (60.16) gives

(60.25)#ct+1ct=(1β)d(β)wt+1

The object d(β) is the present value of the moving average coefficients in the representation for the endowment process yt.

60.4. Two Classic Examples#

We illustrate some of the preceding ideas with two examples.

In both examples, the endowment follows the process yt=z1t+z2t where

[z1t+1z2t+1]=[1000][z1tz2t]+[σ100σ2][w1t+1w2t+1]

Here

  • wt+1 is an IID 2×1 process distributed as N(0,I).

  • z1t is a permanent component of yt.

  • z2t is a purely transitory component of yt.

60.4.1. Example 1#

Assume as before that the consumer observes the state zt at time t.

In view of (60.18) we have

(60.26)#ct+1ct=σ1w1t+1+(1β)σ2w2t+1

Formula (60.26) shows how an increment σ1w1t+1 to the permanent component of income z1t+1 leads to

  • a permanent one-for-one increase in consumption and

  • no increase in savings bt+1

But the purely transitory component of income σ2w2t+1 leads to a permanent increment in consumption by a fraction 1β of transitory income.

The remaining fraction β is saved, leading to a permanent increment in bt+1.

Application of the formula for debt in (60.11) to this example shows that

(60.27)#bt+1bt=z2t=σ2w2t

This confirms that none of σ1w1t is saved, while all of σ2w2t is saved.

The next figure displays impulse-response functions that illustrates these very different reactions to transitory and permanent income shocks.

r = 0.05
β = 1 / (1 + r)
S = 5   # Impulse date
σ1 = σ2 = 0.15

@jit
def time_path(T, permanent=False):
    "Time path of consumption and debt given shock sequence"
    w1 = np.zeros(T+1)
    w2 = np.zeros(T+1)
    b = np.zeros(T+1)
    c = np.zeros(T+1)
    if permanent:
        w1[S+1] = 1.0
    else:
        w2[S+1] = 1.0
    for t in range(1, T):
        b[t+1] = b[t] - σ2 * w2[t]
        c[t+1] = c[t] + σ1 * w1[t+1] + (1 - β) * σ2 * w2[t+1]
    return b, c


fig, axes = plt.subplots(2, 1, figsize=(10, 8))
titles = ['permanent', 'transitory']

L = 0.175

for ax, truefalse, title in zip(axes, (True, False), titles):
    b, c = time_path(T=20, permanent=truefalse)
    ax.set_title(f'Impulse reponse: {title} income shock')
    ax.plot(c, 'g-', label="consumption")
    ax.plot(b, 'b-', label="debt")
    ax.plot((S, S), (-L, L), 'k-', lw=0.5)
    ax.grid(alpha=0.5)
    ax.set(xlabel=r'Time', ylim=(-L, L))

axes[0].legend(loc='lower right')

plt.tight_layout()
plt.show()
_images/4381d045c3efc27c41f9d914e2e97ba32546b3539896b199c4e681aaa93e0c46.png

Notice how the permanent income shock provokes no change in assets bt+1 and an immediate permanent change in consumption equal to the permanent increment in non-financial income.

In contrast, notice how most of a transitory income shock is saved and only a small amount is saved.

The box-like impulse responses of consumption to both types of shock reflect the random walk property of the optimal consumption decision.

60.4.2. Example 2#

Assume now that at time t the consumer observes yt, and its history up to t, but not zt.

Under this assumption, it is appropriate to use an innovation representation to form A,C,U in (60.18).

The discussion in sections 2.9.1 and 2.11.3 of [Ljungqvist and Sargent, 2018] shows that the pertinent state space representation for yt is

[yt+1at+1]=[1(1K)00][ytat]+[11]at+1yt=[10][ytat]

where

  • K:= the stationary Kalman gain

  • at:=ytE[yt|yt1,,y0]

In the same discussion in [Ljungqvist and Sargent, 2018] it is shown that K[0,1] and that K increases as σ1/σ2 does.

In other words, K increases as the ratio of the standard deviation of the permanent shock to that of the transitory shock increases.

Please see first look at the Kalman filter.

Applying formulas (60.18) implies

(60.28)#ct+1ct=[1β(1K)]at+1

where the endowment process can now be represented in terms of the univariate innovation to yt as

(60.29)#yt+1yt=at+1(1K)at

Equation (60.29) indicates that the consumer regards

  • fraction K of an innovation at+1 to yt+1 as permanent

  • fraction 1K as purely transitory

The consumer permanently increases his consumption by the full amount of his estimate of the permanent part of at+1, but by only (1β) times his estimate of the purely transitory part of at+1.

Therefore, in total, he permanently increments his consumption by a fraction K+(1β)(1K)=1β(1K) of at+1.

He saves the remaining fraction β(1K).

According to equation (60.29), the first difference of income is a first-order moving average.

Equation (60.28) asserts that the first difference of consumption is IID.

Application of formula to this example shows that

(60.30)#bt+1bt=(K1)at

This indicates how the fraction K of the innovation to yt that is regarded as permanent influences the fraction of the innovation that is saved.

60.5. Further Reading#

The model described above significantly changed how economists think about consumption.

While Hall’s model does a remarkably good job as a first approximation to consumption data, it’s widely believed that it doesn’t capture important aspects of some consumption/savings data.

For example, liquidity constraints and precautionary savings appear to be present sometimes.

Further discussion can be found in, e.g., [Hall and Mishkin, 1982], [Parker, 1999], [Deaton, 1991], [Carroll, 2001].

60.6. Appendix: The Euler Equation#

Where does the first-order condition (60.5) come from?

Here we’ll give a proof for the two-period case, which is representative of the general argument.

The finite horizon equivalent of the no-Ponzi condition is that the agent cannot end her life in debt, so b2=0.

From the budget constraint (60.2) we then have

c0=b11+rb0+y0andc1=y1b1

Here b0 and y0 are given constants.

Substituting these constraints into our two-period objective u(c0)+βE0[u(c1)] gives

maxb1{u(b1Rb0+y0)+βE0[u(y1b1)]}

You will be able to verify that the first-order condition is

u(c0)=βRE0[u(c1)]

Using βR=1 gives (60.5) in the two-period case.

The proof for the general case is similar.


1

A linear marginal utility is essential for deriving (60.6) from (60.5). Suppose instead that we had imposed the following more standard assumptions on the utility function: u(c)>0,u(c)<0,u(c)>0 and required that c0. The Euler equation remains (60.5). But the fact that u<0 implies via Jensen’s inequality that Et[u(ct+1)]>u(Et[ct+1]). This inequality together with (60.5) implies that Et[ct+1]>ct (consumption is said to be a ‘submartingale’), so that consumption stochastically diverges to +. The consumer’s savings also diverge to +.

2

An optimal decision rule is a map from the current state into current actions—in this case, consumption.

3

This would be the case if, for example, the spectral radius of A is strictly less than one.

4

See [John Y. Campbell, 1988], [Lettau and Ludvigson, 2001], [Lettau and Ludvigson, 2004] for interesting applications of related ideas.

5

Representation (60.3) implies that d(L)=U(IAL)1C.

6

A moving average representation for a process yt is said to be fundamental if the linear space spanned by yt is equal to the linear space spanned by wt. A time-invariant innovations representation, attained via the Kalman filter, is by construction fundamental.