-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathblkstack_2.tex
1084 lines (962 loc) · 49.6 KB
/
blkstack_2.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\input grafinp3
%\input grafinput8
\input psfig
%\eqnotracetrue
%\showchaptIDtrue
%\def\@chaptID{7.}
%\eqnotracetrue
%\hbox{}
\def\toone{{t+1}}
\def\ttwo{{t+2}}
\def\tthree{{t+3}}
\def\Tone{{T+1}}
\def\TTT{{T-1}}
\def\rtr{{\rm tr}}
\chapter{Dynamic Stackelberg Problems\label{stackel}}
\footnum=0
\section{History dependence}
Except for chapter \use{optax}, previous chapters described decision problems that are recursive in
what we can call ``natural'' state variables, meaning state variables that
describe stocks of capital, wealth, and information that helps forecast
future values of prices and quantities that impinge on utilities
or profits. In problems that are recursive in the natural state variables,
optimal decision rules are functions of the natural state variables.
\index{time consistency}
% This chapter studies a class of
% problems that are not recursive in the natural state variables.
Kydland and Prescott (1977) and Calvo (1978) gave
macroeconomic examples of decision problems
that %whose solutions exhibited {\it time inconsistency\/}
are not recursive in natural state variables.
At time $0$, a government
chooses actions for all $t \geq 0$, knowing that it confronts a competitive market composed of many small private
agents
whose decisions are influenced by their
{\it forecasts\/} of the government's future actions. In particular, what private agents choose to do at date $t$ depends
partly on what they expect the government to do at dates $t+j, \ \forall t \geq 0$. In a rational expectations equilibrium in a nonstochastic setting,
the government's actions at time $t \geq 1$ equal private agents' earlier forecasts of
those actions.
Knowing that, the government uses its time $t \geq 1$ actions
to influence earlier actions by private agents. The rational expectations equilibrium concept requires that the government confirm
private sector forecasts. That
prevents the government's decision problem from being
recursive in natural state variables and makes the government's
decision rule at $t$ depend on the {\it history\/} of the natural
state variables from time $0$ to time $t$.
\auth{Kydland, Finn E.} \auth{Prescott, Edward C.}%
It took time for economists to learn how to formulate policy problems of this type
recursively.
Prescott (1977) asserted that
recursive optimal control theory (i.e., dynamic programming) did not apply
to problems with this structure. This chapter and chapters \use{optaxrecur},
\use{socialinsurance}, and \use{credible} describe
how Prescott's initial pessimism about the inapplicability
of optimal control theory was overturned.\NFootnote{The important contribution by Kydland and Prescott (1980)
dissipated
Prescott's initial pessimism.}
An important finding is that if the natural state variables
are augmented with appropriate `forward looking' state variables, this class of problems
can be made recursive. This affords
computational advantages and yields substantial insights.
This chapter displays these
within the tractable framework of linear-quadratic problems.
\section{The Stackelberg problem}
To exhibit the essential structure of
the decision problems that concerned Kydland and Prescott (1977) and Calvo (1979),
this chapter uses the optimal linear regulator problem of chapter \use{dplinear}
to solve a linear-quadratic version of
what is known as a dynamic Stackelberg problem.\NFootnote{In some settings
it is called a Ramsey problem. See chapters \use{optax} and \use{optaxrecur}.}
For now we refer to the Stackelberg leader as a government and
the Stackelberg follower as a representative agent or
private sector. In section \use{sec:monopfringe}, we'll give
an application with another interpretation of these two types of agent.
Let $z_t$ be an $n_z \times 1$ vector of natural state variables,
$x_t$ an $n_x \times 1$ vector of endogenous forward-looking variables, and $u_t$ a vector of government choice variables.
Included in $x_t$ are prices and quantities that adjust
instantaneously to clear markets at time $t$.
The $z_t$ vector is inherited from the past.
The vector $x_t$ is determined purely by {\it future} values of $z$ and $u$.
Nevertheless, at $t \geq 1$, $x_t$ is inherited from the past because
values of $z$ and $u$ for all $ t \geq 0$ are set by a Stackelberg plan devised at time $0$.
% The model determinesunder a Stackelberg plan,
% the ``jump variables'' $x_t$ at time
% $t$.
\specsec{Remark:} For $t \geq 1$, $x_t$ will turn out to be both a {\it forward-looking\/} and a {\it backward-
looking\/} variable. It is forward looking because it depends on forecasts of future actions of the Stackelberg leader.
It is backward looking because it is a promise about time $t$ outcomes that was chosen earlier by the Stackelberg leader.
Let $ y_t = \left[\matrix{z_t \cr x_t \cr} \right]$. Let $u_t$ be a vector of variables chosen by
the government at $t$.
Define the
government's one-period loss function\NFootnote{The
problem assumes that there
are no cross products between states and controls in the
return function. There is a simple transformation that converts a problem
whose return function has cross products into an equivalent problem
that has no cross products. For example, see Hansen and Sargent (2008, chapter 4, pp. 72-73).}
$$ r(y, u) = y' R y + u' Q u . \EQN target $$
Subject to an initial condition for $z_0$, but not for
$x_0$, a government at time $0$ wants to % chooses $(x_0, \{u_t, y_t\}_{t=0}^\infty)$ to
maximize % chooses $\{y_{t+1}, x_t, u_t \}_{t=0}^\infty$ to maximize
$$ - \sum_{t=0}^\infty \beta^t r(y_t, u_t) \EQN new1 $$
in light of the model
$$ \left[\matrix{I & 0 \cr
G_{21} & G_{22} \cr}\right]
\left[ \matrix{ z_{t+1} \cr x_{t+1} \cr} \right]
= \left[ \matrix{ \hat A_{11} & \hat A_{12} \cr
\hat A_{21} & \hat A_{22} \cr} \right]
\left[\matrix{ z_t \cr x_t \cr} \right]
+ \hat B u_t . \EQN new2$$
We shall explain the meaning of the second block of equations in system \Ep{new2} soon.
We assume that the matrix on the left is invertible,
so that we can multiply both sides of the above equation by its
inverse to obtain
% \NFootnote{We
% have assumed that the matrix on the left of \Ep{new2} is invertible
% for ease of presentation.
% However, by appropriately using the
% invariant subspace methods described
% under step 2 below (see Appendix \the\chapternum\use{appBblkstack}),
% it is straightforward to adapt the computational
% method when this assumption is violated.}
$$ \left[ \matrix{ z_{t+1} \cr x_{t+1} \cr} \right]
= \left[ \matrix{ A_{11} & A_{12} \cr
A_{21} & A_{22} \cr} \right]
\left[\matrix{ z_t \cr x_t \cr} \right]
+ B u_t \EQN new3$$
or
$$ y_{t+1} = A y_t + B u_t . \EQN new30 $$
At time $0$, the government maximizes \Ep{new1} by choosing
$\{u_t, x_t, z_{t+1}\}_{t=0}^\infty$ subject to
\Ep{new3} and an initial condition for $z_0$.\NFootnote{Miller and Salmon (1982, 1985),
Hansen, Epple, and Roberds (1985), Pearlman, Currie, and Levine (1986),
Sargent (1987a, chapter XIII),
Pearlman (1992), and others have studied
versions of this problem.}
\medskip
The optimal decision rule is history dependent, meaning that
$u_t$ depends not only on $z_t$ but also on lags of $z$.
History
dependence has two sources: (a) the government's
commitment
to a sequence of rules at time
$0$,\NFootnote{The government would
make different choices were it
to choose sequentially, that is, were it to select its time $t$ action
at time $t$.} and
(b) the forward-looking behavior of the
private sector embedded in the second block of equations
\Ep{new3}.
The second
block of equations of \Ep{new2} or \Ep{new3}
typically includes first-order conditions
for private agents' optimization problems (i.e., their Euler equations). These
summarize the forward-looking aspect of private agents' behavior.
We shall provide an example in section \use{sec:monopfringe} in which
the last $n_x$ equations
of \Ep{new3} or \Ep{new30} constitute {\it implementability constraints} that
are formed by the Euler equations of a competitive fringe or
private sector.
When combined with a stability condition to be imposed below, these
Euler equations
summarize the private sector's best responses to the
sequence of actions by the government. The government uses its understanding of these responses
to manipulate private sector actions.
To indicate features of the problem that make $x_t$ a vector of forward-looking
variables, write the second block of system \Ep{new2}
as
$$ x_t = \phi_1 z_t + \phi_2 z_{t+1} + \phi_3 u_t + \phi_0 x_{t+1} , \EQN eqn:xlawforward $$
where $\phi_0 = \hat A_{22}^{-1} G_{22}$.
The models we study in this chapter typically satisfy
\specsec{Condition A:}
The eigenvalues of $\phi_0$ % = \hat A_{22}^{-1} G_{22}$
are bounded in modulus by $1$.\NFootnote{It will suffice if the eigenvalues of $\phi_0$ are bounded in modulus
by $\beta^{-.5}$.} %${\frac{1}{\sqrt{\beta}}}$.
\medskip
\noindent Condition A makes equation explosive if ``solved backward'' but stable if ``solved forward''.\NFootnote{See
appendix \use{appa1} of chapter \use{timeseries} and chapter \use{dplinear}. Also, see Sargent (1987a, chapter IX).} So we solve
equation \Ep{eqn:xlawforward} forward to get
$$ x_t = \sum_{j=0}^\infty \phi_0^j \left[ \phi_1 z_{t+j} + \phi_2 z_{t+j+1} + \phi_3 u_{t+j} \right] . \EQN bell101 $$
In choosing $u_t$ for $t \geq 1$ at time $0$, the government takes into account how future $z$'s and $u$'s affect earlier
$x$'s through equation \Ep{bell101}.
%
% Let $X^t$ denote the history of any variable $X$ from
% $0$ to $t$. Miller and Salmon (1982, 1985),
% Hansen, Epple, and Roberds (1985), Pearlman, Currie, and Levine (1986),
% Sargent (1987),
% Pearlman (1992), and others have all studied
% versions of the following problem:
% \medskip
% \noindent {\bf Problem S:}
% The {\it
% Stackelberg problem\/}
% is to maximize \Ep{new1} by choosing an $x_0$ and
% a sequence of decision rules, the time $t$ component of which
% maps the time $t$ history of the state
% $z^t$ into the time $t$ decision $u_t$ of the Stackelberg leader.
% The Stackelberg leader commits to
% this sequence of decision rules at time $0$.
% The maximization is subject to a given initial condition
% for $z_0$.
% %But $x_0$ is among the objects to be chosen by the Stackelberg leader.
% %The jump variables $x_t$ are functions of both the natural state variables $z_t$ and
%the multipliers $\mu_{xt}$.
\specsec{Remark:}
We can regard $x_t$ in equation \Ep{eqn:xlawforward} or \Ep{bell101} as indexing the optimal behavior of the followers
in response to a Stackelberg plan $\{u_t\}_{t=0}^\infty$. In chapters \use{optaxrecur} and \use{chang}, we use counterparts to $x_t$ to index
competitive equilibria with distorting taxes. In equation \Ep{eqn:xlawforward}, the effects of $\{u_{t+j}\}_{j=1}^\infty$ on $x_t$ are all
intermediated through $x_{t+1}$.
\specsec{Remark:}
The certainty equivalence principle
stated in chapter \use{dplinear} allows us
to work with a nonstochastic model.
We would attain the
same decision rule for the Stackelberg leader if we were to replace
$x_{t+1}$ with the forecast $E_t x_{t+1}$ and to add a shock process
$C \epsilon_{t+1}$ to the right side of \Ep{new3}, where
$\epsilon_{t+1}$ is an i.i.d.\ random vector
with mean zero
and identity covariance matrix.
\section{Recursive formulation}\label{recurstack}%
For any vector $a_t$, define $\vec a_t = [a_t, a_{t+1}, \ldots]$.
\specsec{Definition:} Given $z_0$, the Stackelberg problem is to choose $\vec x_0, \vec z_1, \vec u_0$ that maximize
criterion \Ep{new1} subject to \Ep{new3} for $t \geq 0$. A Stackelberg plan is a $(\vec x_0, \vec z_1, \vec u_0)$ that solves the Stackelberg problem starting from a given $z_0$. To formulate a Stackelberg problem recursively,
we formulate two Bellman equations in two sets of state variables.
\subsection{Two Bellman Equations}
Define a feasible set of $(\vec y_1, \vec u_0)$ sequences
% $$
% \Omega(y_0) = \left\{ (\vec y_1, \vec u_0) : - \sum_{t=0}^\infty \beta^t r(y_t, u_t) > -\infty \ {\rm and} \ y_{t+1} = A y_t + B u_t, \forall t \geq 0 \right\} $$
$$
\Omega(y_0) = \left\{ (\vec y_1, \vec u_0) : \ y_{t+1} = A y_t + B u_t, \forall t \geq 0 \right\} $$
In the definition of $\Omega(y_0)$, $y_0$ is taken as given.
We express the Stackelberg problem in terms of two subproblems:
\medskip
\specsec{subproblem 1}
$$ v(y_0) = \max_{(\vec y_1, \vec u_0) \in \Omega(y_0)} - \sum_{t=0}^\infty \beta^t r(y_t, u_t)
\EQN stacksub1 $$
%\vfil\eject
\medskip
\specsec{subproblem 2}
$$
w(z_0) = \max_{x_0} v(y_0) \EQN stacksub2 $$
\subsection{Subproblem 1}
The value function $v(y)$ in subproblem 1 satisfies the Bellman equation
$$ v(y) = \max_{u, y^*} \left\{ - r(y,u) + \beta v(y^*) \right\} \EQN bell1
$$
where the maximization is subject to
$$ y^* = A y + B u,
\EQN bell2 $$
where $y^*$ denotes next period's value of $y$.
% The optimal value function $v(y) = - y' P y$ is attained by
% the linear decision rule
% $$ u = - F y $$
% where XXXX
% he linear regulator is
% $$ v(y_0) = -y_0' P y_0
% = {\rm max}_{\{ u_t, y_{t+1}\}_{t=0}^\infty} - \sum_{t=0}^\infty \beta^t
% \left( y_t' R y_t + u_t' Q u_t \right) \EQN olrp1a $$
% where the maximization is subject to a fixed initial condition for
% $y_0$ and the law of motion\NFootnote{In step 4, we acknowledge that the $x_0$ component
% is {\it not\/} given but is to be chosen by the Stackelberg leader.}
% $$ y_{t+1} = A y_t + B u_t . \EQN new30a $$
The problem takes the form of a linear quadratic dynamic programming problem, also called an optimal linear regulator problem in
chapter \use{dplinear}. Substituting the (correct) guess $v(y) = - y'P y$ into Bellman equation \Ep{bell1} gives
$$ - y' P y = {\rm max}_{ u, y^*} \left\{ - y' R y - u'Q u - \beta y^{* \prime} P y^* \right\} , $$
which as in chapter \use{dplinear} gives rise to the algebraic matrix Riccati equation
$$ P = R + \beta A' P A - \beta^2 A' P B ( Q + \beta B' P B)^{-1} B' P A
\EQN bell3 $$
and the optimal decision rule % the formula for $F$ in the decision rule
$$ u_t = - F y_t,
\EQN bell5 $$
where
$$ F = \beta( Q + \beta B' P B)^{-1} B' P A . \EQN bell4 $$
% Thus, we can solve problem \Ep{new1}, \Ep{new30} by iterating
% to convergence on the difference equation counterpart to the algebraic Riccati equation \Ep{bell3}, or by using
% a faster computational method that emerges as a by-product in
% step 2. This method is described in Appendix \the\chapternum\use{appBblkstack}.
\subsection{Subproblem 2}
The value function $v(y_0)$ satisfies $v(y) = - y_0' P y_0$ or
$$ v(y_0) = - z_0 ' P_{11} z_0 - 2 x_0' P_{21} z_0 - x_0' P_{22} x_0
\EQN valuefny $$
where
$$P = \bmatrix{P_{11} & P_{12} \cr P_{21} & P_{22} }.$$
Choose $x_0$ by equating to zero the gradient of $v(y_0)$ with respect to $x_0$:
$$ - 2 P_{21} z_0 - 2 P_{22} x_0 =0, $$
which implies that
$$ x_0 = - P_{22}^{-1} P_{21} z_0. \EQN king6x0 $$
We have solved subproblem 2.
\specsec{Remark:}
From chapter \use{dplinear}, recall the formula $\mu_t = P y_t$ for the vector of
shadow prices $\mu_t = \bmatrix{ \mu_{zt} \cr \mu_{xt} }$ on the transition equations.
The shadow price $\mu_{xt}$ evidently equals
$$ \mu_{xt} = P_{21} z_t + P_{22} x_t. \EQN eqnmux $$
So \Ep{king6x0} is equivalent with
$$ \mu_{x0} = 0 . \EQN mu0condition $$
The Lagrange multiplier $\mu_{xt}$ measures the cost to the government at $t \geq 0$
of confirming expectations about its time $t$ action that the followers had
at dates $s < t$.
Setting
$\mu_{x0}=0$ means that at time $0$ there are no prior expectations to confirm. But
when $\mu_{xt} \neq 0 $ for $ t \geq 1$, it indicates that it is costly to firm the private sector's expectations about time $t \geq 1$ actions.
The government takes these costs into account when it weighs the costs and benefits
of using its choice $ u_{t+j}, j >0$ to influence $x_t \ \forall t \geq 0$.
\subsection{Timing protocol}
Equations \Ep{bell5} and \Ep{king6x0} form a recursive representation of a Stackelberg plan that
features the following timing protocol:
\smallskip
\item{1.} At times $t \geq 1$, the government takes $(z_t, x_t)$ as given and chooses $ (u_t, z_{t+1}, x_{t+1}) $.
\medskip
\item{2.} At time $0$, the government takes $z_0$ as given and chooses $(x_0, u_0, z_1, x_1)$.
\smallskip
\noindent In this timing protocol, the entirely forward looking vector $x_t$ that obeys \Ep{eqn:xlawforward} is part of the state vector confronting
the government at times $t \geq 1$ but not at time $t=0$. It is presented to the government
at times $t \geq 1$ as a promise to be kept. The time $t$ government delivers $x_t$ by choosing $u_t, z_{t+1}, x_{t+1}$.\NFootnote{See
exercise \the\chapternum.2 for a timing protocol that builds in time consistency.}
\subsection{Time inconsistency}
The two subproblems in section \use{recurstack} express the time inconsistency of the optimal rule. In the recursive representation of
the Stackelberg program, different state variables confront the government at $t=0$, on the one hand, and dates $t \geq 1$, on the other.
At $t =0$, the government faces $z_0$ as a state vector and chooses the forward looking
vector $x_0$ as well as the forward looking vector $x_1$ that will confront the
government at time $1$. At dates $t \geq 1$, the government confronts the state vector $x_t$ as values promised at time $t-1$ that must
be confirmed at $t$.\NFootnote{Another manifestation of time-inconsistency is that $\mu_{xt}$ is zero at $t=0$ and different
from zero at $t \geq 1$.}
%The multiplier $\mu_{xt}$ accounts for how the optimal plan used $u_{t+j}, j \geq 0$ to influence the followers' choice of
%$x_s$ at dates $s < t$ and assures that the Stackelberg leader confirm those expectations.
Define $\vec a_1$ as the continuation of the sequence $\vec a_0$. Recall that a
Stackelberg plan is a $(\vec x_0, \vec z_1, \vec u_0)$ that solves the Stackelberg problem starting from a given $z_0$.
\specsec{Time inconsistency:}
A concise way
to say that a Stackelberg plan is {\it time inconsistent\/} is to note that a continuation of a Stackelberg plan is not
a Stackelberg plan.\NFootnote{Why? Because $x_1$ does not solve subproblem 2 at $z_1$.}
\index{continuation!of a sequence}%
\index{continuation!of a Stackelberg plan}%
%
%\subsection{Digression on determinacy of equilibrium}
%Appendix \the\chapternum\use{appBblkstack} describes methods for solving a system of difference
%equations of the form \Ep{new2} or \Ep{new3} with an arbitrary feedback rule that
%expresses the decision rule for
%$u_t$ as a function of current and previous values of $y_t$ and perhaps previous values
%of itself. The difference equation system has a unique solution
%satisfying the stability condition $\sum_{t=0}^\infty \beta^t y_t \cdot y_t$
%if the eigenvalues of the matrix \Ep{symplec2} split, with half being greater than
%unity and half being less than unity in modulus. If more than half are less than
%unity in modulus, the equilibrium is said to be indeterminate \index{indeterminacy!of equilibrium}%
% in the sense that there are multiple equilibria starting from any initial condition.
%
%If we choose to represent the solution of a Stackelberg or Ramsey problem in the form
%\Ep{vonzer3}, we can substitute that representation for $u_t$ into
%\Ep{new3}, obtain a difference equation system in $y_t, u_t$, and ask whether
%the resulting system is determinate. To answer this question, we would use the method
%of Appendix \the\chapternum\use{appBblkstack}, form system \Ep{symplec2}, then check whether the generalized
%eigenvalues split as required. Researchers have used this method to study the determinacy
%of equilibria under Stackelberg plans with representations like \Ep{vonzer3} and have discovered
%that sometimes an equilibrium can be indeterminate.\NFootnote{The existence of a Stackelberg plan is not at
%issue because we know how to construct one using the method in the text.} See Evans
%and Honkapohja (2003) for a discussion of determinacy of equilibria under commitment
%in a class of equilibrium monetary models and
%how determinacy depends on how the decision rule of the Stackelberg leader is represented. Evans and
%Honkapohja argue that casting a government decision rule in a way that leads to indeterminacy is a bad idea.
%\auth{Evans, George W.}
%\auth{Honkapohja, Seppo}
%\index{implementation!of Stackelberg plan}
\section{Large firm facing a competitive fringe}\label{sec:monopfringe}%
As an example, this section studies the equilibrium of an industry with
a large firm that acts as a Stackelberg leader with respect to a competitive
fringe.\NFootnote{Sometimes the large firm is called `the monopolist' even though there are
actually many firms in the industry.}%
The industry produces a single nonstorable homogeneous good. One
large firm produces $Q_t$ and a representative firm in a competitive
fringe produces $q_t$. The representative firm in the competitive
fringe acts as a price taker and chooses sequentially. The large firm
commits to a policy at time $0$, taking into account its ability to
manipulate the price sequence, both directly through the effects of
its quantity choices on prices, and indirectly through the responses of
the competitive fringe to its forecasts of prices.\NFootnote{Hansen and Sargent
(2012) use this model as a laboratory to illustrate an equilibrium concept featuring
robustness in which at least one of the agents has doubts about the stochastic specification
of the demand shock process.}
The costs of production are
${\cal C}_t = e Q_t + .5 g Q_t^2+ .5 c (Q_{t+1} - Q_{t})^2 $
for the large firm
and $ \sigma_t= d q_t + .5 h q_t^2 + .5 c (q_{t+1} - q_t)^2$
for the competitive firm,
where $d>0, e >0, c>0, g >0, h>0 $ are cost parameters.
There is a linear inverse demand curve
$$ p_t = A_0 - A_1 (Q_t + \overline q_t) + v_t, \EQN oli1 $$
where $A_0, A_1$ are both positive and $v_t$ is a disturbance
to demand governed by
$$ v_{t+1}= \rho v_t + C_\epsilon \check \epsilon_{t+1} \EQN oli2 $$
and where $ | \rho | < 1$ and $\check \epsilon_{t+1}$ is an i.i.d.\
sequence of random variables with mean zero and variance $1$.
In \Ep{oli1}, $\overline q_t$ is equilibrium output of the representative
competitive firm. In equilibrium, $\overline q_t = q_t$, but we
must distinguish between $q_t$ and $\overline q_t$ in posing the optimum
problem of a competitive firm.
\subsection{The competitive fringe}
The representative competitive firm regards $\{p_t\}_{t=0}^\infty$
as an exogenous stochastic process and chooses
an output plan to
maximize
$$ E_0 \sum_{t=0}^\infty \beta^t \left\{
p_t q_t - \sigma_t
% .5 c(q_{t+1} - q_t)^2 -.5h q_t^2 - d q_t
\right\}, \quad \beta \in(0,1) \EQN oli3 $$
subject to $q_0$ given, where %$c>0, d>0, h>0$ are cost parameters,
$E_t$ is the mathematical expectation based on time
$t$ information.
Let $i_t = q_{t+1} - q_t.$ We
regard $i_t$ as the representative firm's control at $t$. The
first-order conditions
for maximizing \Ep{oli3} are
$$ i_t = E_t \beta i_{t+1} -c^{-1} \beta h q_{t+1}
+ c^{-1} \beta E_t( p_{t+1} -d) \EQN oli4 $$
for $t \geq 0$.
We appeal to the certainty equivalence principle stated on
page \use{certequiv} to justify working with
a non-stochastic version of \Ep{oli4} formed by dropping
the expectation operator and the random term $\check \epsilon_{t+1}$
from \Ep{oli2}. We use
an insight of Sargent (1979) and Townsend
(1983).\NFootnote{They used this method to compute a rational expectations
competitive equilibrium. The key step was
to eliminate price and output by substituting
from the inverse demand curve and the production function into
the firm's first-order conditions to get a difference equation
in capital.}
We shift \Ep{oli1} forward one period, replace conditional
expectations with realized values, use \Ep{oli1} to substitute
for $p_{t+1}$ in \Ep{oli4}, and set $q_t = \overline q_t$ and $i_t = \overline i_t$ for all
$t\geq 0$ to get
%{\ninepoint
$$ \overline i_t = \beta \overline i_{t+1} - c^{-1} \beta h \overline q_{t+1}
+ c^{-1} \beta (A_0-d) - c^{-1} \beta A_1 \overline q_{t+1}
- c^{-1} \beta A_1 Q_{t+1} + c^{-1} \beta v_{t+1}. \EQN oli5 $$
%}%endninepoint
Given sufficiently stable sequences $\{Q_t, v_t\}$, we can solve \Ep{oli5}
and $\overline i_t = \overline q_{t+1} - \overline q_t$ to get a second-order difference equation
in $\bar q_t$, then use the method for constructing a stable solution
of a second order linear difference equation described in appendix A of chapter \use{timeseries} to
express the competitive fringe's
output sequence as a function of the (tail of the)
large firm's output sequence:
$$ \bar q_{t+1} = \lambda \bar q_t +k_0 + k_1 \sum_{j=0}^\infty (\beta \lambda)^j Q_{t+j+1} + k_2 \sum_{j=0}^\infty (\beta \lambda)^j v_{t+j+1} , $$
where $\lambda \in (0,1)$ and the $k_i$s are constants that are functions of demand and cost parameters.
The dependence of $\bar q_{t+1}$ on future $Q_{t+j+1}$'s opens an avenue for the large firm to influence $\bar q_{t+1}$ by its choice
of future $Q_{t+j+1}$'s.
It is this feature that makes the large firm's problem fail to be
recursive in the natural state variables $\overline q, Q$. In effect, the large firm arrives
at time $t+j$ {\it not\/} in the position of being able to take past values of $\bar q_t$ as given because these have already
been influenced by the large firm's choice of $Q_{t+j}$. Instead, the large firm
arrives at period $t >0$ facing the constraint that it must
confirm the expectations about its time $t$ decision
upon which the competitive fringe based its decisions at dates
before $t$.
\subsection{The large firm's problem}
The large firm views the competitive firm's sequence of Euler equations
as constraints on its own opportunities.
They are {\it implementability constraints\/} on the
large firm's choices.
Including the implementability constraints \Ep{oli5},
we can represent
the constraints
in terms of the transition law facing the large firm:
%{\ninepoint
$$ \eqalign{ \left[\matrix{ 1 & 0 & 0 & 0 & 0 \cr
0 & 1 & 0 & 0 & 0 \cr
0 & 0 & 1 & 0 & 0 \cr
0 & 0 & 0 & 1 & 0 \cr
A_0 -d & 1 & - A_1 & - A_1 -h & c \cr }\right]
\left[\matrix{ 1 \cr v_{t+1} \cr Q_{t+1} \cr \overline
q_{t+1} \cr i_{t+1} \cr}
\right]
& = \left[ \matrix{ 1 & 0 & 0 & 0 & 0 \cr
0 & \rho & 0 & 0 & 0 \cr
0 & 0 & 1 & 0 & 0 \cr
0 & 0 & 0 & 1 & 1 \cr
0 & 0 & 0 & 0 & {c\over \beta} \cr} \right]
\left[ \matrix{ 1 \cr v_t \cr Q_t \cr \overline
q_t \cr i_t \cr} \right] \cr
& + \left[\matrix{ 0 \cr 0 \cr 1 \cr 0 \cr 0 \cr}\right] u_t
, \cr} \EQN oli6 $$
%}%endninepoint
where $u_t = Q_{t+1} - Q_t $ is the control of the large firm.
The last row portrays the implementability constraints \Ep{oli5}.
Represent \Ep{oli6} as
$$ y_{t+1} = A y_t + B u_t . \EQN oli6a $$
Although we have included the competitive fringe's choice variable $i_t$ as a component
of the ``state'' $y_t$ in the large firm's transition law \Ep{oli6a},
$i_t$ is actually a ``jump''
variable. Nevertheless, the analysis in earlier sections of this chapter
implies that the solution of the large firm's
problem is encoded in the Riccati equation associated with
\Ep{oli6a} as the transition law. Let's decode it.
To match our general setup, we partition $y_t$ as
$y_t' = \left[\matrix{z_t' & x_t' \cr} \right]$ where
$z_t' = \left[\matrix{ 1 & v_t & Q_t & \overline q_t \cr}\right]$
and $x_t = i_t$.
The large firm's problem is
$$
\max_{\{u_t, p_{t+1}, Q_{t+1}, \overline q_{t+1}, i_t\}}
\sum_{t=0}^\infty \beta^t \left\{ p_t Q_t - {\cal C}_t \right\} $$
subject to the given initial condition
for $z_0$, equations \Ep{oli1} and \Ep{oli5} and $i_t = \overline q_{t+1} -
\overline q_t$,
as well as the laws of motion
of the natural state variables $z$. Notice that the large firm in effect chooses the
price sequence, as well as the quantity sequence of the
competitive fringe, albeit subject to the restrictions imposed by
the behavior of consumers, as summarized by the demand curve
\Ep{oli1} and the implementability constraint \Ep{oli5} that
describes the best responses of the competitive fringe.
By substituting \Ep{oli1} into the above objective function,
the large firm's problem can be expressed as
$$
\max_{\{u_t\}}
\sum_{t=0}^\infty \beta^t
\left\{ (A_0 - A_1 (\overline q_t + Q_t) + v_t) Q_t - eQ_t - .5gQ_t^2 -
.5 c u_t^2
\right\} \EQN oli7 $$
subject to \Ep{oli6a}.
This can be written
$$
\max_{\{u_t\}}
- \sum_{t=0}^\infty \beta^t \left\{ y_t' R y_t + u_t' Q u_t
\right\} \EQN oli9 $$
subject to \Ep{oli6a}
where
$$ R = - \left[\matrix{ 0 & 0 & {A_0-e \over 2} & 0 & 0 \cr
0 & 0 & {1 \over 2} & 0 & 0 \cr
{A_0-e \over 2} & {1 \over 2} & - A_1 -.5g
& -{A_1 \over 2} & 0 \cr
0 & 0 & -{A_1 \over 2} & 0 & 0 \cr
0 & 0 & 0 & 0 & 0 \cr} \right] $$
and $Q= {c \over 2}$.
%
% \subsection{Equilibrium representation}
%
% We can use \Ep{king11} to represent the
% solution of the large firm's problem \Ep{oli9} in the form:
% $$ \left[\matrix{z_{t+1} \cr \mu_{x,t+1}\cr}\right]
% = \left[\matrix{m_{11} & m_{12} \cr
% m_{21} & m_{22}\cr}\right]
% \left[\matrix{z_t \cr \mu_{x,t} \cr} \right] \EQN oli11 $$
% or
% $$ \left[\matrix{z_{t+1} \cr \mu_{x,t+1}\cr}\right]
% = m
% \left[\matrix{z_t \cr \mu_{x,t} \cr} \right] . \EQN oli11 $$
% The large firm is
% constrained to set $\mu_{x,0} \leq 0$, but will find it optimal to
% set it to zero.
% Recall that $z_t =\left[\matrix{ 1 & v_t & Q_t & \overline q_t \cr}\right]'$.
% Thus, \Ep{oli11} includes the equilibrium law of motion for the quantity
% $\overline q_t$
% of the competitive fringe. By construction, $\overline q_t$ satisfies the Euler
% equation of the representative firm in the competitive fringe, as
% we elaborate in Appendix \the\chapternum\use{appCblkstack}.
%% TTTTTT
\subsection{Numerical example}
We computed the optimal Stackelberg plan
for parameter settings $A_0, A_1, \rho, C_\epsilon,\hfil\break
c, d, e, g, h, \beta $ = $100, 1, .8, .2, 1, 20, 20, .2, .2,
.95$.\NFootnote{These calculations were performed
by the Matlab program {\tt oligopoly5.m} or the Python program {\tt oligopoly.py}.}\mtlb{olipololy5.m}%
%, a modification of Stijn and Tom's earlier
%program with robustness.XXXXX}
For these parameter values, a recursive representation of the Stackelberg plan is
$$u_t = (Q_{t+1} - Q_t) =\left[\matrix{-83.98 & -0.78 & 0.95 & 1.31 & 2.07 \cr}\right]
\left[ \matrix{z_t \cr x_t \cr}\right] $$
for $ t \geq 0$ and
$$x_0 = \bmatrix{ 31.08 & 0.29 & -0.15 & -0.56 } z_0 .$$
%Note in Figure 4.5 %\Fg{oli30}
%how starting from $0$ the implementation multiplier
%decreases toward its negative steady state value. The negative
%value of the multiplier reflects the cost to the large firm
%of adhering to its plan. The time inconsistency
%of the large firm's plan is reflected in the incentive
%the large firm would have to reset the multiplier to zero
%in any period and thereby reinitialize its plan (see Hansen,
%Epple, and Roberds (1985)). Figure 4.5 %\Fg{oli30}
%and the other sample paths show that the large firm is acting
%to smooth total output $Q+q$, and that it does so by inducing
%a negative contemporaneous covariance between its own output
%and the price.
%%
%%%%%%%%%
%%\midinsert
%%$$ \grafone{oli10.eps,height=2.5in}
%%{{\bf Figure 4.2.}
%%Impulse response of $p, q, Q, \mu_x$ to innovation to demand
%%shock $\epsilon$.} $$
%%\endinsert
%%%%%%%%
%
%\midfigure{oli10f}
%\centerline{\epsfxsize=3truein\epsffile{oli10.eps}}
%\caption{Impulse response of $p, q, Q, \mu_x$ to innovation to demand shock
%$\epsilon$.}
%\infiglist{oli10f}
%\endfigure
%
%%%%%%%%
%%\midinsert
%%$$\grafone{oli60.eps,height=2.5in}{{\bf Figure 4.3.}
%%Impulse response of $q+Q, w, v$ to $\epsilon$.} $$
%%\endinsert
%%%%%%%%%%%%
%
%\midfigure{oli60f}
%\centerline{\epsfxsize=3truein\epsffile{oli60.eps}}
%\caption{Impulse response of $q+Q, w, v$ to $\epsilon$.}
%\infiglist{oli60f}
%\endfigure
%
%%%%%%%%%%%%
%%\midinsert
%%$$
%%\grafone{oli20.eps,height=2.5in}{{\bf Figure 4.4.} Sample path of $q+Q, q, Q$.}
%% $$
%%\endinsert
%%%%%%%%%%%%%%%
%
%\midfigure{oli20f}
%\centerline{\epsfxsize=3truein\epsffile{oli20.eps}}
%\caption{Sample path of $q+Q, q, Q$.}
%\infiglist{oli20f}
%\endfigure
%
%%%%%%%%%%%
%% \midinsert
%%$$\grafone{oli30.eps,height=3in}
%% {{\bf Figure 4.5.} Sample path of $\mu_x, Q, p$.} $$
%% \endinsert
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%
%\midfigure{oli30f}
%\centerline{\epsfxsize=3truein\epsffile{oli30.eps}}
%\caption{Sample path of $\mu_x, Q, p$.}
%\infiglist{oli30f}
%\endfigure
%
%%%%%%%%%%%%%%%%%%
%% \midinsert
%%$$
%%\grafone{oli50.eps,height=3in}{{\bf Figure 4.6.} Sample path of $v, Q, q, p$.}
%%$$
%% \endinsert
%%%%%%%%%%%%%%%%%%%
%
%\midfigure{oli50f}
%\centerline{\epsfxsize=3truein\epsffile{oli50.eps}}
%\caption{Sample path of $v, Q, q, p$.}
%\infiglist{oli50f}
%\endfigure
\section{Concluding remarks}
We shall
confront other problems in which optimal decision rules are history dependent in chapters \use{optaxrecur}, \use{socialinsurance},
\use{socialinsurance2},
and \use{credible} and shall see in various contexts how history
dependence can be represented recursively by
appropriately augmenting the natural state variables with
forward-looking variables chosen by private agents.\NFootnote{For another application of the techniques
in this chapter and how they related to the method recommended by
Kydland and Prescott (1980), see Evans and Sargent (2013).}
In
chapters \use{socialinsurance}, \use{socialinsurance2}, and \use{credible}, we make dynamic
incentive and enforcement problems recursive by augmenting the
state with continuation values of other decision
makers.\NFootnote{In chapter \use{socialinsurance}, we describe Marcet and Marimon's (1992, 1999) method of
constructing recursive contracts, which is closely related to the method
that we have presented in this chapter.}
\auth{Marcet, Albert}%
\auth{Marimon, Ramon}%
\auth{Kydland, Finn E.} \auth{Prescott, Edward C.}%
\auth{Evans, David} \auth{Sargent, Thomas J.}%
%
%\appendix{A}{History-dependent representation of Stackelberg plan}\label{appa1stack}%
%Substituting $u_t = - F y_t$ into the law of motion \Ep{bell2} gives the following motion
%for $y$:
%$$ \bmatrix{ z_{t+1} \cr x_{t+1} } = (A- BF) \bmatrix{ z_t \cr x_t }
%\EQN king11 $$
%or
%$$ \left[ \matrix{ z_{t+1} \cr x_{t+1} \cr} \right]
% = \left[ \matrix{m_{11} & m_{12} \cr m_{21} & m_{22}\cr} \right]
% \left[\matrix{ z_t \cr x_t \cr} \right] \EQN vonzer1 $$
%\auth{Von Zur Muehlen, Peter}%
% For present purposes, it is useful to eliminate $x_t$ as an argument of
% the decision rule $u_t = - F \bmatrix{z_t \cr x_t }$ by expressing it as a function of
% $[z_{t-1}, x_{t-1}]$ and then expressing
%the decision rule for $u_t$ as a function
%of $z_t, z_{t-1},$ and $u_{t-1}$.
%This can be accomplished as
%follows.\NFootnote{Peter Von Zur Muehlen suggested
%this representation to us.} First write the feedback rule \Ep{bell5} for $u_t$
%$$u_t = -F_1 z_{t} - F_2 x_t . \EQN vonzer2 $$
%Then where $F_2^{-1}$ denotes
%a generalized inverse of $F_2$,
% \Ep{vonzer2} implies $x_t = F_2^{-1}(u_t - F_1 z_t)$.
%Equate the right side of this expression to the right side
%of the second line of \Ep{vonzer1} lagged once and rearrange by using
%\Ep{vonzer2} lagged once to eliminate $\mu_{x,t-1}$
%to get
%% $$ u_t = f_{12} m_{22} f_{12}^{-1} u_{t-1} + f_{11} z_t
%% + f_{12}(m_{21} - m_{22} f_{12}^{-1} f_{11}) z_{t-1}
%% \EQN vonzer3;a $$
%% or
%$$ u_t = \rho u_{t-1} + \alpha_0 z_t + \alpha_1 z_{t-1} \EQN vonzer3 $$
%for $t \geq 1$, where $\rho = F_2 m_{22} F_2^{-1} , \alpha_0 = -F_1 ,
%\alpha_1 = -F_2(m_{21} - m_{22} F_2^{-1} F_1) $,\NFootnote{By making the instrument feed back on itself,
%the form of \Ep{vonzer3} potentially allows for
%``instrument-smoothing'' to emerge as an optimal rule under
%commitment. This insight partly motivated Woodford
%(2003) to use his model to interpret empirical evidence about
%interest rate smoothing in the United States.} while
%for $t =0$ the decision rule is
%$$ u_0 = -(F_1 + F_2 P_{22}^{-1} P_{21}) z_0. \EQN vonzer4 $$
%The difference equation \Ep{vonzer3} can be solved backwards subject to the initial condition \Ep{vonzer4} to deduce a sequence
%of history-dependent decision rules
%$$ u_t = \sigma_t(z^t), \ t \geq 0 , $$
%where $z^t = \left[ z_t, z_{t-1}, \ldots, z_0 \right]$.
%% For $t=0$, the initialization $\mu_{x,0}=0$ implies
%% that
%% $$ u_0 = f_{11} z_0. \EQN vonzer3;c $$
%
%\section{Exercises}
\showchaptIDfalse
\showsectIDfalse
\section{Exercises}
\showchaptIDtrue
\showsectIDtrue
\medskip
% \noindent{\it Exercise \the\chapternum.1} \quad There is no uncertainty.
% For $t \geq 0$, a monetary authority sets the growth of the (log)
% of money according to
% $$ m_{t+1} = m_t + u_t \leqno(1) $$
% subject to the initial condition $m_0>0$ given. The demand for money
% is
% $$ m_t - p_t = - \alpha (p_{t+1} - p_t), \alpha > 0, \leqno(2) $$
% where $p_t$ is the log of the price level. Equation (2) can be
% interpreted as the Euler equation of the holders of money.
%
% \medskip
% \noindent{\bf a.} Briefly interpret how equation
% (2) makes the demand for real balances vary inversely with
% the expected rate of inflation.
% Temporarily (only for this part of the exercise) drop
% equation (1) and assume instead that $\{m_t\}$ is a given sequence
% satisfying $\sum_{t=0}^\infty m_t^2 < + \infty$.
% Please solve the difference equation (2) ``forward''
% to express $p_t$ as a function of current and future values of $m_s$.
% Note how future values of $m$ influence the current price level.
%
% \medskip
% At time $0$, a monetary authority chooses a possibly
% history-dependent strategy for setting $\{u_t\}_{t=0}^\infty$. (The monetary
% authority commits to this strategy.) The monetary authority orders
% sequences $\{m_t, p_t\}_{t=0}^\infty$ according to
% $$ - \sum_{t=0}^\infty .95^t \left[ (p_t - \overline p)^2 +
% u_t^2 + .00001 m_t^2 \right]. \leqno(3) $$
% Assume that $m_0=10, \alpha=5, \bar p=1$.
% \medskip
% \noindent{\bf b.} Please briefly interpret this problem
% as one where the monetary authority wants
% to stabilize the price level, subject
% to costs of adjusting the money supply and some implementability
% constraints. (We include the term $.00001m_t^2$ for purely technical
% reasons that you need not discuss.)
%
% \noindent {\bf c.} Please write and run a Matlab program
% to find the optimal sequence
% $\{u_t\}_{t=0}^\infty$.
% \medskip
% \noindent {\bf d.} Display the optimal decision rule for $u_t$
% as a function of $u_{t-1}, m_t, m_{t-1}$.
% \medskip
% \noindent{\bf e.} Compute the optimal $\{m_t, p_t\}_t$
% sequence for $t=0, \ldots, 10$.
%
% \medskip
% \noindent{\it Hint:} The optimal $\{m_t\}$ sequence must satisfy
% $ \sum_{t=0}^\infty (.95)^t m_t^2 < +\infty$.
% You are free to apply the Matlab program {\tt olrp.m\/}.
% % that is available
% %from the course web site or from Yongs Shin.
\noindent{\it Exercise \the\chapternum.1} \quad There is no uncertainty.
For $t \geq 0$, a monetary authority sets the growth of the (log)
of money according to
$$ m_{t+1} = m_t + u_t \leqno(1) $$
subject to the initial condition $m_0>0$ given. The demand for money
is
$$ m_t - p_t = - \alpha (p_{t+1} - p_t), \quad \alpha > 0, \leqno(2) $$
where $p_t$ is the log of the price level. Equation (2) can be
interpreted as an Euler equation of holders of money.
\medskip
\noindent{\bf a.} Briefly interpret how equation
(2) makes the demand for real balances vary inversely with
the expected rate of inflation.
Temporarily (only for this part of the exercise) drop
equation (1) and assume instead that $\{m_t\}$ is a given sequence
satisfying $\sum_{t=0}^\infty m_t^2 < + \infty$. Please verify that
equation (2) implies that $p_t = (1-\lambda ) m_t + \lambda p_{t+1}$,
where $\lambda = {\frac{\alpha}{1 + \alpha}} \in (0,1)$.
Please solve this difference equation ``forward''
to express $p_t$ as a function of current and future values of $m_s$.
{\it Hint:} If necessary, please review appendix A of chapter \use{timeseries}.
\medskip
\noindent At time $0$, a monetary authority chooses a possibly
history-dependent strategy for setting $\{u_t\}_{t=0}^\infty$. (The monetary
authority somehow commits to this strategy once and for all at time $0$.) The monetary authority orders
sequences $\{m_t, p_t\}_{t=0}^\infty$ according to
$$ - \sum_{t=0}^\infty .95^t \left[ p_t^2 +
u_t^2 + .00001 m_t^2 \right]. \leqno(3) $$
%$Assume that $m_0=10, \alpha=5$.
\medskip
\noindent{\bf b.} Please briefly interpret this problem
as one where the monetary authority wants
to stabilize the price level, subject
to costs of adjusting the money supply rapidly and a set of implementability
constraints. (We include the term $.00001m_t^2$ for purely technical
reasons that you need not discuss.)
\medskip
\noindent {\bf c.} Please formulate a `dynamic programming squared' problem
to find the optimal sequence
$\{u_t\}_{t=0}^\infty$. Please tell why it can be called a `dynamic programming squared' problem.
\medskip
\noindent{\bf d.} Define a plan and a continuation of a plan.
\medskip
\noindent {\bf e.} Describe a recursive representation of the optimal plan.
\medskip
\noindent{\bf f.} Tell whether you agree or disagree with the following statement.
``A continuation of an optimal plan is an optimal plan.'' Please describe the logic that causes you to agree or to disagree.
\medskip
\noindent{\bf g.} Please describe formulas to compute all elements of an optimal plan. (You don't have to write
a Matlab or Python program to implement those formulas, but a Matlab or Python programmer should be able to write a program based on your formulas.)
\medskip
\noindent{\it Exercise \the\chapternum.2} \quad {\bf Markov perfect policy makers}
\medskip
\noindent Now let's redo the optimal policy problem in exercise \the\chapternum.1 with timing protocols like those in
the Markov perfect equilibrium concept introduced in chapter 7.
There is a sequence of monetary policy authorities, each in office for only one period. Let
$$ L(p_t, m_t, u_t) = \left[ p_t^2 +
u_t^2 + .00001 m_t^2 \right]. $$
The
policy authority in office at time $t$ chooses $u_t$ to maximize
% $$ - \sum_{j=0}^\infty .95^t \left[ p_{t+j}^2 +
% u_{t+j}^2 + .00001 m_{t+j}^2 \right]. \leqno(3') $$
$$ - \sum_{j=0}^\infty .95^j L(p_{t+j}, m_{t+j}, u_{t+j}) \leqno(1) $$
subject to
$$ m_{t+1} = m_t + u_t ,$$
taking as given $m_t$. To make the time $t$ decision maker's problem well posed, we must attribute views about $\{p_{t+j}, m_{t+j}\}_{j=1}^\infty$ to the time
$t$ decision maker. In the Markov perfect spirit, we assume that the time $t$ policy maker takes as given a policy rule $u_{t+j} = \overline g m_{t+j}$ that
it assumes will be chosen by all
successor monetary policy authorities $j \geq 1$. We also assume that the date $t$ policy authority believes that $p_{t+j} = \overline h m_{t+j}$ for all $j \geq 1$.
What about the public? If it were to believe that the law of motion of the money supply
is
$$ m_{t+1} = (1+ \overline g) m_t $$
for all $t \geq 0$, then to satisfy the difference equation $p_t = (1-\lambda) m_t + \lambda p_{t+1}$, it would act to set the price level according to
$$ p_t = \overline h m_t, \quad {\rm where} \ \overline h = {\frac{(1-\lambda) }{1 - \lambda (1+\overline g)}}, \leqno(2) $$
and where we computed $\lambda \in (0,1)$ in the previous problem.
\medskip
\noindent{\bf a.} Please verify that equation (2) solves $p_t = (1-\lambda) m_t + \lambda p_{t+1}$ .
\medskip
\noindent{\bf b.} Consider the value function
$$ w(m_t) = - \sum_{j=0}^\infty \beta^j L (p_{t+j}, m_{t+j}, u_{t+j}) $$
where
$$ \eqalign{ p_{t+j} & = \overline h m_{t+j} \cr
u_{t+j} & = \overline g m_{t+j} \cr
m_{t+j+1} & = m_{t+j} + u_{t+j} .} $$
Please interpret this value function in terms of the behavior that it assumes about (i) the sequence of monetary policy makers who choose
$\{u_{t+j}\}_{j=0}^\infty$, and (ii) the money holders who choose $\{p_{t+j}\}_{j=0}^\infty$.
\medskip
\noindent{\bf c.} Now please consider the following functional equation:
$$ v(m_0) = \max_{u_0} \left\{ - L(p_0, m_0, u_0) + \beta w(m_1) \right\} \leqno(3) $$
where the maximization is subject to
$$ \eqalign{ m_1 & = m_0 + u_0 \cr
p_0 & = \lambda \overline h [ m_0 + u_0] + (1 - \lambda) m_0 . } $$
Please interpret the equation $p_0 = \lambda \overline h [ m_0 + u_0] + (1 - \lambda) m_0 $ in terms of what it assumes about the
beliefs of the money holders who set the price level at time $0$.
Please interpret the functional equation (3) in terms of the beliefs of the time $0$ monetary authority who chooses $u_0$, in particular,
its beliefs about the decisions of successor monetary authorities.
\medskip
\noindent{\bf d.} Let the optimizer of the right side of equation (3) be $u_0 = g m_0$.
Please define a Markov perfect equilibrium. Tell who chooses what when. Also tell what each decision maker assumes about
other pertinent decision makers.
\medskip
\noindent{\bf e.} Please describe a computer algorithm for computing a Markov perfect equilibrium, being careful first to describe all of the objects comprising
a Markov perfect equilibrium.
\noindent{\bf f.} Is a continuation of a Markov perfect equilibrium a Markov perfect equilibrium?
\medskip
\noindent{\it Exercise \the\chapternum.3} \quad {\bf Duopoly}
\medskip
\noindent