Äîêóìåíò âçÿò èç êýøà ïîèñêîâîé ìàøèíû. Àäðåñ îðèãèíàëüíîãî äîêóìåíòà : http://belyakov.imec.msu.ru/papers/ThesisBelyakov.pdf
Äàòà èçìåíåíèÿ: Wed Sep 10 19:56:37 2014
Äàòà èíäåêñèðîâàíèÿ: Sat Apr 9 22:27:49 2016
Êîäèðîâêà:
´ Universite catholique de Louvain ´ ´ Departement des Sciences economiques ` ´ ´ These presentee en vue de lobtention du grade de ´ docteur en sciences economiques

Essays on economic dynamics under heterogeneity
Anton O. Belyakov

Composition du jury: Julio Davila (promoteur) Raouf Boucekkine Carmen Camacho Vladimir Veliov Mathieu Parenti



Acknowledgements
First of all I would like to thank my supervisors Raouf Boucekkine and Julio Davila. I'm grateful to other members of my doctoral jury: Mathieu Parenti, Carmen Camacho, and Vladimir Veliov. It has been a pleasure and a privilege to collaborate with Vladimir Veliov, Josef Haunschmied, and Tsvetomir Tsachev. Results of this collaboration are presented in the first two chapters of this thesis. I would like to thank Alain Venditti, David de la Croix, and all participants of Workshop Overlapping Generations Days, Marseille (May 14-16, 2012) for their helpful comments on the results that constitute the first chapter. This chapter has also greatly benefited from the discussions at doctoral workshops in UCL. For that I'm thankful to the participants and especially to the discus´` sants of my talks Helene Latzer and Carmen Camacho. I also want to express my gratitude for fruitful discussions to Michael Kuhn, Klaus Prettner, and to all my colleagues from the Institute of Mathematical Methods in Economics in the Vienna University of Technology. I'm grateful to Shlomo Weber and Alexey Savvateev for recommendations they gave me for a Marie Curie Fellowship at the Center for Operations Research and Econometrics. For the fellowship I'm thankful to Jean Hin´ driks. My gratitude goes also to Claude d'Aspremont, Celine Poilly, Michel de Vroey, Yuri Yatsenko, Vincent Vannetelbosch, and all professors whom I learned from during my stay in UCL. I'm thankful to Francois Maniquet and ¸ all participants of lunch seminars at the CORE for their useful comments on the results presented in the third chapter. My thoughts also go to my colleagues and friends, that have made the time spent at the CORE especially inspiring: Paolo, Salome, Maia, Rafael, Peter, ´ Besik, Olivier, Jean-Sebastien, Joel, Denis, Deniz, Aysegul, Asel, Kenmei, Santanu, Hiroshi, Andrea, Michal, Gilles, Ying, Christian, Bastien, Thang, Alejandro, the two Tanjas, and many more. This research was partly financed by Wiener Wissenschafts-, Forschungsund Technologiefonds (WWTF) under grant No MA07-002 and by the Aus-


iv trian Science Foundation (FWF) under grants No I 476-N13 and P 25979-N25.


Contents
General Introduction 1 Heterogeneous consumption in OLG model with horizontal innovations 1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.1 Population and labor . . . . . . . . . . . . . . . . . . 1.1.2 Consumption and savings . . . . . . . . . . . . . . . 1.1.3 Production sector . . . . . . . . . . . . . . . . . . . . 1.1.4 R&D sector . . . . . . . . . . . . . . . . . . . . . . . 1.1.5 Financial sector . . . . . . . . . . . . . . . . . . . . . 1.1.6 Market clearing . . . . . . . . . . . . . . . . . . . . . 1.1.7 Succinct format of the general equilibrium equations . 1.2 Analysis of general equilibrium . . . . . . . . . . . . . . . . . 1.2.1 Aggregated state variables . . . . . . . . . . . . . . . 1.2.2 Utility of an agent . . . . . . . . . . . . . . . . . . . 1.2.3 Benchmark case of homogeneous consumption with =1 . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.4 Homogeneous consumption with < 1 . . . . . . . . 1.2.5 Heterogeneous consumption . . . . . . . . . . . . . . 1.3 Discussion of the model and extensions . . . . . . . . . . . . 1.3.1 Efficiency and fiscal policy . . . . . . . . . . . . . . . 1.3.2 Sources of heterogeneity . . . . . . . . . . . . . . . . 1.3.3 Economic growth . . . . . . . . . . . . . . . . . . . . 1.4 Conclusions and prospects . . . . . . . . . . . . . . . . . . . 1.5 Proofs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.5.1 Proposition 1 . . . . . . . . . . . . . . . . . . . . . . 1.5.2 Proposition 2 . . . . . . . . . . . . . . . . . . . . . . 1.5.3 Corollary 1 . . . . . . . . . . . . . . . . . . . . . . . 1.5.4 Lemma 1 . . . . . . . . . . . . . . . . . . . . . . . . i 1

3 3 6 7 11 12 13 15 15 16 16 20 20 22 23 27 27 30 32 32 33 33 34 34 35


ii 1.5.5 Proposition 3 . . . . . 1.5.6 Proposition 4 . . . . . Calculations . . . . . . . . . . 1.6.1 Expression (1.56) . . . 1.6.2 Expression (1.66) . . . 1.6.3 Expression (1.84) . . . 1.6.4 Expression (1.83) . . . No-arbitrage condition for m = Iterative procedure to calculate .... .... .... .... .... .... .... m0 ( , t , general ....... ....... ....... ....... ....... ....... ....... Q) m1 (q) . equilibrium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 37 38 38 39 39 39 40 41

1.6

1.7 1.8

2 Optimal Control of Heterogeneous Systems main of Heterogeneity 2.1 Introduction . . . . . . . . . . . . . . . 2.2 Formulation of the problem . . . . . . . 2.3 The optimality conditions . . . . . . . . 2.4 A numerical approach . . . . . . . . . . 2.5 Two examples of non-differentiability of 2.6 An economic example . . . . . . . . . 2.7 Conclusion . . . . . . . . . . . . . . .

with Endogenous Do.. .. .. .. the .. .. ..... ..... ..... ..... objective ..... ..... ..... ..... ..... ..... function ..... ..... in. . . . . . . . . . 77 77 80 80 81 83 87 88 89 93 95 97 43 43 45 48 55 56 62 72

3 On the relation of country size to political system and wealth equality 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Mechanisms of border moving by land trade . . . . . . . . . 3.2.1 Territory exchange mechanism and timing . . . . . . 3.2.2 Forms of governments and decision making . . . . . 3.3 Fair land trades . . . . . . . . . . . . . . . . . . . . . . . . 3.3.1 Welfare analysis . . . . . . . . . . . . . . . . . . . 3.3.2 Two Monarchies choose the stable social optimum . 3.4 Effect of possibility to cheat and conquest . . . . . . . . . . 3.5 Effect of additional costs . . . . . . . . . . . . . . . . . . . 3.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . References


General Introduction
Heterogeneity plays an important role in modern economic modeling. First of all, it is worth to mention the models of overlapping generations (OLG), where age distribution of the population is not homogeneous. For example, age distribution of a country population can influence the pension system and may lead to adoption of an optimal age-selective migration policy by the state, see e.g. (Simon, Belyakov, & Feichtinger, 2012) or (Belyakov, Feichtinger, & Simon, 2011). Besides age distribution, heterogeneity can be present in preferences, technologies, or/and wealth of economic agents. Heterogeneity of space, age, or size distribution of a renewable product is considered in the models of optimal fishery or forestry, see e.g. (Belyakov, Davydov, & Veliov, 2013; Belyakov & Veliov, 2014) and references therein. This thesis consists of three essays studding the influence of heterogeneity upon the economic dynamics. In the first model of R&D-based economic growth the agents have heterogeneous preferences over a growing continuum of consumption goods. In the second model a monopolist firm or an industry distributes investments in physical capital among a growing set of technologies the payoff of which is heterogeneous. In the third model of solving territorial disputes between countries the income from the land use of a country is not equally (heterogeneously) distributed among the citizens. Each model is presented in a separate chapter. Chapter 1 develops a general equilibrium endogenous growth model involving heterogeneous consumption by an age-structured population with uncertain but limited life span and balanced life-time budget without bequests. The heterogeneity is introduced via weights which the individuals attribute in their utility function to consumption of different goods depending on the vintage of the good. The goods are produced by monopolistically competitive firms and the variety of available goods/technologies is determined endogenously through R&D investments. The general equilibrium is characterized by a system of functional equations and is analytically or numerically determined for several particular weight functions. It is shown that the investments 1


2 by agents may be insufficient to sustain growth. Therefore additional investments are provided by a competitive financial sector. The resulting imbalance between agents' assets and total value of firms can grow unboundedly in the case of homogeneous consumption. The results exhibit the qualitative difference between the dynamics of the model with heterogeneous versus homogeneous consumption. In particular heterogeneous consumption (when old goods are discounted) reduces the additional investments by the financial sector so that the values of firms become balanced by the assets of agents in the long run. These results are obtained in collaboration with Josef Haunschmied and Vladimir Veliov, (Belyakov, Haunschmied, & Veliov, 2012). Chapter 2 deals with optimal control of heterogeneous systems, that is, families of controlled ODEs parameterized by a parameter running over a domain called domain of heterogeneity. The main novelty here is that the domain of heterogeneity is endogenous: it may depend on the control and on the state of the system. This extension is crucial for several economic applications and turns out to rise interesting mathematical problems. A necessary optimality condition is derived, where one of the adjoint variables satisfies a differential inclusion (instead of equation) and the maximization of the Hamiltonian takes the form of "min-max". As a consequence, a Pontryagin-type maximum principle is obtained under certain regularity conditions for the optimal control. A formula for the derivative of the objective function with respect to the control from L is presented together with a sufficient condition for its existence. A stylized economic example is investigated analytically and numerically. These results are obtained in collaboration with Tsvetomir Tsachev and Vladimir Veliov, (Belyakov, Tsachev, & Veliov, 2011). Chapter 3 studies a mechanism governing the territory exchange between groups of people such as countries or municipalities is proposed based on land trading with approval of both sides under particular voting rule (majority rule, veto rule, etc.). Voting rules can model different forms of government such as monarchy, oligarchy, and democracy. Under these forms of government the influence of hererogeneity of wealth distribution among citizens on the territory exchange result is studied. Conquest of the territory is considered as a special case of trading, when the buyer country pays not to the seller country but to the army which conquer territory for the buyer. Conditions at which countries prefer to trade land rather than fight for it are found using a game theoretic approach. This is an extension of the author's master thesis (Belyakov, 2007a, 2007b), the results of witch were presented in (Belyakov, 2009).


Chapter 1

Heterogeneous consumption in OLG model with horizontal innovations
1.1 Introduction
In this chapter we present and investigate a general equilibrium model comprised of the following components: firms producing consumption goods, R&D firms developing new goods, bank sector, and age-structured consumers. A main new feature of the model is, that the consumption is heterogeneous and the consumers' preference may depend on the vintage of a particular good, as well as on the current technological frontier. This, together with the involvement of the bank sector, allows to reveal interesting new results about the economic growth. In the proposed model, agents consume a continuum of perishable goods produced by monopolistically competitive firms. Each firm possesses a permanent patent which the firm buys on perfectly competitive R&D market. The needed investments are provided by a competitive bank sector, and are secured by agents' savings and future firms profits. We consider any improvement in the quality of a good as the invention of a new good by some R&D firm that increases the variety of goods, thus implying only horizontal innovations. The variety of goods increases with a speed proportional to the labor employed by perfectly competitive R&D firms with allowance for knowledge spillover, like in Romer (1990). The consumers, that is, the population in the considered closed economy, have a finite (but uncertain) life span and no bequest intention. This com3


4

Heterogeneous consumption in OLG model with horizontal innovations

ponent of the model adapts the Cass and Yaari (1967) framework and differs from the perpetual youth models (e.g., Blanchard, 1985), and from more general (possibly realistic) demographic setup in (Faruqee, 2003), since finite life span facilitates rigorous mathematical proofs. A general equilibrium model with realistic survival probability is considered in (Boucekkine, de la Croix, & Licandro, 2002). We consider differentiated goods as final consumption goods, like in Judd (1985), and disentangle the concepts of productivity growth and growth of the variety of goods. In other words, scientists increase the variety of products rather than the productivity of labor. Indeed, the invention of a new car design does not mean that the sells of cars would increase in total or that new car is produced with less labor. The result of the invention can be that people would just buy cars of new design instead of old one. Although old fashioned cars may still be attractive to some customers and expensive, the production of such cars would decrease. Hence, the total production may stay the same, while the utility of a consumer increases because of the increasing variety of designs, since the consumer can find the most suitable car for her among newly invented designs. We do not specify how productivity of labor (and therefore per capita consumption) depends on R&D activity. This contrasts idea-based growth models (e.g., Romer, 1990) where the invention of a new good (technology) immediately increases productivity of labor and hence per capita consumption. That results in the scale effect prediction when the long run growth rate depends on the population of the country (scale of the economy), which is strongly at odds with 20th-century empirical evidence, see (Jones, 1995a, 1995b). We can still have a scale effect for the growth rate of agents' utilities, because the instantaneous utility depends on the increasing variety of goods. But, per capita consumption may not depend on the variety of goods in our model. Thus, we avoid determining the real growth rate of per capita consumption, included in GDP per capita, and its scale effect predictions. We study the growth of lifetime discounted utility of generations. All agents are born with zero assets and should be insured from dying indebted. Agents can borrow money from other coexisting generations like in d'Albis ´ and Augeraud-Veron (2011) which is a continuous time generalization of discrete pure exchange OLG models (e.g., Samuelson, 1958; Gale, 1973). In our model we introduce production without physical capital as in Sorger (2011), so that agents can invest only in patents. But, in contrast to Sorger (2011), the agents are not the only investors in the model. The reason for that is the absence of infinitely living agents' cohorts (Blanchard, 1985) or households (Romer, 1990; Sorger, 2011) to own infinitely living firms. The wish of


1.1. Introduction

5

agents to consume all their bounded lifetime income would hamper their investments in firms and stop the growth. Aggregated assets of agents are proved to be bounded and could be even negative like in pure exchange OLG models (Samuelson, 1958; Gale, 1973) depending on the income pattern over the life of an agent. So in the case of endogenous growth there could be imbalance between aggregated assets and the total value of all patents in the economy. To resolve this issue we introduce competitive banking sector which plays a role of infinitely living institution investing in new firms. Banks give loans to the startup firms for purchasing the patents under the pledge of these very patents. Because of the difference between savings and patents' value the economy needs money (liquidity) provided by the banks. Thus, the banks create additional liquidity equal to the change of the value of the patents in the economy. Hence, there are two asset markets in the economy. The one is the market of firms' loans balanced with the values of their patents. The other is the market of agents' savings balanced with their life insurances, according to which in the case of sudden death of an agent her debt is repayed or her deposit is taken by the insurance company. The banks keep deposits of agents under the same interest rate as loans for the firms because of the no-arbitrage condition. But we do not require the balance between agents' savings and firms' loans. Moreover, it will be seen that such requirement is not needed in our model for determining a general equilibrium and could make the growth impossible. A substantial novelty of the proposed model is that it is a hybrid of continuous time OLG model and growth model with a continuum of consumption goods. It happens that optimal consumption of an agent can be obtained by a two-stage procedure similar to the one in Dixit and Stiglitz (1977). The main result is that for the agents' savings alone may be not enough for growth. Additional money is needed, which is provided by banks. One can interpret this money as a negative bubble. It is proven that aggregated assets of agents are bounded while the total value of intellectual property in the economy with homogeneous consumption can grow unboundedly so the difference between agents' savings and the value of patents can also be unlimited. The heterogeneity of consumption (discounting old goods) brings qualitative difference in the model dynamics compared to the homogenous case. The heterogeneity can make the total value of patents be bounded with zero limit, thus reducing the imbalance between savings of agents and the value of patents. The growth of the lifetime discounted utility of agents' generations could be bounded in the heterogeneous case in contrast to the case of homogeneous consumption, where the growth of utility is unbounded. It means that in the long run agents preferring new goods do not appreciate further increase of the variety of goods.


6

Heterogeneous consumption in OLG model with horizontal innovations

In the case of homogeneous consumption we prove that the real interest rate tends to zero which is the biological interest rate (the population growth rate which is zero in the long run), see Samuelson (1958). While the general equilibrium with heterogeneous consumption can be inefficient and the real interest rate can become negative as it may happen in OLG models (e.g., Blanchard, 1985; Diamond, 1965). We observed that in both heterogeneous and homogeneous cases the variety of goods can grow unboundedly. The chapter is organized as follows. Section 1.1.1 introduces population and labor dynamics. Section 1.1.2 solves the agent's problem of finding her optimal consumption and investment profiles. Section 1.1.3 considers the monopolistically competitive production of continuum of goods and finds the price of goods in equilibrium. Section 1.1.4 introduces competitive R&D sector that increases the variety of goods. Section 1.1.5 finds the equilibrium conditions in financial sector (zero profit of banks) and in R&D sector (zero profit of R&D firms). Section 1.1.6 describes the market clearing including the full employment condition. Section 1.1.7 presents final succinct format of the general equilibrium equations. Section 1.2 studies dynamics of aggregated variables and life time aggregated utility of an agent. Section 1.3 discusses the model considering another source of heterogeneity in section 1.3.2 with introduction of heterogeneous production of labor that also allows for discussion on the economic growth in section 1.3.3. Section 1.3.1 introduces the government and shows that its fiscal policy is efficient in the heterogeneous case in contrast to the homogeneous one. Section 1.4 concludes the chapter. The proofs are located in the Appendix as well as the numerical procedure is described in Appendix 1.8.

1.1.1 Population and labor
The population is time-varying and exogenous: n( , t ) denotes the size at time t of the population cohort born at time . It is assumed that there is a maximal age , so that n(t , ) = 0 for t - , but n( , t ) > 0 for 0 t - < . That is, there are always some people in every cohort until it reaches age . For the population function n(·, ·) it is assumed that it is continuous in for any fixed t and is non-increasing and absolutely continuous in t for every fixed . Then the function S( , t ) n( , t ) n( , )

is the survival probability at time t of an individual of cohort . Survival prob-


1.1. Introduction

7

ability defines mortality rate as

µ ( , t ) -

S( , t ) n( , t ) =- S( , t ) n( , t )

(1.1)

(the dot above a symbol denotes the derivative with respect to the time t ). Thus, we have the expression t (1.2) S( , t ) = e- µ ( , ) d for the survival probability, thus n( , t ) = n( , ) e
-
t


µ ( , ) d

.

(1.3)

The total population at t is obtained by the integration of n over all currently living cohorts N (t )
t
t -

n( , t ) d .

We assume that each individual is endowed with l ( , t ) units of homogeneous labor per time, so that L(t )
t
t -

l ( , t ) n( , t ) d

(1.4)

is the total amount of available labor units in the economy at time t .

1.1.2 Consumption and savings
The evolution of the assets is determined by the consumption/saving decisions of the individuals. The assets are homogeneous, while the consumption goods are heterogeneous: products labeled by the numbers q [0, Q(t )] are available at time t . These products are ordered according to the invention time of the respective technology q, so that Q(t ) is the newest product at time t ­ the one just created at t . It is assumed that each individual has perfect foresight for the wage w(t ), the real return rate on asset r(t ), the available consumption goods q [0, Q(t )], and their real prices p(t , q), for all t within her life-horizon. Under particular normalization of wage and productivity of labor, the prices p will be proven to be constant over both t and q. To write everything in real terms we will use labor as numeriare, that is, we set the wage equal to unity, w(t ) 1. The wage, normalized to 1, is assumed to be equal for all jobs, which is reasonable in a model where qualification is not taken into account.


8

Heterogeneous consumption in OLG model with horizontal innovations

A representative agent born at time chooses her consumption c( , t , q) of good q [0, Q(t )] at time t so that her expected total discounted utility u( )
+


e

- (t - )

S( , t )

Q(t )
0

m( , t , q, Q(t )) c( , t , q) dq dt

(1.5)

is maximized subject to the dynamic budget constraint a( , t ) = l ( , t ) + (r(t ) + µ ( , t )) a( , t ) -
Q(t )
0

p(t , q) c( , t , q) dq,

(1.6)

with the following boundary conditions, due to the absence of bequest, a( , ) = 0, a( , + ) = 0, (1.7) (1.8)

where a( , t ) is the real amount of assets that the agent has at time t [ , + ], (0, 1) is an elasticity parameter, m( , t , q, Q(t )) 0 is the weight function, with which an agent aggregates her utilities from consumption of different products. The weight function m( , t , q, Q(t )) represents agent's preferences among available goods q [0, Q(t )] and is assumed to be strictly positive at least on some subset of [0, Q(t )] with positive measure. That will result in positive demand for new goods. The weight function m( , t , q, Q(t )) makes the goods heterogeneous if it depends on q. Notice that it can depend on the current technological frontier Q and on the age t - of the agent. We will show how the shape of the function m qualitatively determines the outcome of general equilibrium. We assume that individuals are insured against the risk of dying with positive assets by a fair life-insurance company in the spirit of Yaari (1965) that redistributes wealth of individuals who died to those who are still alive in the same cohort1 . Therefore the real rate of return r(t ) is augmented by the mortality rate µ ( , t ). Thus, functional (1.5) is an extension of that in Judd (1985). The main novelty is the function m( , t , q, Q(t )) which makes goods heterogeneous2 . Since the insurance should ensure that each cohort ultimately consumes all its assets we add the end-point condition (1.8). The absence of bequest implies the initial condition (1.7).
The equal saving/debt redistribution of deceased agent only within her cohort satisfies the balance of fare insurance, because agents from the same cohort have the same assets and probability of death. This redistribution is possible due to the assumption that there are always some people in the cohort until it becomes years old (n( , t ) > 0 for 0 t - < ). 2 The functional form m( , t , q, Q(t )) is given exogenously, thus it differs from the quality in quality-adjusted Dixit-Stiglits consumption index used in some models with vertical innovations (e.g., Dinopoulos & Thompson, 1998).
1


1.1. Introduction

9

For any fixed problem (1.5)­(1.8) has an optimal control c( , ·, ·) which can be obtained by a two-stage procedure similar to the one in Dixit and Stiglitz (1977). In the inner stage we define the total real expenditures for consumption at time t as E ( , t )
Q(t ) 0

p(t , q) c( , t , q) dq.

(1.9)

Then we fix an arbitrary non-negative function E (·) and determine for fixed and t the optimal distribution c( , t , ·) that maximizes the inner integral in (1.5) subject to (1.9). The solution is ( c( , t , q) = where G( , t ) The resulting value be written as (G( , survival probability the following outer u( ) = subject to a( , t ) = l ( , t ) + (r(t ) + µ ( , t )) a( , t ) - E ( , t ), a( , ) = 0, a( , + ) = 0. E ( , t ) 0, (1.13) (1.14)
+


m( , t , q, Q(t )) p(t , q)

)

1 1-

E ( , t ) , G( , t )

(1.10)

) Q(t ) ( m( , t , q, Q(t ))
0

1 1-

p(t , q)

p(t , q) dq.

(1.11)

of the instantaneous utility (the inner integral in (1.5)) can t ))1- (E ( , t )) . Then, inserting expression (1.2) for the and solution (1.10) of the inner problem in (1.5), we obtain problem (for a fixed cohort )
(t - )-
t


e-

µ ( , ) d

(G( , t ))1

-

(E ( , t )) dt max , (1.12)
E ( ,·)

Using the Pontryagin maximum principle we obtain the following first-order conditions for problem (1.12)­(1.14): (



E ( , t ) G( , t )

)

-1

- ( , t ) = 0,

(1.15) (1.16)

- ( , t ) = ( , t ) (r(t ) - ) ,

where ( , t ) is the adjoint variable representing the marginal utility per unit of income. Note that like in Sorger (2011) the adjoint equation (1.16) does


10

Heterogeneous consumption in OLG model with horizontal innovations

not depend on the mortality rate µ because the agent has fair life insurance. Adjoint equation (1.16) has the solution

( , t ) = 0 ( ) e

-

t


(r( )- ) d

,

(1.17)

which defines along with (1.15) the solution of the outer problem ( E ( , t ) = G( , t ) e
t


(r( )- ) d

0 ( )

)

1 1-

,

(1.18) has to this it

where 0 ( ) is the initial value (at time t = ) of the adjoint variable that be adjusted to ensure the end-condition a( + ) = 0 in (1.14). In doing will be notationally convenient to define the discount factors ( t ) Rµ ( , t ) exp - (r( ) + µ ( , )) d , ( t ) - r( ) d . R ( , t ) exp - 1-

(1.19) (1.20)

Standard calculations using the Cauchy formula for the solution of (1.13) and notations (1.19)­(1.20) give the following expression for 0 ( ), for which conditions (1.14) are satisfied (

0 ( )

)

1 1-

+

=

+



Rµ ( , t ) l ( , t ) dt . R ( , t ) Rµ ( , t ) G( , t ) dt


(1.21)

Then expression (1.18) with the use of (1.20) and (2.3) takes the explicit form R ( , t ) + Rµ ( , s)l ( , s) ds E ( , t ) = G( , t ) + . R ( , s)Rµ ( , s)G( , s) ds (1.22)

Using (1.10) and (1.22), one can obtain the consumption of product q at time t by an agent born at : ( c( , t , q) = m( , t , q, Q(t )) p(t , q) )
1 1-

+


R ( , t ) h( ) . R ( , s)Rµ ( , s)G( , s) ds

(1.23)

It is proportional to the agent's human wealth defined as in Blanchard (1985) h( )
+


Rµ ( , s)l ( , s) ds,

(1.24)


1.1. Introduction

11

which is the present value of agent's income flow from labor. The propensity to consume the product q decreases with its relative price p(t , q) m( , t , q, Q(t )) (
+

) R ( , s)Rµ ( , s)G( , s) ds

1-

and (due to the multiplier R ( , t ) in (1.23)) with the age of the agent t - , provided that > r(·) in (1.20). There is no nonhuman wealth in the model because other factors of production (besides labor, e.g. physical capital available for agents to invest into) are absent. Thus, agents' assets only redistribute values among generations in time. Note that the optimal consumption profile (1.23) is completely defined by the price p(·, ·), the real interest rate r(·), and the frontier of the product variety Q(·). This is the key difference from the models with infinitely living households (like in Sorger, 2011) or OLG models with bequests, where consumption also depends on some free variable (0 ( ) in Sorger, 2011) which is to be defined in equilibrium.

1.1.3

Production sector

Further we assume that each product q [0, Q(t )] available at time t is produced by a single firm, that is, we implement the concept of monopolistic competition. Starting from its creation, the firm producing product q may exist forever.3 Moreover, we assume, similarly as Sorger (2011), that the production of each good involves only labor, and that the production of one unit of any good requires one unit of labor. Here we denote by C(t , q) the production of good q and use the fact that at equilibrium it is equal to the aggregated consumption of good q: C(t , q)
t
t -

c( , t , q) n( , t ) d .

(1.25)

The firm holding permanently the patent for product q sets the price p = p (t , q). Inserting (1.23) into (1.25) one can express the demanded quantity at price p as F (t , q) C(t , q) = , (1.26) 1 p 1-
There is no scrap time presented in classical vintage models. Nevertheless the firm may stop production of its product, when demand for it becomes zero, because of the function m.
3


12 where the function

Heterogeneous consumption in OLG model with horizontal innovations

t

F (t , q) =

t -

(m( , t , q, Q(t ))) 1- R ( , t ) h( ) n( , t ) d + R ( , s)Rµ ( , s)G( , s) ds

1

does not depend on the price p(t , q) of any particular good q due to the assumption of monopolistic competition. It follows from the definition of G( , t ) in (1.11) that it depends on the prices of the goods in an integrated form, so that no single firm may influence F (t , q). The q-th firm produces quantity C(t , q), the production cost of which in real terms is C(t , q), according to the assumption that the production of one unit of any good requires one unit of labor. Then the real operating profit of the q-th firm at time t is

(t , q) = p C(t , q) - C(t , q).
Due to (1.26) the firm's profit takes the form

(1.27)

(t , q) = ( p - 1) p

-1 1-

F (t , q),

which attains its maximum with respect to the price p at p(t , q) = 1 . (1.28)

Thus, all goods have the same price (1.28), although the consumption of these goods (1.23), (1.25) can be different due to different preferences. The reason for such simple (constant) prices is the normalization of wage and productivity of labor to unity, i.e. that any product q is measured in labor units spent for its production. For the optimal operating profit of the q-th firm at time t (with q [0, Q(t )]) we obtain from (1.27) and (1.28) the expression

(t , q) =

1- C(t , q),

(1.29)

relating this profit with the production C(t , q), which is equal to the aggregated consumption according to (1.25).

1.1.4 R&D sector
The R&D sector produces new technologies increasing in this way the variety [0, Q(t )] of available consumer's goods. The R&D industry sells patents of infinite life for new productions. The R&D sector is a perfectly competitive industry which requires only labor and shares the labor market with the


1.1. Introduction

13

production sector, so that the wage at time t is w(t ) = 1. Similarly as Jones (1995b); Kortum (1997); Segerstrom (1998) we assume that the dynamics of the technological frontier Q(t ) is given by the equation Q(t ) = (Q(t )) LQ (t ), (1.30)

where LQ (t ) is the total labor employed in R&D, > 0 is a productivity parameter, is the parameter determining how productivity depends on Q(t ), that is on the already existing technologies. Thus, we allow for knowledge spillover if > 0 4 . With > 0, formulation (1.30) implies increasing returns to scale in R&D, when previous inventions raise the productivity of current research effort. Alternately, with < 0, the formulation allows for diminishing returns in R&D, as if past inventions make it more difficult to find new ideas. For models with different values of the parameter see Jones (1999) and the references therein. For example, with = 1, in Romer (1990); G. M. Grossman and Helpman (1991a); Aghion and Howitt (1992) models, each unit of research effort can produce a proportionate increase in the technological variety. Entrepreneurs buying new patents have perfect foresight. An entrepreneur knows future demand for the product and (1.26) thus, her future operational profit (1.29). If weight function m is strictly positive in the vicinity of q = Q(t ), then at least during some nonzero time interval, the operational profit flow from production of new goods is nonzero. Hence, entrepreneurs have incentives to buy new patents and found firms, because people will consume new goods.

1.1.5 Financial sector
The entrepreneur, willing to establish a firm for production of a new good, takes a loan from a bank at real interest rate r(t ) in order to buy a patent from the R&D sector. The bank cannot make higher interest rate for the firm than that on agents' deposits because in this case the firm can attract investments from other bank (because of the free entry condition in the bank sector) or directly from people offering them the same rate r(t ) of return as the bank does. Banks can "create money" taking patents as security, thus giving more loans than the total deposits of people. Actually, there is no restriction in the model on how much loans a bank can give since the firms for sure will repay
4 Different extension of equation (1.30) are possible, including the dependence on a weight average of the distribution of labor across different technologies, which is reasonable (see e.g. Aghion & Howitt, 1996, 1998; Sorger, 2011). This would cause only some technical burden, therefore below we focus only on dynamics (1.30).


14

Heterogeneous consumption in OLG model with horizontal innovations

their debts in the future. These debts are balanced in the bank accounting by the values of patents that firms put in pledge. Similar to Sorger (2011) we denote the present value of the real profit flow for the firm producing product q [0, Q(t )] over interval [t , +) as ) ( s v(t , q) = (1.31) exp - r( ) d (s, q) ds,
t t

which, on the other hand, is the debt that the q-th firm owes to the bank. Since the market for patents is competitive we assume that the R&D industry as a whole makes no profit, and the firm who buys a patent makes zero net profit5 too, because of the free entry assumption. This idea of intertemporal zero net profit constraint (as in Romer, 1990) is taken from G. M. Grossman and Helpman (1989). Thus, the market price v(t , Q(t )), of the patents produced at time t is determined by the zero-profit condition v(t , Q(t )) Q(t ) = LQ (t ), (1.32)

which results from the fact that the R&D industry creates Q(t ) patents in a unit of time at t and the cost is what is paid for the involved labor LQ (t ). The time-derivative of expression (1.31) for the price of a new patent q = Q(t ) yields the following no-arbitrage condition
d dt

v(t , Q(t )) (t , Q(t )) Q(t ) + = r(t ) + e- v(t , Q(t )) v(t , Q(t )) v(t , Q(t )) t

s
t

r( ) d

(s, q) q

ds .
q=Q(t )

rate of return on the ownership

rate of novelty premium

(1.33) It states that the rate of return on the ownership of a firm -- consisting of
( Q( )) t the rate of capital gain dv(t ,Q(t )) and the profit rate (tt,,Q(tt)) -- has to equal v the real interest rate r(t ) plus the rate of novelty premium (as we call the last term). It differs from the no-arbitrage condition in the literature (e.g. Romer, 1990; G. M. Grossman & Helpman, 1991a, Chapter 3) by the rate of novelty premium that appears because of the heterogeneity of goods. In special cases the improper integral in (1.33) can be expressed via v(t , Q(t )). Then, no-arbitrage condition (1.33) yields an explicit expression for the real interest rate r(t ). Such expression is more convenient for calculating the general equilibrium, than the original integral equation (1.31) in zero-profit condition (1.32). v(t ,Q(t ))
d

The net profit of the firm is its operating profit minus taxes (that will be introduced later) and minus interest on debt.
5


1.1. Introduction

15

1.1.6 Market clearing
We have already taken into account the clearing on product market (1.25) where production equals consumption. In view of the technological assumption in Section 1.1.3 we can express the total labor in production as
Q(t )

LP (t ) =

C(t , q) dq.
0

(1.34)

The labor market is cleared by the full employment condition LP (t ) + LQ (t ) = L(t ), (1.35)

where the total labor L(t ) is given exogenously by (1.4). Financial market is cleared by intertemporal zero-profit condition (1.32) for the firms, meaning that all invented technologies are purchased by the firms whose debts to the banks are balanced with the values of their patents. The banks have zero profit too, because of the absence of entry barriers for new banks. Thus, the real interest rate r(t ) on firms' debts is the same as that on agents' deposits.

1.1.7 Succinct format of the general equilibrium equations
With the use of (1.31) and (1.29) the zero-profit condition (1.32) in the R&D sector takes the form LQ (t ) = (Q(t )) LQ (t )



t

e-

s
t

r( ) d

1- C(s, Q(t )) ds.

(1.36)

Due to (1.34) the labor balance equation (1.35) becomes LQ (t ) = L(t ) -
Q(t )

C(t , q) dq.
0

(1.37)

The dynamics of the variety frontier Q is as in (1.30) Q(t ) = (Q(t )) LQ (t ), Q(0) = Q0 > 0. (1.38)

The above three equations involve the total consumption C(t , q) given by (1.25), which in view of (1.23) takes the form C(t , q) =
t
t -

(m( , t , q, Q(t ))) g( )

1 1-

R ( , t ) h( ) n( , t ) d .

(1.39)


16 Here

Heterogeneous consumption in OLG model with horizontal innovations

g( )

+


R ( , s)Rµ ( , s)M ( , s) ds
1-

(1.40)

and M ( , s) is a substitution for G( , s) = M ( , s)
Q(s)
0

M ( , s), so that
1 1-

(m( , s, q, Q(s)))

dq.

(1.41)

The integral M ( , s) represents how much the agent in cohort enjoys the variety of goods at time s , and g( ) is the present discounted value of this propensity for a newborn at . So we have the three equations (1.36)­(1.38) for three unknown functions, Q, LQ , and r which determine the general equilibrium. Further we use these equations for analytical and numerical investigation of the economy.

1.2

Analysis of general equilibrium

Notice that because of the assumption that for any time t there is a subset of goods in [0, Q(t )] with nonzero measure, such that m( , t , q, Q(t )) > 0, the integrals M ( , t ) and g( ) are strictly positive for all . Thus, the integral in (1.37) of consumption (1.39) is also strictly positive since functions R , h and n are strictly positive in the integration domain. Hence, there are always workers in the production sector, meaning LQ (t ) < L(t ). We also mention that expression (1.22) for the consumption expenditure can be written in terms of h, g, and M defined in (1.24), (1.40), and (1.41) E ( , t ) = R ( , t ) h( ) M ( , t ) . g( ) (1.42)

1.2.1 Aggregated state variables
For better understanding of the model dynamics we introduce aggregated state variables and derive corresponding aggregated equations. Integration of the profit expression (1.27) over all existing products [0, Q(t )] yields (t ) = I (t ) - LP (t ), where we introduce the total profit and income I of the firms (t ) I (t )
Q(t ) Q(t )
0

(1.43)

(t , q) dq,
p(t , q) C(t , q) dq.

(1.44) (1.45)

0


1.2. Analysis of general equilibrium

17

Thus, (1.43) reads as the total profit of the firms equals their total income minus the total labor expenses in production. At equilibrium the total income of all firms in the economy is equal to the aggregated expenditure for consumption,
t

I (t ) =

t -

E ( , t ) n( , t ) d ,

(1.46)

where the expenditure E ( , t ) of each cohort is defined in (1.42). It follows from (1.34), (1.45), and (1.28) that the labor in production LP (t ) has the following simple relation with the total firms' income I (t ) LP (t ) = I (t ), and defines the total profit of firms (t ) = 1- LP (t ). (1.48) (1.47)

Now let us aggregate the assets of individuals at time t , A(t )
t
t -

a( , t ) n( , t ) d ,

(1.49)

that are on deposits in the bank. Recalling that newborns have no initial assets, see (1.7), we have the following formula for the derivative of A(t ) obtained by time-differentiation of (1.49) with the use of expression (1.2) A(t ) =
t
t -

a( , t ) n( , t ) d -

t
t -

a( , t ) µ ( , t ) n( , t ) d .

(1.50)

The aggregation of the budget constraints (1.13) over age cohorts gives
t
t -

a( , t ) n( , t ) d =

t
t -

[l ( , t ) + (r(t ) + µ ( , t )) a( , t ) - E ( , t )] n( , t ) d .

Then equations (1.4), (1.46), and (1.50) yield the following asset balance equation: A(t ) = r(t )A(t ) + L(t ) - I (t ). (1.51) Thus, the aggregated assets A rise at the amount of interest rA and what is left from the labor income L after the consumption expenses (equal to the total income of all firms I ).


18

Heterogeneous consumption in OLG model with horizontal innovations

We also introduce the aggregated value of all patents in the economy, V (t )
Q(t )

v(t , q) dq,
0

(1.52)

where v(t , q) is given by (1.31). Since the value V (t ) is equal to the firms' debt to the bank we have the following differential equation for V (t ): V (t ) = r(t )V (t ) + LQ (t ) - (t ), (1.53)

meaning that the debt (besides interest rV ) raises with the loan for research LQ and falls with the current firms' profit payed to the banks. Equation (1.53) can also be obtained by differentiation of (1.52) with respect to time, taking into account expression (1.31), definition (1.43), and the R&D zeroprofit condition (1.32). Proposition 1 The total value of patents V (t ) and the total assets of the individuals A(t ) are governed by the same differential equation 1 A(t ) = r(t )A(t ) + L(t ) - LP (t ), and are related as follows: V (t ) = A(t ) + e where is a constant. Proof See Appendix 1.5.1. Although the dynamics of the aggregated assets and the firms' debts is described by the same differential equation (1.54) they are not equal in general ( = 0), as it will be shown by the next proposition.6 It shows that the aggregated real assets, A(t ), are bounded due to the constant wages, and the no-bequest scenario of the model. Proposition 2 If the real interest rate r and total labor endowment L are ¯ bounded (for all t , |r(t )| < r and L(t ) L), then the aggregated assets of the ¯ ¯ individuals are also bounded: |A(t )| < A.
We do not require the aggregated assets of agents to be equal to the total debts of firms like it is done in (Sorger, 2011), because in our model the optimal consumption profile and investments of a finitely living agent are completely defined via initial condition (1.7) and terminal ´ conditions (1.8), like in (Cass & Yaari, 1967; d'Albis & Augeraud-Veron, 2011), which is not the case for infinitely living households in (Sorger, 2011), where he needs additional condition V (t ) A(t ) to specify general equilibrium.
6

(1.54)

t
0

r(s) ds

,


1.2. Analysis of general equilibrium

19

Proof See Appendix 1.5.2. ¯ Corollary 1 If the value of all patents in the economy V is bounded (V (t ) V for all t ) and the real interest rate converges to a constant, r(t ) r with t , ^ then r 0 or V (t ) = A(t ) for all t . ^ Proof See Appendix 1.5.3. Bounded growth ¯ If we assume bounded growth of the variety frontier Q(t ) Q, (1.38) we have LQ (t ) 0, then asset markets are balanced (A(t the long run interest rate is positive (r(t ) r > 0) as shown by ^ proposition. Thus, the imbalance between asset markets (A(t ) < implies growth. so that due to ) V (t )) and the following V (t ) for all t )

Proposition 3 Let there exist a general equilibrium with bounded Q, with bounded real interest rate r and bounded total labor endowment L. Let also ^ r(t ) r and L(t ) L with t . Then the value of patents V is balanced by ^ aggregated assets A for all t and has the following limit A(t ) = V (t ) ^ 1- L > 0, r ^

where limit of the real interest rate is positive r > 0. ^ Proof See Appendix 1.5.5. Obviously, utility in (1.56) is also bounded in this case. Notice that the corner solution (LQ (t ) 0) is the special case ¯ ¯ growth Q(t ) Q, where LQ (t ) = 0 for all t t , i.e. there is ¯ since R&D is absent. Q(t ) = Q From now on we will consider only unbounded growth of we assume that there is always R&D activity, LQ (t ) > 0 for all (2.2) and (1.32) we obtain the price of a new patent v(t , Q(t )) = 1 . (Q(t ))

of the bounded no growth of Q. Moreover, t . Then, from

(1.55)

The newest patent becomes cheaper as Q because of the knowledge spillover, > 0, while without spillovers, = 0, its price stays constant v(t , Q(t )) = 1/ .


20

Heterogeneous consumption in OLG model with horizontal innovations

1.2.2 Utility of an agent
We are to analyze the growth of the expected life-time aggregated utility (1.12) of a representative individual in cohort (see Appendix 1.6.1) u( ) = ( h( )) (g( ))1- , (1.56)

where h and g are defined in (1.24) and (1.40). This expression shows that u( ) depends positively on the human wealth of the agent and on the discounted integral M defined in (1.41) representing how much the agent values the variety of goods.

1.2.3

Benchmark case of homogeneous consumption with = 1

Let us consider the usually studied in the literature (e.g. Dixit & Stiglitz, 1977) the case when m( , t , q, Q) 1, meaning that all existing products are equally appreciated by the consumer. We will show that in the case of knowledge spillover with = 1, as in Romer (1990); G. M. Grossman and Helpman (1991a); Aghion and Howitt (1992), growth of variety frontier is exponential and a balanced growth path is found analytically. Remark 1 Because of the homogeneity (m( , t , q, Q) 1), the consumption in (1.39) LP (t ) C(t , q) = , Q(t ) the profit in (1.29)

(t , q) =
and, consequently, the patent value v(t , q) = defined in (1.31), do not depend on q.

(t ) , Q(t )

V (t ) , Q(t )

Hence, for an internal solution LQ (t ) > 0, we can calculate, using equation (1.55), the value of patents in the economy as follows V (t ) = Q(t ) v(t , Q(t )) = Q(t ) LQ (t ) 1 =, Q(t ) (1.57)


1.2. Analysis of general equilibrium

21

that happens to be constant. No-arbitrage condition (1.33) yields the following expression for real interest rate ( ) LP (t ) ^ r(t ) = -L . (1.58) The next proposition finds a "balanced growth path" corresponding to the exponential growth of the variety of goods in the economy. First we find relation between labor endowment L defined in (1.4) and human wealth h defined in (1.24) when r(t ) 0. For the sake of simplicity we consider the stationary population and labor endowment. Definition 1 We say that an economy has stationary population and labor endowment if the functions n( , t ) and l ( , t ) depend only on the age t - [0, ] of the agent. Equivalently we can say that the and the mortality rate µ ( , t ) l^(t - ) depend only on the age t number of newborns n( , ) n0 is constant ^ µ (t - ) and the labor endowment l ( , t ) - .

Lemma 1 Let the population and labor endowment be stationary (see Definition 1). If the real interest rate is zero (r(t ) 0) in the equilibrium, then the ^ human wealth, given by (1.24), is constant h(t ) h, having the expression: ^ h=


e
0

-


0

^ µ ( ) d

l^( ) d ,

(1.59)

^ ^ and the total labor endowment is also constant and equals L(t ) L = n0 h, where n0 is the number of newborns. Proof See Appendix 1.5.4. The following Proposition finds an explicit solution of our general equilibrium model, when Q grows exponentially. Proposition 4 In the economy with stationary population and labor endowment if goods are homogeneous (m( , t , q, Q) 1) and the knowledge spillover parameter is equal to one ( = 1), then: ^ (i) the triple LP (t ) L, r(t ) 0, Q(t ) = Q0 e solution of the system (1.36)­(1.38),
^ (1- )L t

is a steady state

(ii) g in (1.40) grows exponentially w.r.t. time of births : g( ) = g e ^
^ (1- )L

,

(1.60)


22

Heterogeneous consumption in OLG model with horizontal innovations



where constant g = Q ^

0

e(

^ (1- )L-

1-

)

-


0

^ µ ( ) d

0

d .

Proof See Appendix 1.5.6. It follows from Lemma 1 and (1.60) that for = 1 the growth of the lifetime aggregated utility (1.56) is unbounded as the birth time of generations : ( ) 2^ ^ u( ) = h (g)1- e (1- ) L . ^ Aggregated assets A cannot grow infinitely because of the Proposition 2, ^ thus converge to a constant A. For smaller values of spillover parameter , we show that the real interest rate r and the shares of labor L p and LQ converge to those of the balanced growth path.

1.2.4 Homogeneous consumption with < 1
In this case and we can obtain the real interest rate so we have ( ) 1- r(t ) = LP (t ) - LQ (t ) , (Q(t ))1-

(1.61)

since LQ (t ) > 0, see Appendix 1.7 for general proof. We can calculate the value of patents in the economy with the use of (1.32) and (1.38) as
Q(t )

V (t ) =
0

v(t , q) dq

= Q(t ) v(t , Q(t )) LQ (t ) = Q(t ) Q(t ) (Q(t ))1- = ,

(1.62)

where we take into account that, because of the homogeneity (m( , t , q, Q) 1), the consumption C(t , q) in (1.39), the profit (s, q) in (1.29), and, consequently, the patent value v(t , q) defined in (1.31), do not depend on q, see Remark 1. Because of the assumed unbounded growth of the variety frontier, Q(t ) as t , see Fig. 1.1 b), and the first equation in (1.61), we have that r(t ) 0, see Fig. 1.1 a). Thus, from equation (1.54) we obtain the convergence ^ LP (t ) L, ^ LQ (t ) (1 - )L, (1.63)


1.2. Analysis of general equilibrium

23

see Fig. 1.1 c). The product variety frontier, Q(t ), grows asymptotically as driven by the equation ^ Q(t ) = (1 - )L (Q(t )) , which implies the unbounded growth of patents' value (1.62): V (t ) = (Q(t ))1
-

,

as t , see Fig. 1.1 d). Hence, V (t ) > A(t ) for all t since the assets A are bounded due to Proposition 2 and the patents' value V cannot intersect A transversely because both V and A are governed by the same ODE (1.54), see Proposition 1. The growth of the life-time aggregated utility u( ) = ( h( )) (g( ))1- is also unbounded since g( ) Q( ) due to (1.40)­(1.41) and h( ) ^ h (0, h] due to (1.24). The unbounded growth of the utility u( ) means that the future generations will live unrestrictedly better than their presently living ancestors. This happens because in equilibrium the agent enjoys equal consumption of all goods, and her instantaneous utility can be described via the common consumption level c(t ) = c( , t , q) and variety frontier Q(t ) as Q(t ) c(t ) . It is clear, that the Q(t )-elasticity of the instantaneous utility is greater than its c(t )-elasticity, since 1 > . Thus, the gain in utility from the increase of the variety frontier Q(t ) overweighs the loss from the decrease of the consumption level c(t ) so that the utilities of agents grow proportionally to (Q( ))1- .

1.2.5 Heterogeneous consumption
We assume that at equilibrium LQ (t ) > 0 for all t and agents have heterogeneous preferences, such that m( , t , q, Q) = e-
(Q-q)

,

where > 0 is the parameter of heterogeneity7 . This corresponds to the special case, where m0 ( , t , Q) = e- Q and m1 (q) = e q . Thus, we can calculate the real interest rate r from (1.102), obtained in Appendix 1.7, as follows ) ) ( ( (1 - ) r(t ) = LP (t ) - LQ (t ) - LQ (t ) , Q(t ) ~ (Q(t ))1- 1- M (Q(t )) (1.64)
7

The case of = 0 corresponds to homogeneous goods studied above.


24 where

Heterogeneous consumption in OLG model with horizontal innovations

- 1- ~ M (Q(t )) = (1 - e 1- Q(t ) ). (1.65) The value of patents can be expressed in the following way, with the use of (1.32) and (1.38) (see Appendix 1.6.2)

V (t ) =

1 - 1 - e 1- Q(t ) . Q(t )

-

(1.66)

If we assume convergence to a steady state r(t ) r, then it follows from ^ (1.64) that (1 - ) LP (t ) - LQ (t ) 0, ~ 1- M (Q(t )) ~ because otherwise r(t ) as Q(t ) . Since M (Q(t )) when Q(t ) , we have LP (t ) -
1-

in (1.65)

LQ (t ) 0 1-

and, taking into account labor balance (1.37), we obtain LP (t ) - L(t ) 0. The value of patents in the economy (1.66) tends to zero V (t ) 0 as time t . According to Corollary 1, A(t ) V (t ) or r 0. We study the second ^ type of solution, r 0, that is confirmed by the numerical calculation depicted ^ in Fig. 1.2. The life-time utility of an agent u( ) = ( h( )) (g( ))1- in (1.56) is ~ bounded, because h( ) const and M (Q(t )) = M (Q(t )) 1- (therefore g( ) const ). The intuition for the boundedness of the expected life time utility u is as follows. When the variety frontier Q increases the consumption profile, roughly speaking, shifts to the newer goods, thus exponentially (in Q) decreasing the consumption of the older goods, because of the function m(q, Q) = e- (Q-q) in (1.39). Exponential decrease in consumption of the older goods prevails over the increase of their variety Q. As a result the instantaneous utility converges to a limit constant so, consequently, does the life time utility u( ). We have qualitatively different behavior of V and u in the homogeneous (m 1) and the heterogeneous (m e- (Q-q) ) cases. Mathematically, this ~ difference is conditioned by the properties of the integral M (Q). Intuitively speaking, the reason for vanishing V and bounded u in the heterogeneous case is the abandonment of older goods because of the agent's preferences.


1.2. Analysis of general equilibrium

25

3.5

x 10

-3

16000 14000

3 12000 2.5 10000 2 Q(t) r(t) 8000 6000 1.5 4000 1 2000 0.5 0

0

200

400 time t

600

800

1000

0

200

400 time t

600

800

1000

a)
0.705 7 x 10
5

b)

Assets A Patents V 0.7 6

5 0.695 A(t), V(t) 0 200 400 time t 600 800 1000 LQ / L 4

0.69

3

0.685 2 0.68

1

0.675

0

0

200

400 time t

600

800

1000

c)

d)

Figure 1.1: The case of homogeneous goods m 1. Labor endowment is l ( , t ) = 0.3 for t - 45 and l ( , t ) = 0 after retirement (t - > 45), where we consider the agent's life from her adulthood. Mortality µ ( , t ) = 0 for t - < , life horizon = 80, age concentration of people n( , t ) = n0 = 100, initial frontier of the products' variety Q0 = 1000, = 0.0002, = 0.5, individual discounting = 0.01. a) Real interest rate r. b) The variety frontier Q. c) The relative labor employed in R&D LQ /L (solid line with dashed line depicting the linear extrapolation) and its limit value 1 - (dashed-dotted line). d) Dynamics of total assets A (solid line) and value of patents V (dashed line).


26

Heterogeneous consumption in OLG model with horizontal innovations

2

x 10

-3

18000 16000 14000

1.5

1 12000 0.5 Q(t) 0 -0.5 4000 -1 2000 0 -1.5 0 200 400 time t 600 800 1000 r(t) 10000 8000 6000

0

200

400 time t

600

800

1000

a)
0.706 0.704 12 0.702 0.7 A(t), V(t) 0.698 LQ / L 0.696 0.694 0.692 0.69 2 0.688 0.686 0 200 400 time t 600 800 1000 0 0 200 400 10 14 x 10
4

b)

Assets A Patents V

8

6

4

600 time t

800

1000

c)

d)

Figure 1.2: The case of heterogeneous goods = 0.0004. Labor endowment is l ( , t ) = 0.3 for t - 45 and l ( , t ) = 0 after retirement (t - > 45), where we consider the agent's life from her adulthood. Mortality µ ( , t ) = 0 for t - < , life horizon = 80, age concentration of people n( , t ) = n0 = 100, initial frontier of the products' variety Q0 = 1000, = 0.0002, = 0.5, individual discounting = 0.01. a) Real interest rate r. b) The variety frontier Q. c) The relative labor employed in R&D LQ /L (solid line with dashed line depicting the linear extrapolation) and its limit value 1 - (dashed-dotted line). d) Dynamics of total assets A (solid line) and value of patents V (dashed line).


1.3. Discussion of the model and extensions

27

1.3 Discussion of the model and extensions
1.3.1 Efficiency and fiscal policy
In this subsection we investigate how the government can improve the utility of the agents in the long run by taxing the firms and paying subsidies and pensions to the agents. We assume that there is a constant tax rate < 1 payed by each firm from its profit (t , q) and the sum of the collected taxes (t ) are distributed among the currently living agents of different generations in the form of lump subsidies. For the sake of simplicity, we assume that the collected taxes are divided equally among the newborns and put on their deposits. The equation for government balance reads as a( , ) n( , ) = ( ). (1.67)

Taking the expression (1.48) for the total profit ( ) we obtain the following initial assets of the agents (instead of (1.7)): a( , ) = 1 - LP ( ) . n( , ) (1.68)

The value v(t , q) in (1.31) of the patent to produce product q decreases to ) ( s v(t , q) = (1 - ) exp - r( ) d (s, q) ds, (1.69)
t t

while the agent's human wealth h( ) in (1.24) is augmented by nonhuman wealth from the subsidies a( , ) 0. Thus, in all the previous formulas we should replace h( ) with the total wealth a( , ) + h( ). Taking into account these changes, the aggregated equations (1.51) and (1.53) take the form A(t ) = r(t )A(t ) + L(t ) - I (t ) + (t ), V (t ) = r(t )V (t ) + LQ (t ) - (1 - ) (t ), and similarly as in (1.54) we have 1 - (1 - ) A(t ) = r(t )A(t ) + L(t ) - LP (t ). (1.72) (1.70) (1.71)

Following the same arguments as before we can study the general equilibrium ^ under simplifying assumptions, L(t ) L, etc. Real interest rate We can state that if LQ (t ) > 0 the real interest rate can be calculated as follows.


28

Heterogeneous consumption in OLG model with horizontal innovations

Homogeneous consumption (m( , t , q, Q) 1): ( ) (1 - ) (1 - ) r(t ) = LP (t ) - LQ (t ) . (Q(t ))1-

(1.73)

Heterogeneous consumption (m( , t , q, Q) e- (Q-q) with > 0): ( ( ) ) (1 - ) (1 - ) r(t ) = Q(t ) LP (t ) - LQ (t ) - LQ (t ) , ~ (Q(t ))1- 1- M (Q(t )) (1.74) - ~ where M (Q(t )) = 1- (1 - e 1- Q(t ) ). Labor shares With both homogenous and heterogeneous consumptions we have LP (t )

^ L, 1 - (1 - )

LQ (t )

(1 - ) (1 - ) ^ L, 1 - (1 - )

(1.75)

extending (1.63) to the case of taxation and distribution of subsidies. Hence, the tax increases the amount of labor in production and decreases it in R&D, so that the product variety frontier Q(t ) grows asymptotically slower with the tax > 0 then without it = 0, which can be seen from its asymptotical differential equation (1 - ) (1 - ) ^ Q(t ) = L (Q(t )) . 1 - (1 - ) Using the explicit formula for the solution of this equation we can write Q(t )| Q(t )| ( 1+ ( exp
1- 1- ^ L(1- ) 1- (1- )

)

1

=0 >0

t , =1

)

, <1

,

(1.76)

where Q(t )| >0 denotes the variety frontier, when tax rate is positive, > 0, while Q(t )| =0 is the variety frontier for economy without taxation. Aggregated utility Now we can ask the question: Can the fiscal policy improve the expected life-time aggregated utility (1.56) of a representative agents in the long run?


1.3. Discussion of the model and extensions

29

Substitution of the expression for nonhuman capital (1.68) in the utility of cohort brings it to the form: u( ) = (a( , ) + h( )) (g( ))1- . (1.77)

We see that u( ) depends positively on the agents' human wealth and on the discounted integral M defined in (1.41). Further we assume that population is stationary and labor is fixed, i.e. density n( , t ) and labor endowment l ( , t ) l^(t - ) depend only on the age t - , see Definition 1. Hence, the total labor endowment is constant, so we can take L(t ) = n0 l¯, where n0 = n( , ) is the constant number of newborns, and l¯ is the constant effective labor of a newborn: l¯

0

l^( )e

-


0

^ µ ( ) d

d ,

(1.78)

^ which coincides with the limit of human wealth l¯ = h in the case of r = 0, see ^ expression (1.59) in Lemma 1. Thus, when we have from (1.68) and (1.75) the initial assets converging as follows: a( , )

(1 - ) ¯ l. 1 - (1 - )

(1.79)

In order to answer the question of the efficiency of fiscal policy we consider the asymptotics of the utility u( ) in (1.77) separately in homogeneous and heterogeneous cases. Long run efficiency of the fiscal policy We will show, that the fiscal policy in hand can improve well-being (u( )) of the future generations with heterogeneous consumption, which is not the case for homogeneous consumption. We check if, for all sufficiently high times of birth , life time utility in the economy with taxation, u( )| >0 , is grater than that without taxation, u( )| =0 . Homogeneous consumption. , hence, according to Lemma use of (1.76) and (1.79) we can of utilities with tax and without u( )| u( )|
=0 >0

In the homogeneous case r(t ) 0 when t ^ 1, h( ) h = l¯ when . Thus, with the write for [0, 1) the following limit for ratio tax (see Appendix 1.6.3) (

(1 - (1 - ))



1+ 1-

)

1- 1-

>

1 - (1 - ) (1 - )1
-

1. (1.80)


30

Heterogeneous consumption in OLG model with horizontal innovations

When = 1 this utility ratio tends to infinity with t , due to the ratio in (1.76). Thus, in the homogeneous case the fiscal policy cannot improve agents' utilities in the long run.8 Heterogeneous consumption. Below we show that in contrast with the homogeneous case an appropriate fiscal policy of the government may improve the agents' utility in the long run. It is shown in Section 1.2.5 that r(t ) r 0. ^ The limit of human wealth (1.24), as , can be calculated as follows h( )

0

l^( )e

-r - ^


0

^ µ ( ) d

¯ d h

(1.81)

¯ where h does not depend on . The integral in (1.77), in contrast to the homogeneous case, converges to a constant independent of tax rate : 1- g( )


e
0

-

- r ^ 1-

-


0

^ µ ( ) d

d g. ¯

(1.82)

Then, we can make the following approximations of the asymptotic expected utility (see Appendix 1.6.4) ( ) (1 - ) ¯ ¯ 1- u( )| ¯ . (1.83) l +h g 1 - (1 - ) where l¯ is defined in (1.78). It is seen from (1.83), that the implementation of the fiscal policy ( > 0) increases the agents' expected life-time aggregated utilities in the long run u( )| =0 lim 1. (1.84) u( )| >0 ^ Notice, that these results hold for any reasonable functions of mortality, µ , and labor endowment, l^.

1.3.2 Sources of heterogeneity
There are two, mathematically equivalent, sources of heterogeneity of goods. The first is that we have already described, where goods are weighted heterogeneously in the agent's utility function. The other source of heterogeneity is the potential difference in productivity of labor allocated to different firms.
Note that for < 0 even small tax rate > 0 improves agents' utilities in the long run, u( )| = 1- becomes negative. A value < 0 means that because the derivative dd u( )| =0 1- >0 =0 past inventions make it more difficult to find new ideas, which we think to be unlikely.
8


1.3. Discussion of the model and extensions

31

That can be described by the same model after renormalization. Indeed, let (t , q, Q(t )) be the amount of physical units of the good q that can be produced with one unit of labor, then equation for the profit (1.27) would take the form ~ C(t , q) ~ (t , q) = p(t , q) C(t , q) - ~ , (1.85) (t , q, Q(t )) ~ where p(t , q) is the price of one physical unit of the good q, C(t , q) is the ~ aggregated production (and consumption) in physical units of the good q. Let the agent value all goods equally maximizing her following expected lifetime utility u( ) =
+


e

- (t - )-

t


µ ( , ) d

Q(t )
0

c( , t , q) dq dt , ~

(1.86)

subject to the dynamic budget constraint a( , t ) = l ( , t ) + (r(t ) + µ ( , t )) a( , t ) - with boundary conditions a( , ) = 0, a( , + ) = 0, (1.88)
Q(t )
0

p(t , q) c( , t , q) dq, ~ ~

(1.87)

where c( , t , q) is her consumption in physical units. Then, if we change the ~ variables as follows p(t , q) = ~ so that ~ C(t , q) = C(t , q) (t , q, Q(t )), then equation (1.85) for the profit will coincide with (1.27) and problem (1.86)­ (1.88) will take the form (1.5)­(1.8), where m ( (t , q, Q(t ))) . Thus, the performed analysis is also applicable to the problem with heterogeneous productivity of labor, but with a different meanings of the function m. p(t , q) , (t , q, Q(t )) c( , t , q) = c( , t , q) (t , q, Q(t )), ~


32

Heterogeneous consumption in OLG model with horizontal innovations

1.3.3 Economic growth
So far we have studied only the growth of agents' utilities as indicator of prosperity. However, measuring goods in physical units as we did in Section 1.3.2 we can also discuss economic growth in terms of production and consumption. If we assume limited labor endowment L the economy can grow only due to increase in productivity of labor (t , q, Q(t )), introduced in Section 1.3.2. The growth can be endogenous if the productivity of labor depends on the state variable Q(t ). We can consider a function m in problem (1.5) in the form m( , t , q, Q(t )) ( (t , q, Q(t ))) m( , t , q, Q(t )), ~ (1.89)

where the function m, inducing the actual agent's preferences, is bounded, ~ while the function (q, t , Q(t )), describing the dependence of the productivity upon the variety frontier Q(t ), may grow unboundedly when Q(t ) . Thus, we would have an unbounded growth of per capita consumption (in physical units) i.e. an infinite economic growth. The functional form of (t , q, Q(t )) is supposed to be chosen to fit an observed path of the per capita consumption (Jones, 1995b), but this is beyond the scope of the present study.

1.4 Conclusions and prospects
We suggest an endogenous growth model of an economy, where technological growth is promoted by the entrepreneurial activity of new firms. The liquidity for these new enterprises is provided by the banking sector that de jure owns all intellectual property. We interpret the technological growth as a growth of the variety of goods. The question of how productivity of labor depends on the variety of available technologies (goods) still needs to be answered. One can try to determine this dependence empirically, but we believe that the productivity of labor should be related with qualification, hence education should be explicitly included in the model as a new decision variable of the individuals. Heterogeneity of consumption (m( , t , q, Q) = e- (Q-q) , where > 0) brings the following qualitative effects. Firstly, the total value of patents V become bounded with zero limit V (t ) 0 when t (in case of knowledge spillovers, > 0), thus balancing savings of agents and values of patents in the long run (V (t ) - A(t ) 0). Secondly, the growth of the expected life-time aggregated utilities of agents becomes bounded. Finally, the general equilibrium with heterogeneous consumption can loose its dynamic efficiency. So that it becomes


1.5. Proofs

33

possible to introduce a fiscal policy that improves all agents' utilities in the long run. Dynamic inefficiency is observed in OLG models quite often, but in our model general equilibrium is inefficient only when consumption is heterogeneous. In the case of constant labor endowment and stationary population consuming homogeneous goods we have found the simple steady steady state solution with zero real interest rate and (r 0) and constant share of labor in production (LP /L ) with knowledge spillover parameter = 1. This solution sustain exponential growth of goods' variety. The same interest rate and labor distribution happen to be an attractor for the solutions in homogeneous cases with < 1 and heterogeneous cases with < 0. Thus, in all cases the variety of goods can grow unboundedly. Since the general equilibrium can be numerically calculated without any steady state assumptions and with arbitrary exogenous population dynamics the model allows to investigate the effect of different shocks including demographical changes. The proposed model allows for introduction of functional dependence of labor productivity on the goods' variety frontier, thus describing the endogenous growth of production. Finding such dependence that would mitigate the scale effect could be the topic for further research.

Appendix 1.5 Proofs
1.5.1 Proposition 1

Proof Let us substitute profit expression (1.43) into differential equation (1.53) for V and use labor balance (1.35). Thus, we obtain exactly the same equation (1.51) as for the aggregated assets A. One can check that the substitution of the relation V (t ) = A(t ) + e
t
0

r(s) ds

into equation (1.53) gives equation (1.51). Moreover, due to (1.47) equation (1.51) can be written in terms of labor, (1.54).


34

Heterogeneous consumption in OLG model with horizontal innovations

1.5.2 Proposition 2
Proof Expression (1.49), with a( , t ) taken from the solution of personal budget constraints (1.13) same as in (Cass & Yaari, 1967), has the following form
t

A(t ) = = = = =

t - t t - t

n( , t ) Rµ ( , t ) n( , t ) n( , t )
t

t


Rµ ( , s) (l ( , s) - E ( , s)) ds d
s
t

t t


e- e
t
s

(r( )+µ ( , )) d

(l ( , s) - E ( , s)) ds d

r( ) d

t

t -

n( , s) (l ( , s) - E ( , s)) ds d n( , t )

t - t t -

e

t
s

r( ) d

n( , s) (l ( , s) - E ( , s)) ds d n( , s) (l ( , s) - E ( , s)) d ds (1.90)

e

t s

r( ) d

s
t -

Since l ( , s) 0 and E ( , s) 0 we have the following chain of inequalities |A(t )| e e = e
r ¯

t

s

r ¯

t - t - t t t - t - t t - t

n( , s) (l ( , s) + E ( , s)) d ds n( , s) (l ( , s) + E ( , s)) d ds

(L(s) + I (s)) ds ( ) LP (s) ¯ = e r L(s) + ds t - ) t ( ¯ r ¯ ¯ + L ds e L t - 1+ ¯ ¯ e r L, where we use expression (1.4) for total labor, balance equation (1.46) and relation (1.47).

r ¯

1.5.3 Corollary 1
Proof Proposition 2 claims that the aggregated assets A in the economy are always bounded. Since both A and V are bounded, the term with exponent in


1.5. Proofs

35

the relation

V (t ) = A(t ) + e

t
0

r(s) ds

from Proposition 1 is also bounded. This happens only if ) constant = 0 (then ( t V (t ) = A(t ) for all t ) or when the exponentexp 0 r(s) ds is bounded. But we t have that r(t ) r, hence boundedness of 0 r(s) ds occurs only if r 0. ^ ^

1.5.4 Lemma 1
Proof First we show that the human wealth h defined in (1.24) is constant: h( ) = = =
0

+ + +


Rµ ( , s) l ( , s) ds e- e-

0

s ^ µ ( - ) d s- ^ µ ( ) d 0

l^(s - ) ds l^(s - ) ds (1.91)

e

-

^ µ ( ) d

^ l^( ) d h,

where we used the definition of Rµ ( , s) in (1.19) along with r(t ) 0. Then, from the definition of L in (1.4) and expression (1.3) we have L(t ) =
t
t - t -

l ( , t ) n( , t ) d l^(t - ) n( , ) e l^(t - ) e-

0

t

-

t ^ µ ( - ) d

d

t
t - 0

= n0 = n0

t - ^ µ ( ) d 0

d (1.92)

l^( ) e-

^ µ ( ) d

^ d = n0 h.

1.5.5

Proposition 3

Proof We consider two cases: inner and corner solutions. ¯ ¯ Inner solution. If Q(t ) Q and Q(t ) < Q for all t , then due to LQ (t ) 0 equation (1.102) yields a bounded real interest rate r with strictly positive limit:
1 ¯ 1- ¯ 1 - m1 (Q) L > 0. ^ lim r(t ) = r = Q ^ ¯ t M1 (Q)


36

Heterogeneous consumption in OLG model with horizontal innovations

From expression (1.31) and equation (1.29) we have bounded V due to the boundedness of r and L
Q(t )
0

V (t ) = =

0

v(t , q) dq
Q(t )
t
s
t

e-

s
t

r( ) d

(s, q) ds dq
Q(t )

= =

t t

e

t t

- -

r( ) d r( ) d r( ) d r( ) d

e

s
t

e- e-

s
t

s
t

1- 1- 1- 1-



C(s, q) dq ds
Q(s)
0

C(s, q) dq ds
0

LP (s) ds L(s) ds = 1-

t

e-

s
t

r( ) d

L(s) ds < .

Since r > 0 the upper bound of V (t ) is finite and has the following limit with ¯ t ^ 1 - - ts r( ) d 1 - ^ -r(s-t ) d 1- L . e L(s) d L e^ ds = r ^ t t ¯ ¯ Corner solution. If there exist a time t such that LQ (t ) = 0 and Q(t ) = Q for all ^ ¯ t t . Convergence of r(t ) r and L(t ) L > 0 results in bounded A, accord^ ing to Proposition 2. Hence, the total amount of assets A also converges to a ^ constant, A(t ) A, due to equation (1.54) from which we have the following 1- ^ L, ^ ¯ so that r = 0 and A = 0. If r > 0 then for t t ^ ^ 0 = rA - ^^
Q(t ) ¯ Q ¯ Q

(1.93)

V (t )

=
0

v(t , q) dq =
0

v(t , q) dq =
0t ¯ Q

e-

s
t

r( ) d

(s, q) ds dq

=

1- 1-


t

e- e-

s
t

r( ) d

C(s, q) dq ds
0


t

=

s
t

r( ) d

LP (s) ds ^ 1- L ^ = A > 0. r ^



1- ^ L


t

^ e-r(s-t

) d

ds =


1.5. Proofs

37

The case of r < 0 contradicts Proposition 1. It follows from equation (1.93) ^ ^ < 0 when r < 0. But according to the relation that A ^ V (t ) = A(t ) + e
t
0

r(s) ds

^ ^ in Proposition 1 we have V (t ) A so that A > 0 because V (t ) > 0 by definition. Then, in both inner and corner solutions we have r > 0 and bounded V . ^ Thus, the balance A(t ) V (t ) follows from Corollary 1.

1.5.6 Proposition 4
^ ^ Proof We need to check if LP (t ) L, r(t ) 0, and Q(t ) = Q0 e (1- )L t satisfy equations (1.36)­(1.38). It is easy to check that (1.36) is satisfied due to the no-arbitrage condition in (1.61). The solution of (1.38) is obviously

Q(t ) = Q0 e

^ (1- )L t

.

Let us check (1.37). From (1.41) it follows that M ( , s) = Q(s) = Q0 e
^ (1- )L s

.

Then from (1.40) we obtain expression (1.60) for g( ) as follows g( )
+


R ( , s)Rµ ( , s)M ( , s) ds
+


= Q0 e =

^ (1- )L

e(

^ (1- )L-

1-

)(s

- )-

s- ^ µ ( ) d 0

ds

^ ^ (1- )L Q0 e e( (1- )L-
0 ^ (1- )L

1-

)

-


0

^ µ ( ) d

d

= Q0 e

g, ^

where the last integral is a constant denoted by g. Thus, from (1.39) we have ^ C(t , q) = h( ) R ( , t ) n( , t ) d t - g( ) t t - 1 - (t - ) ^ = h e 1- n( , ) e- 0 t - g( ) t 1 - (t - ) - 0t - µ ( ) ^ ^0 = hn e 1- e g( ) t -
t

^ µ ( ) d d

d (1.94)

d .


38

Heterogeneous consumption in OLG model with horizontal innovations

After substitution of g( ) from (1.60) and canceling the constant we obtain the ^^ following expression with the use of the relation L = hn0 from Lemma 1 C(t , q) = = = = = = ^ hn0 t - (1- )L - (t - )-0t - µ ( ) d ^ ^ e e 1- d Q0 g t - ^ ^ hn0 - (1- )L t t (1- )L (t - )- (t - )-0t - ^ ^ 1- e e Q0 g ^ t - ^ hn0 ^ ^ e( (1- )L- 1- ) - 0 µ ( ) d d ^ (1- )L t g 0 Q0 e ^ ^0 hn ^ Q0 e (1- )L t ^ ^ LP (t ) hn0 L = = Q(t ) Q(t ) Q(t ) ^ - LQ (t ) L . Q(t )


^ µ ( ) d

d

(1.95)

Thus (1.37) is also satisfied.

1.6 Calculations
1.6.1 Expression (1.56)
With the use of consumption (1.23) and relation G( , s) = u( ) =
+ +
1-

M ( , s)

e

- (t - )- µ ( , ) d


t

Q(t )

m( , t , q, Q(t )) c( , t , q) dq dt
-

0 - (t - )- µ ( , ) d


t

=

e
+

(G( , t ))1
1-

(E ( , t )) dt
1-

= ( =


e

- (t - )- µ ( , ) d


t

(

(

M ( , s))

M ( , t ) R ( , t ) h( ) g( )

)



dt

h( ) g( )

) + R ( , t ) Rµ ( , t ) M ( , t )dt


= ( h( )) (g( ))1- , we have (1.56).




1.6. Calculations

39

1.6.2 Expression (1.66)
Q(t )

V (t ) =
0

v(t , q) dq
Q(t )

=
Q(t )
0 t

e- e
-

s
t

r( ) d

(s, q) ds dq
( m(q, Q(s)) m(Q(t ), Q(s))
1-

=
Q(t )
0 t

s
t

r( ) d

)

1 1-

(s, Q(t )) ds dq

=

0 t

e-

s
t

r( ) d -

e

(Q(t )-q) -

(s, Q(t )) ds dq
1-

=
t

e-

s
t

r( ) d

~ = M (Q(t ))


t

(s, Q(t ))
s
t

Q(t )

e
0

(Q(t )-q)

dq ds

e-

r( ) d

~ (s, Q(t )) ds = M (Q(t )) v(t , Q(t ))
-

~ LQ (t ) M (Q(t )) 1 - 1 - e 1- Q(t ) ~ = M (Q(t )) = . = Q(t ) Q(t ) Q(t )

1.6.3 Expression (1.84)
u( )| u( )|
=0 >0


(1- ) 1- (1- )





l¯ + l¯

( (

(

1+ 1-
) )

)

1- 1-

= > =

(1 - (1 - )) (1 - (1 - )) 1 - (1 - ) (1 - )1
-



1+ 1- 1+ 1-

1- 1-



1-

1.

1.6.4

Expression (1.83)

From expression (1.68) and limits (1.79), (1.81), and (1.82) we have the limit of aggregated utility u( ) = (a( , ) + h( )) (g( ))1-


40

Heterogeneous consumption in OLG model with horizontal innovations

) 1 - LP ( ) = + h( ) (g( ))1- n( , ) ( ) (1 - ) ¯ ¯ 1- = l +h g ¯ . 1 - (1 - )


(

1.7 No-arbitrage condition for m = m0 ( , t , Q) m1 (q)
Let us rewrite expression (1.39) using the assumption m( , t , q, Q) = m0 ( , t , Q) m1 (q). Then we have C(t , q) = (m1 (q)) = (m1 (q))
1 1-

t
t -

(m0 ( , t , Q(t ))) g( )

1 1-

R ( , t ) h( ) n( , t ) d (1.96)

1 1-

f (t ),

where function f denotes the integral and function m1 (q) > 0 is differentiable for all q Q0 . Then from (1.34) and (1.96) we have
Q(t ) Q(t )

LP (t ) =

C(t , q) dq = f (t )
0 0

(m1 (q))

1 1-

dq = f (t ) M1 (Q(t )). (1.97)

We can derive from (1.96) and (1.97) the following relations for C(s, q) and, due to (1.29), for (s, q): C(s, q) = (m1 (q)) 1- LP (s), M1 (Q(s)) 1- 1 - (m1 (q)) 1- LP (s), C(s, q) = M1 (Q(s))
Q
0
1 1

(1.98) (1.99)

(s, q) =

where we introduce notation M1 (Q) =

m1 (q)

1 1-

dq,

(1.100)

so that the integral defined in (1.41) takes the form M ( , t ) = m0 ( , t , Q(t ))
1-
1 1-

M1 (Q(t )).

Thus, we have the expression for the derivative of C(s, q) with respect to q

C (s, q) q

=
q=Q(t )

(m1 (Q(t )))

m1 q

(Q(t ))

(1 - ) M1 (Q(s))

m1 C(s, Q(t )) q (Q(t )) LP (s) = , 1 - m1 (Q(t ))


1.8. Iterative procedure to calculate general equilibrium

41

and, due to (1.29), the derivative of (s, q):

(s, q) q

=
q=Q(t )

1 - C (s, q) q

=
q=Q(t )

m1 (s, Q(t )) q (Q(t )) . (1.101) 1 - m1 (Q(t ))

It is assumed that there is always R&D activity LQ (t ) > 0. Due to (1.101), (1.99), and (1.55) no-arbitrage condition (1.33) takes the form ) ( ( ) m1 LQ (t ) q (Q(t )) 1- LP (t ) r(t ) = (Q(t ))1- Q(t ) M(Q(t ))) - 1- m1 (Q(t )) - LQ (t ) , (1.102) ~ where M1 (Q(t )) ~ M (Q(t )) = 1. m1 (Q(t )) 1- This condition is easier to use instead of zero-profit condition (1.36) in the R&D sector. Although condition (1.102) does not depend explicitly on the function m0 ( , t , Q), it is still affected by m0 through expressions (1.39), (1.40), and (1.41). However, if the function m0 depends only on , then it does not influence the solution (LQ (t ), r(t ), Q(t )) of system ((1.37), (1.38), (1.102)) and matters only for the utilities of the agents. Indeed, consumption profile (over goods q and times t ) of an agent would remain the same if she have changed her weight function m proportionately for all available goods and times.

1.8

Iterative procedure to calculate general equilibrium

The expression for the real interest rate (1.102) allows to construct a simple iterative procedure for numerical calculation of a general equilibrium in the case of interior solution (LQ (t ) > 0 for all t ). In order to solve system (1.37), (1.38), (1.102) first, we give the initial labor in research LQ [1](t ) in the time interval [0, T ], where T . With LQ [1](t ) we calculate the product variety domain Q[1](t ) in [0, T ] from (1.38). Then, we calculate the real interest rate r[1](t ) in [0, T ] with the use of (1.102). The functions Q[1](t ) and r[1](t ) determine the aggregated consumption of agents C[1](t , q) in [ , T - ] by (1.39). Knowing C[1](t , q) we obtain the labor in R&D for the next iteration, LQ [2](t ), t [ , T - ] with the formula (1.37) and extrapolate it linearly to the whole interval [0, T ]. Then we continue in the same way. If the sequence of so defined iterations converges, then as a result we have a numerical approximation of the general equilibrium with two allowances:


42

Heterogeneous consumption in OLG model with horizontal innovations

finiteness of time horizon T and linear extrapolation to the ends of the time interval. There are two cases to be considered separately. The first case is when consumers evaluate all goods homogenously ( m1 (q) 0) and the second case q is when consumers value newest goods more (
m1 q

(q) > 0 for all q [0, Q(t )]).


Chapter 2

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity
2.1 Introduction
Generally speaking, in our terminology heterogeneous control systems are represented by a family of controlled ODEs parameterized by a parameter q varying in a measurable space Q , called domain of heterogeneity. The family is coupled by "aggregated states" (involving integration over Q ) that enter in all ODEs and in the respective initial conditions. For each q Q the respective ¯ ODE has its own time-interval [ (q), (q)] in which it matters, at the beginning of which an initial condition is posed. A basic theory of optimal control problems for such heterogeneous systems is presented in (Veliov, 2008). Applications in many areas are indicated in this study, including problems in population dynamics, epidemiology, air pollution control, and several problems in economics. In (Veliov, 2008) the initial time (q) for the q-th ODE is exogenously given. In order to explain what are the motivations for the present research, let us consider a particular economic context originating from (Jones, 1976; Dixit & Stiglitz, 1977) in which the parameter q is interpreted as an available technology (or consumption good) and (q) is the time at which the q-th technology emerges. Then, in the spirit of the endogenous economic growth literature , 43


44

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

e.g. (Romer, 1990; G. M. Grossman & Helpman, 1991b; Jones, 1995a), (q) depends on the rate of technological advancement which depends, in its turn, on the investments in R&D. That is, the time, (q), of appearance of the q-th technology is endogenous. Such a situation, where the domain of heterogeneity at each time is endogenously determined, is not covered by the results in (Veliov, 2008) and by the existing literature. In the endogenous growth literature involving a variety of technologies/products it is assumed that the products are indistinguishable, therefore the amount of physical capital allocated to production of each of them is equal (Barro & Martin, 2004). This reduces the originally distributed optimal control problem to a lumped one. In the present research we address the issue of endogenous domain of heterogeneity. The class of problems that we consider is relatively simple and does not contain many of the economic applications. Such will be presented in follow-up publications in economics-oriented journals. This is because here we want to stress the mathematical challenges that the endogenous domain of heterogeneity brings in the optimal control context. The main trouble is caused by the fact that the objective value considered as a function of the control is, in general, non-differentiable (in any reasonable space setting). This effect does not arise in standard optimal control problems with smooth data if the set of admissible controls is a subset of L . As a result of this intrinsic nondifferentiability, the necessary optimality condition of Pontryagin's type takes a non-standard form in which the adjoint systems is represented by a differential inclusion (rather than equation) although the data are assumed smooth with respect to the state variables. However, this situation may happen only if the optimal control is "irregular" with respect to the parameter of heterogeneity q, which is hard to exclude a priori. The "irregularity" of the optimal control that requires the abovementioned non-standard form of the maximum principle does not happen in the economic application we have in mind, thus it is perhaps mainly of "academic interest". If the optimal control is "regular" enough, the optimality conditions take a form corresponding to the heuristic application of the Lagrange principle. For example, it is possible to prove that the optimal control in the more specific problem, considered in (Skritek, Tsachev, & Veliov, 2014), satisfies the regularity condition discussed above. As a consequence of this the optimality conditions of Pontryagins type, which are good enough for analytic and numerical investigation can be obtained. However, the trouble with non-differentiability of the objective function still remains. It creates certain difficulties in the derivation of the optimality condition in the form of global maximum principle, therefore we present it


2.2. Formulation of the problem

45

below in detail. The possible non-differentiability also requires a special care about the numerical approaches to the problem based on gradient-type methods. The chapter is organized as follows. In Section 2.2 we formulate the problem and the assumptions. In Section 2.3 we derive a necessary optimality condition without any a priori assumptions for the optimal control. In this optimality condition the adjoint variable satisfies a differential inclusion and the maximization of the Hamiltonian takes the form of "min-max". From here, under certain regularity of the optimal control we derive also a Pontryagin-type maximum principle. Section 2.4 presents a formula for the derivative of the objective function with respect to the control from L and a sufficient condition for its existence. Moreover, a version of the gradient projection method in the control space is briefly described, which at each iteration involves only controls with respect to which the objective function is differentiable. In Section 2.5 we give two examples of non-differentiability which justify the special treatment in the preceding two sections. Section 2.6 gives a stylized economic example. In a simple case we obtain the solution analytically and show that the existence issue is complicated: an optimal solution exists for some configurations of the parameters and fails to exist for others. Also, numerical results are presented and interpreted. One longer proof is shifted to Appendix.

2.2 Formulation of the problem
¯ Let [0, T ] be a fixed time-interval and let [0, Q] be an interval where the param¯ eter of heterogeneity, q, will take values (here Q > 0 could be +, in which ¯ case the interval should be interpreted as [0, )). Denote D = [0, T ] â [0, Q]. 1 The state variables in the model below will be the functions ¯ x : D Rn , Q : [0, T ] [0, Q], y : [0, T ] Rm , while u : D U Rr will be a control function. The optimal control problem we consider reads as follows:
T Q(t )

max
u 0 0

L(t , q, x(t , q), Q(t ), y(t ), u(t , q)) dq dt ,

(2.1)

subject to the equations Q(t ) = g(t , Q(t ), y(t )),
1

Q(0) = Q0 0, t [0, T ],

(2.2)

In economics q, for instance, can denote technology, Q the technological frontier, and y can be some additional externality.


46

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

q ¯ Q6 Q(·) q xb (t ) - x(t , q- f - )=
-

Q

0

t =Q

[u]-1

-t

(q)

T

Figure 2.1: Increasing domain of heterogeneity
Q(t )

y(t ) =
0

h(t , q, u(t , q)) dq, f (t , q, x(t , q), Q(t ), y(t ), u(t , q)), x(0, q) = x (q), q [0, Q ],
0 0

(2.3) (2.4)

x(t , q) =

x(t , Q(t )) = xb (t ), t (0, T ], u(t , q) U . Here ¯ L : D â Rn â [0, Q] â Rm â U R, ¯ f : D â Rn â [0, Q] â Rm â U Rn , g : D â Rm R, h : D â U Rm , x(t , q) is the derivative with respect to t . The informal meaning is as follows, see Figure 2.1. Given a control function u with values in U , equations (2.2) and (2.3) define the interval [0, Q(t )] in which the parameter q takes values at time t . The state y(t ) represents an aggregated (over the domain of heterogeneity [0, Q(t )]) quantity. Equation (2.4) with the respective side conditions defines the distributed state x. Then the objective functional (2.1) is to be maximized with respect to the control u. Before giving the formal definition of the problem we enlist Standing Assumptions (i) ­ (vi) which will hold throughout the chapter: (i) The set U Rr is compact. (ii) The functions L, f , g, h are measurable in (t , q) and continuous in the rest (2.5)


2.2. Formulation of the problem

47

of the variables, locally essentially bounded, differentiable in (x, Q, y), with locally Lipschitz partial derivatives, uniformly with respect to u U and (t , q) D. The function h is locally Lipschitz continuous in u uniformly with respect to (t , q) D. ¯ (iii) g(t , Q, y) 0 > 0 for every (t , Q) [0, T ] â [Q0 , Q] and every y = 0Q h(t , q, u) dq for some u U . (iv) xb : [0, T ] Rn is continuously differentiable, x0 : [0, Q0 ] Rn is measurable and bounded. Denote U = {u L (D) : u(t , q) U }. Since for any given u U one can represent ( g(t , Q, y(t )) = g t , Q,
0 Q

) h(t , q, u(t , q)) dq

and the function in the right-hand side is Lipschitz in Q, equation (2.2) has locally a solution Q = Q[u] and it is unique on its maximal interval of existence in [0, T ]. ¯ Standing Assumption (v): For every u U the solution Q[u] exists in [0, Q] on the whole interval [0, T ]. ¯ Given u U , we define for q [0, Q] 0 Q[u]-1 (q) [u](q) = T if q [0, Q0 ], if q (Q0 , Q[u](T )), ¯ if q [Q[u](T ), Q].

(2.6)

The definition is correct, since Q[u] is invertible according to Assumption (iii) and its image is [Q0 , Q[u](T )], see Figure 2.1. Notice that [u] is Lipschitz continuous with constant 1/0 due to Assumption (iii). We extend the definition of xb by setting {0 x (q) if q [0, Q0 ], b x (t , q) = (2.7) ¯ xb (t ) if q (Q0 , Q]. ¯ Then we may view (2.4) as a family of ODEs (one for each q [0, Q]), where the functions y = y[u] and Q = Q[u] are already defined from (2.2), (2.3) as described above. For each such q the solution x[u] of (2.4) with the additional condition x( [u](q), q) = xb ( [u](q), q) locally exists and is unique on its max¯ imal interval of existence in [ [u](q), T ]. We extend x[u] for q (Q0 , Q] and b (t ). t [0, [u](q)) as x[u](t , q) = x ¯ Standing Assumption (vi): For every u U and for almost every q [0, Q] the solution x[u](·, q) exists on [0, T ].


48

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

Due to the continuous dependence of the solution of the ODE (2.4) on the parameter q and on the initial data, and due to the measurability of a superposition of a measurable and a continuous function, x[u] is measurable with respect to q. Then the meaning of the optimization problem (2.1) is clear. We mention that several of the Standing Assumptions could be relaxed, however at a certain price. In some extensions this price is just a technical complication, as for example considering a function h in (2.3) depending also on x, or requiring only continuity (instead on Lipschitz continuity) with respect to u in Assumption (ii). Other extensions require a substantial additional analysis. For example, dependence of the side condition xb (t ) on y or on an additional non-distributed control v(t ). A third class of extensions require conceptual clarification. This concerns mainly Assumption (iii). How the solution should be defined if Q(t ) is not strictly monotone increasing? Apparently the "right" definition depends on the particular application. All these extensions have clear interpretations in several economic contexts and is investigated for a particular economic models in forthcoming papers, see e.g. (Skritek et al., 2014). In this study, however, we focus on the mathematical complication that the endogenous heterogeneity brings already in the simplest case which is general enough to cover some applications (see Section 2.6 for an example).

2.3 The optimality conditions
In this section we derive necessary optimality conditions of Pontryagin's type for the problem (2.1)­(2.5). What makes this derivation not a routine work, is that the objective functional (2.1) is, in general, non-differentiable with respect to the control function, as it will be demonstrated in Section 2.5. Moreover, the form of the maximum principle a la Pontryagin is non-standard, in general, although under an additional (non-restrictive for the typical applications) condition it takes a form that could be heuristically derived by an appropriate application of the Lagrange multiplier rule. The problem of existence of an optimal solution is not systematically investigated in this chapter, although it is also challenging, as we show in Section 2.6. To make the expressions below more compact we interpret x, y and u as columns, in contrast to the adjoint variables and that will be involved later, which are viewed as row-vectors with corresponding dimensions. We start with a variational analysis the result of which will be summarized in Proposition 1 below. It will be used in the proofs of the three theorems to follow in this and in the next section. Let u U be fixed and let u U be a sequence of controls parame-


2.3. The optimality conditions

49

terized by positive 0. We shall denote by x, Q and y the values of the state variables corresponding to the control u and by x , Q and y ­ the values of the state variables corresponding to the control u . Further, we denote u = u - u, Q = Q - Q, etc. For the sequence of controls u we require that there exists a constant c such that for all sufficiently small uL1
T
0 (D)

+ yL1

(0,T )

+ QC

(0,T )

+ max x(t , ·)
t [0,T ]

¯ L1 (0,Q)

c , (2.8)

u(t , ·)

¯ L (0,Q)

dt + yL

(0,T )

+ max x(t , ·)L
t [0,T ]

¯ (0,Q)

c , (2.9)

where · C(0,T ) denotes the max-norm of the space of the continuous functions on [0, T ] and · L p is the usual L p -norm for p = 1 or p = . There are two essential cases of sequences u for which the above requirements are satisfied: Case 1: L1 (-simple needle)-variation, where { v if (t , q) [ , + ] â [ , + ], u (t , q) = u(t , q) elsewhere ¯ and [0, T ), [0, Q) and v U are arbitrarily fixed. Case 2: L -variation, where u = u + v and v L (D) is such that u (t , q) U for all sufficiently small . Using assumptions (i), (ii), (iii) and (v) it can be easily verified that in both cases requirements (2.8) and (2.9) are fulfilled. Let us denote by J (v) the value of the objective function (2.1) corresponding to v U . By a similar analysis as in the usual proof of the Pontryagin maximum principle for ODE control systems with unconstrained state we obtain the following result. For every function : D Rn which is absolutely ¯ continuous in t for a.e. q [0, Q], with L (D) and (T , q) = 0, for every absolutely continuous function µ : [0, T ] R satisfying µ (T ) = 0, and for every L (0, T ) the following variational representation holds (the proof is not straightforward and will be presented in Appendix): J = +
0

T Q(t ) T[
0 0

[Lx + + fx ]x dq dt
Q(t )
0

µ + µ gQ +

(LQ + fQ ) dq


50

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

] Q (t ) 1 b [L + ( f - x ) + h] dq Q(t ) dt + Q(t ) Q(t ) ] T[ Q(t ) + - + µ gy + (Ly + fy ) dq y dt
T Q(t )
0 0

+
0 0

[u L + u f + u h] dq dt + o( ).
Q (t )

(2.10)

1 In the case Q(t ) = 0 the term Q(t ) Q(t ) may be defined as any real number. Here the arguments (t , q, x(t , q), Q(t ), y(t ), u(t , q)) are skipped, u L = L(u (t , q)) - L (similarly for the other terms u . . .), and as usual o( )/ 0. This notational convention will be systematically used below: the arguments of the functions will appear only if they are different from those mentioned above, or in order to avoid confusion, or for more clarity. Let be defined as the solution of the adjoint equation

- (t , q) = Lx (t , q) + (t , q) fx (t , q),

(T , q) = 0.

(2.11)

Then the first term in the right-hand side of (2.10) disappears. This equation ¯ has a unique solution on [0, T ] in a similar sense as (2.4): for every q [0, Q] L (D), as it is a linear ODE with end condition (T , q) = 0. Clearly, required. Given the control u U and the corresponding solution of (2.2)­(2.4) and adjoint function , let us define for t [0, T ] and µ R the set [ 1 Q(t )+ (t , µ ) = Limsup L(t , q) + (t , q)( f (t , q) - xb (t )) Q(t ) 0, =0 ] + (µ gy (t ) + (t ))h(t , q) dq, (2.12) where
Q(t )
0

(t ) =
0, =0

(Ly (t , q) + (t , q) fy (t , q)) dq

(2.13)

and Limsup G( ) is the Kuratowski upper limit of a function G at = 0, consisting of all condensation points of sequences G(k ) with k 0, k = 0. Thanks to the continuity of the right-hand side in (2.12) with respect to > 0 it is easy to prove that (t , µ ) is a compact interval. For the same reason it is easy to prove that : [0, T ] â R R is measurable in t and Lipschitz in µ (theorems 8.2.8 and 8.2.5 in (Aubin & Frankowska, 1990) are used in the proof of the measurability). The Lipschitz continuity holds due to gy hL (D) < ,


2.3. The optimality conditions

51

which is a Lipschitz constant of with respect to µ . Then the differential inclusion -µ (t ) µ (t )gQ (t ) + (t ) + (t , µ (t )), with

µ (T ) = 0

(2.14)

(t ) :=

Q(t )
0

[LQ (t , q) + (t , q) fQ (t , q)] dq

(2.15)

has at least one solution (see e.g. (Aubin, 1991, Theorem 11.7.1)), therefore its reachable set R(t ) is nonempty for every t [0, T ]. Let us define the measurable functions { 1 if Q (t ) = Q(t ), s (t ) = 0 if Q (t ) = Q(t ), and d (t ) = Q(t ) + 2 s (t ).

Now consider the equation ~ ~ -µ (t ) = µ (t )gQ (t ) + (t ) Q(t )+d (t ) 1 ~ + [L + ( f - xb (t )) + (µ (t )gy (t ) + (t ))h] dq, d (t ) Q(t ) ~ µ (T ) = 0. (2.16) Notice that d (t ) = 0 for any > 0 and t . Since the right-hand side of (2.16) ~ ~ is linear in µ uniformly in t and measurable in t , it has a solution µ (t ). Then denoting ~ ~ (t ) = µ (t )gy (t ) + (t ) (2.17) ~~ and inserting , µ , in (2.10) we obtain J =
T Q(t )
0 0

~ [u L + u f + u h] dq dt + o( ).

(2.18)

We have ( Q(t 1 dist d (t ) Q(t )

)+d (t )

~ [L + ( f - xb (t )) + (µ (t )gy (t ) + (t ))h] dq, ) ~ (t , µ (t )) := (t ) - 0
0

for every t , where is bounded in and t . This easily follows from the continuity with respect to µ of the mappings involved, the uniform boundedness


52

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

~ of µ and the definition of . By the Filippov theorem (Aubin & Frankowska, 1990, Theorem 10.4.1) we obtain that there exists a solution µ (t ) of (2.14) such that ~ |µ (t ) - µ (t )| C
T 0

(t ) dt

- 0

0.

Then using (2.8), (2.17), (2.18), and the Lipschitz continuity of h with respect to u (cf. Standing Assumption (ii)) we obtain that J = where
T Q(t )
0 0

[u L + u f + u h] dq dt + o( ),

(2.19)

(t ) = µ (t )gy (t ) + (t ).

(2.20)

Let us summarize what we have obtained so far. Proposition 1 Let u U be arbitrarily fixed and let u U be a sequence of controls satisfying (2.8), (2.9). Then equation (2.19) holds true with the adjoint function L (D) defined in (2.11) and some solution µ of (2.14) and the corresponding defined by (2.20). Here and µ are absolutely continuous in t with L (D), µ L [0, T ], and are defined by (2.13) and (2.15), respectively. We shall make use of the above finding in two ways. First, in this section, we shall consider the needle variation defined in Case 1 in order to obtain a specific global maximum principle for the problem in consideration, then in Section 2.4 we shall use variations as in Case 2 to express the derivative of the objective functional with respect to the control under additional conditions that ensure its existence. ~ Define H : D â Rn+1+m+r+n+m R as ~ H (t , q, x, Q, y, u, , ) = L(t , q, x, Q, y, u) + f (t , q, x, Q, y, u) + h(t , q, u). Theorem 1 Let u L (D) be an optimal control in the problem (2.1)­(2.5) and let z := (x, Q, y) be the corresponding trajectory. Let be the solution of the adjoint equation (2.11) (it exists and is unique on D) and let , and be defined by (2.13), (2.15) and (2.12), respectively. Then the reachable set R(t ) of the differential inclusion (2.14) is nonempty, and for almost every (t , q) D(u) := {(t , q) : t [0, T ), q [0, Q(t ))} ( ) ~ ~ max min H (t , q, z(t , q), v, (t , q), ) - H (t , q, z(t , q), u(t , q), (t , q), ) 0, v (2.21) subject to v U and R(t )gy (t ) + (t ).


2.3. The optimality conditions

53

Proof We shall sketch the proof emphasizing only the points that are unusual. Assume that the claim of the theorem is not true. Denote by (t , q, v) the function "min . . ." in (2.21). This function is measurable in (t , q), according to (Aubin & Frankowska, 1990, Theorem 8.2.11). In a routine way we obtain that there exists a set D(u) with positive measure, > 0 and v U such that (t , q, v) for (t , q) . Let ( , ) be a Lebesgue point of , that is, meas( B ) = + o( ), (2.22) where B is the box [ , + ] â [ , + ]. Let u be the simple needle control variation defined in Case 1, and let ( , µ , ) be the functions from Proposition 1. Then according to (2.19) J =

B

~ ~ [H (t , q, v, (t )) - H (t , q, (t ))] dq dt + o( ),

where, consistently with our notational convention: ~ ~ H (t , q, v, ) = H (t , q, z(t , q), v, (t , q), ), ~ ~ H (t , q, ) = H (t , q, z(t , q), u(t , q), (t , q), ). Then taking into account that (t ) R(t )gy (t ) + (t ) we obtain from the inequality (t , q, v) and (2.22) that J

B

d(q, t ) +


B \

. . . d(q, t ) + o( ) = + o( ).

Since for sufficiently small > 0 the right-hand side is strictly positive we come to a contradiction. The above theorem gives information about the optimal control only for q [0, Q(t )]. Obviously the values of u for q > Q(t ) are irrelevant for the objective value. In the applications we have in mind the optimal control is regular enough to reduce the inclusion in (2.14) to an equation. Below we elaborate on this case, starting with Assumption (vii): The functions L, Lx , f , fx and h are continuous with respect to q (uniformly in the rest of the variables); the optimal control u is (equivalent to) a function which is continuous from the left with respect to q at q = Q(t ) for a.e. t [0, T ]. Since, as just mentioned, the values of u for q > Q(t ) are irrelevant for the objective function (2.1), we may redefine the optimal u as u(t , q) = u(t , Q(t )) for q > Q(t ). Moreover, because of assumption (vii), the functions x(t , ·) and


54

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

(t , ·) are continuous in q at q = Q(t ), thus x(t , Q(t )) and (t , Q(t )) are well defined. Then the set (t , µ ) is a singleton:
(t , µ ) = L(t , Q(t )) + (t , Q(t ))( f (t , Q(t )) - xb (t )) + (µ gy (t ) + (t ))h(t , Q(t )) and (2.14) becomes an equation. Let µ (·) be its solution and (·) be obtained from µ (·) by (2.20). Then µ (·) and (·) solve the equations -µ (t ) = µ (t )gQ (t ) + L(t , Q(t )) + (t , Q(t ))( f (t , Q(t )) - xb (t )) + (t )h(t , Q(t ))
Q(t )
0

+

[LQ (t , q) + (t , q) fQ (t , q)] dq,
Q(t )
0

µ (T ) = 0, (2.23)
(2.24)

(t ) = µ (t )gy (t ) +

[Ly (t , q) + (t , q) fy (t , q)] dq.

Notice that according to the notational convention and the side condition x(t , Q(t )) = xb (t ) we have L(t , Q(t )) = L(t , Q(t ), xb (t ), Q(t ), y(t ), u(t , Q(t ))) and similarly for the other functions above. Now we introduce the function (having in many respects the traditional meaning of "hamiltonian") H : D â Rn+1+m+r+n+1+m R as H (t , q, x, Q, y, u, , µ , ) = L(t , q, x, Q, y, u) + f (t , q, x, Q, y, u) + µ g(t , Q, y) + h(t , q, u) where the arguments in the left-hand side should be inserted in the right-hand side wherever appropriate. Then Theorem 1 implies that there exists a unique solution of adjoint equations, as states in the following theorem. Theorem 2 Under assumptions (i) ­ (vii) denote by u L (D) an optimal control in the problem (2.1)­(2.5) and by z := (x, Q, y) the corresponding trajectory. Then the adjoint system (2.11), (2.23)­(2.24) has a unique solution := ( , µ , ) and for a.e. t [0, T ] and a.e. q [0, Q(t )] H (t , q, z(t , q), u(t , q), (t , q)) = max H (t , q, z(t , q), u, (t , q)).
uU

In the next section, we elaborate on numerical procedure of finding an optimal control.


2.4. A numerical approach

55

2.4 A numerical approach
We include this section in the chapter for the following reason. It turns out (as shown in the next section by two examples) that the objective functional J (u) is not differentiable in u L (D), in general. Even more intriguing, although the value of J (u) does not depend on the values of u(t , q) for q > Q[u](t ), the differentiability property may depend on how u(t , q) is defined above the graph of Q. This fact requires a special analysis which is the main point in this section. For solving numerically problem (2.1)­(2.5) we apply a version of the gradient projection method in the control space L (D). Denoting as before by J (u) the objective value corresponding to control u U , the gradient projection method consists of the following. Denote by J (u) the derivative of J with respect to u in L (D) (if it exists). Given the current "approximation" uk U of the optimal control, assume that J (uk ) can be represented by a function from L (D) (this will be proved in Theorem 3 below, see equation (2.25)). We find a next approximation uk+1 as u
k+1

= PU (uk + k J (uk )).

where PX means the metric projection on X . Clearly ) ( uk+1 (t , q) = PU uk (t , q) + k J (uk )(t , q) . The choice of the parameter k 0 is a subject of many publications on gradient methods and it is beyond the scope of current discussion. Denote by Ulc the set of those u U that are continuous from the left at q = Q[u](t ) for a.e. t [0, T ] and by Uc the set of those u U that are continuous at Q[u](t ) for a.e. t [0, T ]. First of all, in addition to assumptions (i)­(vii) we introduce the following one. Assumption (viii). The functions L, f , h are continuously differentiable with respect to u, uniformly with respect to the rest of the variables. Theorem 3 Under assumptions (i)­(viii) the objective function J (u) is Gateaux-differentiable at every u Uc and the derivative J (u) can be represented by the function { Hu (t , q), if q Q(t ), J (u) = (2.25) 0 , if q > Q(t ), where ( , µ , ) is the solution of the adjoint system (2.11), (2.23)­(2.24).


56

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

Proof Let u be defined as in Case 2: u = u + v, in Section 2.3, where v L (D). Then from (2.19) and the fact that now does not depend on we obtain J =
T Q(t )
0

=


0

0 T Q(t ) 0

[H (t , q, u(t , q) + v(t , q)) - H (t , q, u(t , q))] dq dt + o( ) Hu (t , q)v(t , q) dq dt + o( ),

due to the uniform continuous differentiability of H in u, which means that the functional J is Gateaux differentiable at u in the space L (D) and its derivative is (represented by) (2.25). In order to ensure that J (u) stays continuous from the left, we shall modify the gradient projection procedure in such a way that uk Uc . Indeed, assume that uk Uc . Then it is easy to verify (using the continuous dependence of the solution of an ODE on parameters) that Hu is also continuous in q at q = Q(t ) for a.e. t . Then J (u) is continuous from the left at q = Q(t ) due to (2.25). Since PU is a continuous mapping, we obtain that uk+1 (t , q) := PU (uk (t , q) + k J (uk )(t , q)) has the same property. Define the ~ operator I : Ulc Uc as { u(t , q) , for q Q[u](t ), I (u)(t , q) = u(t , Q[u](t )) , for q > Q[u](t ). Notice that J (I (u)) = J (u), but J is differentiable at I (u), while it need not be differentiable at u. Then we define the next iteration as ( ( )) uk+1 (t , q) = I PU uk (t , q) + k J (uk )(t , q) . The numerical implementation involves discretization with respect to t and q which contains delicate technical points due to the changing domain [0, Q(t )]. The author wrote in MATLAB a program tackling these points, using adaptive discretization with respect to q. The detailed description of the program is outside the scope of the present study. Some results of calculations by the program are presented in Section 2.6 for an economic example.

2.5 Two examples of non-differentiability of the objective function
As mentioned in the previous section the objective value J (u) in (2.1) considered as a function of the control could be non-differentiable in the space L (D).


2.5. Two examples of non-differentiability of the objective function

57

We distinguish two different cases of non-differentiability: one is harmless, while the other is not and requires the non-standard form of the maximum principle considered in the previous section. 1. Let u U and let Q(t ) = Q[u](t ) be the corresponding solution of (2.2), (2.3). As argued in Section 2.2, the respective solution (x[u], Q[u], y[u]) is independent of the values of u for q > Q(t ), and if u is continuous from the left with respect to q at q = Q(t ) for a.e. t , then u can be redefined to become continuous in q for q = Q(t ), e.g. as u(t , q) = u(t , Q(t )) for q > Q(t ) (we use the notation u# for the redefined control). Then J is Gateaux differentiable at u# . This is the "harmless" case of possible non-differentiability of J (u), which can be avoided by the redefinition of u for q > Q(t ). Although the objective value J (u) does not depend on the values of u for q > Q(t ), its differentiability does, and requires that u be continuous in q also from the right at q = Q(t ) (which means that the redefinition of u as u# is essential). The example below shows this. Example 1. Consider the system Q(t ) = y(t ), y(t ) =
0

Q(t )

Q(0) = 1. u(t , q) dq
1

and the functional
1 Q(t )

J (u) =
0 0

1 dq dt =
0

Q(t ) dt .

This is not interpreted as an optimal control problem. We just study the differentiability of the functional J , defined for u L ([0, 1] â [0, )). Fix the control { 1, if q [0, et ], u(t , q) = a, if q > et , where a is a real number. Then consider the controls uh (t , q) = u(t , q) - h 1 and uh (t , q) = u(t , q) + h, h > 0, and the corresponding solutions (Qh , yh ), and ii 2 compare with the solution for u, which is obviously Q(t ) = et . It can also be directly checked that for h close to zero we have Qh (t ) = e(1-h)t = Q(t ) - ht et + O(h2 ). 1 The function F (t , Q) = 0Q u(t , q) dq is Lipschitz in Q and measurable in t , and Q(t ) and Qh (t ) satisfy the equations 2 Q(t ) = F (t , Q(t )), 2 Qh (t ) = F (t , Qh (t )) + hQh (t ), 2 2 Q(0) = Qh (0) = 1. 2



58

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

By a standard comparison argument (or as a consequence of the viability theory, (Aubin, 1991, Chapter 11)) this implies Qh (t ) Q(t ). Hence 2 yh (t ) = 2
Q(t )
0

uh (t , q) dq + 2

Qh (t ) 2
Q(t )

uh (t , q) dq 2

= (1 + h)Q(t ) + (a + h)(Qh (t ) - Q(t )) 2 = (a + h)Qh (t ) + (1 - a)Q(t ). 2 Then the equation for Qh becomes 2 2 Qh (t ) = (a + h)Qh (t ) + (1 - a)Q(t ), 2 Qh (0) = 1. 2

Using the Cauchy formula we obtain after routine calculations that ( ) h h h t Q2 (t ) = e 1 + - e(a+h)t 1-a-h 1-a-h (t ) h h = Q(t ) + ht eat + O(h2 ). e - eat - 1-a-h 1-a-h Now we consider the case a = 1 (hence u is discontinuous from the right). Then the second last term in the above equality is also O(h2 ), thus for h close to zero we have Qh (t ) = Q(t ) + 2 Then J (u - h) - J (u) = -h while J (u + h) - J (u) = ) h (t e - eat + O(h2 ). 1-a
1
0

t et dt + O(h2 ),

1( ) h et - eat dt + O(h2 ). 1-a 0 The last two equalities show that J is not differentiable if a = 1. In contrast, if a = 1 the objective J is differentiable in the direction ±1 (as it was proved in the previous section) since in this case

Qh (t ) = Q(t ) + ht et + O(h2 ). 2 The above attribute to the 2. Below redefinition of plicable if u is example shows, in particular, that the special attention that we definition of u above Q[u](t ) in the previous section is essential. we give another example, which shows that the "remedy" of u above Q(t ) in order to get differentiability of J (u) is not aptoo "bad" below Q(t ). This example justifies the non-standard


2.5. Two examples of non-differentiability of the objective function

59

form of the maximum principle in the case of an optimal control that is not continuous from the left at Q(t ). Example 2. Consider the system Q(t ) = y(t ), y(t ) =
0

Q(t )

Q(0) = 1, u(t , q) dq,

t [0, 1]

(2.26) (2.27)

and the functional J (u) =

1 Q(t )

u(t , q) dq dt .
0 0

As in Example 1, this is not interpreted as an optimal control problem. We just study the differentiability of the functional J , defined for u L ([0, 1] â [0, )). We shall prove that for some u the functional J is not even directionally differentiable at u in the direction of { ¯ 0 for (t , q) [0, t ] â [0, ) w(t , q) = ¯ -1 for (t , q) (t , 1] â [0, ), ¯ where t (0, 1) is to be chosen later sufficiently close to 1. Namely, we define { u(t , q) u(q) = v(1 - q) 0 for q [0, 1) for q 1

where v(q) is defined on (0, 1] as v(1) = -1 and { v(q) = 1 -1 [1 2) for q [ 3k , 3k ) k = 1, 2, . . . , 23 for q 3k , 3k k = 1, 2, . . . .

The solution of (2.26), (2.27) for u is Q(t ) = 1, y(t ) = 0, since
1

y(t ) =
0

v(1 - q) dq =

1

v(q) dq = 0.
0

1 Denoting V (q) = v(s) ds for q (-, ), we have 0 V (q) 2 q for 0) (3 (2) 2 q [0, 1] and also V 3k = 0 and V 3k = 1 · 3k for k = 1, 2, 3, ... . 2

Now, let u = u + w, where is a "small" positive real number. Denote by Q and y the corresponding solution of (2.26), (2.27). It is clear that ¯ ¯ u (t , q) u(t , q) on [0, t ] â [0, ) and, hence, Q (t ) Q(t ) 1 on [0, t ]. Is is ¯ also clear that Q (t ) Q(t ) for t [t , 1].
q


60

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

¯ For t [t , 1] we have Q (t ) =

Q (t )
0

[u(q) - ] dq = - +

Q (t )
1

[u(q) -

] dq. ¯ ¯ ¯ Defining (t ) := Q (t ) - (1 - (t - t )) for t [t , 1], we have (t ) = 0 and
(t ) =

1
¯ 1- (t -t )+ (t )

[u(q) - ] dq [v(s) - ] ds

=-


0

¯ (t -t )- (t )

¯ ¯ = -V [ (t - t ) - (t )] + [ (t - t ) - (t )]. ¯ Case 1. Assume ( ) > 0 for [t0 , t ] [t , 1] where (t0 ) = 0. Then 0 < ( ) i.e.

t
0

1 ¯ ¯ 2 (µ - t ) dµ 2 ( - t )2 = O( 2 ) for [t0 , t ], (2.28) 2

¯ Q ( ) = 1 - ( - t ) + O( 2 ) for [t0 , t ].

(2.29)

¯ Case 2. Assume ( ) < 0 for [t0 , t ] [t , 1] where (t0 ) = 0. Then 1 V (q) q 2 and

t
0

( ) =
yield

¯ ¯ [-V ( (µ - t ) - (µ )) + ( (µ - t ) - (µ ))] dµ

( 0 < - ( )

1 - 2

)
2

1 ¯ ( - t )2 + 2


t
0

(- (µ )) dµ .

Applying the Gronwall inequality (cf., e.g., (Corduneanu, 1971), p. 14) we obtain ( ) 1 1 21 ¯ ¯ 0 < - ( ) - ( - t )2 et -t0 ( - t )2 e1-t¯ for [t0 , t ]. 2 2 4 This implies 1 ¯ ¯ Q ( ) 1 - ( - t ) - ( - t )2 e1 4
¯ -t

for [t0 , t ].

(2.30)


2.5. Two examples of non-differentiability of the objective function

61

We have J (u ) - J (u) =
1 Q (t )
0 0

[u(q) + w(t , q)] dq dt -
1 1
¯ t ¯ 1- (t -t )

1 1

u(q) dq dt
0 0

¯ = - (1 - t ) -
1 Q (t )

u(q) dq dt
1 1
¯ t Q (t )

+
¯ t

¯ 1- (t -t )

u(q) dq dt +

dq dt .

From (2.29) and (2.30) we obtain that the last term above is O( 2 ). We next estimate the third term in the last row above. From (2.28) and (2.30) we obtain that if the positive is small enough, ¯ |Q (t ) - [1 - (t - t )]| Hence,
1 Q (t )
¯ t ¯ 1- (t -t )

e1-t¯ ¯ ¯ (t - t )2 holds true for every t [1 - t , 1]. 4

u(q) dq dt

e1-t¯ 4

1
¯ t

¯ (t - t )2 d =

e1-t¯ ¯ (1 - t )3 . 12

(2.31)

We thus obtained that ¯ J (u ) - J (u) = - (1 - t ) - Z + Y + O( 2 ). where Z =
1 1

u(q) dq dt and Y =

1

Q (t )

u(q) dq dt .
¯ ¯ t 1- (t -t )

¯ ¯ t 1- (t -t )

In order to prove non-differentiability of J at u in the direction of w it is enough to show that (Z - Y )/ does not converge with 0. We have Z 1
1 1

=

¯ ¯ t 1- (t -t )

1 v(1 - q) dq dt =

( 1 t -t¯)

v(q) dq dt
¯ t 0

=

-t ) (1 ¯ -t ) (1 ¯ ¯ ¯ (1 - t )2 (1 - t )2 v(q) dq - q v(q) dq. (2.32) ¯ ¯ (1 - t ) ( (1 - t ))2
0 0


62

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

We shall consider the last two integrals separately, for values k = k =

(2.32)

3/3k of k = 1, 2, . . . . Easy calculations give for the first integral in ¯ 1-t
( ¯ k1-t ) ( ¯ k1-t )

2/3k and ¯ 1-t

1 (1 - t ) ¯ k

0

1 v(q) dq - ¯ k (1 - t )

v(q) dq =
0

1 1 -0 = . 2 2

More cumbersome calculations, which we skip, give for the second integral in (2.32) 1 ¯ (k (1 - t ))2
( ¯ k1-t )

q v(q) dq -

0

1 ¯ (k (1 - t ))2

( ¯ k1-t )

q v(q) dq =
0

11 1 15 += . 32 8 32

Comparing the expressions for the two integrals in (2.32) we conclude that the ¯ variation of Z remains strictly positive (at least (1 - t )2 /32) for arbitrarily ¯ small . On the other hand, from (2.31) we obtain that if t (0, 1) is close |Y | ¯)2 /64 holds true. Hence (Z - Y )/ does not enough to 1, (1 - t converge with 0. This completes the proof of the non-differentiability of J at u in the direction w. The example, discussed above, can be easily modified to include the case of strictly increasing Q(·) by replacing the equation in (2.26) by Q(t ) = 1 + y(t ). Since the proof of the non-differentiability of J in this case requires longer and more cumbersome calculations, we omit it.

2.6 An economic example
In this section we present a stylized economic model of endogenous economic growth to which the above results can be applied. This economic model is a particular case of a model considered in (Skritek et al., 2014), for which regularity condition for the optimal control is fulfilled. We consider a finite time horizon [0, T ] (presumably rather large, so that T is an "approximation" of the infinity) and a large corporation producing at time t diverse goods labeled by the real number q [0, Q(t )]. Here Q(t ) is the newest good (technology) available at time t . Each of the goods q is produced by a separate firm that at time t has physical capital stock x(t , q). The q-th firm (q [0, Q(t )]) invests at time t an amount u(t , q) that is split in two parts: u(t , q), 0 1, is allocated to increase the capital stock,


2.6. An economic example

63

while (1 - )u(t , q) is the contribution of the q-th firm to the R&D activity of the corporation which results in development of new technologies (goods) and hence in increase of Q(t ). The model reads as follows: x(t , q) = - x(t , q) + u(t , q), Q(t ) = (1 - )y(t ),
Q(t )
0

x(0, q) = x0 (q) for q [0, Q0 ], x(t , Q(t )) = 0 for t > 0,
0

Q(0) = Q ,

y(t ) =

u(t , q) dq.

Here y(t ) is the total investment in R&D, 0 is the depreciation rate of the physical capital, x0 (q) is the initially available capital stock for producing goods q [0, Q0 ], Q0 > 0 is the newest technology available at time t = 0. For all technologies obtained at t > 0, initial capital stock is assumed to be zero, x(t , Q(t )) = 0. The objective function to be maximized is
T
0

e

-rt

Q(t ) [
0

] p(q, Q(t ))x(t , q) - bu(t , q) - cu2 (t , q) dq dt ,

(2.33)

subject to the control constraints u(t , q) 0 and u(t , q) 1. In the model r 0 is the discount rate, p(q, Q) is the market price of the good q [0, Q], given that goods up to level Q are available, bu + cu2 , b 0, c > 0, is the quadratic cost of investments u. The dependence of the price p on q and Q reflects the fact that the market price of any available good decreases when newer products emerge (that is, when Q increases). For the present illustrative purpose we chose the specification p(q, Q) = e- (Q-q) with 0. So the price of the newest product is normalized to one (which is supported by the data for personal computers and mobile telephones, where the price of the new products does not substantially change with time, while the quality increases). Because the integrand of the objective functional (2.33) is strictly concave in u, one may expect that optimal control would be always bounded even without the upper constraint u(t , q) 1. We will give an example, where this is not the case. The reason is that the upper limit Q(t ) of the integral also depends on u, and at particular parameters the unique optimal solution hits the upper bound for sufficiently high time horizon T , see Figure 2.3. Let us write optimality conditions. As proved in (Skritek et al., 2014), there exists a unique optimal control u(t , q) that is continuous from the left at


64

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

q = Q[u](t ). Adjoint equations (2.11), (2.23)-(2.24) for this example, written in ~ ~ ~ the terms of the "current value" adjoint variables = ert , µ = ert µ , = ert read as follows ~ ~ ~ - (t , q) = -( + r) (t , q) + e- (Q(t )-q) , (T , q) = 0, ~ ~ -µ (t ) = -r µ (t ) - [bu(t , Q(t )) + cu2 (t , Q(t ))] ~ ~ + (t , Q(t ))u(t , Q(t )) + (t )u(t , Q(t )) -
Q(t )
0

(2.34)

e

- (Q(t )-q)

x(t , q) dq,

~ µ (T ) = 0,

(2.35) (2.36)

~ ~ (t ) = µ (t )(1 - ), and Theorem 2 implies that { { }} .

~ ~ (t , q) + (t ) - b u(t , q) = min 1, max 0, 2c

(2.37)

In the following we solve the optimal control problem for different parameter sets. In simple cases we obtain exact analytic solutions, in more complicated cases we derive asymptotic solutions and resort to numerics, as described in Section 2.4. 1. Let us consider first the case = 0, where the solution can be studied analytically. It follows from (2.37) and (2.34) that the optimal control does ~ ~ not depend on q, so (t , q) = (t ) and u(t , q) = u(t ). We will solve equa~ tions (2.34)­(2.37) backwards starting from time T . It follows from µ (T ) = 0, ~ (T ) = 0, and (2.37) that u(T ) = 0. Let [t0 , T ] be the maximal interval which ends at T and in which u(t ) = 0. To find t0 we substitute u(t ) = 0 in (2.35) and ~ ~ obtain (using that = 0) the solution µ (t ) = (t ) = 0 for t [t0 , T ]. Now we ~ (t0 ) = b/ and substitute here the solution get from (2.37) that 1 - e-(r+ ~ (t ) = r+ of equation (2.34). Thus, we obtain ( ) 1 +r t0 = T + ln 1 - b , +r
)(T -t )

when

b

+r < 1.

It the case b ( + r)/ 1 the optimal control is identically zero. In a time interval, where the control constraint is not active (u(t ) > 0) the derivative of the hamiltonian must be equal to zero: ~ ~ ~ Hu (t , q) = -b - 2cu(t , q) + (t , q) + (t ) = 0.


2.6. An economic example

65

0.35

0.3

0.25

control u

0.2

0.15

0.1

0.05

0

0

10

20 time t

30

40

50

Figure 2.2: Optimal investments for = 0, = 0, r = 0.3, T = 50, b = 0.4, c = 3, = 0.5, A > 0. Using this condition and equations (2.34)­(2.36) we can derive the following Riccati equation for the optimal control for t [0, t0 ] ( ) r 1- 2 u(t ) - -b - e-( +r)(T -t ) , u(t ) = ru(t ) - 2 2c + r 2c( + r) u(t0 ) = 0, (2.38) which turns out to be positive on [0, t0 ). Notice, that if solution u(t ) reaches the upper bound, u(t1 ) = 1, at first time instance t1 > 0 to the left from t0 , then optimal control u(t ) = 1 for all t [0, t1 ). 1.1. The solution of equation (2.38) can be obtained in special functions. Under the additional condition = 0 it takes the simpler form ( ( )) r A A rc - (t0 - t ) + arctanh , u (t ) = tanh 1 - c (1 - ) 2c A see Figure 2.2, if ( ) A := c r2 c - (1 - )( - b r) > 0.


66

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

1 0.9 0.8 0.7 control u 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 time t 30 40 50

Figure 2.3: Optimal investments hit its upper bound, for = 0, = 0, r = 0.1, T = 50, b = 0.4, c = 3, = 0.5, A < 0. If A < 0 the solution is ( ( )) rc r -A -A (t0 - t ) - arctan u (t ) = + , tan 1 - c (1 - ) 2c -A which tends to infinity, if t0 - t may take sufficiently large values when t [0, t0 ]. This is always the case if T is sufficiently large, as it follows from the expression for t0 . Then the optimal control certainly hits its upper bound, u(t ) = 1 for t [0, t1 ), see Figure 2.3. Thus, in order to ensure boundedness of the solution on any time horizon even without upper control constraint, like in Figure 2.2, the discount rate should be big enough: ( ) 1 2 (1 - )2 + 4 c (1 - ) - b (1 - ) . r> b (2.39) 2c 1.2. In the case > 0 we apply an asymptotic analysis assuming that T is large. Namely, we can find the asymptotic solution of (2.38) considering times


2.6. An economic example

67 the last term in (2.38). Then the ( )] -b , +r ( )] . -b +r

t such that t0 - t is big enough for neglecting two steady state solutions are [ 1- r u1 = r+ r- 1- c [ r 1- u2 = r- r- 1- c

(2.40)

The first of which is an attractor while the second is a repeller. In inverse time t = -t point u2 becomes an attractor with the basin of attraction (u1 , -). Thus, u2 is the horizontal asymptote of the exact solution for t - if it is still in the basin u(t ) (u1 , -) when the last term in (2.38) is already insignificant, see Figure 2.4 (top). For a large time horizon [0, T ] if the optimal control u(t ) exists then most of the time t [0, t0 ] it is close to u2 when u2 > 0, or u(t ) 0 when u2 0. Nonnegativity of the expression under the square root in (2.40) gives us condition for existence of asymptote u2 ) ( 1 2 + 4 c (1 - ) - b (1 - ) - c (b (1 - ) - c) (2.41) r> 2c that generalizes condition (2.39) to all 0. It follows from (2.40) and (2.41) that when > /b even r = 0 allows for bounded solution but this solution is trivial u(t , q) 0. The essence of the above analysis is that even for a very simple economic problem the issue of existence of a solution is not simple. To ensure existence we introduce here upper constraint u(t ) 1, that has not much of economic sense. While inequality (2.41) provides a necessary condition under which a solution exists for any time horizon [0, T ] without upper constraint. 2. The above considerations concern the "degenerate" case = 0, where all data of the problem are independent of q. In the general case 0 we have obtained only numerical results.2 Figure 2.5 presents the optimal investments (top) and the corresponding capital stock (bottom) of the firm with = 0.7. The time horizon is T = 50 and the initial product variety is [0, Q0 ] = [0, 1]. Notice that due to the technological development (Q(t ) increases from 1 to about 2) the firm completely abandons investments in some older technologies. At time t = 30, for example, the firm invests only in technologies q [0.6, 1.63]. As a result, the physical capital
The numerical results are obtained by our own MATLAB solver in which the modification of the gradient projection method described in Section 2.4 is implemented.
2


68

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

of technologies q much smaller than the technological frontier Q(t ) is close to zero.3 Every section x(·, q) has the meaning of diffusion curve of technology q. It starts at the time (q) (see (2.6)) and reaches its maximum at some later time, after which it begins decreasing. 3. In the above numerical example the dependence of the optimal control on the technology q results from the dependence of the price function p(q, Q) on q. However, a minor nonlinearity inserted in the problem may lead to technology-dependent investments even if all data of the problem are technology-independent. Let, for example, the revenue function p(q, Q) x in (2.33) be replaced with p (x + xm ) with (0, 1) and constant p > 0 and xm > 0. This corresponds to the revenue of a firm with market power or scarce non-capital production factors. The numerically obtained optimal control is plotted on Figure 2.6. Clearly, the investments in the newest technologies are larger than those of the older ones due to the higher marginal productivity of the technologies with lower capital stock. But the nature of the higher marginal productivity of new technologies is the nonlinearity of the production function rather than heterogeneity of market prises.

A more profound investigation of the issue of obsolescence in the spirit of (Boucekkine, R´ & Licandro, 2005) are yet to be done elsewhere. io,

3


2.6. An economic example

69

=0.2, r =0.3, T =50, b =0.4, c =3, =0.5, =0, Q0 =1 0.12

0.1

0.08 control u

0.06

0.04

0.02 numerical solution asymptote u2 0 0 10 20 time t =0.2, r =0.3, T =50, b =0.4, c =3, =0.5, =0, Q0 =1 30 40 50

0.4 0.3 capital x 0.2 0.1 15 0 0 10 20 30 40 50 time t 0 5 Q 10

Figure 2.4: Optimal investments (top) and the corresponding capital stock (bottom) for = 0. Numerical results are compared with asymptotic solution (dashed lines).


70

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

=0.2, r =0.3, T =50, b =0.4, c =3, =0.5, =0.7, Q0 =1

0.1 0.08 control u 0.06 0.04 0.02 2 0 0 10 20 30 40 time t =0.2, r =0.3, T =50, b =0.4, c =3, =0.5, =0.7, Q0 =1 0 0.5 Q 1 1.5

0.2 0.15 capital x 0.1 0.05 2 0 0 10 20 30 40 time t 0 0.5 Q 1 1.5

Figure 2.5: Optimal investments (top) and the corresponding capital stock (bottom) for = 0.7


2.6. An economic example

71

=0.2, r =0.3, T =50, b =0.4, c =20, =0.5, =0.5, xm =0.1, Q0 =1

0.04

0.03 control u

0.02

0.01 2.5 0 0 10 20 30 40 time t 0 0.5 Q 1 2 1.5

=0.2, r =0.3, T =50, b =0.4, c =20, =0.5, =0.5, xm =0.1, Q0 =1

0.1 0.08 capital x 0.06 0.04 0.02 2.5 0 0 10 20 30 40 time t 0 0.5 Q 1 2 1.5

Figure 2.6: Optimal investments (top) and the corresponding capital stock (bottom) for nonlinear production function x + 0.1


72

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

2.7 Conclusion
Optimal control of heterogeneous systems, that is, families of controlled ODEs parameterized by a parameter running over a growing domain of heterogeneity is considered. The main novelty, added in this system, is that the domain of heterogeneity is endogenous: it may depend on the control and on the state of the system. This extension is crucial for several economic applications and turns out to rise interesting mathematical problems. A necessary optimality condition is derived, where one of the adjoint variables satisfies a differential inclusion (instead of equation) and the maximization of the Hamiltonian takes the form of "min-max". As a consequence, a Pontryagin-type maximum principle is obtained under certain regularity conditions for the optimal control. A formula for the derivative of the objective function with respect to the control from L is presented together with a sufficient condition for its existence. A stylized economic example of R&D driven economic growth of a corporation consisting of continuum of firms is given. The economic model is investigated both analytically and numerically. Sufficient conditions for the optimal solution being bounded even without upper constraint were derived for homogeneous prices. The effect of obsolescence of old technologies is demonstrated on diffusion curves in the case of heterogeneous prices, when goods of new design cost more than those of old ones. We also show that a minor nonlinearity inserted in the problem may lead to technology-dependent (heterogeneous) investments even if all data of the problem are technology-independent.

Appendix
Here we prove the variational representation (2.10) under the conditions of Section 2.3 and with the notational convention made there. Consider J := J (u ) - J (u) = - =
0

T Q (t )
0 0

L(t , q, x (t , q), Q (t ), y (t ), u (t , q)) dq dt

T Q(t )
0 0 Q (t )

L(t , q, x(t , q), Q(t ), y(t ), u(t , q)) dq dt [L + Lx x + LQ Q + Ly y + u L] dq dt L dq dt + o( ).

T
0 0 T

-

Q(t )
0


2.7. Conclusion

73

The same convention is systematically used below. The above equality follows in a standard way from Assumption (ii) and (2.8), (2.9)4 . Using again (2.8) and (2.9), we obtain J =
T Q (t ) T Q(t )

L dq dt +
0 Q(t ) 0 0

[Lx x + LQ Q + Ly y + u L] dq dt + o( ).

(2.42) ¯ Let : D Rn be absolutely continuous in t for a.e. q [0, Q], L (D) ¯ and (T , q) = 0, q [0, Q]. We remind that is considered as a row-vector. Consider the value
T Q (t )
0 0 T

(t , q)[x (t , q) - xb (t )] dq dt (t , q)[x(t , q) - xb (t )] dq dt

- =
0


0

Q(t )
0

T Q (t )
0 0 T

(t , q)[x (t , q) - xb (t , q)] dq dt (t , q)[x(t , q) - xb (t , q)] dq dt


- =

Q(t )

0 Q (T )

T
(q)

- =- +

Q(T ) T
0

0

[x - xb ] dt dq

(q) Q (T ) T

[x - xb ] dt dq
[x - xb ] dt dq

Q(T )
0

0

(q) T

(q)

[x - xb ] dt dq,

where we use the side conditions for x, x and (see also (2.6) and (2.7)) and have denoted (q) := [u ](q) and (q) := [u](q). Changing back the order of integration and denoting f (t , q) = f (t , q, x , Q , y , u ) we obtain from the above equalities that
T Q (t )
0

-
4


0

0 T

(t , q)[ f (t , q) - xb (t )] dq dt (t , q)[ f (t , q) - xb (t )] dq dt

Q(t )
0

It is to be mentioned that o( ) is not necessarily of second order with respect to . It can be of order 3/2.


74

Optimal Control of Heterogeneous Systems with Endogenous Domain of Heterogeneity

=- +

T Q (t ) T Q(t )
0 0 0 0

(t , q)[x (t , q) - xb (t , q)] dq dt (2.43)

(t , q)[x(t , q) - xb (t , q)] dq dt .

In a similar way as for L and using the same notational convention we represent the left-hand side of (2.43) as
T Q(t )
0


0

0 T

[ fx x + fQ Q + fy y + u f ] dq dt [ f - xb (t )] dq dt + o( ).
(2.44)

Q (t )
Q(t )

+

The right-hand side of (2.43) can be rewritten as -
T Q(t )
0 0

x dq dt -

T Q (t )
0 Q(t )

(t , q)[x (t , q) - xb (t )] dq dt .

(2.45)

Now we shall argue that the second term in the last expression is o( ). We estimate this term by
T Q (t )
0 Q(t ) T Q (t ) Q(t )

|x (t , q) - xb (t )| dq dt L





0

|x (t , q) - x ( (q), q)| dq dt L



T Q (t )
0 Q(t ) T Q (t ) Q(t )

+ +
0

|x ( (q), q) - xb ( (q))| dq dt L |xb ( (q)) - xb (t )| dq dt L .



(2.46)

The second term in the right-hand side equals zero due to the definition of x. Let us consider the first term. Since for a.e. q the function x (·, q) is Lipschitz with a constant C (independent of q) we have
T Q (t )
0

C =C


0

Q(t ) T Q (t ) Q(t )

|x (t , q) - x ( (q), q)| dq dt |t - (q)| dq dt | (Q (t )) - (q)| dq dt |Q (t ) - q| dq dt C T c2 2 , 20

T Q (t )
0 Q(t )



C 0

T Q (t )
0 Q(t )


2.7. Conclusion

75

where we have used that is Lipschitz continuous with constant 1/0 and also (in the last inequality) that QC[0,T ] c (cf. (2.8)). The same argument applies to the last term in (2.46) since xb is Lipschitz. Thus we obtain from (2.43), (2.44) and (2.45) the equality
T Q(t )
0


0

0 T

[ x + ( fx x + fQ Q + fy y + u f )] dq dt

Q (t )
Q(t )

+

[ f - xb (t )] dq dt = o( ),

(2.47)

where is close to zero. Now we introduce an absolutely continuous function µ : [0, T ] R satisfying µ (T ) = 0. By the same argument that we used in obtaining (2.47) we obtain the equality
T
0

[µ (gQ Q + gy y) + µ Q] dt = o( ),

(2.48)

where is close to zero (note that Q and y depend on ). As before, o( ) 3/2 . Finally we introduce an (1 â m)-dimensional function L (0, T ) and obtain from (2.3) by the same argument as before that -
T
0

y dt +

T Q(t )
0 0

u h dq dt +

T Q (t )
0 Q(t )

h dq dt = o( ),

(2.49)

where is close to zero. Summing up the equalities (2.42), (2.47), (2.48), (2.49) we obtain (2.10).



Chapter 3

On the relation of country size to political system and wealth inequality
3.1 Introduction
There are few historical examples of territory trade between countries. The biggest are the purchases by the United States of America of Louisiana (2,147,000 km2 for $15,000,000) from France in 1803 and Alaska (1,717,854 km2 for ´ $7,200,000) from Russia in 1867. In XX-th century the Treaty of Petropolis between Bolivia and Brazil, signed on November 11, 1903 gave Brazil the territory of Acre (191,000 km2 ), in exchange for over 3,000 km2 of Brazilian ^ territory between the Abuna and Madeira rivers, a monetary payment of two million British pounds, and a pledge of a rail-link between the Bolivian city of Riberalta and the Brazilian city of Porto Velho. We study how wealth inequality and the decision making mechanism, adopted by a country, influence its wellbeing, measured by a social welfare function or by the amount of resources (e.g. land) accumulated in the country. We also suggest the condition under which the purchase of the resources (land) happens voluntarily rather than under military threat. We try to imagine why, for example, Alexander II, the Emperor of Russia, could have sold Alaska, and more importantly, why the United States could have bought it. Indeed, one could argue that usually countries conquer land rather than buy it. Territorial disputes, that accompany trade, have a higher probability of going to war than other kinds of disputes, for example policy disputes (see, e.g. Vasquez & Henehan, 2001). We consider conquest of the territory as a special 77


78

On the relation of country size to political system and wealth inequality

case of trading, when the buyer country pays not to the seller country but to the third player, a mercenary army, that conquers territory for the buyer, unless the seller pays to the mercenary at least the same amount to keep its territory. Which tactics would countries prefer: to fight or to deal? The same dilemma has been studied (e.g. H. I. Grossman & Mendoza, 2001a, 2001b) in the economic theory of empire building using examples of the Roman and other empires, where three strategies where considered: Uncoerced Annexation, Coerced Annexation, and Attempted Conquest. In an Uncoerced Annexation the Romans would compensate the Barbarians sufficiently to induce the Barbarians to agree to the annexation of their country by Rome. In a Coerced Annexation the Romans would induce the Barbarians to agree to the annexation of their country under the threat that the Romans will attack and try to conquer the country. In an Attempted Conquest the Romans would attack the Barbarian country. In contrast with choice-theoretic explanation by Grossman and Mendoza we tackle this problem with a game-theoretic model, where both players have the same strategic possibilities, but could have different parameters. We are to find the condition at which countries would trade their territory rather than conquer it. We try to correspond the results with terms Uncoerced Annexation, Coerced Annexation, and Attempted Conquest. We study these questions on the example of two neighboring countries which use their territories as a production factor (capital). So the countries look like firms competing for a limited resource. Land is deviled between two states. Private good produced in a country is distributed among its citizens proportionally to their shares of the state economy. These shares are inherited from the parents and, in general, not equal. The agents are assumed to be selfish and consuming all their product before end of their life. Hence, product is not accumulated though generations. Generations of agents are assumed to be not overlapping. Thus, each generation starts its life with zero amount of product, but with inherited heterogeneous distribution of shares in the future domestic production. Either country can trade part of its territory for the part of other's country production. Countries exchange territory for product, or vice versa, according to the voting rules corresponding to their forms of government. We model three forms of government: monarchy (one monarch gets all the product and makes decisions), oligarchy (citizens vote by their shares of the economy), and democracy (one citizen ­ one vote). Notice, that monarchy, in this definition, is equivalent to the situation when decision can be accepted only unanimously, i.e. every citizen has veto power. Thus, citizens choose the size of the land to buy and the amount of country's product to be given to a group of neighboring


3.1. Introduction

79

country's citizens for their decision to cede the part of their county's territory. In the presented study we try to avoid some not very realistic assumptions such as exogenous income, uniform distribution of the territory among agents, and free will or full mobility of every agent, when one independently decides which country to join. It is typical for the literature to use such assumptions for allocation of public facilities (Cremer, Kerchove, & Thisse, 1985) or for the similar problem of the equilibrium size and number of nations on the continuum of uniformly distributed individuals (Alesina & Spolaore, 1997), where each individual at the border between two countries can choose which country to join with her land. (Alesina & Spolaore, 1997) have also considered the coalition equilibrium as well as (Bolton & Roland, 1997). Charles M. Tiebout in his paper (Tiebout, 1956) considered a concept of equilibrium, where the consumer-voter is fully mobile and will move to one of the fixed number of communities where her preference pattern is best satisfied. This concept has been criticized by (Bewley, 1981). He argued that if one tries to generalize the rigorous version of Tiebout's theory in a number of interesting directions, then equilibria may no longer exist or may no longer be Pareto optimal. The existence proof of "strong Tiebout equilibrium" in (Greenberg & Weber, 1986) makes use of the notion of "consecutive games" which the authors introduce and show that for such games there always exists a partition with a nonempty core. More general research (e.g. Haimanko, Breton, & Weber, 2004) also needs special assumptions about the structure of the model in order to prove existence of equilibrium. Seems that perfect mobility and free will of agents make it difficult to find an equilibrium, therefore we will refrain from such assumptions. The more so because there are always some mobility constraints in the real world. That is why, and also for the sake of simplicity, in the presented model, migration is not allowed. Notice, that in our model a citizen does not own any particular piece of land. She receives her share of country's production, which depends on country's territory. These shares do not need to be equal for all citizens. Thus, we study the problem with heterogeneous wealth distribution among the agents with endogenous income. This work elaborates on some early author's results in (Belyakov, 2007b).


80

On the relation of country size to political system and wealth inequality

3.2 Mechanisms of border moving by land trade
We consider two neighboring countries i {1, 2} as firms with production functions fi (Si ) depending on the countries territories Si in the following way fi (Si ) 0, d fi (Si ) > 0, d Si d 2 fi (Si ) <0 d Si2 for all Si 0. (3.1)

So functions fi are positive, strictly increasing, strictly concave, and defined for the positive territory. Each country i is populated with constant number of citizens Ni > 2 which have their shares j i of domestic production. Constant Ni means that no migration is allowed even if a part of the land is transmitted to the other country and no population growth. One can imagine that all population of a country lives in the city which always stays on the territory of the country. For convenience we assume that j i are sorted in ascending order over index i and normalized, i.e.

ji

j+1 i

for all j = 1, . . . , Ni - 1,

and

j=1



Ni

j i = 1 for all i = 1, 2,

Agents have linear utilities with respect to the produced private good. Private good is the only means of payment in the ceonomy. The territory of the country stays in state property, so its citizens do not own any particular part of it. Thus, besides territory we could similarly consider any other limited natural resource or production factor such as rights to exploit deposits of gas, oil, etc. Citizens can be considered as shareholders.

3.2.1 Territory exchange mechanism and timing
The agents live only one period, and they are selfish, i.e. they do not care about future generations. So citizens of country i inherit from their parents only common country territory Si and rights for the shares j i of domestic production. During the life period of one generation the countries go through the following sequence of major events. 1. Countries produce private good fi (Si ) using their inherited territories Si . 2. Countries transfer the amounts of land Si which they choose in the result of trade or conquest. 3. Countries produce private good fi (Si + Si ) using their changed territories.


3.2. Mechanisms of border moving by land trade

81

Countries produce product twice (events 1 and 3) and may exchange territory in between (event 2).1 In the beginning of the period countries have zero amount of private good, because each generation is selfish and consumes all good to the end of its life. So country can pay for the increase of its territory only after it produces good. Hence payments can be made after any of these three events. Citizens do not discount their consumption during the time period. All land is used and its total amount is constant S1 + S2 = const , hence the sum of territory changes Si is zero, meaning S1 = -S2 . (3.2)

The government of each country can buy land from its neighbor by paying the citizens of the neighboring country compensations for loses of their income due to decreased production. Government of each country can transfer the payment (private good) personally to citizens of the other country. We assume that the payment is made before the product is redistributed among the citizens of the county that buys territory. Thus, we can say that, a proportional tax is imposed on the citizens of the buyer country, while individual compensations to citizens of the seller country look like bribes. These are the key assumptions allowing to compensate only to the selected part of the seller country society its loss of utility, that lead to "asymmetry" between buyer and seller. Sum of product transactions ti to the country should be zero t1 + t2 = 0, unless there is no additional expense. (3.3)

3.2.2 Forms of governments and decision making
The territory exchange needs approval from the citizens of both countries in the case of trade and only from the citizens of attacking country in the case of conquest. The approval is given according to the national decision making system. Due to the territory change Si the production of country i changes at the following amount fi fi (Si + Si ) - fi (Si ). (3.4)
Two production events, instead of one, will be essential in Section 3.4, where due to possibility to cheat, countries should have some good to pay before the exchange. Notice, that exchange of territory after the last production would make no sense for selfish population.
1


82

On the relation of country size to political system and wealth inequality

Let us first consider the country that sells its territory. In order person j of country i to agree with the deal her loss of utility j i fi should be compensated. We consider the following formes of government implying different voting procedures requiring minimal total compensation i fi : · Monarchy implies that only one person possesses the production and makes decisions (meaning absolute monarchy), i.e. Ni i = 1 and j i = 0 for all j = 1, . . . , Ni - 1,. Hence, agent Ni is the Monarch. Thus, in order to buy some land -Si from monarchy i one needs to compensate all loss of its domestic production - fi . The same total compensation is required in the case of unanimous rule when each citizen has veto power and therefore is to get compensation, thus

i =

j=1



Ni

j i = 1.

· Oligarchy means that citizens vote with their shares. A decision is made if people voting for it own together more than a half of domestic production. Here the buyer needs to compensate more than half of the seller country production losses, hence ( ) 1 i = min ji ji > 2 . J {1,...,Ni } jJ jJ · Democracy has the rule of majority voting. It is sufficient for the buyer to compensate the poorest half of the seller country i society (plus one person if Ni is an even number), with the total shares

i =

[Ni /2]+1 j=1



j i.

Thus, the buyer needs to pay for the same amount of land its full cost to a Monarchy, more than half of its cost to an Oligarchy, and the smallest amount to a Democracy. It follows from the following inequalities ( ) [Ni /2]+1 1 (3.5) j i J{min,Ni } j i j i > 2 1. 1,... jJ jJ j=1 Parameter i (0, 1] is the minimal relative share of country loses, that one needs to compensate in order to get agreement of the country to sell its territory. This parameter reflects the decision mechanism in a country. When


3.3. Fair land trades

83

( ) 2 i = 1 then we model country i as a Monarchy, when i [Ni /Ni]+1 , 1 we ( ] consider an Oligarchy, when i 1 , [Ni /2i]+1 it can be either an Oligarchy or N ( 1 ]2 a Democracy, and when i 0, 2 it is a Democracy. Coefficient i can also be a measure of inequality in country i, if it is a Democracy. Situation when country i buys land is different. Since shares j i of future production remain constant all citizens with j i > 0 get benefit j i fi from the increase of country territory Si > 0. Hence, all citizens agree to give their future j i fi for the territory increase Si regardless of the government form in their country. Thus, each country i sells its territory for not less than -i fi > 0 and buys territory for not more than fi > 0, because otherwise the deal would not be approved. It means that if i < 1 then country i can sell territory cheaper than buy it. This "asymmetry", as we mentioned in Section 3.2.1, comes from ability of buyer country to compensate only selected citizens of seller county their losses. Group of country citizens, that needs minimal compensation to promote a deal, will be called the ruling coalition of the country. We can say that government of each country maximizes the wealth of its ruling coalition. Thus, i is the sum of shares of ruling coalition.

3.3 Fair land trades
Let us consider fair land trading, where no conquest or cheating is allowed. So one cannot refuse to pay for the purchased land even after the final production (event 3 in section 3.2.1). Let us suppose that county 1 buys territory from country 2. The consumption of the buyer consists of the production in the first event f1 (S1 ), the production in the third event f1 (S1 ) + f1 , and negative payment t1 0. The maximization problem of the buyer with respect to S1 and t1 can be written omitting the constant 2 f1 (S1 ) f1 + t s.t. f1 + t 2 f1 (S1 ) + f1 + t
1 1 1



S1 [0,-S2 ], t1 [-t2 ,0]

max

,

(3.6) (3.7) (3.8)

0, 0,

(participation constraint) (budget constraint)

where maximization of social welfare gain f1 + t1 is equivalent to the maximization of the ruling coalition welfare gain 1 f1 + 1t1 , because all citizens are ready to pay for the increase of their country's territory, since all of them benefit from it proportionally to their constant shares j 1 . Participation


84

On the relation of country size to political system and wealth inequality

constraint (3.7) implies positive social welfare gain and hence ensures budget constraint (3.8) so we can disregard the budget constraint here. In the seller country, the not ruling part of the society is worse off, because it does not get any compensation for their loss in future production share, in all (1 - 2 ) f2 . Thus, the seller's problem is as follows

2 f2 + t2
s.t. 2 f2 + t2

S2 [-S1 ,0], t2 [0,-t1 ]

max

,

(3.9) (3.10)

0.

Expressions (3.6)­(3.10) form a bargaining problem. We assume that this problem can be solved by the governments, so that they both are interested in maximization of their total revenue, f1 + 2 f2 , from the trade. Indeed, it is easy to show that any reasonable bargaining solu tion (e.g. Nash bargaining solution) defining the payment t1 = -t2 for given territory exchange S2 = -S1 provides incentives for both governments to consider such territory exchange that maximizes the total revenue f1 + 2 f2 . It follows from the positive dependance of the resulting objective functions f1 + t1 and 2 f2 + t2 on the total revenue f1 + 2 f2 . Hence, common maximization problem is the following f 1 + 2 f s.t. f 1 + 2 f
2 2



S1 =-S2 0

max

,

(3.11) (3.12)

0,

where trade possibility condition (3.12) is obtained from the sum of budget constraints (3.7) and (3.10) taking into account relation (3.3). Problem (3.11)­ (3.12) does not contain moneyproduct transfers and defines only territory allocation which is an interest of ours. We can define the price pi

fi fi (S) = Si S

,
S=Si +Si

using expression (3.4). Thus, pi is the maximal price at which country i can buy small part of territory. Then, i pi is the price at which country i can sell small part of its territory. In Fig. 3.1 the prices pi are drown with solid lines and the prices i pi with dashed lines. In these terms the first order condition for problem (3.11) takes the following form p1 - 2 p p1 - 2 p
2 2

=0 0

for S1 > 0, for S1 = 0.

(3.13) (3.14)


3.3. Fair land trades

85 p2

p1

2 p 1 p
1

2

A B C

1

S1

S

2

2

Figure 3.1: Functions p1 and p2 of territories S1 and S2 are the marginal productivities of territory of countries 1 and 2. Functions 1 p1 and 2 p2 are the marginal production shares of ruling coalitions in countries 1 and 2. Condition (3.13) means that if the initial border between the countries is on the left from the absciss of point C in Figs. 3.1 and 3.2 then the countries will move the border to C. Condition (3.14) means that if the border is on the right of C country 1 will not buy any land. In order to solve the same problem when country 2 buys territory from country 1 we need only to change indexes in (3.13) and (3.14) p2 - 1 p p2 - 1 p
1 1

=0 0

for S2 > 0, for S2 = 0.

(3.15) (3.16)

Condition (3.15) means that if the initial border between countries is on the right from the absciss of point B in Figs. 3.1 and 3.2, then the countries will move the border to B. Condition (3.16) means that if the border is on the right of B country 2 will not buy any land. From the solutions (3.13)­(3.16) we can conclude that if the initial border position is on the left from point B, the only option is for country 2 to sell its lend to country 1, so that new border is at point C, and vice versa, if the initial border position is on the right from point C, the only option is for country 1 to sell its lend to country 2, so that new border is at point B. The only question left is at which point, B or C, countries will move their border if its initial position is between B and C. The answer, which country to be the buyer and which the seller, depends on the particular bargaining conditions. But we can assume, as before, that governments maximize the total revenue from the trade. Then


86 p1

On the relation of country size to political system and wealth inequality

p2

2 p 1 p
1

2

A B f
BC 1

+ 2 f

BC 2

C

1

S1

S

2

2

Figure 3.2: Country 1 buys territory from country 2 moving the border from B B point B to point C. Total trade revenue is the gray area f1 C + 2 f2 C . the border near or at point B will be moved to point C, and vice versa, the border near or at point C will be moved to point B, which creates a cycle in generations. For example, when country 1 is the buyer, moving border from B B B to C, the total revenue is f1 C + 2 f2 C , which is the gray area in Fig. 3.2, where we denote fiBC = fi (SC ) - fi (SiB ) according to (3.4). In the next life i period the situation is the opposite, and new generations of citizens move the border from C to B. Thus, we get a cyclical solution which is a sequence of bargaining results on the border position (of events 2, section 3.2.1), one in each generation's life time. Generally speaking, this solution is asymptotically stable in Lyapunov's sense (Liapounoff, 1907). Definition 1 Let x( ) be the solution and x( ) be the perturbed solution, where ~ is the time. Solution x( ) is (asymptotically) stable if for every > 0 there exists = ( ) > 0 such that |x(0) - x(0)| < is sufficient for |x( ) - x( )| < ~ ~ for all > 0 (and lim x( ) = x( )). ~


The only case when solution could be unstable is when the initial border position is at the threshold, where countries can not decide which country should sell or buy, because both options give equal total trade revenue. Let us introduce average size of the the country i Si = SiB + SC i , 2 (3.17)


3.3. Fair land trades

87

where SiB is the size of country when the border is at point B, while SC is that at i C. We can conclude from Figs. 3.1 and 3.2, that minimal territory of country i positively depends on its parameter i , while maximal size of the country does not depend on i
B B d SC d SC d S2 d S1 2 1 > 0, > 0, = 0, = 0. (3.18) d 1 d 2 d 1 d 2 Hence, the average size of country i positively depends on parameter i , i.e.

d Si > 0. d i

(3.19)

It means that, caeteris paribus, Monarchy in average is bigger than Oligarchy, which is, in tern, bigger than Democracy. The greater the inequality in the Democracy, the smaller its territory could be.

3.3.1 Welfare analysis
Even without knowledge about exact values of product transfers ti we can say something about the dependance between social welfare of the country and its type of government. Since agents have linear utilities the utilitarian social welfare function can be evaluated as production plus product transfer Wi = fi (Si ) + ti . Let us introduce the average social welfare of country i as Wi = fi (SiB ) + tiB + fi (SC ) + tiC i , 2 (3.20)

where SiB and tiB are the territory and product transfers of country i, if the border is at point B; while SC and tiC are those at C in Fig. 3.2. Thus, regardless of the i governments' bargaining powers, we can write the lower bounds of countries' welfares B W1 f1 (S1 ), W2 f2 (SC ). (3.21) 2 The global average social welfare can be calculated precisely from (3.20), due to the payment balance (3.3): W1 + W2 =

i=1



2

fi (SiB ) + fi (SC ) i . 2

(3.22)

If we differentiate expression (3.22) with respect to parameters i , then using conditions (3.1) and (3.18), we get that the global average social welfare (3.22) positively depends on both parameters 1 and 2 , i.e. d W1 + W2 > 0 for all i = 1, 2. d i (3.23)


88 p1

On the relation of country size to political system and wealth inequality

p2

A

1

S1

S2

2

Figure 3.3: Functions p1 and p2 of territories S1 and S2 are the marginal productivities of territory of countries 1 and 2. Thus, we can get the unique global social optimum for the corner solution 1 = 2 = 1, which implies that each country is Monarchy or has unanimous voting rule. So the counties locate the border at point A.

3.3.2

Two Monarchies choose the stable social optimum

When two Monarchies trade territory the buyer have to compensate all domestic production loss to the seller since 1 = 2 = 1. That is why maximization problems (3.6)­(3.7) and (3.9)­(3.10) coincide: f1 + f s.t. f1 + f
2 2



S1 =-S2

max ,

(3.24) (3.25)

0,

and have the same unique solution p1 = p2 , positioning the border at point A in Fig. 3.3. This territory allocation is a global social optimum as it actually maximizes the sum of production in both countries
A S1 = arg max S1 =-S2

f1 (S1 + S1 ) + f2 (S2 + S2 ) = arg max f1 + f2 .
S1 =-S
2

That is why this allocation is Pareto efficient. We note that the territory allocation of two Monarchies is a steady state equilibrium which means that if time period starts with such allocation the Monarchies will not change it. This steady state solution is also asymptotically stable in Lyapunov's sense; see definition 1, which means that if period starts with slightly different allocation then it converges to that with border at point A.


3.4. Effect of possibility to cheat and conquest

89

3.4 Effect of possibility to cheat and conquest
Let us add to the model the possibilities of not committing the obligation on a contract (cheating) by the seller and conquest of the territory by the buyer. The conquest is a two step sequential game, see Fig. 3.4. In the first step country a 1 (conqueror) instead of paying for the territory, pays -t1 0 for hiring the army to conquer a part of neighbor's territory. In the second step, if the country a under attack does not pay equal or greater amount of product -t2 to the army to defend its territory, then the territory goes to the conqueror. We recall that private good is produced twice in the time period: in the beginning and in the end (see section 3.2.1). We assume that between these two productions (event 2 in section 3.2.1) countries play the game in Fig. 3.5, which includes two conquest subgames, see Fig. 3.4. Thus, before the game each country is endowed with the good from the first production. Only that amount of good can be used for land purchasing, because no one would rely on the promise to pay in the end of the period if cheating is allowed. Hence, there is the initial product constraint fi (Si ) + tiA 0, (3.26)

which matters only for the conqueror as will be shown later on. Conquest looks like country 1 buys land from Monarchy (see Fig. 3.1), having budget constraint (3.26). The only difference is that the seller does not get the payment { } a A -t1 = min - f2 , f1 (S1 ) , (3.27) which goes to the Army. The Army could be treated as the third player, which can not gain utility from the land itself but only conquers territory for the country which pays the highest price. In the conquest game in Fig. 3.4, country 2 will wish to defend its territory a a a S2 with minimal military expanses -t2 = -t1 if conqueror pays -t1 < - f2 , a = 0. otherwise country 2 retreats with t2 Proposition 1 Initial production fi (Si ) of country i is greater than loss - fi > 0 from any territory decrease Si [-Si , 0), i.e. fi (Si ) > - fi . Proof I t follows from definition (3.4) of symbol fi and the first property of function fi (Si ) in (3.1). Corollary 1 In order to conquer the territory S1 country 1 should transfer a t1 = f2


90

On the relation of country size to political system and wealth inequality

defend

aa t1 , t2

1

a t1

2

retreat
a f1 +t1 , f 2

Figure 3.4: Conquest game. Payoffs of countries are sums of payoffs of all its a citizens. Conqueror chooses military transfer t1 < 0. Attacked country defends a a a a spending t2 = t1 < 0 if t1 > f2 , or retreats spending t2 = 0 if t1 f2 . That a = f . is why in SPNE conqueror chooses t1 2 Proof I t follows from proposition 1, that country 2 has enough initial endowa ment f2 (S2 ) to defend its territory, which it wishes to do only if -t1 < - f2 . A = - f is the minimal payment for Hence, inequality (3.26) is strict and -t1 2 conquest. Using corollary 2, we can write the maximization problem of the conqueror in the game Fig. 3.4 f1 + f s.t. f1 (S1 ) + f
2 2



S1 =-S2 0

max

,

(3.28) (3.29)

0,

which differs from (3.24)­(3.25) only by constraint (3.29). Constraint (3.29) can be active. To specify the Subgame Perfect Nesh Equilibrium (SPNE) of the conquest game in Fig. 3.4 in cases of active and inactive constraint (3.29) we introduce the following definition. Definition 2 We will say that country 1 is rich if its initial production is greater than total loss of production in country 2 after moving the border to point A, i.e. the following inequality is satisfied
A f1 (S1 ) > - f2 .

(3.30)

Thus, country 1 is not rich if inequality (3.30) is not satisfied.


3.4. Effect of possibility to cheat and conquest

91

t1 + f1 , t2 +2 f fulfil buy 1 conquer 2 cheat 1
a a f 1 + f 2 , 2 f a 2

2

stay

t1 , t2

conquer
a a t1 + f1 + f2 , t2 +2 f a 2

Figure 3.5: The game territory exchange is played each time period by new generation. Payoffs of country 1 are sums of payoffs of all its citizens. Payoffs of country 2 are sums of payoffs of its ruling coalition members. The SPNE of the game in Fig. 3.4 consists of the payments to the army a (3.27), t2 = 0, and territory changes Si which can be found in both cases of rich and not rich country 1.
A 1. If country 1 is rich (see definition (2)), then it conquers S1 . ConA A A queror's profit is f1 + f2 0 while the profit of country 2 is f2 0 A 0. and profit of its ruling coalition is 2 f2

2. If country 1 is not rich, then inequality (3.30) is not satisfied, which a A implies that country 1 conquers such amount of territory S1 S1 that f1 (S1 ) = - f2 . We will analyze the game in Fig. 3.5 for both internal solution for rich conqueror and corner solution for not rich one. If the government of country 1 wants to buy land S1 , then it proposes to its citizens a project of purchase, for which amount of product -t1 is collected from the citizens who will benefit from the territory increase. It means that everybody, who have strictly positive share j 1 of production, would agree to pay not more than j 1 f1 and, hence, would vote for the project. Then the government of country 1 gives product t2 = -t1 min {- f1 (S1 ), f1 } to the ruling coalition of country 2. The ruling coalition of country 2 receives the amount of product t2 and can execute its duty, giving required territory -S2 to the buyer, or it can cheat, refusing to give the territory. In the first case the profit of the seller's ruling coalition is t2 + 2 f2 , profit of the buyer is t1 + f1 , where transfers are balanced according to (3.3), and the game is finished. In the case of cheating, the profit of seller is t2 , while the profit of buyer is t1 < 0. But the buyer has the


92

On the relation of country size to political system and wealth inequality

a a possibility to conquer part S1 of the disputed territory, paying for this f2 as much as the seller maximally can pay to keep that territory. In that case, being rational, the seller country retreats without fight, and its ruling coalition have a a a profit t2 + 2 f2 while the profit of the buyer is t1 + f1 + f2 , and the game is finished.

Proposition 2 Country 1 buys only such amount of land S1 , which it would be able to conquer in the case of cheating by county 2. Country 1 buys land when it surely is not going to be cheated, because otherwise it would conquer it in the first place. If county 1 buys more than it could conquer after has been cheated, then it gives country 2 incentives to cheat, because country 2 would have more land rest after conquest than after executing its duties. Corollary 2 Country 1 buys land of amount S1 , not exceeding the amount A S1 , that locate the border at point A, like it would trade with Monarchy. Since point A is the optimal position of the border for unconstrained conquest A (3.28) country 1 would never conquer more than S1 after being cheated. It A follows from proposition 2 that country 1 also do not buy more then S1 . It follows from corollary 2, that maximization problem for buying the lend f1 + t1 s.t. f1 (S1 ) + f2 + t1
S1 0, t1 0

max

,

(3.31) (3.32)

0,

has more restrictive budget constraint (3.32) than (3.7). If both countries prefer to trade the bargaining problem is composed by (3.31)­(3.32) and (3.9)­(3.10) with conquest (3.28)­(3.29) as a disagreement point. 1. When country 1 is not rich (see definition (2)) then inequality (3.32) a becomes an equality. Hence, S1 is also a solution of maximization a a problem (3.31)­(3.32), which yields profit f1 + f2 + t1 , where t1 0. Country 1 chooses between conquest and purchase, comparing their a a a a profits f1 + f2 and f1 + f2 + t1 , therefore, it would buy only for free, i.e. t1 = 0, which is the disagreement point for seller country 2. Thus, not rich country conquers territory rather then buys it.
A 2. When country 1 is rich (inequality (3.30) is satisfied) it buys S1 acA cording to corollary 2, with payment t2 = -t1 f1 (S1 ) + f2 .


3.5. Effect of additional costs

93

These strategies are SPNE of the game in Fig. 3.5 which can be described in terms introduced in (H. I. Grossman & Mendoza, 2001a, 2001b). We distinguish the type of annexation according to the values of t2 .
A · We can say that if t2 + 2 f2 0, then it is an Uncoerced Annexation, because the ruling coalition of country 2 is better off in the end of the period. A · If t2 + 2 f2 < 0 while t2 > 0, then it is a Coerced Annexation, because the ruling coalition of country 2 is worse off after the trade. Which means that country 1 has threatened country 2 with possibility of conquest. Though, we have not any additional costs introduced for Coerced Annexation, like the cost of deploying Legions in (H. I. Grossman & Mendoza, 2001a, 2001b).

· If the countries have nothing to bargain about, because country 1 is not a rich, then it simply conquers the territory S1 , which we could call an Attempted Conquest, where t2 = 0. Thus, not rich yet more efficient country conquers territory generation by generation until it becomes rich enough to buy territory of the globally optimal size SiA . The main result of this section is that the possibility to cheat and conquest completely removes the dependence of the country size on its type of government. More then that, this possibility results in the Pareto efficient and asymptotically stable steady state territory allocation like that between two Monarchies with border at point A in Fig. 3.1. Another interesting result is that "not rich" countries expand by concurring territory. While "rich" countries buy territory, because other countries would not dare to concur it. That could be the reason why Alaska was sold to the United States rather than absorbed in North-West Territories of future Canada. Although Russian-American Company, that was a monopoly in Alaska, was evaluated 3,721,400 silver rubles, see (Okun, 1939), while Alaska was sold for approximately 11 million silver rubles. Thus, clearly we have a case of Uncoerced Annexation.

3.5 Effect of additional costs
Until now we believed that the there is no costs for adaptation of new territory or moving the border. Let us assume that there are adaptation cost d 0 per


94 p1

On the relation of country size to political system and wealth inequality

p2

2 p 1 p
1

2

B

A1 ~ B

A

A

1

~ C

C

1

S1

S

2

2

Figure 3.6: Country 1 buys territory from country 2 moving the border from ~ ~ ~ B~ B~ B~ point B to point C. Total trade revenue f1 C + 2 f2 C - c - d S1 C is the gray area minus constant c. unit of traded territory and constant cost c 0 of rebuilding the border, then product balance equation (3.3) takes the following form: t1 + t2 = -c - d |S1 | . Hence, maximization problem (3.11)­(3.12) also changes: f1 + 2 f2 - c - d |S1 | s.t. f1 + 2 f2 - c - d |S1 |
S1 =-S2 0

(3.33)

max

,

(3.34) (3.35)

0,

The first order condition for problem (3.34) takes the following form: p1 - 2 p2 - d = 0 for S1 > 0, p1 - 2 p
2

(3.36) (3.37)

0 for S1 = 0.

We see that only per unit adaptation cost d influences the first order condition ~ ~ (3.36)­(3.37), and hence, moves points B and C to B and C, see Fig. (3.6), while constant c in the maximization constraint (3.12) determines whether trade will happen or not. Thus, oscillatory solutions can occur only if both of the following inequalities are satisfied ( ) ~ ~ ~ ~ B~ B~ B 0, (3.38) f1 C + 2 f2 C - c - d SC - S1 1 ( ) ~ ~ ~ ~ C~ C~ B f2 B + 1 f1 B - c - d S2 - SC 0. (3.39) 2


3.6. Conclusion

95

It is easily seen form Def. 1, that if inequalities (3.38) and (3.39) are strict for all points of a solution, then the oscillatory solution is asymptotically stable. Oscillatory solutions could coexist with multiple steady state solutions, these are all such points Si for which the following inequalities are satisfied: ( ) ~ ~ ~ C C f1 + 2 f2 - c - d SC - S1 < 0, (3.40) 1 ( ) ~ ~ ~ B B B < 0. (3.41) f2 + 1 f1 - c - d S2 - S2 These steady state solutions are stable but not asymptotically. Here it means that if we change the initial point a little (so that inequalities (3.40) and (3.41) are still satisfied), then the perturbed solution neither goes further away from the unperturbed solution, nor converges back to it. If cheating and conquest are allowed, then the same game theoretic reasoning, as in the previous section, gives us the existence of a steady state points Si , which should satisfy the following inequality:
A A f1 + f2 - c - d SiA - Si ~ ~ ~

< 0.

(3.42)

It means that all points between A1 and A2 are stable steady state solutions. We can conclude that additional linear costs reduce the amplitude of territory oscillations. Hence, they decrease dependence between size of the country and its type of government. Such costs can also cause the appearance of multiple stable steady state positions of the border.

3.6 Conclusion
In this work the mechanism governing the territory exchanging between groups of people such as countries is proposed, based on trading with approval of both sides under particular voting rule (veto rule, majority rule,...). We have obtained a cyclical in time asymptotically stable solution, which is the sequence of bargaining results on the border position, one in each generation life time. We also have found that the average size and social welfare of the Monarchy are bigger than those of Oligarchy, which in turn bigger than those of Democracy. The greater inequality in a Democracy, the smaller its territory could be. Possibility to cheat and conquest completely removes the dependence of the country size on its type of government. More than that, this possibility results in the Pareto efficient and asymptotically stable steady state territory


96

On the relation of country size to political system and wealth inequality

allocation, like one between two Monarchies or countries with veto voting rules (when decisions can be made only unanimously). It was found that only rich countries, that produce more product than sellers could lose, Definition 2, can afford to buy territory rather then conquer it. Monarchy sells its territory only when it is to difficult to protect, for instance, because of the high distances, like in the case of Alaska, that would have been difficult to defend form British Empire. We may say, that it was the US, that was rich enough to be capable of defending its new territory. Additional linear costs reduce the amplitude of territory oscillations. Hence, they decrease the dependence of the size of a country on its form of government. Such costs can also cause the appearance of multiple stable steady state positions of the border.


References
Aghion, P., & Howitt, P. (1992). A model of growth through creative destruction. Econometrica, 60(2), 323­351. Aghion, P., & Howitt, P. (1996). Research and development in the growth process. Journal of Economic Growth, 1(1), 49-73. Aghion, P., & Howitt, P. (1998). Endogenous growth theory. Cambridge, MA: MIT Press. Alesina, A., & Spolaore, E. (1997). On the number and size of nations. The Quarterly Journal of Economics, 112(4), 1027­1056. ¨ Aubin, J.-P. (1991). Viability theory. Boston, Basel, Berlin: Birkhauser. Aubin, J.-P., & Frankowska, H. (1990). Set-valued analysis. Boston, Basel, ¨ Berlin: Birkhauser. Barro, R., & Martin, X. Sala-i. (2004). Economic growth. Cambridge, MA: MIT Press. Belyakov, A. O. (2007a). About mathematical description of the people's association development processes (ethnoses, firms staff etc.). Economics and Mathematical Methods, 43(2), 118­122. Belyakov, A. O. (2007b). On the dynamics of people's unions (Working Paper No. BSP/2007/094E). New Economic School. Available from http://www.nes.ru/dataupload/files/programs /econ/preprints/2007/Belyakov eng.pdf Belyakov, A. O. (2009, June). On the relation of country size to the form of government. In Ecore summer school on market evolution and public decision. Brussels. Available from http://www.ecares.org/ecaresdocuments/ws /summerschool2009/papers/belyakov.pdf Belyakov, A. O., Davydov, A., & Veliov, V. M. (2013). Optimal cyclic exploitation of renewable resources [Research Report]. (2013-07). Available from http://orcos.tuwien.ac.at/fileadmin/t/orcos /Research Reports/2013-07b.pdf Belyakov, A. O., Feichtinger, G., & Simon, C. (2011). General 97


98

On the relation of country size to political system and wealth inequality

equilibrium model with horizontal innovations and heterogeneous products (Research Report No. 2011-13). Operations Research and Control Systems, Institute of Mathematical Methods in Economics, Vienna University of Technology. Available from http://orcos.tuwien.ac.at/fileadmin/t/orcos /Research Reports/2011-13.pdf Belyakov, A. O., Haunschmied, J. L., & Veliov, V. M. (2012). General equilibrium model with horizontal innovations and heterogeneous products (Research Report No. 2012-01). Operations Research and Control Systems, Institute of Mathematical Methods in Economics, Vienna University of Technology. Available from http://orcos.tuwien.ac.at/fileadmin/t/orcos /Research Reports/2012-01 Bely-VV-HS.pdf Belyakov, A. O., Tsachev, T., & Veliov, V. M. (2011). Optimal control of heterogeneous systems with endogenous domain of heterogeneity. Applied Mathematics & Optimization, 64(2), 287­311. Belyakov, A. O., & Veliov, V. M. (2014). Constant versus periodic fishing: age structured optimal control approach [Research Report]. (2014-01). Available from http://orcos.tuwien.ac.at/fileadmin/t/orcos /Research Reports/2014-01.pdf Bewley, T. F. (1981). A critique of tiebout's theory of local public expenditures. Econometrica, 49(3), 713­740. Blanchard, O. (1985). Debt, deficits and finate horizons. Journal of Political Economy, 93, 223­247. Bolton, P., & Roland, G. (1997). The breakup of nations: A political economy analysis. The Quarterly Journal of Economics, 112(4), 1057­1090. Boucekkine, R., de la Croix, D., & Licandro, O. (2002). Vintage human capital, demographic trends, and endogenous growth. Journal of Economic Theory, 104(2), 340 - 375. Boucekkine, R., R´ F. del, & Licandro, O. (2005). Obsolescence and modio, ernization in the growth process. Journal of Development Economics, 77 (1), 153­171. Cass, D., & Yaari, M. E. (1967). Individual saving, aggregate capital accumulation, and efficient growth. In K. Shell (Ed.), Essays on the theory of optimal economic growth (p. 233-268). Cambridge, MA: MIT. Corduneanu, C. (1971). Principles of differential and integral equations. Boston: Allyn and Bacon. Cremer, H., Kerchove, A.-M. D., & Thisse, J.-F. (1985). An economic theory


3.6. Conclusion

99

of public facilities in space. Mathematical Social Sciences, 9(3), 249­ 262. ´ d'Albis, H., & Augeraud-Veron, E. (2011). Continuous-time overlapping generations models. In R. Boucekkine, N. Hritonenko, & Y. Yatsenko (Eds.), Optimal control of age-structured population in economy, demography, and the environment. routledge explorations in environmental economics (pp. 45­69). Taylor and Francis. Diamond, P. A. (1965). National debt in a neoclassical growth model. American Economic Review, 55, 1126­1150. Dinopoulos, E., & Thompson, P. (1998). Schumpeterian growth without scale effects. Journal of Economic Growth, 3(4), 313­335. Dixit, A. K., & Stiglitz, J. E. (1977). Monopolistic competition and optimum product diversity. American Economic Review, 67 (3), 297­308. Faruqee, H. (2003). Debt, deficits, and age-specific mortality. Review of Economic Dynamics, 6(2), 300 - 312. Gale, D. (1973). Pure exchange equilibrium of dynamic economic models. Journal of Economic Theory, 6, 12­36. Greenberg, J., & Weber, S. (1986). Strong tiebout equilibrium under restricted preferences domain. Journal of Economic Theory, 38(1), 101­117. Grossman, G. M., & Helpman, E. (1989). Product development and international trade. Journal of Political Economy, 97 (6), 1261­1283. Grossman, G. M., & Helpman, E. (1991a). Innovation and growth in the global economy. MIT Press. Grossman, G. M., & Helpman, E. (1991b). Innovation and growth in the global economy. Cambridge, MA: MIT Press. Grossman, H. I., & Mendoza, J. (2001a, February). Annexation or conquest? the economics of empire building (Working Paper No. 8109). National Bureau of Economic Research. Available from http://www.nber.org/papers/w8109 Grossman, H. I., & Mendoza, J. (2001b). Butter and guns: Complementarity between economic and military competition. Economics of Governance, 2(1), 25­33. Haimanko, O., Breton, M. L., & Weber, S. (2004). Voluntary formation of communities for the provision of public projects. Journal of Economic Theory, 115(1), 1­34. Jones, C. I. (1976). Product selection, fixed costs, and monopolistic competition. Review of economic studies, 43(2), 217­235. Jones, C. I. (1995a). R&d-based models of economic growth. Journal of Political Economy, 103(4), 759­784.


100

On the relation of country size to political system and wealth inequality

Jones, C. I. (1995b). Time-series tests of endogenous growth models. Quarterly Journal of Economics, 110, 495­525. Jones, C. I. (1999). Growth: With or without scale effects? American Economic Review, 89, 139­144. Judd, K. L. (1985). On the performance of patents. Econometrica, 53, 567­ 585. Kortum, S. S. (1997). Research, patenting, and technological change. Econometrica, 65(6), 1389-1419. ` ´´ ´ Liapounoff, A. (1907). Probleme general de la stabilite du mouvement. Annales de la faculte des scienses de Toulouse Ser. 2, 9, 203­474. ´ ´ Okun, S. B. (1939). Russian-american company. Moscow - Leningrad: SotsEkGiz. Romer, P. M. (1990). Endogenous technological change. Journal of Political Economy, 98, 71­102. Samuelson, P. A. (1958). An exact consumption-loan model of interest with or without the social contrivance of money. Journal of Political Economy, 66, 467­482. Segerstrom, P. S. (1998). Endogenous growth without scale effects. American Economic Review, 88(5), 1290­1310. Simon, C., Belyakov, A. O., & Feichtinger, G. (2012). Minimizing the dependency ratio in a population with below-replacement fertility through immigration. Theoretical Population Biology, 82(3), 158-169. Skritek, B., Tsachev, T., & Veliov, V. M. (2014). Optimality conditions and the hamiltonian for a distributed optimal control problem on controlled domain. Applied Mathematics & Optimization, 1-24. Sorger, G. (2011). Horizontal innovations with endogenous quality choice. Economica, 78(312), 697­722. Tiebout, C. M. (1956). Voluntary formation of communities for the provision of public projects. Journal of Political Economy, 64(5), 416­424. Vasquez, J., & Henehan, M. T. (2001). Territorial disputes and the probability of war, 1816-1992. Journal of Peace Research, 38(2), 123-138. Veliov, V. M. (2008). Optimal control of heterogeneous systems: Basic theory. Journal of Mathematical Analysis and Applications, 346(1), 227 - 242. Yaari, M. E. (1965). Uncertain lifetime, life insurance and the theory of the consumer. The Review of Economic Studies, 70, 83­101.