Consider a game-theoretic model of pollution control. There are 2 players join in the game, N = {1, 2}. Each player has an industrial production site. It is assumed that the production is proportional to the pollutions $u_i$. Thus, the strategy of a player is to choose the amount of pollutions emited to the atmosphere, $u_i \in [0; b_i]$. In this example the solution will be
considered in the class of open-loop strategies $u_i(t)$.
The dynamics of the total amount of pollution x(t) is described by
$\dot x=u_1+u_2, x(t_0)=x_0.$
The payoff of the i-th player is defined as
$\int_{0}^{T} (b_i −\frac{1}{2}u_i)u_i − d_ix)dt, i = 1, 2.$
We assume that the terminal cost is zero.
Each player is to maximize their total cost, then the optimization problem is as follows:
$\sum_{i=1}^{2}\int_{0}^{T}(b_i-\frac{1}{2}u_i)u_i - d_i x)dt \rightarrow \mathop{\max}\limits_{u_1,u_2 }$
Solution, first write down the Hamiltonion function: $H(x,\psi,u)=\sum_{i=1}^{2}[ (b_i −\frac{1}{2}u_i)u_i − dx+ \psi(u_i)] $
where $\psi$ is the adjoint variable and $d=d_1+d_2$
Taking the first derivative with respect to $u_i$, we get the expressions for the optimal controls:
$u_{i}^{*}=b_i+\psi,i=1,2.$
Since there have no terminal cost in this case, then $\psi(T)=0.$ Combined with $\dot \psi=-\frac{\partial H}{\partial x}=d$, denote $d=d_1+d_2, b=b_1+b_2,$ therefore, $\psi(t)=d(t-T).$ Correspondingly, the optimal control is
$u^{*}(t)=b_i-d(T-t)$, there still have one additional condition,
$d_i \in [0,\frac{min\left \{ b_1,b_2 \right \}}{T}-d]$,
The initial condition $x(0)=x_0$, naturally
$x^{*}(t)= \frac{2d}{2}(t^{2}-t_{0}^{2})+(b-2Td)(t-t_0)+x_0.$
Case 2, Consider a similar pollution control model. There are 2 players join in the game, N = {1, 2}. Each player has an industrial production site. It is assumed that the production is proportional to the pollutions $v_i$. Thus, the strategy of a player is to choose the amount of pollutions emited to the atmosphere, $v_i ∈ [0; b_i]$. In this example the solution will be considered in the class of open-loop strategies $v_i(t)$.
The dynamics of the total amount of pollution x(t) is described by
$\dot x=v_1+v_2-\delta x, x(t_0)=x_0.$
The payoff of the i-th player is defined as
$\int_{0}^{T} (p_i −\frac{1}{2}v_i)v_i − g_ix)dt, i = 1, 2.$
We assume that the terminal cost is zero.
Each player is to maximize their total cost, then the optimization problem is as follows:
$\sum_{i=1}^{2}\int_{0}^{T}(p_i-\frac{1}{2}v_i)v_i - g_i x)dt \rightarrow \mathop{\max}\limits_{v_1,v_2 }$
Similar to above, we use Pontrygin's maximal principle to get
$v_i^{*}(t)=p_i-\frac{g}{\delta}+\frac{g}{\delta}e^{\delta(t-T)},i=1,2$
Here, we assume that for the environment's self cleaning capacity $\delta$ the following condition hold: $\delta\geq \frac{d}{\ b_i}$. Then the optimal cooperative trajectory is
$x^{*}(t)= Ce^{-\delta t} + \frac{nd}{2\delta^2}e^{\delta(t-T)}+\frac{p}{\delta}-\frac{2g}{\delta^2}$
where $C=e^{\delta t_0}(x_0-\frac{p}{\delta}+\frac{2g}{\delta^2}-\frac{g}{\delta^2}e^{\delta(t_0-T)}), p=p_1+p_2, g=g_1+g_2$.
Question: If I combine two different model;
Dynamic system $\dot x=u_1+u_2, x(t_0)=x_0.$
$\dot y=v_1+v_2-\delta y, y(t_0)=y_0$.
Optimization problem as follows:
$\rightarrow \mathop{\max}\limits_{u_1,u_2,v_1,v_2 }\sum_{i=1}^{2}\int_{0}^{T}[(p_i-\frac{1}{2}v_i)v_i - g_i y+(b_i −\frac{1}{2}u_i)u_i − d_i x)]dt $
where $p_i,g_i, b_i, d_i$ are parameters
We assume the there doesn't have terminal cost, and how to solve this kind of problem, previous I guess it is similar to finite Horizon LQ Game, but detail was different, does anyone can be give me some hints and advice?