Controlled Markov chains with non-exponential discounting and distribution-dependent costs
ESAIM: Control, Optimisation and Calculus of Variations, Tome 27 (2021), article no. 5

This paper deals with a controlled Markov chain in continuous time with a non-exponential discounting and distribution-dependent cost functional. A definition of closed-loop equilibrium is given and its existence and uniqueness are established. Due to the time-inconsistency brought by the non-exponential discounting and distribution dependence, it is proved that the equilibrium is locally optimal in some appropriate sense. Moreover, it is shown that our problem is equivalent to a mean-field game for infinite-many symmetric players with a non-exponential discounting cost.

Reçu le :
Accepté le :
Première publication :
Publié le :
DOI : 10.1051/cocv/2021003
Classification : 93E20, 60J27, 90C40
Keywords: time inconsistency, distribution dependence, controlled Markov chain
@article{COCV_2021__27_1_A7_0,
     author = {Mei, Hongwei and Yin, George},
     title = {Controlled {Markov} chains with non-exponential discounting and distribution-dependent costs},
     journal = {ESAIM: Control, Optimisation and Calculus of Variations},
     year = {2021},
     publisher = {EDP-Sciences},
     volume = {27},
     doi = {10.1051/cocv/2021003},
     language = {en},
     url = {https://www.numdam.org/articles/10.1051/cocv/2021003/}
}
TY  - JOUR
AU  - Mei, Hongwei
AU  - Yin, George
TI  - Controlled Markov chains with non-exponential discounting and distribution-dependent costs
JO  - ESAIM: Control, Optimisation and Calculus of Variations
PY  - 2021
VL  - 27
PB  - EDP-Sciences
UR  - https://www.numdam.org/articles/10.1051/cocv/2021003/
DO  - 10.1051/cocv/2021003
LA  - en
ID  - COCV_2021__27_1_A7_0
ER  - 
%0 Journal Article
%A Mei, Hongwei
%A Yin, George
%T Controlled Markov chains with non-exponential discounting and distribution-dependent costs
%J ESAIM: Control, Optimisation and Calculus of Variations
%D 2021
%V 27
%I EDP-Sciences
%U https://www.numdam.org/articles/10.1051/cocv/2021003/
%R 10.1051/cocv/2021003
%G en
%F COCV_2021__27_1_A7_0
Mei, Hongwei; Yin, George. Controlled Markov chains with non-exponential discounting and distribution-dependent costs. ESAIM: Control, Optimisation and Calculus of Variations, Tome 27 (2021), article no. 5. doi: 10.1051/cocv/2021003

[1] T. Björk, M. Khapko and A. Murgoci, On time-inconsistent stochastic control in continuous time. Finance Stoch. 21 (2017) 331–360.

[2] F. Cherbonnier, Optimal insurance for time-inconsistent agents. Preprint (2016).

[3] M. Dodd, Obesity and time-inconsistent preferences. Obes. Res. Clin. Practice 2 (2008) 83–89.

[4] I. Ekeland and T. A. Pirvu, Investment and consumption without commitment. Math. Finan. Econ. 2 (2008) 57–86.

[5] I. Ekeland and A. Lazrak, The golden rule when preferences are time inconsistent. Math. Finan. Econ. 4 (2010) 29–55.

[6] I. Ekeland, O. Mbodji and T. A. Pirvu, Time-consistent portfolio management. SIAM J. Financ. Math. 3 (2012) 1–32.

[7] T. S. Findley and F. N. Caliendo, Short horizons, time inconsistency and optimal social security. Int. Tax & Public Finance 16 (2009) 487–513.

[8] T. S. Findley and J. A. Feigenbaum, Quasi-hyperbolic discounting and the existence of time-inconsistent retirement. Theor. Econ. Lett. 3 (2013) 119–123.

[9] S. L. Green, Time inconsistency, self-control, and remembrance. Faith Econ. 42 (2003) 51–60.

[10] S. R. Grenadier and N. Wang, Investment under uncertainty and time-inconsistent preferences. J. Financ. Econ. 84 (2007) 2–39.

[11] Y. Hu, H. Jin and X. Y. Zhou, Time-inconsistent stochastic linear–quadratic control. SIAM J. Control Optim. 50 (2012) 1548–1572.

[12] Y. Hu, H. Jin and X. Y. Zhou, Time-inconsistent stochastic linear–quadratic control: characterization and uniqueness of equilibrium. SIAM J. Control Optim. 55 (2017) 1261–1279.

[13] M. Huang, R. P. Malhamé and P. E. Caines, Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle. Commun. Inf. Syst. 6 (2006) 221–252.

[14] M. Huang, P. E. Caines and R. P. Malhamé, Large-population cost-coupled LQG problems with nonuniform agents: individual-mass behavior and decentralized-nash equilibria. IEEE Trans. Autom. Control 52 (2007) 1560–1571.

[15] M. Huang, P. E. Caines and R. P. Malhamé, The Nash Certainty Equivalence Principle and McKean-Vlasov Systems: an Invariance Principle and Entry Adaptation. 46th IEEE Conference on Decision and Control (2007) 121–123.

[16] M. Huang, P. E. Caines and R. P. Malhamé, An invariance principle in large population stochastic dynamic games. J. Syst. Sci. Complex. 20 (2007) 162–172.

[17] J.-M. Lasry and P.-L. Lions, Jeux champ moyen. I. Le cas stationnaire. C.R. Math. Acad. Sci. Paris 343 (2006) 619–625.

[18] J.-M. Lasry and P.-L. Lions, Jeux champ moyen. II. Horizon fini et contrle optimal. C.R. Math. Acad. Sci. Paris 343 (2006) 679–684.

[19] J.-M. Lasry and P.-L. Lions, Large investor trading impacts on volatility. Ann. Inst. Henri Poincaré Anal. Non Linaire 24 (2007) 311–323.

[20] J.-M. Lasry and P.-L. Lions, Mean field games. Jpn. J. Math. 2 (2007) 229–260.

[21] H. Mei and J. Yong, Equilibrium strategies for time-inconsistent stochastic switching systems. ESAIM: COCV 25 (2019) 64.

[22] Y. Ni, J.Zhang and M. Krstic, Time-inconsistent mean-field stochastic LQ problem: open-loop time-consistent control. IEEE Trans. Autom. Control 63 (2018) 2771–2786.

[23] S. L. Nguyen, D. T. Nguyen and G. Yin, A stochastic maximum principle for switching diffusions using conditional mean-fields with applications to control problems. ESAIM: COCV 26 (2020) 69.

[24] S. Nguyen, G. Yin and T. Hoang, On laws of large numbers for systems with mean-field interactions and Markovian switching. Stochastic Process. Appl. 130 (2020) 262–296.

[25] H. Pham and X. Wei, Bellman equation and viscosity solutions for mean-field stochastic control problem. ESAIM: COCV 24 (2018) 437–461.

[26] H. Pham and X. Wei, Dynamic programming for optimal control of Stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55 (2017) 1069–1101.

[27] R. A. Pollak, Consistent planning. Rev. Econ. Stud. 35 (1968) 185–199.

[28] H. Wang, J. Sun and J. Yong, Recursive Utility Processes, Dynamic Risk Measures and Quadratic Backward Stochastic VolterraIntegral Equations. Appl. Math. Optim. (2019) 1–46.

[29] H. Wang and J. Yong, Time-inconsistent stochastic optimal control problems and back-ward stochastic volterra integral equations. Preprint (2019). | arXiv

[30] T. Wang, Characterizations of equilibrium controls in time inconsistent mean-field stochastic linear quadratic problems. I. Preprint (2018). | arXiv

[31] J. Wei, Time-inconsistent optimal control problems with regime switching. Math. Control Rel. Fields 7 (2017) 585–622.

[32] Q. Wei, J. Yong and Z. Yu, Time-inconsistent recrusive stochastic optimal control problems. SIAM J. Control Optim. 55 (2017) 4156–4201.

[33] W. Yan and J. Yong, Time-inconsistent optimal control problems and related issues. Modeling, Stochastic Control, Optimization, and Applications, edited by G. Yin., Q. Zhang. In Vol. 164 of The IMA Volumes in Mathematics and its Applications. Springer (2019) 533–569.

[34] G. Yin and Q. Zhang, Continuous-time Markov chains and applications: a singular perturbation approach. Vol. 37. Springer (2012).

[35] J. Yong, Time-inconsistent optimal control problems and the equilibrium HJB equation. Math. Control Relat. Fields 2 (2012) 271–329.

[36] J. Yong, Linear-quadratic optimal control problems for mean-field stochastic differential equations – time-consistent solutions. Trans. AMS 369 (2017) 5467–5523.

Cité par Sources :

The research of the second author was supported in part by the Air Force Office of Scientific Research under grant FA9550-18-1-0268.