Dynamic air ticket pricing using reinforcement learning method
RAIRO. Operations Research, Tome 56 (2022) no. 4, pp. 2475-2493

This paper studies a dynamic air ticket pricing problem in a strategic and myopic passengers co-existence market. The strategic or myopic passengers can be further divided into high-valuation and low-valuation groups according to how they evaluate their purchases. The strategic passengers have different strategic levels. When the airline sets a ticket price, every passenger makes his or her purchase decision according to his or her type and the strategic level, or might select “wait” or “leave (the market)”. The paper firstly proposes a dynamic pricing algorithm in which the utilities of both the airline and passengers are considered. The reinforcement learning (RL) is employed to deal with the progressive or dynamic decision-making framework, in which the dynamic pricing problem is formulated as a discrete finite Markov decision process (MDP) and the Q-learning is adopted to solve the problem. By using this method, the airline can adaptively decide the ticket price based on passengers strategic behaviors and the time-varying demand. The effects of the passenger type proportion and strategic level are analyzed. The computational results show the higher proportion of strategic passengers is, the smaller price increase the airline can adopt, and the higher proportion of high-valuation strategic passengers is, the larger price increase the airline can put to use under the same strategic level. If the proportion of low-valuation strategic passengers is higher, the price increase should be gentle and step by step when the price increase strategy is adopted. If the airline uses price-cut policy, the adjustment should be small. In addition, the high-valuation passenger mainly affects high-price periods and the low-valuation passenger mainly affects low-price periods. When the proportion of strategic passengers is fixed, the lower the passenger strategic level is, the larger the price slope is. These findings can provide some references for the airline to make more precise and flexible pricing decisions.

DOI : 10.1051/ro/2022103
Classification : 90B22, 90C40
Keywords: Dynamic pricing, strategic behavior, reinforcement learning, Q-learning, Markov decision process (MDP)
@article{RO_2022__56_4_2475_0,
     author = {Gao, Jinmin and Le, Meilong and Fang, Yuan},
     title = {Dynamic air ticket pricing using reinforcement learning method},
     journal = {RAIRO. Operations Research},
     pages = {2475--2493},
     year = {2022},
     publisher = {EDP-Sciences},
     volume = {56},
     number = {4},
     doi = {10.1051/ro/2022103},
     mrnumber = {4458841},
     language = {en},
     url = {https://www.numdam.org/articles/10.1051/ro/2022103/}
}
TY  - JOUR
AU  - Gao, Jinmin
AU  - Le, Meilong
AU  - Fang, Yuan
TI  - Dynamic air ticket pricing using reinforcement learning method
JO  - RAIRO. Operations Research
PY  - 2022
SP  - 2475
EP  - 2493
VL  - 56
IS  - 4
PB  - EDP-Sciences
UR  - https://www.numdam.org/articles/10.1051/ro/2022103/
DO  - 10.1051/ro/2022103
LA  - en
ID  - RO_2022__56_4_2475_0
ER  - 
%0 Journal Article
%A Gao, Jinmin
%A Le, Meilong
%A Fang, Yuan
%T Dynamic air ticket pricing using reinforcement learning method
%J RAIRO. Operations Research
%D 2022
%P 2475-2493
%V 56
%N 4
%I EDP-Sciences
%U https://www.numdam.org/articles/10.1051/ro/2022103/
%R 10.1051/ro/2022103
%G en
%F RO_2022__56_4_2475_0
Gao, Jinmin; Le, Meilong; Fang, Yuan. Dynamic air ticket pricing using reinforcement learning method. RAIRO. Operations Research, Tome 56 (2022) no. 4, pp. 2475-2493. doi: 10.1051/ro/2022103

[1] Z. Bo and L. Meilong, The route efficiency evaluation method based on DEA. Technol. Mark. 26 (2019) 10–12.

[2] C. Campbell, MCMC simulation for modelling airline passenger choice behavior. Dublin City University (2001).

[3] J. Cao, Z. Liu and Y. Wu, Learning Dynamic Pricing Rules for Flight Tickets. In: Knowledge Science, Engineering and Management. Springer Publishing (2020). | DOI

[4] Y. Chen and V. F. Farias, Robust Dynamic Pricing with Strategic Customers. 16th ACM Conference on Economics & Computation, ACM Publishing (2015). | DOI

[5] B. D. Chung, J. Li and T. Yao, Demand learning and dynamic pricing under competition in a state-space framework. IEEE Trans. Eng. Manag. 59 (2012) 240–249. | DOI

[6] A. Collins and L. Thomas, Comparing reinforcement learning approaches for solving game theoretic models: a dynamic airline pricing game example. J. Oper. Res. Soc. 63 (2012) 1165–1173. | DOI

[7] A. Collins and L. Thomas, Learning competitive dynamic airline pricing under different customer models. J. Revenue Pricing Manag. 12 (2013) 416–430. | DOI

[8] J. Correa, R. Montoya and C. Thraves, Contingent Preannounced Pricing Policies with Strategic Consumers. Oper. Res. 64 (2016) 251–272. | MR | DOI

[9] T. S. Cui, Y. Z. Wang and S. Nazarian, Profit maximization algorithms for utility companies in an oligopolistic energy market with dynamic prices and intelligent users. AIMS Energy 4 (2016) 119–135. | DOI

[10] P. Du and Q. Chen, Skimming or penetration: optimal pricing of new fashion products in the present of strategic consumers. Ann. Oper. Res. 257 (2017) 275–295. | MR | DOI

[11] I. Dogan and A. R. Güner, A reinforcement learning approach to competitive ordering and pricing problem. Expert Syst. 32 (2013) 39–48. | DOI

[12] J. Feng, L. Liu and X. Liu, An optimal policy for joint dynamic price and lead-time quotation. Oper. Res. 59 (2011) 1523–1527. | MR | DOI

[13] D. P. Gaver, P. A. W. Lewis and G. S. Shedler, Analysis of exception data in a staging hierarchy. IBM J. Res. Dev. 18 (1974) 423–435. | DOI

[14] J. Gonsch, R. Klein and M. Neugebauer, Dynamic pricing with strategic customers. J. Bus. Econ. 83 (2013) 505–549.

[15] Z. Guan and J. Ren, Optimal dynamic pricing policy in the presence of strategic consumers. Syst. Eng. Theory Pract. 34 (2014) 2018–2024.

[16] B. Josef and R. Paat, Dynamic pricing under a general parametric choice model. Oper. Res. 60 (2012) 965–980. | MR | DOI

[17] M. Khouja and X. Liu, A price adjustment policy for maximizing revenue and countering strategic consumer behavior. Int. J. Prod. Econ. 236 (2021) 108–116. | DOI

[18] M. Kremer, B. Mantin and A. Ovchinnikov, Dynamic pricing in the presence of myopic and strategic consumers: theory and experiment. Prod. Oper. Manag. 26 (2017) 116–133. | DOI

[19] M. Le and M. Lu, Network revenue management with dependent demand under overlapping segments, In: ICNC-FSKD. IEEE Publishing (2018).

[20] Y. Levin, J. Mcgill and M. Nediak, Dynamic pricing in the presence of strategic consumers and oligopolistic competition. Informs 55 (2009) 32–46.

[21] Y. Levin, J. Mcgill and M. Nediak, Optimal dynamic pricing of perishable items by a monopolist facing strategic consumers. Prod. Oper. Manag. 19 (2010) 40–60. | DOI

[22] T. Levina, Y. Levin and J. Mcgill, Dynamic pricing with online learning and strategic consumers: an application of the aggregating algorithm. Oper. Res. 57 (2009) 327–341. | DOI

[23] H. Li, Q. Peng and M. Tan, Dynamic pricing strategy for airline tickets through demand learning with strategic passengers. Oper. Res. Manag. Sci. 27 (2018) 118–125.

[24] C. Li, M. Chu and C. Zhou, Two-period discount pricing strategies for an e-commerce platform with strategic consumers. Comput. Ind. Eng. 147 (2020) 1–13.

[25] Q. Liu and D. Zhang, Dynamic pricing competition with strategic customers under vertical product differentiation. Manag. Sci. 59 (2013) 84–101. | DOI

[26] R. Lu, S. H. Hong and X. Zhang, A Dynamic pricing demand response algorithm for smart grid: Reinforcement learning approach. Appl. Energy 220 (2018) 220–230. | DOI

[27] Q. Ma, F. Meng and X. J. Zeng, Optimal dynamic pricing for smart grid having mixed customers with and without smart meters. J. Mod. Power Syst. Clean Energy 6 (2018) 1244–1254. | DOI

[28] A. J. Mersereau and D. Zhang, Markdown pricing with unknown fraction of strategic customers. Manuf. Serv. Oper. Manag. 14 (2012) 355–370. | DOI

[29] C. V. L. Raju, Y. Narahari and K. Ravikumar, Learning dynamic prices in electronic retail markets with customer segmentation. Ann. Oper. Res. 143 (2006) 59–75. | DOI

[30] R. Rana and F. S. Oliveira, Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning. Omega 47 (2014) 116–126. | DOI

[31] R. Rana and F. S. Oliveira, Dynamic pricing policies for interdependent perishable products or services using reinforcement learning. Expert Syst. Appl. 42 (2015) 426–436. | DOI

[32] Z. J. M. Shen and X. Su, Customer behavior modeling in revenue management and auctions: a review and new research opportunities. Prod. Oper. Manag. 16 (2010) 713–728. | DOI

[33] X. Su, Intertemporal Pricing with Strategic Customer Behavior. Manag. Sci. 53 (2007) 726–741. | DOI

[34] M. Tan, Optimal Pricing for Tickets with Myopic and Strategic Passengers. Ind. Eng. Manag. 23 (2018) 107–115.

[35] S. Yousefi, M. P. Moghaddam and V. J. Majd, Optimal real time pricing in an agent-based retail market using a comprehensive demand response model. Energy 36 (2011) 5716–5727. | DOI

[36] H. Zeng and Y. Zhang, Intertemporal Pricing of Substitutes under the Coexistence of Myopic and Strategic Consumers. Syst. Eng. 33 (2015) 33–39.

[37] Y. Zhang and J. Zhang, Strategic customer behavior with disappointment aversion customers and two alleviation policies. Int. J. Prod. Econ. 191 (2017) 170–177. | DOI

[38] J. Zhao, J. Qiu and X. Hu, Vertically Differential Product Introduction and Pricing in the Presence of Strategic Consumers. Syst. Eng. Theory Pract. 37 (2017) 3098–3108.

[39] B. Zhu and M. Le, The route efficiency evaluation method based on DEA. Technol. Mark. 26 (2019) 10–12.

Cité par Sources :