This paper studies a dynamic air ticket pricing problem in a strategic and myopic passengers co-existence market. The strategic or myopic passengers can be further divided into high-valuation and low-valuation groups according to how they evaluate their purchases. The strategic passengers have different strategic levels. When the airline sets a ticket price, every passenger makes his or her purchase decision according to his or her type and the strategic level, or might select “wait” or “leave (the market)”. The paper firstly proposes a dynamic pricing algorithm in which the utilities of both the airline and passengers are considered. The reinforcement learning (RL) is employed to deal with the progressive or dynamic decision-making framework, in which the dynamic pricing problem is formulated as a discrete finite Markov decision process (MDP) and the Q-learning is adopted to solve the problem. By using this method, the airline can adaptively decide the ticket price based on passengers strategic behaviors and the time-varying demand. The effects of the passenger type proportion and strategic level are analyzed. The computational results show the higher proportion of strategic passengers is, the smaller price increase the airline can adopt, and the higher proportion of high-valuation strategic passengers is, the larger price increase the airline can put to use under the same strategic level. If the proportion of low-valuation strategic passengers is higher, the price increase should be gentle and step by step when the price increase strategy is adopted. If the airline uses price-cut policy, the adjustment should be small. In addition, the high-valuation passenger mainly affects high-price periods and the low-valuation passenger mainly affects low-price periods. When the proportion of strategic passengers is fixed, the lower the passenger strategic level is, the larger the price slope is. These findings can provide some references for the airline to make more precise and flexible pricing decisions.
Keywords: Dynamic pricing, strategic behavior, reinforcement learning, Q-learning, Markov decision process (MDP)
@article{RO_2022__56_4_2475_0,
author = {Gao, Jinmin and Le, Meilong and Fang, Yuan},
title = {Dynamic air ticket pricing using reinforcement learning method},
journal = {RAIRO. Operations Research},
pages = {2475--2493},
year = {2022},
publisher = {EDP-Sciences},
volume = {56},
number = {4},
doi = {10.1051/ro/2022103},
mrnumber = {4458841},
language = {en},
url = {https://www.numdam.org/articles/10.1051/ro/2022103/}
}
TY - JOUR AU - Gao, Jinmin AU - Le, Meilong AU - Fang, Yuan TI - Dynamic air ticket pricing using reinforcement learning method JO - RAIRO. Operations Research PY - 2022 SP - 2475 EP - 2493 VL - 56 IS - 4 PB - EDP-Sciences UR - https://www.numdam.org/articles/10.1051/ro/2022103/ DO - 10.1051/ro/2022103 LA - en ID - RO_2022__56_4_2475_0 ER -
%0 Journal Article %A Gao, Jinmin %A Le, Meilong %A Fang, Yuan %T Dynamic air ticket pricing using reinforcement learning method %J RAIRO. Operations Research %D 2022 %P 2475-2493 %V 56 %N 4 %I EDP-Sciences %U https://www.numdam.org/articles/10.1051/ro/2022103/ %R 10.1051/ro/2022103 %G en %F RO_2022__56_4_2475_0
Gao, Jinmin; Le, Meilong; Fang, Yuan. Dynamic air ticket pricing using reinforcement learning method. RAIRO. Operations Research, Tome 56 (2022) no. 4, pp. 2475-2493. doi: 10.1051/ro/2022103
[1] and , The route efficiency evaluation method based on DEA. Technol. Mark. 26 (2019) 10–12.
[2] , MCMC simulation for modelling airline passenger choice behavior. Dublin City University (2001).
[3] , and , Learning Dynamic Pricing Rules for Flight Tickets. In: Knowledge Science, Engineering and Management. Springer Publishing (2020). | DOI
[4] and , Robust Dynamic Pricing with Strategic Customers. 16th ACM Conference on Economics & Computation, ACM Publishing (2015). | DOI
[5] , and , Demand learning and dynamic pricing under competition in a state-space framework. IEEE Trans. Eng. Manag. 59 (2012) 240–249. | DOI
[6] and , Comparing reinforcement learning approaches for solving game theoretic models: a dynamic airline pricing game example. J. Oper. Res. Soc. 63 (2012) 1165–1173. | DOI
[7] and , Learning competitive dynamic airline pricing under different customer models. J. Revenue Pricing Manag. 12 (2013) 416–430. | DOI
[8] , and , Contingent Preannounced Pricing Policies with Strategic Consumers. Oper. Res. 64 (2016) 251–272. | MR | DOI
[9] , and , Profit maximization algorithms for utility companies in an oligopolistic energy market with dynamic prices and intelligent users. AIMS Energy 4 (2016) 119–135. | DOI
[10] and , Skimming or penetration: optimal pricing of new fashion products in the present of strategic consumers. Ann. Oper. Res. 257 (2017) 275–295. | MR | DOI
[11] and , A reinforcement learning approach to competitive ordering and pricing problem. Expert Syst. 32 (2013) 39–48. | DOI
[12] , and , An optimal policy for joint dynamic price and lead-time quotation. Oper. Res. 59 (2011) 1523–1527. | MR | DOI
[13] , and , Analysis of exception data in a staging hierarchy. IBM J. Res. Dev. 18 (1974) 423–435. | DOI
[14] , and , Dynamic pricing with strategic customers. J. Bus. Econ. 83 (2013) 505–549.
[15] and , Optimal dynamic pricing policy in the presence of strategic consumers. Syst. Eng. Theory Pract. 34 (2014) 2018–2024.
[16] and , Dynamic pricing under a general parametric choice model. Oper. Res. 60 (2012) 965–980. | MR | DOI
[17] and , A price adjustment policy for maximizing revenue and countering strategic consumer behavior. Int. J. Prod. Econ. 236 (2021) 108–116. | DOI
[18] , and , Dynamic pricing in the presence of myopic and strategic consumers: theory and experiment. Prod. Oper. Manag. 26 (2017) 116–133. | DOI
[19] and , Network revenue management with dependent demand under overlapping segments, In: ICNC-FSKD. IEEE Publishing (2018).
[20] , and , Dynamic pricing in the presence of strategic consumers and oligopolistic competition. Informs 55 (2009) 32–46.
[21] , and , Optimal dynamic pricing of perishable items by a monopolist facing strategic consumers. Prod. Oper. Manag. 19 (2010) 40–60. | DOI
[22] , and , Dynamic pricing with online learning and strategic consumers: an application of the aggregating algorithm. Oper. Res. 57 (2009) 327–341. | DOI
[23] , and , Dynamic pricing strategy for airline tickets through demand learning with strategic passengers. Oper. Res. Manag. Sci. 27 (2018) 118–125.
[24] , and , Two-period discount pricing strategies for an e-commerce platform with strategic consumers. Comput. Ind. Eng. 147 (2020) 1–13.
[25] and , Dynamic pricing competition with strategic customers under vertical product differentiation. Manag. Sci. 59 (2013) 84–101. | DOI
[26] , and , A Dynamic pricing demand response algorithm for smart grid: Reinforcement learning approach. Appl. Energy 220 (2018) 220–230. | DOI
[27] , and , Optimal dynamic pricing for smart grid having mixed customers with and without smart meters. J. Mod. Power Syst. Clean Energy 6 (2018) 1244–1254. | DOI
[28] and , Markdown pricing with unknown fraction of strategic customers. Manuf. Serv. Oper. Manag. 14 (2012) 355–370. | DOI
[29] , and , Learning dynamic prices in electronic retail markets with customer segmentation. Ann. Oper. Res. 143 (2006) 59–75. | DOI
[30] and , Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning. Omega 47 (2014) 116–126. | DOI
[31] and , Dynamic pricing policies for interdependent perishable products or services using reinforcement learning. Expert Syst. Appl. 42 (2015) 426–436. | DOI
[32] and , Customer behavior modeling in revenue management and auctions: a review and new research opportunities. Prod. Oper. Manag. 16 (2010) 713–728. | DOI
[33] , Intertemporal Pricing with Strategic Customer Behavior. Manag. Sci. 53 (2007) 726–741. | DOI
[34] , Optimal Pricing for Tickets with Myopic and Strategic Passengers. Ind. Eng. Manag. 23 (2018) 107–115.
[35] , and , Optimal real time pricing in an agent-based retail market using a comprehensive demand response model. Energy 36 (2011) 5716–5727. | DOI
[36] and , Intertemporal Pricing of Substitutes under the Coexistence of Myopic and Strategic Consumers. Syst. Eng. 33 (2015) 33–39.
[37] and , Strategic customer behavior with disappointment aversion customers and two alleviation policies. Int. J. Prod. Econ. 191 (2017) 170–177. | DOI
[38] , and , Vertically Differential Product Introduction and Pricing in the Presence of Strategic Consumers. Syst. Eng. Theory Pract. 37 (2017) 3098–3108.
[39] and , The route efficiency evaluation method based on DEA. Technol. Mark. 26 (2019) 10–12.
Cité par Sources :





