×

Dynamic pricing in high-dimensions. (English) Zbl 1484.91202

Summary: We study the pricing problem faced by a firm that sells a large number of products, described via a wide range of features, to customers that arrive over time. Customers independently make purchasing decisions according to a general choice model that includes products features and customers’ characteristics, encoded as \(d\)-dimensional numerical vectors, as well as the price offered. The parameters of the choice model are a priori unknown to the firm, but can be learned as the (binary-valued) sales data accrues over time. The firm’s objective is to maximize its revenue. We benchmark the performance using the classic regret minimization framework where the regret is defined as the expected revenue loss against a clairvoyant policy that knows the parameters of the choice model in advance, and always offers the revenue-maximizing price. This setting is motivated in part by the prevalence of online marketplaces that allow for real-time pricing.
We assume a structured choice model, parameters of which dependon \(s_0\) out of the \(d\) product features. Assuming that the market noise distribution is known, we propose a dynamic policy, called regularized maximum likelihood pricing (RMLP) that leverages the (sparsity) structure of the high-dimensional model and obtains a logarithmic regret in \(T\). More specifically, the regret of our algorithm is of \(O(s_0 \log d \cdot \log T)\). Furthermore, we show that no policy can obtain regret better than \(O(s_0 (\log d + \log T))\). {In addition, we propose a generalization of our policy to a setting that the market noise distribution is unknown but belongs to a parametrized family of distributions. This policy obtains regret of \(O(\sqrt{(\log d)T})\). We further show that no policy can obtain regret better than \(\Omega(\sqrt{T})\) in such environments.}

MSC:

91B24 Microeconomic theory (price theory and economic markets)
62H12 Estimation in multivariate analysis
62P05 Applications of statistics to actuarial sciences and financial mathematics

References:

[1] Yasin Abbasi-Yadkori, David Pal, and Csaba Szepesvari. Online-to-confidence-set conversions and application to sparse stochastic bandits. InAISTATS, pages 1-9, 2012.
[2] Shipra Agrawal and Nikhil R. Devanur. Bandits with concave rewards and convex knapsacks. InProceedings of the Fifteenth ACM Conference on Economics and Computation, EC ’14, pages 989-1006, 2014.
[3] Airbnb Documentation. Smart pricing: Set prices based on demand.https://www.airbnb. com/help/article/1168/smart-pricing-set-prices-based-on-demand, 2015.
[4] Kareem Amin, Afshin Rostamizadeh, and Umar Syed. Repeated contextual auctions with strategic buyers. InAdvances in Neural Information Processing Systems, pages 622-630, 2014.
[5] Victor F Araman and Ren´e Caldentey. Dynamic pricing for nonperishable products with demand learning.Operations research, 57(5):1169-1188, 2009. · Zbl 1233.91099
[6] Moshe Babaioff, Shaddin Dughmi, Robert Kleinberg, and Aleksandrs Slivkins. Dynamic pricing with limited supply. InProceedings of the 13th ACM Conference on Electronic Commerce, EC ’12, pages 74-91, 2012. ISBN 978-1-4503-1415-2.
[7] Ashwinkumar Badanidiyuru, Robert Kleinberg, and Aleksandrs Slivkins.Bandits with knapsacks. InFoundations of Computer Science (FOCS), 2013 IEEE 54th Annual Symposium on, pages 207-216. IEEE, 2013. · Zbl 1425.68340
[8] Mark Bagnoli and Ted Bergstrom. Log-concave probability and its applications.Economic theory, 26(2):445-469, 2005. · Zbl 1077.60012
[9] Santiago R Balseiro, Jon Feldman, Vahab Mirrokni, and S Muthukrishnan. Yield optimization of display advertising with ad exchange.Management Science, 60(12):2886-2907, 2014.
[10] Hamsa Bastani and Mohsen Bayati. Online decision-making with high-dimensional covariates. Working Paper, 2016. · Zbl 1445.90042
[11] Omar Besbes and Assaf Zeevi. Dynamic pricing without knowing the demand function: risk bounds and near-optimal algorithms.Operations Research, 57:1407-1420, 2009. · Zbl 1233.90011
[12] Sonia A Bhaskar and Adel Javanmard.1-bit matrix completion under exact low-rank constraint. InInformation Sciences and Systems (CISS), 2015 49th Annual Conference on, pages 1-6. IEEE, 2015.
[13] Stephen Boyd and Lieven Vandenberghe.Convex optimization. Cambridge university press, 2004. · Zbl 1058.90049
[14] Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and TrendsRin Machine Learning, 3(1):1-122, 2011. · Zbl 1229.90122
[15] Josef Broder and Paat Rusmevichientong. Dynamic pricing under a general parametric choice model.Operations Research, 60(4):965-980, 2012. · Zbl 1260.91094
[16] Peter B¨uhlmann and Sara van de Geer.Statistics for high-dimensional data. SpringerVerlag, 2011. · Zbl 1273.62015
[17] Emmanuel Candes and Terence Tao. The dantzig selector: Statistical estimation when p is much larger than n.The Annals of Statistics, pages 2313-2351, 2007. · Zbl 1139.62019
[18] Xi Chen, Zachary Owen, Clark Pixton, and David Simchi-Levi.A statistical learning approach to personalization in revenue management. Working Paper, 2015.
[19] Wang Chi Cheung, Will Ma, David Simchi-Levi, and Xinshang Wang. Inventory balancing with online learning.Working Paper, 2018.
[20] Maxime C Cohen, Ilan Lobel, and Renato Paes Leme. Feature-based dynamic pricing.ACM Conference on Economics and Computation, 2016.
[21] Varsha Dani, Thomas P Hayes, and Sham M Kakade. Stochastic linear optimization under bandit feedback. InCOLT, pages 355-366, 2008.
[22] A. V. den Boer and A. P. Zwart. Mean square convergence rates for maximum(quasi) likelihood estimation.Stochastic systems, 4:1 - 29, 2014. ISSN 1946-5238.
[23] Arnoud V den Boer. Dynamic pricing and learning: historical origins, current research, and new directions.Surveys in operations research and management science, 20(1):1-18, 2015.
[24] Arnoud V den Boer and Bert Zwart. Simultaneously learning and optimizing using controlled variance pricing.Management Science, 60(3):770-783, 2013.
[25] Benjamin Edelman, Michael Ostrovsky, and Michael Schwarz. Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords.The American economic review, 97(1):242-259, 2007.
[26] Ossama Elshiewy, Daniel Guhl, and Yasemin Boztug.Multinomial logit models in marketing-from fundamentals to state-of-the-art.Marketing ZFP, 39(3):32-49, 2017.
[27] Vivek F Farias, Srikanth Jagabathula, and Devavrat Shah. A nonparametric approach to modeling choice with limited data.Management Science, 59(2):305-322, 2013.
[28] Kris Johnson Ferreira, David Simchi-Levi, and He Wang. Online network revenue management using thompson sampling.Operations research, 2018. · Zbl 1446.90095
[29] Dennis H Gensch and Wilfred W Recker. The multinomial, multiattribute logit choice model.Journal of Marketing Research, pages 124-132, 1979.
[30] Alexander Goldenshluger and Assaf Zeevi. A linear response bandit problem.Stochastic Systems, 3(1):230-261, 2013. · Zbl 1352.91009
[31] Negin Golrezaei, Hamid Nazerzadeh, and Paat Rusmevichientong. Real-time optimization of personalized assortments.Management Science, 60(6):1532-1551, 2014.
[32] Negin Golrezaei, Adel Javanmard, and Vahab Mirrokni. Dynamic incentive-aware learning: Robust pricing in contextual auctions.http://dx.doi.org/10.2139/ssrn.3144034, 2018. · Zbl 1470.91128
[33] Peter M Guadagni and John DC Little. A logit model of brand choice calibrated on scanner data.Marketing Science, 27(1):29-48, 2008.
[34] J Michael Harrison, Bora Keskin, and Assaf Zeevi. Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution.Management Science, 58(3): 570-586, 2012.
[35] John R Hauser and Frank S Koppelman. Alternative perceptual mapping techniques: Relative accuracy and usefulness.Journal of marketing Research, pages 495-506, 1979.
[36] Adel Javanmard. Perishability of data: Dynamic pricing under varying-coefficient models.Journal of Machine Learning Research, 18(53):1-31, 2017. URLhttp://jmlr.org/ papers/v18/17-061.html. · Zbl 1440.62386
[37] Nathan Kallus and Madeleine Udell. Dynamic assortment personalization in high dimensions. Working Paper, 2016. · Zbl 1451.90077
[38] Godfrey Keller and Sven Rady. Optimal experimentation in a changing environment.The review of economic studies, 66(3):475-507, 1999. · Zbl 0946.91049
[39] Bora Keskin.Optimal dynamic pricing with demand model uncertainty: A squaredcoefficient-of-variation rule for learning and earning. Working Paper, 2014.
[40] Bora Keskin and Assaf Zeevi. Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies.Operations Research, 62(5):1142-1167, 2014. · Zbl 1368.91103
[41] N Bora Keskin and Assaf Zeevi. Chasing demand: Learning and earning in a changing environment.Mathematics of Operations Research, 2016. · Zbl 1364.93886
[42] Robert Kleinberg and Tom Leighton. The value of knowing a demand curve: Bounds on regret for online posted-price auctions. InProceedings of 44th Annual IEEE Symposium on Foundations of Computer Science, pages 594-605. IEEE, 2003.
[43] S´ebastien Lahaie and David M Pennock. Revenue analysis of a family of ranking rules for keyword auctions. InProceedings of the 8th ACM conference on Electronic commerce, pages 50-56. ACM, 2007.
[44] Jordan J Louviere and George Woodworth. Design and analysis of simulated consumer choice or allocation experiments: an approach based on aggregate data.Journal of marketing research, pages 350-367, 1983.
[45] Daniel McFadden. Economic choices.American economic review, 91(3):351-378, 2001.
[46] Roger B. Myerson. Optimal auction design.Mathematics of Operations Research, 6(1): 58-73, 1981. · Zbl 0496.90099
[47] Yaniv Plan and Roman Vershynin. One-bit compressed sensing by linear programming. Communications on Pure and Applied Mathematics, 66(8):1275-1297, 2013. · Zbl 1335.94018
[48] Stanislav P Ponomarev. Submersions and preimages of sets of measure zero.Siberian Mathematical Journal, 28(1):153-163, 1987. · Zbl 0623.28002
[49] Sheng Qiang and Mohsen Bayati. Dynamic pricing with demand covariates. Working Paper, 2016.
[50] Garvesh Raskutti, Martin J Wainwright, and Bin Yu. Minimax rates of estimation for highdimensional linear regression over-balls.IEEE Transactions on Information Theory, 57 (10):6976-6994, 2011. · Zbl 1365.62276
[51] Michael Rothschild. A two-armed bandit theory of market pricing.Journal of Economic Theory, 9(2):185-202, 1974.
[52] Mark Rudelson and Shuheng Zhou. Reconstruction from anisotropic random measurements. IEEE Trans. on Inform. Theory, 59(6):3434-3447, 2013. · Zbl 1364.94158
[53] Kalyan T Talluri and Garrett J Van Ryzin.The theory and practice of revenue management, volume 68. Springer Science & Business Media, 2006. · Zbl 1083.90024
[54] A.B. Tsybakov.Introduction to Nonparametric Estimation. Springer Series in Statistics. Springer New York, 2008. ISBN 9780387790527. URLhttps://books.google.com/ books?id=mwB8rUBsbqoC. · Zbl 1176.62032
[55] Sara A Van de Geer. High-dimensional generalized linear models and the lasso.The Annals of Statistics, pages 614-645, 2008. · Zbl 1138.62323
[56] Roman Vershynin. Introduction to the non-asymptotic analysis of random matrices.arXiv preprint arXiv:1011.3027, 2010.
[57] Zizhuo Wang, Shiming Deng, and Yinyu Ye. Close the gaps: A learning-while-doing algorithm for single-product revenue management problems.Operations Research, 62(2): 318-331, 2014. · Zbl 1302.91100
[58] Baichun Xiao, Wei Yang, and Jun Li. Optimal reserve price for the generalized second-price auction in sponsored search advertising.Journal of Electronic Commerce Research, 10 (3):114, 2009.
[59] Yuhong Yang and Andrew Barron. Information-theoretic determination of minimax rates of convergence.Annals of Statistics, pages 1564-1599, 1999. · Zbl 0978.62008
[60] Bin Yu.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.