Dual Mirror Descent for Online Allocation Problems

Abstract: We consider online allocation problems with concave revenue functions and resource constraints, which are central problems in revenue management and online advertising. In these settings, requests arrive sequentially during a finite horizon and, for each request, a decision maker needs to choose an action that consumes a certain amount of resources and generates revenue. The revenue function and resource consumption of each request are drawn independently and at random from a probability distribution that is unknown to the decision maker. The objective is to maximize cumulative revenues subject to a constraint on the total consumption of resources. We design a general class of algorithms that achieve sub-linear expected regret compared to the hindsight optimal allocation. Our algorithms operate in the Lagrangian dual space: they maintain a dual multiplier for each resource that is updated using online mirror descent. By choosing the reference function accordingly, we recover dual sub-gradient descent and dual exponential weights algorithm. The resulting algorithms are simple, efficient, and shown to attain the optimal order of regret when the length of the horizon and the initial number of resources are scaled proportionally. We discuss applications to online bidding in repeated auctions with budget constraints and online proportional matching with high entropy.

18/07/2021

Deep Learning, Generative Models, Applications, Computer Vision; Applications, Visual Scene Analysis and Interpretation; Deep Learning, Adversarial Network, Algorithms, Online Learning Algorithms

5:15

18/07/2021

Dual Mirror Descent for Online Allocation Problems

Haihao Lu, Santiago Balseiro, Vahab Mirrokni

Comments

Similar Papers

Regularized Online Allocation Problems: Fairness and Beyond

Santiago Balseiro, Haihao Lu, Vahab Mirrokni

Keywords Abstract Paper

Algorithms, Online Learning Algorithms

A Bandit Learning Algorithm and Applications to Auction Design

Keywords Abstract Paper

Online Continuous DR-Submodular Maximization with Long-Term Budget Constraints

Omid Sadeghi, Maryam Fazel

Keywords Abstract Paper

Parallelizing Thompson Sampling

Amin Karbasi, Vahab Mirrokni, Mohammad Shadravan

Keywords Abstract Paper

reinforcement learning and planning, bandits

A Primal-Dual Online Algorithm for Online Matching Problem in Dynamic Environments

Yu-Hang Zhou, Peng Hu, Chen Liang and Huan Xu, Guangda Huzhang, Yinfu Feng, Qing Da, Xinshang Wang, An-Xiang Zeng

Keywords Abstract Paper

Online Market Equilibrium with Application to Fair Division

Yuan Gao, Christian Kroer, Alex Peysakhovich

Keywords Abstract Paper

Joint Online Learning and Decision-making via Dual Mirror Descent

Alfonso Lobos Ruiz, Paul Grigas, Zheng Wen

Keywords Abstract Paper

Deep Learning, Generative Models, Applications, Computer Vision; Applications, Visual Scene Analysis and Interpretation; Deep Learning, Adversarial Network, Algorithms, Online Learning Algorithms

Non-Exponentially Weighted Aggregation: Regret Bounds for Unbounded Loss Functions

Keywords Abstract Paper

Probabilistic Methods, Bayesian Methods

Variational Bayesian Optimistic Sampling

Brendan O'Donoghue, Tor Lattimore

Keywords Abstract Paper

optimization, reinforcement learning and planning, generative model, bandits, online learning

Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition

Liyu Chen, Haipeng Luo, Chen-Yu Wei

Keywords Abstract Paper

The Online Min-Sum Set Cover Problem

Dimitris Fotakis, Loukas Kavouras, Grigorios Koumoutsos and Stratis Skoulakis, Manolis Vardas

Keywords Abstract Paper

Online Algorithms, Competitive Analysis, Min-Sum Set Cover

Efficient Online Estimation of Causal Effects by Deciding What to Observe

Shantanu Gupta, Zachary Lipton, David Childers

Keywords Abstract Paper

reinforcement learning and planning, graph learning, causality

Learning-to-learn non-convex piecewise-Lipschitz functions

Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet S Talwalkar

Keywords Abstract Paper

optimization, machine learning, robustness, meta learning, online learning

$\texttt{LeadCache}$: Regret-Optimal Caching in Networks

Debjit Paria, Abhishek Sinha

Keywords Abstract Paper

graph learning, online learning

Policy Optimization as Online Learning with Mediator Feedback

Alberto Maria Metelli, Matteo Papini, Pierluca D'Oro, Marcello Restelli

Keywords Abstract Paper

Online Matching in Sparse Random Graphs: Non-Asymptotic Performances of Greedy Algorithm

Nathan Noiry, Vianney Perchet, Flore Sentenac

Keywords Abstract Paper

Learning to Make Decisions via Submodular Regularization

Ayya Alieva, Aiden Aceves, Jialin Song and Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Abstract Paper

Dynamic Regret of Convex and Smooth Functions

Peng Zhao, Yu-Jie Zhang, Lijun Zhang, Zhi-Hua Zhou

Keywords Abstract Paper

Distributed Online Optimization over a Heterogeneous Network

Nima Eshraghi, Ben Liang

Keywords Abstract Paper

Finite Regret and Cycles with Fixed Step-Size via Alternating Gradient Descent-Ascent

James P Bailey, Gauthier Gidel, Georgios Piliouras

Keywords Abstract Paper

Economics, game theory, and incentives, Online learning

Online Posted Pricing with Unknown Time-Discounted Valuations

Giulia Romano, Gianluca Tartaglia, Alberto Marchesi, Nicola Gatti

Keywords Abstract Paper

A Game-Theoretic Analysis of the Empirical Revenue Maximization Algorithm with Endogenous Sampling

Xiaotie Deng, Ron Lavi, Tao Lin and Qi Qi, Wenwei WANG, Xiang Yan

Keywords Abstract Paper

Tracking regret bounds for online submodular optimization

Tatsuya Matsuoka, Shinji Ito, Naoto Ohsaka

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yu-Hang Zhou, Peng Hu, Chen Liang and
Huan Xu, Guangda Huzhang, Yinfu Feng, Qing Da, Xinshang Wang, An-Xiang Zeng

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Dimitris Fotakis, Loukas Kavouras, Grigorios Koumoutsos and
Stratis Skoulakis, Manolis Vardas

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xiaotie Deng, Ron Lavi, Tao Lin and
Qi Qi, Wenwei WANG, Xiang Yan

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shinji Ito, Daisuke Hatano, Hanna Sumita and
Kei Takemura, Takuro Fukunaga, Naonori Kakimura, Ken-Ichi Kawarabayashi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper