Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

02/02/2021

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Quentin Cappart, Thierry Moisan, Louis-Martin Rousseau, Isabeau Prémont-Schwarz, Andre A. Cire

Keywords:

Abstract Paper Similar Papers

Abstract: Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal is to find an optimal solution among a finite set of possibilities. The well-known challenge one faces with combinatorial optimization is the state-space explosion problem: the number of possibilities grows exponentially with the problem size, which makes solving intractable for large problems. In the last years, deep reinforcement learning (DRL) has shown its promise for designing good heuristics dedicated to solve NP-hard combinatorial optimization problems. However, current approaches have an important shortcoming: they only provide an approximate solution with no systematic ways to improve it or to prove optimality. In another context, constraint programming (CP) is a generic tool to solve combinatorial optimization problems. Based on a complete search procedure, it will always find the optimal solution if we allow an execution time large enough. A critical design choice, that makes CP non-trivial to use in practice, is the branching decision, directing how the search space is explored. In this work, we propose a general and hybrid approach, based on DRL and CP, for solving combinatorial optimization problems. The core of our approach is based on a dynamic programming formulation, that acts as a bridge between both techniques. We experimentally show that our solver is efficient to solve three challenging problems: the traveling salesman problem with time windows, the 4-moments portfolio optimization problem, and the 0-1 knapsack problem. Results obtained show that the framework introduced outperforms the stand-alone RL and CP solutions, while being competitive with industrial solvers.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38949198

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Learning Hard Optimization Problems: A Data Generation Perspective

James Kotary, Ferdinando Fioretto, Pascal Van Hentenryck

Keywords Paper

optimization, machine learning

0

0

0

0

14:47

19/08/2021

Abstract Cores in Implicit Hitting Set MaxSat Solving (Extended Abstract)

Jeremias Berg, Fahiem Bacchus, Alex Poole

Keywords Paper

Constraints and SAT, MaxSAT, MinSAT, Constraint Optimization, SAT, Constraints

0

0

0

0

14:57

13/04/2021

Scalable constrained bayesian optimization

David Eriksson, Matthias Poloczek

Keywords Paper

0

0

0

0

2:58

06/12/2020

Faster Randomized Infeasible Interior Point Methods for Tall/Wide Linear Programs

Agniva Chowdhury, Palma London, Haim Avron, Petros Drineas

Keywords Paper

0

0

0

0

3:21

18/07/2021

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs

Tolga Ergen, Mert Pilanci

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:40

18/07/2021

Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity

Yorgos Amanatidis, Federico Fusco, Philip Lazos and
Stefano Leonardi, Alberto Marchetti-Spaccamela, Rebecca Reiffenhäuser

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:15

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

26/10/2020

Through the Lens of Sequence Submodularity

Sara Bernardini, Fabio Fagnani, Chiara Piacentini

Keywords Paper

Greedy algorithms, Submodularity, Sequence functions, Search, Scheduling, Recommender Systems

0

0

0

0

10:31

06/12/2021

Faster Matchings via Learned Duals

Michael Dinitz, Sungjin Im, Thomas Lavastida and
Benjamin Moseley, Sergei Vassilvitskii

Keywords Paper

theory, optimization

0

0

0

0

20:11

16/11/2020

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems

Sushmita Bhattacharya, Siva Kailas, Sahil Badyal and
Stephanie Gil, Dimitri Bertsekas

Keywords Paper

0

0

0

0

5:04

03/08/2020

Amortized Bayesian Optimization over Discrete Spaces

Kevin Swersky, Yulia Rubanova, David Dohan, Kevin Murphy

Keywords Paper

0

0

0

0

7:40

26/10/2020

A correctness result for synthesizing plans with loops in stochastic domains

Laszlo Treszkai, Vaishak Belle

Keywords Paper

Finite-state controllers, plans with loops, stochastic domains, soundness and completeness results

0

0

0

0

12:43

06/12/2021

Robust and Fully-Dynamic Coreset for Continuous-and-Bounded Learning (With Outliers) Problems

Zixiu Wang, Yiwen Guo, Hu Ding

Keywords Paper

optimization, machine learning, adversarial robustness and security, clustering

0

0

0

0

8:38

12/07/2020

Multi-Task Learning with User Preferences: Gradient Descent with Controlled Ascent in Pareto Optimization

Debabrata Mahapatra, Vaibhav Rajan

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:35

02/02/2021

Deeplite NeutrinoTM: A BlackBox Framework for Constrained Deep Learning Model Optimization

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and
Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Paper

0

0

0

0

18:29

19/01/2020

The Weak Call-By-Value λ-Calculus is Reasonable for Both Time and Space

Yannick Forster, Fabian Kunze, Marc Roth

Keywords Paper

lambda calculus, time and space complexity, abstract machines, invariance thesis, weak call-by-value reduction

0

0

0

0

22:05

14/07/2020

Approximation algorithms for scheduling with class constraints

Klaus Jansen, Alexandra Lassota, Marten Maack

Keywords Paper

class constraints, scheduling, PTAs, n-fold ILP

0

0

0

0

12:46

06/12/2020

Interior Point Solving for LP-based prediction+optimisation

Jayanta Mandi, Tias Guns

Keywords Paper

0

0

0

1

3:28

18/07/2021

Versatile Verification of Tree Ensembles

Laurens Devos, Wannes Meert, Jesse Davis

Keywords Paper

Algorithms, Boosting and Ensemble Methods

0

0

0

0

5:25

12/07/2020

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

Vien Mai, Mikael Johansson

Keywords Paper

Optimization - Non-convex

0

0

0

0

15:49

13/04/2021

Adaptive sampling for fast constrained maximization of submodular functions

Francesco Quinzan, Vanja Doskoc, Andreas Göbel, Tobias Friedrich

Keywords Paper

0

0

0

0

2:54

12/07/2020

Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints

Runchao Ma, Qihang Lin, Tianbao Yang

Keywords Paper

Optimization - Non-convex

0

0

0

0

12:52

06/12/2020

Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees

Shali Jiang, Daniel Jiang, Max Balandat and
Brian Karrer, Jacob Gardner, Roman Garnett

Keywords Paper

Applications -> Hardware and Systems, Applications

0

0

0

0

3:23

23/08/2020

A block decomposition algorithm for sparse optimization

Ganzhao Yuan, Li Shen, Wei-Shi Zheng

Keywords Paper

NP-hard, nonconvex optimization, block coordinate descent, sparse optimization, convex optimization

0

0

0

0

18:12

06/12/2021

A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs

Runzhong Wang, Zhigang Hua, Gan Liu and
Jiayi Zhang, Junchi Yan, Feng Qi, Shuang Yang, Jun Zhou, Xiaokang Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, machine learning, graph learning

0

0

0

0

11:19

06/12/2020

Approximate Heavily-Constrained Learning with Lagrange Multiplier Models

Harikrishna Narasimhan, Andy Cotter, Yichen Zhou and
Serena Wang, Wenshuo Guo

Keywords Paper

0

0

0

0

3:21

06/07/2020

Bounding boxes for weakly supervised segmentation: Global constraints get close to full supervision

Hoel Kervadec, Jose Dolz, Shanshan Wang and
Eric Granger, Ismail Ben Ayed

Keywords Paper

0

0

0

0

15:09

06/12/2021

Nonsmooth Implicit Differentiation for Machine-Learning and Optimization

Jérôme Bolte, Tam Le, Edouard Pauwels, Tony Silveti-Falls

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

12:32

06/12/2020

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

Arun Suggala, Praneeth Netrapalli

Keywords Paper

1

1

0

0

3:29

02/02/2021

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Keywords Paper

0

0

0

0

19:14

06/12/2021

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

Keywords Paper

theory, machine learning, bandits, kernel methods, active learning

0

0

0

0

7:41

03/05/2021

Learning to Make Decisions via Submodular Regularization

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

0

0

0

0

5:53

17/08/2020

NASOQ: Numerically accurate sparsity-oriented QP solver

Kazem Cheshmi, Danny M. Kaufman, Shoaib Kamil, Maryam Mehri Dehnavi

Keywords Paper

indefinite factorization, numerical optimization, contact simulation, sparse row modification, mesh deformation, quadratic programming, sparse linear algebra

0

0

0

0

15:27

06/12/2021

Learning to Schedule Heuristics in Branch and Bound

Antonia Chmiela, Elias Khalil, Ambros Gleixner and
Andrea Lodi, Sebastian Pokutta

Keywords Paper

0

0

0

0

15:05

06/12/2021

SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients

Feihu Huang, Junyi Li, Heng Huang

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

13:13

19/08/2021

Improved Algorithms for Allen's Interval Algebra: a Dynamic Programming Approach

Leif Eriksson, Victor Lagerkvist

Keywords Paper

Knowledge Representation and Reasoning, Qualitative, Geometric, Spatial, Temporal Reasoning, Constraint Satisfaction

0

0

0

0

13:25

06/12/2021

Fast Training Method for Stochastic Compositional Optimization Problems

Hongchang Gao, Heng Huang

Keywords Paper

optimization, machine learning, meta learning

0

0

0

0

14:00

30/11/2020

Large-Scale Cross-Domain Few-Shot Learning

Jiechao Guan, Manli Zhang, Zhiwu Lu

Keywords Paper

0

0

0

0

7:26

18/07/2021

The Limits of Min-Max Optimization Algorithms: Convergence to Spurious Non-Critical Sets

Ya-Ping Hsieh, Panayotis Mertikopoulos, Volkan Cevher

Keywords Paper

Theory

0

0

0

0

16:38

13/04/2021

Convergence properties of stochastic hypergradients

Riccardo Grazzi, Massimiliano Pontil, Saverio Salzo

Keywords Paper

0

0

0

0

3:11