Deep Reinforcement Learning Approach to Solve Dynamic Vehicle Routing Problem with Stochastic Customers

Abstract: In real-world urban logistics operations, changes to the routes and tasks occur in response to dynamic events. To ensure customers’ demands are met, planners need to make these changes quickly (sometimes instantaneously). This paper proposes the formulation of a dynamic vehicle routing problem with time windows and both known and stochastic customers as a route-based Markov Decision Process. We propose a solution approach that combines Deep Reinforcement Learning (specifically neural networks-based Temporal-Difference learning with experience replay) to approximate the value function and a routing heuristic based on Simulated Annealing, called DRLSA. Our approach enables optimized re-routing decision to be generated almost instantaneously. Furthermore, to exploit the structure of this problem, we propose a state representation based on the total cost of the remaining routes of the vehicles. We show that the cost of the remaining routes of vehicles can serve as proxy to the sequence of the routes and time window requirements. DRLSA is evaluated against the commonly used Approximate Value Iteration (AVI) and Multiple Scenario Approach (MSA). Our experiment results show that DRLSA can achieve on average, 10% improvement over myopic, outperforming AVI and MSA even with small training episodes on problems with degree of dynamism above 0.5.

06/12/2020

Yi Ma, Xiaotian Hao, Jianye Hao and
Jiawen Lu, Xing Liu, Tong Xialiang, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng

autonomous driving, imitation learning, dagger, carla, adaptive sampling, online learning, mixture of distributions, curriculum learning

1:01

12/07/2020

Deep Reinforcement Learning Approach to Solve Dynamic Vehicle Routing Problem with Stochastic Customers

Waldy Joe, Hoong Chuin Lau

Comments

Similar Papers

Online Bayesian Goal Inference for Boundedly Rational Planning Agents

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and Josh Tenenbaum, Vikash Mansinghka

Keywords Abstract Paper

Real-Time Pricing Optimization for Ride-Hailing Quality of Service

Enpeng Yuan, Pascal Van Hentenryck

Keywords Abstract Paper

Multidisciplinary Topics and Applications, Transportation, Real-Time Systems

Guidelines for Action Space Definition in Reinforcement Learning-Based Traffic Signal Control Systems

Maxime Treca, Julian Garbiso, Dominique Barth

Keywords Abstract Paper

Learning, Real-time scheduling, Traffic signal control, Reinforcement Learning, Action space, Mobility

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

Yi Ma, Xiaotian Hao, Jianye Hao and Jiawen Lu, Xing Liu, Tong Xialiang, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng

Keywords Abstract Paper

optimization, reinforcement learning and planning

VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation

Jiyang Gao, Chen Sun, Hang Zhao and Yi Shen, Dragomir Anguelov, Congcong Li, Cordelia Schmid

Keywords Abstract Paper

autonomous driving, behavior prediction, motion forecasting, map representation

Dynamic Rebalancing Dockless Bike-Sharing System based on Station Community Discovery

Jingjing Li, Qiang Wang, Wenqi Zhang and Donghai Shi, Zhiwei Qin

Keywords Abstract Paper

Planning and Scheduling, Applications of Planning, Planning and Scheduling

Traffic Congestion Alleviation over Dynamic Road Networks: Continuous Optimal Route Combination for Trip Query Streams

Ke Li, Lisi Chen, Shuo Shang and Panos Kalnis, Bin Yao

Keywords Abstract Paper

Multidisciplinary Topics and Applications, Transportation

Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization

Zhenghao Peng, Quanyi Li, Ka Ming Hui and Chunxiao Liu, Bolei Zhou

Keywords Abstract Paper

optimization, reinforcement learning and planning

Learning to delegate for large-scale vehicle routing

Sirui Li, Zhongxia Yan, Cathy Wu

Keywords Abstract Paper

optimization, machine learning, transformers

Multi-agent Trajectory Prediction with Fuzzy Query Attention

Nitin Kamra, Hao Zhu, Dweep Kumarbhai Trivedi and Ming Zhang, Yan Liu

Keywords Abstract Paper

Hierarchically and Cooperatively Learning Traffic Signal Control

Bingyu Xu, Yaowei Wang, Zhaozhi Wang and Huizhu Jia, Zongqing Lu

Keywords Abstract Paper

Preserving dynamic attention for long-term spatial-temporal prediction

Haoxing Lin, Rufan Bai, Weijia Jia and Xinyu Yang, Yongjian You

Keywords Abstract Paper

attention mechanism, long-term prediction, neural network, mining spatial-temporal information

Learning Model Parameters for Decentralized Schedule-Driven Traffic Control

Hsu-Chieh Hu, Stephen F. Smith

Keywords Abstract Paper

planning and learning, reinforcement learning, online planning, network-level coordination

A Multi-Criteria System for Recommending Taxi Routes with an Advance Reservation

Jie-Yu Fang, Fandel Lin, Hsun-Ping Hsieh

Keywords Abstract Paper

taxi service, heuristic search, spatial-temporal predictions, multi-criteria searching

Solving Large Real-Life Bus Driver Scheduling Problems with Complex Break Constraints

Lucas Kletzander, Nysret Musliu

Keywords Abstract Paper

Driver scheduling, Break scheduling, Simulated Annealing

Refining Process Descriptions from Execution Data in Hybrid Planning Domain Models

Alan Lindsay, Santiago Franco, Rubiya Reba, Thomas L. McCluskey

Keywords Abstract Paper

Automated Planning, Hybrid Planning, Domain Model Refinement, Urban Traffic Management

Synthesis of Search Heuristics for Temporal Planning via Reinforcement Learning

Andrea Micheli, Alessandro Valentini

Keywords Abstract Paper

Curb-GAN: Conditional urban traffic estimation through spatio-temporal generative adversarial networks

Yingxue Zhang, Yanhua Li, Xun Zhou and Xiangnan Kong, Jun Luo

Keywords Abstract Paper

generative adversarial networks, traffic estimation, spatial-temporal data, self-attention

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

Keywords Abstract Paper

ConSTGAT: Contextual spatial-temporal graph attention network for travel time estimation at baidu maps

Xiaomin Fang, Jizhou Huang, Fan Wang and Lingke Zeng, Haijin Liang, Haifeng Wang

Keywords Abstract Paper

contextual information, attention mechanism, graph neural network, transportation, baidu maps, travel time estimation

Automated Dynamic Mechanism Design

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and
Josh Tenenbaum, Vikash Mansinghka

Keywords Paper

Keywords Paper

Keywords Paper

Yi Ma, Xiaotian Hao, Jianye Hao and
Jiawen Lu, Xing Liu, Tong Xialiang, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng

Keywords Paper

Jiyang Gao, Chen Sun, Hang Zhao and
Yi Shen, Dragomir Anguelov, Congcong Li, Cordelia Schmid

Keywords Paper

Jingjing Li, Qiang Wang, Wenqi Zhang and
Donghai Shi, Zhiwei Qin

Keywords Paper

Ke Li, Lisi Chen, Shuo Shang and
Panos Kalnis, Bin Yao

Keywords Paper

Zhenghao Peng, Quanyi Li, Ka Ming Hui and
Chunxiao Liu, Bolei Zhou

Keywords Paper

Keywords Paper

Nitin Kamra, Hao Zhu, Dweep Kumarbhai Trivedi and
Ming Zhang, Yan Liu

Keywords Paper

Bingyu Xu, Yaowei Wang, Zhaozhi Wang and
Huizhu Jia, Zongqing Lu

Keywords Paper

Haoxing Lin, Rufan Bai, Weijia Jia and
Xinyu Yang, Yongjian You

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yingxue Zhang, Yanhua Li, Xun Zhou and
Xiangnan Kong, Jun Luo

Keywords Paper

Keywords Paper

Xiaomin Fang, Jizhou Huang, Fan Wang and
Lingke Zeng, Haijin Liang, Haifeng Wang

Keywords Paper

Keywords Paper

Sandeep Singh Sandha, Luis Garcia, Bharathan Balaji and
Fatima Anwar, Mani Srivastava

Keywords Paper

Qiaoyu Tan, Jianwei Zhang, Ninghao Liu and
Xiao Huang, Hongxia Yang, Jingren Zhou, Xia Hu

Keywords Paper

Keywords Paper

Keywords Paper

Aditya Prakash, Aseem Behl, Eshed Ohn-Bar and
Kashyap Chitta, Andreas Geiger

Keywords Paper

Sofien Dhouib, Ievgen Redko, Tanguy Kerdoncuff and
Rémi Emonet, Marc Sebban

Keywords Paper

Keywords Paper

Kuan Xu, Chilin Fu, Xiaolu Zhang and
Cen Chen, Ya-Lin Zhang, Wenge Rong, Zujie Wen, Jun Zhou, Xiaolong Li, Yu Qiao

Keywords Paper

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Leilei Sun, Yansong Bai, Bowen Du and
Chuanren Liu, Hui Xiong, Weifeng Lv

Keywords Paper

Anshul Nasery, Soumyadeep Thakur, Vihari Piratla and
Abir De, Sunita Sarawagi

Keywords Paper

Christoph Dann, Yishay Mansour, Mehryar Mohri and
Ayush Sekhari, Karthik Sridharan

Keywords Paper