-
A Hybrid Reinforcement Learning-based Model for the Vehicle Routing Problem in Transportation Logistics
- Back
Document Title
A Hybrid Reinforcement Learning-based Model for the Vehicle Routing Problem in Transportation Logistics
Author
Phiboonbanakit T., Horanont T., Huynh V., Supnithi T.
Name from Authors Collection
Affiliations
School of Knowledge Science, Japan Advanced Institute of Science and Technology, Nomi, Ishikawa, 923-1292, Japan and School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani, 12000, Thailand. (e-mail: thananut@jaist.ac.jp); School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani, 12000, Thailand.; School of Knowledge Science, Japan Advanced Institute of Science and Technology, Nomi, Ishikawa, 923-1292, Japan.; National Electronics and Computer Technology Center (NECTEC), National Science and Technology Development Agency, Pathum Thani, 12000, Thailand.
Type
Article
Source Title
IEEE Access
ISSN
21693536
Year
2021
Volume
9
Page
163325-163347
Open Access
Gold
Publisher
Institute of Electrical and Electronics Engineers Inc.
DOI
10.1109/ACCESS.2021.3131799
Format
Abstract
Currently, the number of deliveries handled by transportation logistics is rapidly increasing because of the significant growth of the e-commerce industry, resulting in the need for improved functional vehicle routing measures for logistic companies. The effective management of vehicle routing helps companies reduce operational costs and increases its competitiveness. The vehicle routing problem (VRP) seeks to identify optimal routes for a fleet of vehicles to deliver goods to customers while simultaneously considering changing requirements and uncertainties in the transportation environment. Due to its combinatorial nature and complexity, conventional optimization approaches may not be practical to solve VRP. In this paper, a new optimization model based on reinforcement learning (RL) and a complementary tree-based regression method is proposed. In our proposed model, when the RL agent performs vehicle routing optimization, its state and action are fed into the tree-based regression model to assess whether the current route is feasible according to the given environment, and the response received is used by the RL agent to adjust actions for optimizing the vehicle routing task. The procedure repeats iteratively until the maximum iteration is reached, then the optimal vehicle route is returned and can be utilized to assist in decision making. Multiple logistics agency case studies are conducted to demonstrate the application and practicality of the proposed model. The experimental results indicate that the proposed technique significantly improves profit gains up to 37.63% for logistics agencies compared with the conventional approaches. Author
Keyword
Adaptation models | Costs | freight | intelligent transportation | logistics | Optimization | policy | reinforcement learning
License
CC BY or CC BY-NC-ND
Rights
Author
Publication Source
IEEE
Note
Full text
Document
-
A_Hybrid_Reinforcement_Learning-Based_Model_for_the_Vehicle_Routing_Problem_in_Transportation_LogisticsDownload