Residential College | false |
Status | 已發表Published |
Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making | |
Wang, Zhen1; Jin, Xiaoyue1; Zhang, Tao1; Li, Jiahao1; Yu, Dengxiu2![]() ![]() | |
2024-03 | |
Source Publication | IEEE Transactions on Cybernetics
![]() |
ABS Journal Level | 3 |
ISSN | 2168-2267 |
Volume | 54Issue:3Pages:1614-1624 |
Abstract | In this article, an expert system-based multiagent deep deterministic policy gradient (ESB-MADDPG) is proposed to realize the decision making for swarm robots. Multiagent deep deterministic policy gradient (MADDPG) is a multiagent reinforcement learning algorithm proposed to utilize a centralized critic within the actor-critic learning framework, which can reduce policy gradient variance. However, it is difficult to apply traditional MADDPG to swarm robots directly as it is time consuming during the path planning, rendering it necessary to propose a faster method to gather the trajectories. Besides, the trajectories obtained by the MADDPG are continuous by straight lines, which is not smooth and will be difficult for the swarm robots to track. This article aims to solve these problems by closing the above gaps. First, the ESB-MADDPG method is proposed to improve the training speed. The smooth processing of the trajectory is designed in the ESB-MADDPG. Furthermore, the expert system also provides us with many trained offline trajectories, which avoid the retraining each time we use the swarm robots. Considering the gathered trajectories, the model predictive control (MPC) algorithm is introduced to realize the optimal tracking of the offline trajectories. Simulation results show that combining ESB-MADDPG and MPC can realize swarm robot decision making efficiently. |
Keyword | Model Prediction Control Multiagent Deep Deterministic Policy Gradient (Maddpg) Swarm Robot Decision Making |
DOI | 10.1109/TCYB.2022.3228578 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Automation & Control Systems ; Computer Science |
WOS Subject | Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics |
WOS ID | WOS:000906217700001 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85146249164 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology |
Corresponding Author | Yu, Dengxiu |
Affiliation | 1.Northwestern Polytechnical University, School of Mechanical Engineering, Xi'an, 710072, China 2.Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an, 710072, China 3.Singapore University of Technology and Design, Science, Mathematics and Technology Cluster, Tampines, 487372, Singapore 4.South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510641, China 5.University of Macau, Faculty of Science and Technology, Macao |
Recommended Citation GB/T 7714 | Wang, Zhen,Jin, Xiaoyue,Zhang, Tao,et al. Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making[J]. IEEE Transactions on Cybernetics, 2024, 54(3), 1614-1624. |
APA | Wang, Zhen., Jin, Xiaoyue., Zhang, Tao., Li, Jiahao., Yu, Dengxiu., Cheong, Kang Hao., & Chen, C. L.Philip (2024). Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making. IEEE Transactions on Cybernetics, 54(3), 1614-1624. |
MLA | Wang, Zhen,et al."Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making".IEEE Transactions on Cybernetics 54.3(2024):1614-1624. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment