Optimal robust online tracking control for space manipulator in task space using off-policy reinforcement learning

This study addresses the demands for adaptability, uncertainty management, and high performance in the control of space manipulators, and the inadequacies in achieving optimal control and handling external uncertainty in task space in previous research. Based on off-policy reinforcement learning, a model-free and time-efficient method for online robust tracking control in task space is devised. To address the complexity of dynamic equations in task space, a mixed-variable approach is adopted to transform the multivariable coupled time-varying problem into a single-variable problem. Subsequently, the optimal control policy is derived with the disturbance convergence, stability, and optimality of the control method being demonstrated. This marks the first instance of achieving robust optimal tracking control in task space for space manipulators. The efficacy and superiority of the presented algorithm are validated through simulation. © 2024

Authors
Zhuang H. , Zhou H. , Shen Q. , Wu S. , Razoumny V.Y. , Razoumny Y.N.
Publisher
Elsevier Masson s.r.l.
Language
English
State
Published
Number
109446
Volume
153
Year
2024
Organizations
  • 1 Shanghai Jiao Tong University, Shanghai, 200240, China
  • 2 Peoples' Friendship University of Russia (RUDN University), Moscow, 117198, Russian Federation
Keywords
Online tracking control; Reinforcement learning; Task space
Share

Other records