International Standard Serial Number:

ISSN 1001-4551

Sponsor:

Zhejiang University;

Zhejiang Machinery and Electrical Group

Edited by:

Editorial of Journal of Mechanical & Electrical Engineering

Chief Editor:

ZHAO Qun

Vice Chief Editor:

TANG ren-zhong,

LUO Xiang-yang

Tel:

86-571-87041360,87239525

Fax:

86-571-87239571

Add:

No.9 Gaoguannong,Daxue Road,Hangzhou,China

P.C:

310009

E-mail:

meem_contribute@163.com

Deep reinforcement learning-PI control strategy of air servo system based on genetic algorithm optimization
Published:2023-09-20 author:HONG Zi-qi, XU Wen-bo, LV Chen, et al. Browse: 1071 Check PDF documents
Deep reinforcement learning-PI control strategy of air servo system based on 
genetic algorithm optimization

HONG Zi-qi1, XU Wen-bo2, LV Chen1, OUYANG Quan1, WANG Zhi-sheng1
(1.School of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China; 
2.Laboratory of Aerospace Servo Actuation and Transmission, Beijing Institute of Precision 
Mechatronics and Controls, Beijing 100076, China)

Abstract: Aiming at the problem that traditional proportional integral (PI) control was difficult to select parameters with better control performance, taking the air rudder servo system as the research object, a control method of reinforcement learning-PI based on genetic algorithm optimization was proposed. Firstly, the mathematical model of the air rudder servo system was established. Then, the initial parameters of PI controller were optimized by genetic algorithm. The current PI controller was adjusted in real time using the deep deterministic policy gradient(DDPG)algorithm to realize the position command control of the air rudder servo system. Finally, the effect of the method used in the air rudder servo system was verified in Simulink through simulation analysis. The results show that the improved algorithm has certain online stability when the parameters are perturbed. In the case of no load, the required adjustment time is less than that of genetic algorithm - PI, DDPG-PI and traditional PI algorithm, and it is increased by at least 20%. At the same time, in the case of load, the fluctuation amplitude of the improved algorithm is at least 15% better than that of the other three methods compared with the time to return to steady state after the end of load, which proves the effectiveness of the method used in the air rudder servo system.
Key words:  servo system; proportional integral(PI) controller; genetic algorithm; deep deterministic policy gradient(DDPG) algorithm; parameter optimization; Simulink

  • Chinese Core Periodicals
  • Chinese Sci-tech Core Periodicals
  • SA, INSPEC Indexed
  • CSA: T Indexed
  • UPD:Indexed


2010 Zhejiang Information Institute of Mechinery Industry

Technical Support:Hangzhou Bory science and technology

You are 1895221 visit this site