Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

4 次查看（过去 30 天）

Unmanned Aerial and Space Systems 2022-4-30

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1708930-reinforcement-learning-based-quadrotor-control-using-soft-actor-critic-the-reward-is-not-converging

编辑： Unmanned Aerial and Space Systems 2022-5-1

Hi, I am trying to control of a rotary wing UAV (quadrotor) by using Soft-Actor Critic methodology, but I have some problems, my reward is increasing continously after the point you see following image, what is the main problem, can you advice for this situation, I am sharing my files (Simulink and m-file). My max reward values should be zero as we define in reward function on Simulink file. This reward function indicates that the difference between desired trajectory and actual trajectory is about zero.

在 Help Center 和 File Exchange 中查找有关 Reinforcement Learning 的更多信息

产品

版本

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论