Train a DDPG agent to swing a pole with constraints

2 次查看（过去 30 天）

GCats 2021-12-16

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1612320-train-a-ddpg-agent-to-swing-a-pole-with-constraints

Screen Shot 2021-12-16 at 12.09.56.png

Hello everyone!

I'm currently working with the pendulum enviroment and the DDPG agent described in this document: https://nl.mathworks.com/help/reinforcement-learning/ug/train-ddpg-agent-to-swing-up-and-balance-pendulum.html

Now, I would like to add some constraints on the Simulink model between the observations and the agent (I believe this technique is called shielding). For example, I would like to constraint the angular speed of the pendulum before the observations are fed to the agent.

I think an option could be to use the Contraint Enforcement block on Simulink, however I am not sure on how to tackle the implementation.

Could anyone help me out jumpstart the problem? Thanks!!

Cheers :)

在 Help Center 和 File Exchange 中查找有关 Reinforcement Learning 的更多信息

产品

版本

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Train a DDPG agent to swing a pole with constraints

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Train a DDPG agent to swing a pole with constraints

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论