How to set the reinforcement learning block in Simulink to output 9 actions

Question

Aaron Amusan 2021-5-15

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/830453-how-to-set-the-reinforcement-learning-block-in-simulink-to-output-9-actions

评论：轩 2024-1-4

I am trying to tune 3 PID controllers in matlab/simulink via reinforcement learning thus the reinforcemt learning toolbox. I am trying to follow "Tune PI Controller using Reinforcement Learning" (https://www.mathworks.com/help/reinforcement-learning/ug/tune-pi-controller-using-td3.html) as best I can and extrapolate it to create three PID controllers controlled by a DDPG agent. i can't really figure out how to make it account for three PID controllers, hence nine gain values.

I can't really share much about the model since I am bound to a NDA by my university, but I feel as though my question is nonspecifc enough to both not get me in trouble, but also allow potential helpers to get what I am trying to do. Please let me know if more info is needed.

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

轩 2024-1-4

Hello Aaron, I am trying the same method now. Could you please leave a contact information for your advice?

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Yikai 2021-5-17

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/830453-how-to-set-the-reinforcement-learning-block-in-simulink-to-output-9-actions#answer_701513

the number of the actions is defined at actionInfo of your environment

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 2

Emmanouil Tzorakoleftherakis 2021-5-17

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/830453-how-to-set-the-reinforcement-learning-block-in-simulink-to-output-9-actions#answer_701918

Hello,

the example you are referring to does not output 3 values for the pid gains. The PID gains are "integrated" into the neural network architecture and the policy output is still the same as PID. If you want to follow the same setup, the output of the policy in your case would output 3 values and the neural network weights would match the weights of the PID controllers.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

How to set the reinforcement learning block in Simulink to output 9 actions

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

回答（2 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

How to set the reinforcement learning block in Simulink to output 9 actions

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

回答（2 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论