update reinforcement policy.m weights

Question

Victor Bayer 2021-6-15

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/856810-update-reinforcement-policy-m-weights

回答： Emmanouil Tzorakoleftherakis 2021-6-22

Hello,

in order to run an RLAgent on a Raspberry i have generated a Policy.m file out of the saved Agent (see: https://www.mathworks.com/matlabcentral/answers/854085-run-reinforcement-learning-agent-on-raspberry?s_tid=srchtitle

&

https://www.mathworks.com/help/reinforcement-learning/ug/deploy-trained-reinforcement-learning-agents.html )

This file is attached to the question (evaluatePolicy.m).

In the Simulink-model running on the raspberry (Raspberry_USB_.slx) this file is called as replacement to the RLAgent Block, since that one can not be executed on the Raspberry hardware. Through this, an action can be calculated on the raspberry. However, since the Policy.m file does not consider any reward and does not update itself, no learning takes place on the raspberry (see....).

My question is, if there is any way to update the policy function if one considers a reward for the executed action?

The goal is to enable learning on a raspberry.

I am gratefull for any tip.

Thanks and best regards,

Victor Bayer

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Emmanouil Tzorakoleftherakis 2021-6-22

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/856810-update-reinforcement-policy-m-weights#answer_730735

Hello,

When you want to perform inference on an RL policy, there is no need to consider rewards. The trained policy already knows internally that the actions taken are the right ones.

If you are asking whether you can perform RL training on the raspberry pi, this is not currently supported.