Salvataggio Agente trainato per code generation

Question

francesco 2024-2-20

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2084503-salvataggio-agente-trainato-per-code-generation

回答： Aiswarya 2024-2-27

Salve, ho allenato un agente DDPG tramite il "ReinforcementLearningDesigner", successivamente ho ottenuto un blocco .mat con tutti i dati della sessione, vorrei sapere dove è salvato l'agente trainato, chiedo questo perche vorrei fare code generation, quindi vorrei generare la policy da questo agente trainato per poi inserirla dentro il "policy block".

Inoltre mi chiedo, com'è possibile che il blocco policy funzioni se non ha in input anche i dati sulla "reward"?

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Walter Roberson 2024-2-20

Approximate translation:

Hi, I trained a DDPG agent via the "ReinforcementLearningDesigner", subsequently I obtained a .mat block with all the session data, I would like to know where the trained agent is saved, I ask this because I would like to do code generation, so I would like to generate the policy driven by this agent and then inserting it into the "policy block".I also wonder, how is it possible for the policy block to work if it doesn't also have the "reward" data as input?

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Aiswarya 2024-2-27

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2084503-salvataggio-agente-trainato-per-code-generation#answer_1417228

在 MATLAB Online 中打开

Hi,

Si prega di notare che risponderò alla domanda in inglese.

( Please note that I will be answering the question in English. )

In order to save your trained agent in the Reinforcement Learning Designer, you can export your agent to MATLAB workspace. To do this, navigate to Reinforcement Learning tab, and under Export select the trained agent. You may refer to this following documentation on how to export agent and save the session: https://www.mathworks.com/help/reinforcement-learning/ug/design-dqn-using-rl-designer.html#mw_abc1bb48-f0fc-400d-98a5-e222c80d131d

You can save your agent in a MAT file using the below command :

save("Agent.mat","agent")

Then you can directly create a Simulink "Policy" block using command line as follows:

load("Agent.mat","agent")
generatePolicyBlock(agent);

You may refer to the following link for more information on the "generatePolicyBlock" function: https://www.mathworks.com/help/reinforcement-learning/ref/generatepolicyblock.html

During training the agent uses the reward signal and updates its policy based on it. Once the policy is trained, you can deploy it to make decisions and hence the policy block only needs the observation as input to output an action. The "Policy" block is using the policy as a parameter, which has already been learned through the rewards received during training. Hence, it does not require the reward as input.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Salvataggio Agente trainato per code generation

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

标签

产品

版本

Community Treasure Hunt

Salvataggio Agente trainato per code generation

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

标签

产品

版本

Community Treasure Hunt

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论