How I can access the action output of the actor network in DDPG during training?

Question

Maha Mosalam 2021-12-2

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training

回答： Yash 2024-12-24

I want to access the action output of the actor network in DDPG during training since I want to change it by force function to other action optimized from sepeate function to accelerate training and improve learning effeciecncy for actor , if any help for that? I wil be thankful

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Yash 2024-12-24

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training#answer_1556358

You can use the function getAction which returns action from agent, actor or policy object given environment observations. You can write a custom loss function that directly uses getAction and dlgradient within it, and then use dlfeval and dlaccelerate with your custom loss function. For an example, see Train Reinforcement Learning Policy Using Custom Training Loop and Custom Training Loop with Simulink Action Noise.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

How I can access the action output of the actor network in DDPG during training?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

How I can access the action output of the actor network in DDPG during training?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论