I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

Question

Maha Mosalam 2021-12-19

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit

回答： Yash 2024-12-23

for example online actor=10^-1 and target actor 10^-2...how I can do this in matlab?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Yash 2024-12-23

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit#answer_1556284

在 MATLAB Online 中打开

Yes, you can use different learning rates for Actor and Critic by specifying them individually when setting up your training options for DDPG agent. Here is a simple code snippet to achieve this:

actorOptimizerOptions = rlOptimizerOptions(LearnRate=1e-1)
criticOptimizerOptions = rlOptimizerOptions(LearnRate=1e-2)
opt = rlDDPGAgentOptions('ActorOptimizerOptions',actorOptimizerOptions,'CriticOptimizerOptions',criticOptimizerOptions)

Refer to this documentation page for more information on creating an object for DDPG agent: https://www.mathworks.com/help/reinforcement-learning/ref/rl.option.rlddpgagentoptions.html

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论