
I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?
5 次查看(过去 30 天)
显示 更早的评论
I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?
for example online actor=10^-1 and target actor 10^-2...how I can do this in matlab?
0 个评论
回答(1 个)
Yash
2024-12-23
Yes, you can use different learning rates for Actor and Critic by specifying them individually when setting up your training options for DDPG agent. Here is a simple code snippet to achieve this:
actorOptimizerOptions = rlOptimizerOptions(LearnRate=1e-1)
criticOptimizerOptions = rlOptimizerOptions(LearnRate=1e-2)
opt = rlDDPGAgentOptions('ActorOptimizerOptions',actorOptimizerOptions,'CriticOptimizerOptions',criticOptimizerOptions)

Refer to this documentation page for more information on creating an object for DDPG agent: https://www.mathworks.com/help/reinforcement-learning/ref/rl.option.rlddpgagentoptions.html
0 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Policies and Value Functions 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!