The definition of the Target update frequency in Reinforcement Learning Designer.

Question

Xian Zheng Hong 2024-3-7

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer

评论： Xian Zheng Hong 2024-3-16

采纳的回答： UDAYA PEDDIRAJU

In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.

The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.

Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

UDAYA PEDDIRAJU 2024-3-12

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer#answer_1424086

Hi Xian,

No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Xian Zheng Hong 2024-3-16

Thanks for answering. Here is my another question.

Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

请先登录，再进行评论。

The definition of the Target update frequency in Reinforcement Learning Designer.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

The definition of the Target update frequency in Reinforcement Learning Designer.

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论