How GAE calculates in Reinforement Learning Toolbox(PPO)?

4 次查看（过去 30 天）

显示更早的评论

TigerSee 2021-2-14

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/745072-how-gae-calculates-in-reinforement-learning-toolbox-ppo

回答： Emmanouil Tzorakoleftherakis 2021-2-16

采纳的回答： Emmanouil Tzorakoleftherakis

A difference between help center and reference[3] about TD error.

Why

in Generalized Advantage Estimator?

https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

采纳的回答

Emmanouil Tzorakoleftherakis 2021-2-16

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/745072-how-gae-calculates-in-reinforement-learning-toolbox-ppo#answer_624942

Hello,

Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

类别

Physical Modeling Simscape Electrical Specialized Power Systems

在 Help Center 和 File Exchange 中查找有关 Specialized Power Systems 的更多信息

产品

Reinforcement Learning Toolbox

版本

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

How GAE calculates in Reinforement Learning Toolbox(PPO)?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

How GAE calculates in Reinforement Learning Toolbox(PPO)?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论