How GAE calculates in Reinforement Learning Toolbox(PPO)?
4 次查看(过去 30 天)
显示 更早的评论
TigerSee
2021-2-14
回答: Emmanouil Tzorakoleftherakis
2021-2-16
A difference between help center and reference[3] about TD error.
Why
in Generalized Advantage Estimator?
in Generalized Advantage Estimator?https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

0 个评论
采纳的回答
Emmanouil Tzorakoleftherakis
2021-2-16
Hello,
Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.
0 个评论
更多回答(0 个)
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Specialized Power Systems 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!