How GAE calculates in Reinforement Learning Toolbox(PPO)?

4 次查看(过去 30 天)
A difference between help center and reference[3] about TD error.
Why in Generalized Advantage Estimator?
https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

采纳的回答

Emmanouil Tzorakoleftherakis
Hello,
Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Specialized Power Systems 的更多信息

标签

产品


版本

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by