Reinforcement Learning Toolbox: Episode Q0 does not change ., DDPG agent

4 次查看(过去 30 天)
Hi :-)
I am training my ddpg agent with matlab.
I saw something weird with my trainig graph at reinforcement learning episode manager.
this is my plot, and as you can see,, Q0 value never follows or go near the average reward.
Does this mean that there is something wrong with my critic network? but I can see the traing procedure is working quite properly I guess,,,
Please help!!

回答(1 个)

Berk Agin
Berk Agin 2022-4-3
Hello,
I also see that this problem on my training data. I couldn't understand and wonder the solution. Have a nice day.

产品


版本

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by