Feeds
提问
solve critic overestimate and how to explore specific action range
hello im using a ddpg agent to tune a robot controller.all of my rewards are negetive and my critic learning rate is 0.01 and m...
2 years 前 | 0 个回答 | 0
0
个回答提问
ddpg agent does not learn
hi im using a ddpg alghorithm to learn for tuning a pd like controller (transpose jacobian) for tuning its gains.my gains need t...
2 years 前 | 2 个回答 | 0