dani ansari
Followers: 0 Following: 0
Feeds
提问
solve critic overestimate and how to explore specific action range
hello im using a ddpg agent to tune a robot controller.all of my rewards are negetive and my critic learning rate is 0.01 and m...
1 year 前 | 0 个回答 | 0
0
个回答提问
ddpg agent does not learn
hi im using a ddpg alghorithm to learn for tuning a pd like controller (transpose jacobian) for tuning its gains.my gains need t...
1 year 前 | 2 个回答 | 0