Francisco Serra

Last seen: 6 months 前 | 自 2024 起处于活动状态

Followers: 0 Following: 0

统计学

Feeds

提问

Why is my DDPG agent converging to a state where it gets continuous penalization, while having a state it can go with 0 penalization?
I am training a Reinforcement Learning DDPG agent to drive a vehicle to a reference. The vehicle dynamics are: x_dot = v*cos(...

9 months 前 | 1 个回答 | 0

1

个回答