Jan Dewez
Followers: 0 Following: 0
Feeds
提问
How to pretrain a stochastic actor network for PPO training?
I want to create a stochastic actor network that outputs an action array of 10 values between 0 and 1 given an observation array...
3 years 前 | 2 个回答 | 0