photo

Jan Dewez


Last seen: 3 years 前 自 2021 起处于活动状态

Followers: 0   Following: 0

统计学

  • Thankful Level 1

查看徽章

Feeds

排序方式:

提问


How to pretrain a stochastic actor network for PPO training?
I want to create a stochastic actor network that outputs an action array of 10 values between 0 and 1 given an observation array...

3 years 前 | 2 个回答 | 0

2

个回答