Kundan Panta

Last seen: 11 months 前 | 自 2024 起处于活动状态

Followers: 0 Following: 0

统计学

Feeds

提问

Confusion in agent and trainFromData options when using RNN/LSTM
My dataset contains numTraj trajectories, each containing numSteps time-steps. I filled the experience buffer with my data in a ...

1 year 前 | 1 个回答 | 0

1

个回答

提问

Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?
Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work...

1 year 前 | 0 个回答 | 0

0

个回答