Soft Actor Critic deploy mean path only
显示 更早的评论
Hi, I'm wondering if there's a way to only deploy the mean path of the SAC agent after it's been trained? This is useful to create more stable actions after the network has been trained.
Should I extract the network weights manually, create a network, then extract an output path for the mean network?
采纳的回答
更多回答(0 个)
类别
在 帮助中心 和 File Exchange 中查找有关 Reinforcement Learning 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!