Problems with Reinforcement Learning Toolbox Examples
显示 更早的评论
For the "Stochastic Waterfall Grid World" example, what hyperparameter settings will cause it to converge? The defaults don't seem to work.
I ran the "Rocket Lander" example for the recommended 20,000 episodes and default settings, and it was still continuing to have violent crash landings. Why is this? What settings will work? The documentation says that it will take 2 to 3 hours to execute, yet it literally took 50 hours on my Dell mobile work station (CPU). I bought the computer two years ago and I believe it has the second-fastest processor that was available at the time. Thank you for your assistance.
采纳的回答
更多回答(0 个)
类别
在 帮助中心 和 File Exchange 中查找有关 Deep Learning Toolbox 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!