Hyperparameter optimization and saving the best agents for Reinforcement Learning

Question

0 个投票

I am trying to train my RL agent (ddpg) but it's performing quite poorly. I think it may be a problem with the hyperparameter values since I have not tuning. Now I have two questions--

If there is anything in MATLAB that may help solve this problem of hyperparameter optimization other than manual trial-and-error?
How do I save the best performing agent given I don't know the critical values (i.e. don't know the range of the reward)? Basically, I want to save the agent that provides maximum reward or, say, top-5 highest rewarding agents?

Thanks.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis 2020-12-3

0 个投票

Hello,

You can use something like this. We do not have any examples with Reinforcement Learning Toolbox that show how to use this yet unfortunately.
If it's challenging to estimate what a good episode reward is, you can run a singe training session for a good number of episodes (e.g. 5k episodes) to get some idea how the agent is doing and then use that knowledge from the training plot to set the 'SaveAgent' option as needed. Most of the time you will need to run multiple training sessions either way to tweak parameters, rewards, etc, so just use the first one to get some intuition.

2 个评论
显示无隐藏无

laha_M 2020-12-4

Thanks, Emmanouil.

Francisco Serra 2024-1-23

Hey @laha_M, did you manage to to this with RL Toolbox?

请先登录，再进行评论。

Hyperparameter optimization and saving the best agents for Reinforcement Learning

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论
显示无隐藏无

更多回答（0 个）

类别

产品

版本

标签

Community Treasure Hunt

Hyperparameter optimization and saving the best agents for Reinforcement Learning

0 个评论 显示 -2更早的评论 隐藏 -2更早的评论

采纳的回答

2 个评论 显示 无 隐藏 无

更多回答（0 个）

类别

产品

版本

标签

另请参阅

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无