Reinforcement Learning Toolbox - Experience Buffer Samples

Question

Hans-Joachim Steinort 2019-9-17

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/480767-reinforcement-learning-toolbox-experience-buffer-samples

评论： Hans-Joachim Steinort 2019-9-20

The simulation I'm running has a fixed-step solver with a fixed-step-size of 5e-4. The sample time of my DQN-Agent (and the corresponding S-function for the reward-signal) is 0.25.

How is it possible that after a simulation time of 20 seconds I have a BufferLength of ~1600 samples? I hope you can enlighten me...

Bonus question:

Is it possible to look into the ExperienceBuffer? As impressed as I am by the RL-Toolbox, I would really prefer it not to be such a blackbox in most cases.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Raunak Gupta 2019-9-20

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/480767-reinforcement-learning-toolbox-experience-buffer-samples#answer_392732

Hi,

Since it is mentioned that DQN Agent is used, I am assuming that rlDQNAgentOptions is used for setting up the agent properties. The ExperienceBufferLength can be specified for storing that many experiences from training the agent. Also, there is a parameter SaveExperienceBufferWithAgent which can be set to true for saving the Experience buffer while training. The experience upto the limit of ExperienceBufferLength will be stores in a rlDQNAgent Object.

You may look for other Training Options here.

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Hans-Joachim Steinort 2019-9-20

Thank you for your answer!

I found a bug in my simulation setup so that the Buffer was filled way to fast. I fixed this with a delay block inheriting a certain sample-time.

Yet it is not pissible to look inside the buffer to view what kind of (s,a,r,s')-touples are stored in there.

请先登录，再进行评论。

Reinforcement Learning Toolbox - Experience Buffer Samples

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Reinforcement Learning Toolbox - Experience Buffer Samples

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论