ExperienceBufferLength in Reinforcement Learning Toolbox
14 次查看(过去 30 天)
显示 更早的评论
Hello, everyone,
I found a problem with the 'ExperienceBufferLength' property in 'rlDDPGAgentOptions' when specifying options for rl agents.
In this example, every episode has 600 (60/0.1) steps. Does the agent start to train when the experience buffer is filled up with the experiences (S,A,R,S'). If so, it would take at least 1667 (1000000/600 ) episodes before the agent starts to improve.
So I want to know how to determine this value.
0 个评论
采纳的回答
Ari Biswas
2021-11-17
The agent will train until at least one minibatch can be sampled from the buffer. If your mini batch size is 64, then the first learn step will occur after the buffer has stored 64 experiences. The experience buffer is circular, i.e., it removes older experiences when full. The size of the buffer is hence important. You may lose important experiences if the buffer size is too small.
4 个评论
Arman Ali
2022-9-27
How about if we want to fill our buffer first and then start taking minibatches?? how to implement this in matlab?
Francisco Serra
2024-5-2
For that you can set:
agent.AgentOptions.NumWarmStartSteps=experience_buffer_length
As default, this is set to the minibatch size, but changing to the experience buffer size will force the algorithm to wait until the buffer is full.
更多回答(0 个)
另请参阅
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!