Custom Training Loop with configured DDPG Agent

10 次查看(过去 30 天)
Hello,
is there the possibility to do a custom training loop with a already configured DDPG agent?
Background: I want to check after each episode whether the average reward has reached a new maximum. When a new maximum is reached, the agent should be saved to a mat file, otherwise not in order to reduce the amount of data. In the training options, I can only set a limit for when the agent should be saved. But then all agents are saved as soon as the limit is exceeded.
Thanks!
Best regards,
allmo

采纳的回答

Poorna
Poorna 2023-8-30
Hi,
I understand that you would like to save the agent after every episode based on whether the new episode's reward is greater than the existing average reward. You can achieve this by using “rlDataLogger”.
Create a new “FileLogger” object as shown below:
fileLgr = rlDataLogger();
Then, you can do this:
fileLgr.EpisodeFinishedFcn = @myEpisodeFinishedFcn;
where myEpisodeFinishedFcn is your custom function which implements the logic to conditionally save the agent to disk.
Hope this helps!

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Training and Simulation 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by