Use current simulation data to initialize new simulation - RL training

Question

Federico Toso 2024-3-17

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2095406-use-current-simulation-data-to-initialize-new-simulation-rl-training

评论： Federico Toso 2024-4-8

In the context of PPO Agent training, I would like to use Welford algorithm to calculate the runninig average & and standard deviation of my observations, in order to standardize them and improve the convergence of actor & critic neural networks.

I implemented the algorithm, but I don't know how to keep track of the current running statistics (average and standard deviation) every time a new simulation starts, during the training. This is what I would like to do:

Whenever a simulation terminates (i.e. "isDone" flag is set to 1) , save the current value of runnig statistics in Matlab workspace
While initializing the new simulation, set the starting value of the running statistics to match the values just saved in Matlab workspace

Note that I'm using the standard "train" function to run the training, so the transition between one simulation and the next one is handled automatically and I don't have much flexibility in this sense.

I thought about using the "ResetFcn" function handle within my "SimulinkEnvWithAgent" object to accomplish the task, but I am still not able to programmatically save the last value of my signal to the Workspace at the end of a simulation, and then pass it to the ResetFcn as additional argument in order to initialize the next one

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Poorna 2024-3-31

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2095406-use-current-simulation-data-to-initialize-new-simulation-rl-training#answer_1433801

在 MATLAB Online 中打开

Hi Federico Toso,

I see you want to save simulation data to workspace to later use it in your "ResetFcn". A suitable tool for this is the "rlDataLogger" object, which enables you to log simulation data at various points, such as after each step, episode and after each learn subroutine. You can craft a custom function for logging the specific statistics you're interested in and then assign this function to the appropriate callback property of the rlDataLogger. Although logging typically saves data to a folder after training concludes, your custom callback function can be used to immediately write the necessary statistics to the MATLAB workspace.

You can create a "rlDataLogger" object as below:

logger = rlDataLogger();

For instance, to log the ActorLoss value after every episode, your episode finish callback function could be structured like this:

function dataToLog = episodeFinish(data)
    assignin('base', 'actorLoss', data.ActorLoss);
    dataToLog = data.ActorLoss;
end

And then assign the function handle to the corresponding callback property of the data logger object as below:

logger.EpisodeFinishedFcn = @episodeFinish;

To learn more about the "rlDataLogger" function refer to the below documentation:

https://www.mathworks.com/help/reinforcement-learning/ref/rl.logging.filelogger.html

Hope this Helps!

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Federico Toso 2024-4-8

Thank you!

请先登录，再进行评论。

Use current simulation data to initialize new simulation - RL training

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Use current simulation data to initialize new simulation - RL training

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论