how to get continuous action data and store in Reinforcement learning

Question

raja sekhar 2021-10-25

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1571138-how-to-get-continuous-action-data-and-store-in-reinforcement-learning

回答： Alan 2024-5-31

I am working REINFORCEMENT LEARNING, , need to see data of action space , I can see reward, episodic Q0 value and average reward value for each episode, in the same awy I would like to see action space data for each episode?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Alan 2024-5-31

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1571138-how-to-get-continuous-action-data-and-store-in-reinforcement-learning#answer_1465836

在 MATLAB Online 中打开

Hi Raja,

I’m assuming that you are observing the episode reward, episode Q0, and average reward from the Reinforcement Learning Designer app. Unfortunately, there are no options to plot your any other custom data (in your case action data) within the app. So, you will have to create a custom training loop that logs and plots the data you wish to see.

To start off, you can export the training code by choosing the “Generate MATLAB function for training” option from the drop down as shown below:

After saving the training function, you could export your agent from a drop down in a similar way as show below:

The modifications required to be made to plot action data lie in the generated train function. You could use the MonitorLogger object along with a custom callback that logs the required data. The logger can use different callbacks to collect data. In your use case, you want to plot action data after each episode. So, we can assign a callback to the EpisodeFinishedFcn property of the logger which collects action data after each episode. The following snippet demonstrates the same:

monitor = trainingProgressMonitor(); 
logger = rlDataLogger(monitor); 
logger.EpisodeFinishedFcn = @episodeActionLogger; 

You can then define the custom callback (I named it episodeActionLogger) as follows:

function dataToLog = episodeActionLogger(data) 
if mod(data.AgentLearnCount, 2) == 0 
    dataToLog.ActionInfo  = data.ActionInfo; 
else 
    dataToLog = []; 
end 
end 

After defining the logger, pass it on to the training function in the following manner:

info = train(agent,slEnv,opts,Logger=logger);

More details on MonitorLogger and this above mentioned technique of logging and plotting custom data can be viewed in the following documentation page: https://www.mathworks.com/help/reinforcement-learning/ref/rl.logging.monitorlogger.html

The following documentation might also be useful to customize your call to the train function: https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-agents.html

Do make sure you are using a release later than R2022b to use MonitorLogger.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

how to get continuous action data and store in Reinforcement learning

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

how to get continuous action data and store in Reinforcement learning

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论