Live Monitoring of Critic Predictions in the RL Toolbox

1 次查看（过去 30 天）

walli 2020-8-17

1
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/580884-live-monitoring-of-critic-predictions-in-the-rl-toolbox

编辑： walli 2020-8-17

I'm wondering if it is possible to monitor the Q-value predictions within any critic-based RL approach using the RL toolbox? For example, having a multi-output DQN agent the internal deep NN has to be called at every step in order to evaluate all possible discrete actions given the current state sample - hence, somewhere internally there must be a Q-value prediction for every discrete action available which are then evaluated in order to find the optimal action.

However, having spend some time on the 2020a documentation I was not able to find a way accessing these internal Q-value predictions at each time step. In particular, it would be nice if the Simulink-based agent block would be able to provide these predictions for further processing and monitoring reasons during the training and deployment phase.

Does somebody have a useful hint in order to retrieve the Q-value estimates during learning?

在 Help Center 和 File Exchange 中查找有关 Reinforcement Learning 的更多信息

产品

Reinforcement Learning Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Live Monitoring of Critic Predictions in the RL Toolbox

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

Live Monitoring of Critic Predictions in the RL Toolbox

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论