reinforcement learning toolbox - q table

Question

Xinpeng Wang 2019-7-10

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table

回答： Tuong Nguyen 2022-10-7

I'm a newbie to RL and the RL toolbox. I played with Q-learning agent with a model in simulink. My question is after training, How can I access to the trained Q table? The qTable used to generate the agent is all ZERO. I cannot figure out where the trained Q values and the policies are stored. Thank you!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Emmanouil Tzorakoleftherakis 2019-7-23

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_384423

在 MATLAB Online 中打开

Hi Xinpeng,

To see the trained table, you have to do is extract it using ‘getCritic’. Try:

critic = getCritic(agent);

The variable ‘critic’ has a field which contains the Qtable after training.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 2

carlos pedreira 2020-1-13

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_409897

OK, but, after that, HOW CAN I SEE the table....

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 3

Shikhar Sharma 2020-1-24

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_411873

It should appear under the Workspace tab.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 4

Umut Can Akdag 2020-5-18

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_433182

For those who are still looking for the q table I think this is the solution.

critic = getCritic(agent);

qtable = getLearnableParameters(critic);

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 5

RUBEN HERNANDEZ 2022-4-19

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_946135

Hi everyone

I want to simulate Q-learning agent for control inverted pendulum in simulink (with Q-table) (just for ilustrative example)

I've picked the rlsimplependulumModel.slx predefined in matlab.

This is my code

mdl = 'rlSimplePendulumModel';

open_system(mdl)

obsInfo = rlNumericSpec([3 1]); % vector of 3 observations: sin(theta), cos(theta), d(theta)/dt

actInfo = rlFiniteSetSpec([-2 0 2]); % 3 possible values for torque: -2 Nm, 0 Nm and 2 Nm

obsInfo.Name = 'observations';

actInfo.Name = 'torque';

agentBlk = [mdl '/RL Agent'];

env = rlSimulinkEnv(mdl,agentBlk,obsInfo,actInfo);

env.ResetFcn = @(in)setVariable(in,'theta0',pi,'Workspace',mdl);

Ts = 0.05; % simulation time

Tf = 20; % sample time

% Fix the random generator seed for reproducibility

rng(0)

%% To create a Q-learning agent:

%1 Create a critic using an rlQValueRepresentation object.

qTable = rlTable(obsInfo, actInfo);

qRepresentation = rlQValueRepresentation(qTable, obsInfo, actInfo);

qRepresentation.Options.LearnRate = 0.99;

%% 2 Specify agent options using an rlQAgentOptions object.

agentOpts = rlQAgentOptions;

agentOpts.DiscountFactor = 0.99;

agentOpts.EpsilonGreedyExploration.Epsilon = 0.9;

agentOpts.EpsilonGreedyExploration.EpsilonDecay = 0.01;

%% 3 Create the agent using an rlQAgent object.

qAgent = rlQAgent(qRepresentation,agentOpts);

%% Training Algorithm

% rlQAgentOptions.

trainOpts = rlTrainingOptions;

trainOpts.MaxStepsPerEpisode = ceil(Tf/Ts);

trainOpts.MaxEpisodes = 2000;

trainOpts.StopTrainingCriteria = "AverageReward";

trainOpts.StopTrainingValue = -740;

trainOpts.ScoreAveragingWindowLength = 5;

trainingStats = train(qAgent,env,trainOpts);

AND THIS IS THE ERROR MESSAGE

Error using rlTable/validateInput (line 131)

Input must be a scalar rlFiniteSetSpec.

Error in rlTable (line 51)

validateInput(obj, ObservationInfo)

Error in qlearningpendulum (line 30)

qTable = rlTable(obsInfo, actInfo);

any suggestions?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 6

Tuong Nguyen 2022-10-7

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_1068895

I think to use tabular Q learning, your observation has to be discrete and finite. That means your obsInfo has to be rlFiniteSetSpec(allStates), where in "allStates" you list out all the possible observations. See https://www.mathworks.com/help/reinforcement-learning/ref/rltable.html for the rlTable and https://www.mathworks.com/help/reinforcement-learning/ref/rl.util.rlfinitesetspec.html for the rlFiniteSetSpec.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

reinforcement learning toolbox - q table

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（5 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

reinforcement learning toolbox - q table

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（5 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论