Epsilon greedy policy for DQN

Question

Akash 2023-8-16

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2009102-epsilon-greedy-policy-for-dqn

回答： Emmanouil Tzorakoleftherakis 2023-9-25

采纳的回答： Emmanouil Tzorakoleftherakis

Hello,

I have created DQN agent with epsilon greedy exploration which has 4 discrete actions and 10 observations.

Now, my policy is:

Epsilon = 0.9;

EpsilonDecay = 1e-3

EpsilonMin = 0.01

I want to plot the Epsilon value over the episodes during the training, or need to find the variable Epsilon over the training. But, i just can see the above described values even after the training has been done.

If you have idea how to plot/know the epsilon for particular episodes then please let me know?

Thanks

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Emmanouil Tzorakoleftherakis 2023-9-25

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2009102-epsilon-greedy-policy-for-dqn#answer_1317952

You can use the formula here to calculate the epsilon value

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Epsilon greedy policy for DQN

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Epsilon greedy policy for DQN

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论