DDPG Agent OU noise options to favour exploration

6 次查看（过去 30 天）

Abd Al-Rahman Al-Remal 2021-7-22

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/883684-ddpg-agent-ou-noise-options-to-favour-exploration

编辑： Abd Al-Rahman Al-Remal 2021-7-22

Hi there,

I have seem similar posts here however I haven't found one that explains how to actually tune the OU noise parameters to favour exploration - currently my agent is stuck on the same reward value from the beginning and does not change/train/learn.

Can anyone advise on how to tune the OU noise parameters within the code to favour exploration? Currently mine are:

agentOpts.NoiseOptions.StandardDeviation = 0.3;

agentOpts.NoiseOptions.StandardDeviationDecayRate = 1e-5;

agentOpts.NoiseOptions.MeanAttractionConstant = 2e-3;

This worked for a previous similar model I made however I understand that the parameter smust be modified per model however I don't know how and literature all looks very dense and doesn't give a clear answer.

Thanks in advance!

Abd

在 Help Center 和 File Exchange 中查找有关 Deep Learning Toolbox 的更多信息

产品

版本

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

DDPG Agent OU noise options to favour exploration

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

DDPG Agent OU noise options to favour exploration

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论