reward error during training

Question

기범 2023-1-11

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1891820-reward-error-during-training

回答： Harsh 2025-2-28

Hello,

Im using reinforcemet designer to train my model,

and here is my problem.

Q. I dont not why my reward cannot go up to 0.1, why is this happen?? How can I fix this??

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Harsh 2025-2-28

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1891820-reward-error-during-training#answer_1560852

Hi @기범

In Reinforcement Learning (RL), the reward is a signal that guides the agent’s learning by providing feedback on its actions. It changes dynamically based on the agent’s behavior and the environment’s response. A well-designed reward function encourages desired actions and discourages unwanted ones, leading to improved performance over time.

You can try the following to improve your performance:

Ensure that “cos(psi(t)) - cos(psi(t-1)) > 0” is met frequently by checking if psi(t) increases over time.
Verify that the initial conditions of the delay block are properly set to prevent incorrect first-step evaluations.
Modify the reward function threshold to make the positive reward condition more lenient.

For more information on how to craft a reward function please refer to the following MATLAB tech talks webinar - https://www.mathworks.com/videos/reinforcement-learning-part-2-understanding-the-environment-and-rewards-1551976590603.html

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

reward error during training

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

reward error during training

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论