how to change the reward parameter in RL toolbox every time step?
2 次查看(过去 30 天)
显示 更早的评论
Hi,
I am using RL toolbox for energy mangement of a energy storage. my reward is like follow:
C*P where C is a parameter ant P the action. As the problem is energy mangemnt, the C which is the price of the power is difffrent every time step (for this exampl 1 hour)
currently in my simulink model i use a ps counter to count evert time sample of simulink and give a value to C for each time sample, but its causing problems. Is there any way to acess the time step number of rl toolbox?
0 个评论
采纳的回答
Emmanouil Tzorakoleftherakis
2020-11-16
编辑:Emmanouil Tzorakoleftherakis
2020-11-16
One way is to implement your logic in your reward subsystem using a MATLAB Fcn block. Execute the MATLAB Fcn block every Ts time units (1hr in your problem) and add a persistent variable that keeps track of how many times the reward subsystem has been accessed.
One comment on your RL problem design - if C depends on time only, that should also be used as an observation (alternatively, you could use the time stamp itself).
3 个评论
Emmanouil Tzorakoleftherakis
2020-11-18
If you right click the MATLAB Fcn block, select Block Parameters and change the Sample Time.
You cannot assign dynamic epsilon values I believe, but you can change the decay rate and the minimum epsilon values to allow the agent to explore more (see here for example).
更多回答(0 个)
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Simulink Functions 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!