RL Environment: get obs from last episode
1 次查看(过去 30 天)
显示 更早的评论
Hi,
I defined a Reinforcement Learning Environment based on the rlCreateEnvTemplate.
How can I limit the change of the actions, which are choosen by the agent while having a predefined action range (-50V<action<150V) ? (in my case I have voltages as actions.)
I think about something like this:
abs(action(i-1)-action(i)) < 10
for step i. But I don't know how to access the action from the previous step (which would be action(i-1)).
Another approach would be to use the change in voltage as action and then add this change to the voltage from the previous step. Again I would have to access a value from the previous step and I don't know how to get this value.
Thank you for any advice :)
0 个评论
采纳的回答
更多回答(0 个)
另请参阅
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!