RL Environment: get obs from last episode

Question

Katharina Schmidt 2021-8-24

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1439889-rl-environment-get-obs-from-last-episode

回答： Katharina Schmidt 2021-8-25

采纳的回答： Katharina Schmidt

在 MATLAB Online 中打开

Hi,

I defined a Reinforcement Learning Environment based on the rlCreateEnvTemplate.

How can I limit the change of the actions, which are choosen by the agent while having a predefined action range (-50V<action<150V) ? (in my case I have voltages as actions.)

I think about something like this:

abs(action(i-1)-action(i)) < 10

for step i. But I don't know how to access the action from the previous step (which would be action(i-1)).

Another approach would be to use the change in voltage as action and then add this change to the voltage from the previous step. Again I would have to access a value from the previous step and I don't know how to get this value.

Thank you for any advice :)