RL Environment: get obs from last episode

1 次查看(过去 30 天)
Hi,
I defined a Reinforcement Learning Environment based on the rlCreateEnvTemplate.
How can I limit the change of the actions, which are choosen by the agent while having a predefined action range (-50V<action<150V) ? (in my case I have voltages as actions.)
I think about something like this:
abs(action(i-1)-action(i)) < 10
for step i. But I don't know how to access the action from the previous step (which would be action(i-1)).
Another approach would be to use the change in voltage as action and then add this change to the voltage from the previous step. Again I would have to access a value from the previous step and I don't know how to get this value.
Thank you for any advice :)

采纳的回答

Katharina Schmidt
Katharina Schmidt 2021-8-25
I solved it by using LoggedSignals to store the action values from the last step.

更多回答(0 个)

产品


版本

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by