Custom environment in Deep reinforcement learning

2 次查看(过去 30 天)
I am currently trying to buid to a custom environment for the implementation of deep reinforcement learning. My considered environment has 4 states low, med, high, severe represented by 1,2,3,4 respectively and the actions to be taken are 1,2,3 and rewards are decided on the basis of context like temperature, pressure,humidity which varies with time. So how i can define my reward that changes with time in mystepfunction?

回答(1 个)

Ari Biswas
Ari Biswas 2020-4-20
One way to solve this is by introducing a property to keep track of elapsed time in your custom MATLAB environment. You can use this property to compute rewards and increment this as needed in the step function.

类别

Help CenterFile Exchange 中查找有关 Deep Learning Toolbox 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by