How do I define a continuous reward function for RL environment?

Prashanth Chivkula

2020 10 5

1 个回答

回答已采纳

更新时间：2020 10 12

1 次查看（30 天）

请先登录，再回答此问题。

Follow Question

请先登录，再回答此问题。

Follow Question

显示更早的评论

0 个投票

I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?

3 个评论
显示 1更早的评论隐藏 1更早的评论

Prashanth Chivkula 2020-10-12

Yes I did that, thank you, Just to confirm the output of the cost function will always be a scalar value, right? So in the double integrator continuous example there are two states but the output reward at each step is a scalar value, right?

Emmanouil Tzorakoleftherakis 2020-10-12

That's right

请先登录，再进行评论。

请先登录，再回答此问题。

Follow Question

采纳的回答

Priysha LNU 2020-10-8

0 个投票

Here is an excerpt from the documentation :

To guide the learning process, reinforcement learning uses a scalar reward signal generated from the environment.

For detailed information on defining reward signals, discrete and continous rewards, please refer to this documentation link.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

更多回答（0 个）

请先登录，再回答此问题。

类别

在帮助中心和 File Exchange 中查找有关 Reinforcement Learning 的更多信息

产品

Reinforcement Learning Toolbox

版本

R2020a

标签

Prashanth Chivkula

2020-10-5

Emmanouil Tzorakoleftherakis

2020-10-12

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by