Different Sample time for RL environment

6 次查看(过去 30 天)
Hello Everyone,
I am trying to build a RL agent using DQN currently. My environment model is composed of Nonlinear equations which have faster dynamics around 1ms. I was wondering if it possible to have my RL agent to work at 10ms and let the environment run at 1ms. This will obviously mean that for 10 timesteps the environment will be feeding at a constant control action from the RL agent.
I know I can code this up if I create the agent and environment myself, but currently I'm taking advantage of MATLAB's Reinforcement Learning Toolbox in MATLAB editor and I would save a lot of time if this is possible in the toolbox itself.
Thanks.

采纳的回答

Kartik Saxena
Kartik Saxena 2024-6-13
Hi,
When creating a custom environment in MATLAB for use with the Reinforcement Learning Toolbox, you can define a 'step' function that advances the environment's state based on the agent's action. To simulate the environment at a faster rate than the agent operates, modify the 'step' function to perform multiple updates (10 updates of 1ms each) for each call to the 'step' function.
Refer to the following code snippet:
for i = 1:10
% Update the environment state based on the action
% Assume updateEnvironment is a function that updates the environment state
% for a 1ms timestep given the current state and action
loggedSignals = updateEnvironment(loggedSignals, action);
end
Hope it helps!
  1 个评论
Syed Adil Ahmed
Syed Adil Ahmed 2024-6-13
Thanks a lot. That is not the solution I was expecting, but sometimes its great to see small things (like a for loop) making the difference.

请先登录,再进行评论。

更多回答(0 个)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by