Using Reinforcement Learning Agent with PX4 in ROS/Gazebo for Iris Drone - PID Gain Issue?

Question

Gaurav 2024-10-10

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2158770-using-reinforcement-learning-agent-with-px4-in-ros-gazebo-for-iris-drone-pid-gain-issue

回答： Naing Lin 2025-4-24

Hi everyone,

I have trained a reinforcement learning (RL) agent using the UAV Toolbox's multirotor model in MATLAB/Simulink, and the training was successful. The agent can effectively control the multirotor in the simulation environment.

Now, I am trying to deploy the same RL agent to control an Iris drone in ROS/Gazebo with PX4. The configuration of the Iris drone is set to default. However, when I attempt to control the drone using the RL agent, it fails to perform as expected.

I suspect that the issue might be related to the PID settings on the Iris drone in PX4. Do I need to tune the PID gains, or are there other factors that could be affecting the agent's performance in the new environment? Has anyone encountered a similar issue when transitioning from MATLAB to PX4/ROS?

Any guidance on adjusting the PID gains or other relevant tips would be greatly appreciated!

Thanks in advance for your help!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Kothuri 2024-10-17

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2158770-using-reinforcement-learning-agent-with-px4-in-ros-gazebo-for-iris-drone-pid-gain-issue#answer_1533465

Hi Gaurav,

I understand that you are facing an issue while deploying the RL agent developed using the UAV Toolbox's multirotor model in MATLAB/Simulink to control an Iris drone in ROS/Gazebo with PX4.

You can follow the below steps:

The default PID gains on the Iris drone in PX4 may not align with the dynamics learned by the RL agent in Simulink. You may need to manually tune the PID gains to better match the expected performance.
Adjust the PID gains incrementally by focusing on one axis at a time (roll, pitch, yaw, and throttle) to understand the impact of each parameter.
Ensure that the dynamics of the multirotor model in MATLAB/Simulink closely match those of the Iris drone in PX4. You might need to adjust the model parameters to better reflect the real drone’s behaviour.
Consider the impact of sensor noise and communication delays in the ROS/Gazebo environment, which might not be present in the Simulink model.
Retrain the RL agent with domain randomization techniques to make it more robust to variations in the environment.
Ensure that the control loop timing in ROS is consistent with what the RL agent expects. Latency or timing mismatches can degrade performance.
Use Gazebo to simulate various scenarios and validate the RL agent's performance before deploying it on the actual hardware.

You can refer the below documentation for more info

https://www.mathworks.com/help/uav/px4-hitl.html?s_tid=CRUX_lftnav