[Question] A question about the cost function of the p3o algorithm #358

Liqinyan821 · 2024-11-15T06:26:13Z

Required prerequisites

I have read the documentation https://omnisafe.readthedocs.io.
I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Questions

Hello Omnisafe team, thank you very much for your contribution.
When I was Learning the p3o algorithm, I found that the def _loss_pi_cost function was not clip, and loss_pi_cost in the P3O Optimization for Safe Reinforcement Learning used clip.

Gaiejj · 2024-11-25T06:50:15Z

You must be a very meticulous person! In fact, this is a trick we discovered while debugging the algorithm, which makes P3O more suitable for high-dimensional complex environments. Have you tried removing the clip? Do you have any experimental data? If it performs well without it, we will modify this implementation later.

Liqinyan821 added the question Further information is requested label Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] A question about the cost function of the p3o algorithm #358

[Question] A question about the cost function of the p3o algorithm #358

Liqinyan821 commented Nov 15, 2024 •

edited

Loading

Gaiejj commented Nov 25, 2024

[Question] A question about the cost function of the p3o algorithm #358

[Question] A question about the cost function of the p3o algorithm #358

Comments

Liqinyan821 commented Nov 15, 2024 • edited Loading

Required prerequisites

Questions

Gaiejj commented Nov 25, 2024

Liqinyan821 commented Nov 15, 2024 •

edited

Loading