Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Drone-Env with Random Target and Improved Documentation #358

Merged
merged 6 commits into from
Dec 27, 2024

Conversation

KafuuChikai
Copy link
Contributor

Demo:

video.mp4

Description:

This pull request updates the hover environment to include random target point generation and improves the documentation to reflect these changes. The drone is now trained to reach randomly generated target points, enhancing the training process and making it more dynamic.

Changes:

  1. Hover Environment Update (hover_env.py):

    • Updated the hover environment to generate random target points that the drone must reach.
    • Improved the reward functions to provide feedback based on the drone's performance in reaching the target points.
    • Enhanced the visualization.
  2. Training Script Update (hover_train.py):

    • Updated the training script to reflect the changes in the hover environment.
    • Ensured that the training process correctly initializes the environment and starts the training with the new random target point generation feature.
  3. Evaluation Script Update (hover_eval.py):

    • Updated the evaluation script to reflect the changes in the hover environment.
    • Add target point visualization and record videos.
  4. Documentation Update (README.md):

    • Updated the installation and training instructions to ensure clarity and completeness.
    • Added a reference to the inspiration for the reward design.

How to Test:

  1. Training:

    • Run the training script with the following command:
      python examples/drone/hover_train.py -e drone-hovering -B 8192 --max_iterations 500
    • The script will initialize the environment, start the training process, and save the trained policy and configurations in the drone-hovering directory.
  2. Evaluation:

    • Run the evaluation script with the following command:
      python examples/drone/hover_eval.py -e drone-hovering --ckpt 500 --record
    • The script will load the trained policy from the specified checkpoint, initialize the environment, and visualize the drone's performance.

And I'll soon write some tutorials on Genesis World Documentation if that's okay with you.

Please review the changes and provide feedback. Thank you!

@zhouxian zhouxian merged commit ad8317c into Genesis-Embodied-AI:main Dec 27, 2024
1 check failed
@zhouxian
Copy link
Collaborator

nice :)

@KafuuChikai KafuuChikai deleted the drone_rl branch December 28, 2024 15:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants