proximal-policy-optimization

Star

Here are 202 public repositories matching this topic...

mohith-sakthivel / sufficient-ppo

Star

Clean and flexible implementation of PPO (built on top of stable-baselines3)

reinforcement-learning pytorch proximal-policy-optimization ppo openai-baselines ppo2 stable-baselines3

Updated Jul 9, 2021
Python

sunoh-kim / deep-reinforcement-learning

Star

This repository contains my assignment solutions for the Deep Reinforcement Learning course (430.729_003) offered by Seoul National University (Spring 2020).

deep-reinforcement-learning imitation-learning deep-q-learning deep-deterministic-policy-gradient proximal-policy-optimization

Updated Apr 10, 2022
Jupyter Notebook

sarahalshareeda / Task-Offloading-PPO-DRL

Star

Cognitive Generative Intelligent Task Offloading for Digital Twins of Vehicular Networks This repository contains the code and resources for the implementation of cognitive generative intelligent task offloading in digital twins for vehicular networks.

deep-reinforcement-learning vehicular-networks proximal-policy-optimization task-offloading generative-ai cognitive-digital-twins

Updated Jul 9, 2024
Jupyter Notebook

blahBlahhhJ / ProjectProcgen

Star

A pytorch project to easily run experiments on OpenAI's Procgen Benchmark

reinforcement-learning pytorch proximal-policy-optimization

Updated May 20, 2021
Python

qiqinyi / GenAI-with-LLMs

Star

My lab work of “Generative AI with Large Language Models” course offered by DeepLearning.AI and Amazon Web Services on coursera.

python machine-learning kl-divergence proximal-policy-optimization llms generative-ai flan-t5 low-rank-adaptation parameter-efficient-fine-tuning peft-fine-tuning-llm

Updated Jul 31, 2024
Jupyter Notebook

ruchitapaithankar15 / marioAI-Gaming-Reinforcement-Learning

Star

Built and trained a model using OpenAI gym, NES emulator to play Super Mario. Optimized the model using preprocessing techniques and vectorization. The algorithm used is PPO (Proximal Policy Optimal) along with Reinforcement Learning.

python reinforcement-learning ai openai-gym nes-emulator openai proximal-policy-optimization

Updated Mar 21, 2023
Jupyter Notebook

pmistry9597 / Reinforcement-Learning-Algo-Demo

Star

A demonstration of some prominent reinforcement learning algorithms

reinforcement-learning openai-gym policy-gradient deep-q-network proximal-policy-optimization

Updated Mar 28, 2023
Python

ays-dev / lunarlander-pytorch

Star

Single file implementation of Deep Reinforcement Learning algorithm (PPO) based on LunarLander-v2 environment

python machine-learning deep-neural-networks reinforcement-learning deep-learning torch python3 pytorch gym proximal-policy-optimization ppo lunar-lander

Updated Jul 13, 2023
Python

Raiszo / ppo-4Quadrotor

Star

PPO implementation for the cable suspended load quadrotor

reinforcement-learning quadcopter tensorflow proximal-policy-optimization cable-suspended-load

Updated Jan 9, 2020
Python

1jsingh / rl_reacher

Star

Train double-jointed arms to reach target locations using Proximal Policy Optimization (PPO) in Pytorch

pytorch ddpg proximal-policy-optimization ppo unity-environment reacher-environment

Updated May 3, 2019
Jupyter Notebook

MichaelFish199 / SonicTheHedgehog2-ReinforcmentLearning

Star

This project implements an agent for playing the SonicTheHedgehog2 game from a ROM file using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.

game reinforcement-learning sonic-the-hedgehog proximal-policy-optimization rom-files stable-baselines3 rewards-and-scoring game-playing-agents

Updated Dec 12, 2022
Jupyter Notebook

farkoo / PG-PPO-OthelloSolver

Star

This repository provides an implementation of Othello game playing agents trained using reinforcement learning techniques.

python reinforcement-learning tensorflow othello policy-gradient proximal-policy-optimization

Updated Jul 7, 2023
Python

drakyanerlanggarizkiwardhana / MarioPPO

Star

MarioPPO implementation uses the TensorFlow machine learning platform

reinforcement-learning game-ai super-mario-bros proximal-policy-optimization machine-learning-framework

Updated Mar 27, 2023
Python

GiorgiaAuroraAdorni / CAT-optimal-hybrid-solver

Star

The CAT Optimal Hybrid Solver is a tool designed to tackle the cross array task (CAT) activity designed to assess algorithmic thinking skills in the context of K-12 education.

reinforcement-learning clustering problem-solving depth-first-search random-search computational-thinking proximal-policy-optimization hybrid-approach

Updated Oct 17, 2023
Python

nslyubaykin / relax_ppo_example

Star

Example PPO implementation with ReLAx

reinforcement-learning gae policy-gradient reinforcement-learning-algorithms continuous-control proximal-policy-optimization ppo generalized-advantage-estimation discrete-control

Updated Aug 29, 2022
Jupyter Notebook

GenerativeAIAffiliates / AskAboutSymptomsGPT

Star

Ask About Symptoms is an LLM that has an in-depth understanding of health. The creator of the original version known as DoctorGPT, Siraj Raval, says it works offline, it's cross-platform, & the health data is said to be kept private. We are learning how to build this in our community.

python ios cmake deep-learning cplusplus compiler xcode conda pip tensor source-code quantization submodule fine-tuning proximal-policy-optimization tvm hugging-face llama2

Updated Aug 12, 2023
Jupyter Notebook

arthur-x / SimplyPPO

Star

SimplyPPO replicates Proximal-Policy-Optimization with minimum (~250) lines of code in clean, readable PyTorch style, while trying to use as few additional tricks and hyper-parameters as possible (PyBullet benchmarks included).

proximal-policy-optimization pybullet-benchmarks