PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
-
Updated
Nov 15, 2021 - Python
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
An end-to-end (E2E) reinforcement learning model for autonomous vehicle collision avoidance in the CARLA simulator, using a recurrent PPO algorithm for dynamic control. The model processes RGB camera inputs to make real-time acceleration and steering decisions.
Solving the Atari Breakout environment using Stable Baselines
AI agents for the boardgame Splendor
Add a description, image, and links to the recurrent-ppo topic page so that developers can more easily learn about it.
To associate your repository with the recurrent-ppo topic, visit your repo's landing page and select "manage topics."