DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
-
Updated
Mar 25, 2023 - Python
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
A PyTorch Library for Reinforcement Learning Research
Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.
Reinforcement learning models in ViZDoom environment
🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
Gym environments and agents for autonomous driving.
source code for 'Improving automatic source code summarization via deep reinforcement learning'
Energym is an open source building simulation library designed to test climate control and energy management strategies on buildings in a systematic and reproducible way.
Reinforcement Workbench for FreeCAD
Autonomous Drone for Object Tracking
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
Lane keeping assistant using Reinforcement learning
Sidekick Policy Learning for Active Visual Exploration (ECCV 2018)
Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMak…
A deep learning Crazyhouse chess program that uses a Monte Carlo Tree Search (MCTS) based evaluation system and reinforcement to enhance its play style.
Tic Tac Toe with Alpha Zero method - My first work
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
Flying Cavalry Project - Ucan Kavalye Projesi
Add a description, image, and links to the reinforcement topic page so that developers can more easily learn about it.
To associate your repository with the reinforcement topic, visit your repo's landing page and select "manage topics."