ppo-agent

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

lora ppo peft ppo-agent huggingface-transformers rlhf flan-t5 llm-training

Updated Aug 8, 2023
Jupyter Notebook

AnastasiaML / Computational-Intelligence-And-Deep-Learning-Techniques-In-Developing-Intelligent-Agents-For-Games

Star

The aim of this repository is the analysis and study of computer intelligence and in-depth learning techniques in the development of intelligent gaming agents.

python machine-learning reinforcement-learning opeani-gym deep-learning deep-reinforcement-learning pytorch atari spaceinvaders mspacman computational-intelligence qbert dqn-agents ppo-agent a2c-agent stable-baselines3 qrdqn

Updated Apr 21, 2023
Python

PetropoulakisPanagiotis / igae

Star

State Representations as Incentives for Reinforcement Learning Agents: A Sim2Real Analysis on Robotic Grasping

Updated Aug 15, 2024
Python

c2d08y / LearningBot

Star

A deep reinforcement learning Bot for https://kana.byha.top:444/

bot deep-neural-networks reinforcement-learning deep-learning neural-network deep-reinforcement-learning gamebot nueral-networks ppo2 ppo-agent ppo-pytorch ppo-algo

Updated Aug 29, 2022
Python

IvanBirkmaier / ppo_agent

Star

This repository contains the code for a project paper for a Master's module in the field of reinforcement learning. The aim of the project is to explore and implement Proximal Policy Optimization (PPO) agents to learn and play the 7x7 Hex game.

python torch neural-networks hex-game ppo-agent

Updated Jul 7, 2024
Python

jookie / jojostock1

Star

An adaptive Machine Reinforcement Learning (MRL) system is being developed to gather and analyze media data using web scraping, training models to predict outcomes in areas like stock market trends, sports events, and other performance domains. It continuously refines its strategies based on real-time data and evolving patterns.

sentiment-analysis cryptocurrency ddpg-algorithm ppo-agent a2c-agent td3-pytorch day-trader

Updated Nov 8, 2024
Python

RsGoksel / Snake-Game_PPO-Solution

Star

Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.

python pygame gym snake-game actor-critic proximal-policy-optimization ppo reinforcement-learning-agent a2c stable-baselines ppo-agent stable-baselines3

Updated May 9, 2024
Jupyter Notebook

dschori / Ackerbot

Star

Reinforcement Learning based navigation

reinforcement-learning navigation ros gazebo real-world ackermann ppo-agent

Updated Mar 18, 2021
Jupyter Notebook

fracapuano / Quinto

Star

Repository for the final project of the "Computational Intelligence" course @ PoliTo, 2022/2023

board-game reinforcement-learning deep-learning reinforcement-learning-agent ppo-agent

Updated Feb 3, 2023
Python

jookie / jojoBot

Star

Financial trading strategies using deep reinforcement learning (DRL). It offers a frameworks for quantitative finance, enabling practitioners to create, test, and implement investments strategies.

machine-learning stock-market sharpe-ratio bitcoin-wallet options-trading ppo-agent sharpe-ratios drl-algorithms

Updated Sep 30, 2024
TypeScript

strcoder4007 / Mario-Reinforcement-Learning

Star

Training a Mario reinforcement learning agent using Open AI Gym and Stable Baselines 3 PPO algorithm.

mario reinforcement-learning openai-gym pytorch ppo-agent stable-baselines3

Updated Jul 12, 2024
Python

ImSOLty / On-The-Waves

Star

🚤🏖️BOATS DO VZHHHHH BBBDROOM, BEEEEP, BEEEP, GNAA, HONK, VZHHHHHHHHHHHHHH🏖️🚤

relay reinforcement-learning racing unity artificial-intelligence lobby photon-pun imitation-learning sac ppo ml-agents netcode ppo-agent

Updated Jan 7, 2024
C#

7enTropy7 / Racer_AI

Sponsor

Star

Developed an highly customizable OpenAI gym environment and trained a stable_baselines3 PPO agent. Used the expert agent for Imitation Learning with DAgger

reinforcement-learning artificial-intelligence imitation-learning stable-baselines ppo-agent

Updated Aug 28, 2023
Python

JulianCatnip / atst-walker-agent

Star

Concept and development of a walking AT-ST Walker (Starwars) ML-agent.

star-wars machine-learning reinforcement-learning robot unity unity3d starwars walker ppo ml-agents unity-ml-agents ppo-agent at-st-walker atst-walker all-terrain-scout-transport

Updated Feb 24, 2022
C#

jfpettit / flare

Star

Modular Reinforcement Learning in PyTorch.

machine-learning reinforcement-learning deep-learning machine-learning-algorithms deep-reinforcement-learning pytorch gym neural-networks reinforcement-learning-algorithms a2c rew leng ppo-agent

Updated Feb 16, 2023
Python

Improve this page

Add a description, image, and links to the ppo-agent topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo-agent topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo-agent

Here are 28 public repositories matching this topic...

ellkrauze / gc-ml

GerTheMessiah / Snake-AI

roeey777 / Splendor-AI

00Utkarsh00 / ML-DOOM

harikris001 / Super-Mario-Reinforcement_Learning

navneet1083 / textsum-tune

AnastasiaML / Computational-Intelligence-And-Deep-Learning-Techniques-In-Developing-Intelligent-Agents-For-Games

PetropoulakisPanagiotis / igae

c2d08y / LearningBot

IvanBirkmaier / ppo_agent

jookie / jojostock1

RsGoksel / Snake-Game_PPO-Solution

dschori / Ackerbot

fracapuano / Quinto

jookie / jojoBot

strcoder4007 / Mario-Reinforcement-Learning

ImSOLty / On-The-Waves

7enTropy7 / Racer_AI

JulianCatnip / atst-walker-agent

jfpettit / flare

Improve this page

Add this topic to your repo