Java Garbage Collector Performance Tuning with Reinforcement Learning Methods
-
Updated
Apr 13, 2024 - Jupyter Notebook
Java Garbage Collector Performance Tuning with Reinforcement Learning Methods
Short own implementation of the game snake. In this project I'am using the ray library together with ray tune and a custom PPO model.
AI agents for the boardgame Splendor
Automated gaming using machine learning
Reinforcement Learning in Super Mario using Pytorch and PPO
This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.
The aim of this repository is the analysis and study of computer intelligence and in-depth learning techniques in the development of intelligent gaming agents.
State Representations as Incentives for Reinforcement Learning Agents: A Sim2Real Analysis on Robotic Grasping
A deep reinforcement learning Bot for https://kana.byha.top:444/
This repository contains the code for a project paper for a Master's module in the field of reinforcement learning. The aim of the project is to explore and implement Proximal Policy Optimization (PPO) agents to learn and play the 7x7 Hex game.
An adaptive Machine Reinforcement Learning (MRL) system is being developed to gather and analyze media data using web scraping, training models to predict outcomes in areas like stock market trends, sports events, and other performance domains. It continuously refines its strategies based on real-time data and evolving patterns.
Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.
Reinforcement Learning based navigation
Repository for the final project of the "Computational Intelligence" course @ PoliTo, 2022/2023
Financial trading strategies using deep reinforcement learning (DRL). It offers a frameworks for quantitative finance, enabling practitioners to create, test, and implement investments strategies.
Training a Mario reinforcement learning agent using Open AI Gym and Stable Baselines 3 PPO algorithm.
🚤🏖️BOATS DO VZHHHHH BBBDROOM, BEEEEP, BEEEP, GNAA, HONK, VZHHHHHHHHHHHHHH🏖️🚤
Developed an highly customizable OpenAI gym environment and trained a stable_baselines3 PPO agent. Used the expert agent for Imitation Learning with DAgger
Concept and development of a walking AT-ST Walker (Starwars) ML-agent.
Modular Reinforcement Learning in PyTorch.
Add a description, image, and links to the ppo-agent topic page so that developers can more easily learn about it.
To associate your repository with the ppo-agent topic, visit your repo's landing page and select "manage topics."