epsilon-greedy

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

haidarns / ml-based-lb-ryu

Star

Machine Learning based Load Balancing with RYU OpenFlow Controller

machine-learning load-balancer round-robin ryu epsilon-greedy sdn-controller flask-api iperf3 ip-hash d-itg

Updated Oct 16, 2018
Python

viswanath57 / Bandit-Algorithms

Star

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

ValentinaZangirolami / MADRQN

Star

Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator

reinforcement-learning deep-learning deep-reinforcement-learning epsilon-greedy self-driving-car multiagent-systems airsim multiagent-reinforcement-learning deep-recurrent-q-network drqn airsim-simulator

Updated Apr 1, 2022
Python

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

Star

Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits

thompson-sampling epsilon-greedy exploration-exploitation optimistic-bayesian-sampling

Updated Jul 2, 2021
Python

ChaitanyaC22 / Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

Star

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is fo…

Updated Jul 9, 2021
Jupyter Notebook

KaleabTessera / Multi-Armed-Bandit

Star

Implementation of greedy, E-greedy and Upper Confidence Bound (UCB) algorithm on the Multi-Armed-Bandit problem.

reinforcement-learning greedy epsilon-greedy upper-confidence-bounds multi-armed-bandit

Updated Dec 8, 2022
Python

thetawom / mabby

Star

A multi-armed bandit (MAB) simulation library in Python

python reinforcement-learning simulation probability artificial-intelligence thompson-sampling epsilon-greedy multi-armed-bandits agent-based-simulation

Updated Jul 15, 2024
Python

georgedeath / egreedy

Star

Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

optimization epsilon-greedy bayesian-optimization acquisition-functions

Updated May 20, 2021
C++

DimitrisPatiniotis / epsilon-greedy-Q-learning

Star

Epsilon-Greedy Q-Learning in a Multi-agent Environment

reinforcement-learning q-learning epsilon-greedy cooperative-environments

Updated Jun 24, 2023
Python

ValentinaZangirolami / DRL

Star

Deep Recurrent Q-Network with different exploration strategies for self-driving cars (using AirSim)

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning epsilon-greedy self-driving-car softmax airsim deep-recurrent-q-network drqn exploration-strategy softmax-exploration max-boltzmann-exploration

Updated Sep 5, 2024
Python

1391819 / MA-seek

Star

A multi agent reinforcement learning environment where two agents controlled by DRQNs play a custom version of the pursuit-evasion game.

tensorflow epsilon-greedy pomdp drqn experience-replay marl

Updated Jun 16, 2023
Python

lkwbr / grid-qlearn

Star

See a program learn the best actions in a grid-world to get to the target cell, and even run through the grid in real-time! This is a Q-Learning implementation for 2-D grid world using both epsilon-greedy and Boltzmann exploration policies.

python machine-learning reinforcement-learning grid-world epsilon-greedy boltzmann-exploration

Updated Feb 4, 2023
Python

saminheydarian / Interactive_Learning_Course_2021

Star

Interactive Learning Course | Home Works & Quiz | Fall 2021 | Prof. Majid Nili

q-learning epsilon-greedy sarsa value-iteration tree-backup n-armed-bandit-problem regret-minimization multi-agent-multi-armed-bandits 2-step-tree-backup model-based-learning off-policy-monte-carlo social-bandit-learning reinforcement-comparison model-based-model-free-mixture

Updated Feb 24, 2022
Jupyter Notebook

ErfanFathi / RL_Cartpole

Star

Implementation of the Q-learning and SARSA algorithms to solve the CartPole-v1 environment. [Advance Machine Learning project - UniGe]

reinforcement-learning q-learning python3 epsilon-greedy sarsa cartpole-v1 q-learning-vs-sarsa

Updated Jun 9, 2023
Python

Improve this page

Add a description, image, and links to the epsilon-greedy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the epsilon-greedy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epsilon-greedy

Here are 82 public repositories matching this topic...

iamjagdeesh / Artificial-Intelligence-Pac-Man

starkblaze01 / Artificial-Intelligence-Codes

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

kulinshah98 / Multi-Armed-Bandit-Algorithms

antoine-hochart / bandit_algo_evaluation

akshaykhadse / reinforcement-learning

haidarns / ml-based-lb-ryu

viswanath57 / Bandit-Algorithms

ValentinaZangirolami / MADRQN

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

ChaitanyaC22 / Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

KaleabTessera / Multi-Armed-Bandit

thetawom / mabby

georgedeath / egreedy

DimitrisPatiniotis / epsilon-greedy-Q-learning

ValentinaZangirolami / DRL

1391819 / MA-seek

lkwbr / grid-qlearn

saminheydarian / Interactive_Learning_Course_2021

ErfanFathi / RL_Cartpole

Improve this page

Add this topic to your repo