#

upper-confidence-bounds-policy

Here is 1 public repository matching this topic...

RezaSaadatyar / Reinforcement-Learning

The repository contains codes for RL (e.g., Q-Learning, Monte Carlo, …) in the form of Python files.

reinforcement-learning q-learning dynamic-programming multi-armed-bandit policy-iteration monte-carlo-methods greedy-policy e-greedy-policy upper-confidence-bounds-policy stochastic-gradient-ascent-policy iterative-policy-evaluation monte-carlo-exploring-starts state-action-reward-state-action first-visit-mc-prediction value-iteration-

Updated Oct 30, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the upper-confidence-bounds-policy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the upper-confidence-bounds-policy topic, visit your repo's landing page and select "manage topics."