Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
-
Updated
Apr 1, 2021 - Jupyter Notebook
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Source files of the course "Mathematics for Macroeconomics"
PyDiffGame is a Python implementation of a Nash Equilibrium solution to Differential Games, based on a reduction of Game Hamilton-Bellman-Jacobi (GHJB) equations to Game Algebraic and Differential Riccati equations, associated with Multi-Objective Dynamical Control Systems
Find the shortest route using A* algorithm and graphs (Route Planner application)
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".
A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spat…
Reinforcement Learning applied to Autonomous Networking to issue scheduling and decision to drones.
Policy Iteration for Continuous Dynamics
Solving high dimensional HJB equation using tensor decomposition
Foundations Of Intelligent Learning Agents (FILA) Assignments
Q-Learning from scratch in Python
A visualization tool for policy iteration and value iteration
Reinforcement learning
calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)
CSCI-561 AI Assignments.
Dynamic Optimization project working on an economic model
In this project we use Reinforcement Learning to extract features from an image.
Provides methods to efficiently set up and solve dynamic programming problems of one choice variable using Value Function Iteration and the Endogenous Grid Method.
Repository for the code of the "Dynamic Programming and Optimal Control" (DPOC) lecture at the "Institute for Dynamic Systems and Control" at ETH Zurich.
Add a description, image, and links to the bellman-equation topic page so that developers can more easily learn about it.
To associate your repository with the bellman-equation topic, visit your repo's landing page and select "manage topics."