This repository contains DQN algorithm for pogema. Algorithm uses logger for training on previous experiments and two NNs: target net and policy net. Policy net is being training every training step and once in TARGET_UPDATE
steps is being logged into target net for stable learning. File vis.py
contains script for visualizing results into .svg
file.
-
Notifications
You must be signed in to change notification settings - Fork 0
SuperCrabLover/DQN_For_Pogema
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Deep Q-Learning algorithm for Partially-Observable Grid Environment for Multiple Agents
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published