The objective of this project is to build a reasonable AI agent to play Mahjong with human players.
The enviroment of Mahjong is built by RLCard.
- PyTorch
- RLCard
Train PPO Agent
python main.py --log_dir 'experiments/mahjong_ppo_result/'
Play Demo with trained agent
python demo.py
- PPO Agent
- Demo
- Distributed PPO Agent
- MAPPO Agent
- SAC Agent
- Add other types of Mahjong games