RL-Travelling Salesman Problem 🤖

Our project is based on Reinforcement Learning (RL) for solving Travelling Salesman Problem (TSP). Our code and experiments around the paper https://arxiv.org/abs/1802.04240. Here you can find our presentation.

We consider solving TCP solving with RL based on Pointer Network.

As the dataset 20 uniform distributed points from 0 to 1 for each coordinates were used.

For each dataset 4 different types of embeddings were used:

🔒 Linear layer;

🔒 Simple node encoding throw dot product with a random vector;

🔒 Node2Vec;

🔒 DeepWalk.

Quick start

To train for 30 epochs and infer dataset containing 20 points, using simple embeddings with embedding size 128 you may just run:

python main_not.py

Different inference commands

model

-e or --epochs - number of epochs. Default: 30;

-embedding or --embedding_size - size of embeddings. Default: 128;

--embedding_type - type of embeddings. Default: simple. Other possible options: linear, other;

-b or --batch_size - batch size. Default: 1024;

dataset

-train_size - size of train dataset for linear and simple embeddings. Default: 100_000;

--val_size - size of val dataset for linear and simple embeddings. Default: 1_000;

--path_train - path for other saved train embedding. Default: OtherNode2Vec_train.csv;

--path_val - path for other saved val embeddings. Default: OtherNode2Vec_val.csv.

Your embeddings

If you want to use other embeddings, for each item of dataset you should save .npz file, that contains:

embeddings;
initial item;

Then you should make a .csv files for train and val datasets, that constain pathes to emeddings.

Results

Train and val losses for linear and simple embeddings.

Pink color for linear embeddings with Dot Attention;
Blue color for simple embeddings with Dot Attention;

Orange color for linear embeddings with Bahdanau Attention;
Pink color for simple embeddings with Bahdanau Attention;

Pink color for linear embeddings with N = 200_000 (number of samples);
Green color for linear embeddings with N = 400_000 (number of samples);

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
dataset		dataset
embeddings		embeddings
model		model
pictures		pictures
train_RL		train_RL
.gitignore		.gitignore
Config.py		Config.py
DOCUMENTATION.MD		DOCUMENTATION.MD
arguments.py		arguments.py
main.py		main.py
presenation.pdf		presenation.pdf
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-Travelling Salesman Problem 🤖