Multi-agent statistical discriminative sub-trajectory mining

This repository contains the code for Multi-agent Stat-DSM (MA-Stat-DSM), a multi-agent statistical discriminative sub-trajectory mining method for analyzing the trajectories of multiple agents.

The MA-Stat-DSM method is demonstrated on tracking data of five agents from NBA basketball, to determine sub-matrices that discriminate between effective and ineffective plays.

Requirements

Create Environment

The required environment can be created using the environment.yml file. If you are running on your local machine, you can use Anaconda, in which case it is recommended to install mamba:

conda install mamba -n base -c conda-forge

Then, you can create the environment using the environment.yml file:

mamba env create -n geoenv -f environment.yml

Create PostgreSQL Database

Install PostgreSQL and create a PostgreSQL database with: dbname = 'postgres', user = 'postgres', host = 'localhost', password = 1234, and port = 5432.

Once Postgres has been installed, create the PostGIS extension:

CREATE EXTENSION postgis;

Then, to create the required database tables, run

python create_postgresql_db.py

Data

NBA Dataset

Download the NBA dataset from Google Drive by running the following command:

gdown --id 19PZPfg-EfXzcsGO6kr2fo-IuwUw9M5-q -O dataset_as_a_file_600_games.pkl

(This dataset was created/preprocessed from https://github.com/rajshah4/BasketballData and considers the trajectories for five agents -- ball, shooter, shooter defender, last passer, last passer defender -- from 600 NBA games in the 2015/2016 season.)

nba_data_preprocess.py:

Use nba_data_preprocess.py to create the trajectory_ma.csv and point_ma.csv files, which are input to MA-Stat-DSM.

Argument	Type	Required	Default	Description
`-y`, `--label`	`str`	Optional	`effective`	Type of label (`effective`, `score`, or `attempt`)
`-a`, `--a_list`	`str`	Optional	`['ball', 'shooter', 'lastpasser', 'shooterdefender', 'lastpasserdefender']`	List of agents
`-i`, `--time_int`	`str`	Optional	`t2`	Time interval (`t1` or `t2`)
`-d`, `--downsampling`	`int`	Optional	`1`	Downsample trajectories
`-g`, `--game_id`	`int`	Optional	`0`	Specify a particular match id
`-t`, `--team`	`int`	Optional	`0`	Specify a particular team id

To create trajectory_ma.csv and point_ma.csv files that contain all Cleveland match trajectories (with all points):

python nba_data_preprocess.py -t 1610612739

To create trajectory_ma.csv and point_ma.csv files that contain all Golden State Warriors match trajectories (with every fourth point, i.e., a downsampling rate of 4):

python nba_data_preprocess.py -d 4 -t 1610612744

Other Dataset

Rather than using the NBA dataset, if you have another multi-agent trajectory dataset you'd like to apply MA-Stat-DSM to, you can create trajectory_ma.csv and point_ma.csv files with the same structure and use them instead as input to MA-Stat-DSM.

MA-Stat-DSM

ma_stat_dsm.py:

Argument	Type	Required	Default	Description
`-l`, `--min_l`	`int`	Required		Minimum trajectory length
`-d`, `--dist_threshold`	`float`	Required		Distance threshold
`-p`, `--pos_label`	`str`	Optional	`1`	Positive label
`-n`, `--neg_label`	`str`	Optional	`0`	Negative label
`-i`, `--max_it`	`int`	Optional	`1000`	Maximum number of iterations
`-a`, `--alph`	`float`	Optional	`0.05`	Statistical significance level (alpha)
`-b`, `--hausdorff_base_dist`	`str`	Optional	`euclidean`	Options for base distance are euclidean, manhattan, chebyshev, or cosine
`-s`, `--seed`	`int`	Optional	`0`	Random seed for labels

E.g., running MA-Stat-DSM with minimum length 5 and distance threshold 1.5:

python ma_stat_dsm.py -l 5 -d 1.5

Identifying statistically significant discriminative sub-matrices

significant_subtrajectories.py:

Argument	Type	Required	Default	Description
`-d`, `--delta`	`str`	Yes		The delta* value that was printed upon completion of ma_stat_dsm.py

E.g.,

python significant_subtrajectories.py -d 0.0344542453452345

Obtaining discriminative trajectory sub-matrices

run.sh:

In run.sh, we iterate over different downsampling factors and MA-Stat-DSM parameters and apply to Golden State and Cleveland match trajectory data.

To run as a batch process, where million2 is the name of the server where postgres is installed:

sbatch -w million2 ./run.sh

Stat-DSM

Code for the original Stat-DSM method provided by Le Duy et al. (2020). The Stat-DSM code is also available on his repository here

stat_dsm.py

Argument	Type	Required	Default	Description
`-p`, `--pos_label`	`str`	No	`1`	The label for positive examples
`-n`, `--neg_label`	`str`	No	`0`	The label for negative examples
`-i`, `--max_it`	`int`	No	`1000`	The maximum number of iterations
`-alpha`, `--alph`	`float`	No	`0.05`	The statistical significance level
`-l`, `--min_l`	`int`	Yes		The minimum trajectory length
`-d`, `--dist_threshold`	`float`	Yes		The distance threshold
`-tr`, `--trajectory_table_name`	`str`	No	`trajectory`	The name of the trajectory table
`-po`, `--point_table_name`	`str`	No	`point`	The name of the point table
`-s`, `--seed`	`int`	No	`0`	The random seed for labels

References

Bunker et al. (2023). Multi-agent statistical discriminative sub-trajectory mining and an application to NBA basketball. Preprint. https://www.researchgate.net/profile/Rory-Bunker/publication/370062987

Le Duy et al (2020). Stat-DSM: Statistically Discriminative Sub-Trajectory Mining With Multiple Testing Correction. IEEE Transactions on Knowledge and Data Engineering, 34(3), 1477-1488.

Name		Name	Last commit message	Last commit date
Latest commit History 405 Commits
achive		achive
ad-hoc		ad-hoc
archive		archive
figs		figs
hurricane		hurricane
meta_data		meta_data
runtimes		runtimes
testing		testing
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
args.txt		args.txt
args_msdsm.txt		args_msdsm.txt
average_traj_lengths.py		average_traj_lengths.py
candidates.csv		candidates.csv
combined_df_final.csv		combined_df_final.csv
court_class.py		court_class.py
create_postgresql_db.py		create_postgresql_db.py
custom_csv_data_preprocess.py		custom_csv_data_preprocess.py
dataset_as_a_file_600_games.pkl		dataset_as_a_file_600_games.pkl
environment.yml		environment.yml
index_mapping.csv		index_mapping.csv
list_min_p.txt		list_min_p.txt
ma_stat_dsm.code-workspace		ma_stat_dsm.code-workspace
ma_stat_dsm.py		ma_stat_dsm.py
max_euclidean.py		max_euclidean.py
nba_court_T.png		nba_court_T.png
nba_data_preprocess.py		nba_data_preprocess.py
plot_run_time.ipynb		plot_run_time.ipynb
plot_single_play.py		plot_single_play.py
plots.ipynb		plots.ipynb
point.csv		point.csv
point_ma.csv		point_ma.csv
run.sh		run.sh
run_time_analysis.py		run_time_analysis.py
significant_subtrajectories.py		significant_subtrajectories.py
slurm-5405024.out		slurm-5405024.out
slurm-5415834.out		slurm-5415834.out
slurm-5433488.out		slurm-5433488.out
slurm-5437922.out		slurm-5437922.out
slurm-5439930.out		slurm-5439930.out
slurm-5451221.out		slurm-5451221.out
slurm-5451225.out		slurm-5451225.out
slurm-5451865.out		slurm-5451865.out
slurm-5453831.out		slurm-5453831.out
slurm-5457253.out		slurm-5457253.out
slurm-5464021.out		slurm-5464021.out
slurm-5466815.out		slurm-5466815.out
slurm-5479781.out		slurm-5479781.out
stat_dsm.py		stat_dsm.py
trajectory.csv		trajectory.csv
trajectory_ma.csv		trajectory_ma.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-agent statistical discriminative sub-trajectory mining

Requirements

Create Environment

Create PostgreSQL Database

Data

NBA Dataset

nba_data_preprocess.py:

Other Dataset

MA-Stat-DSM

ma_stat_dsm.py:

Identifying statistically significant discriminative sub-matrices

significant_subtrajectories.py:

Obtaining discriminative trajectory sub-matrices

run.sh:

Stat-DSM

stat_dsm.py

References

About

Languages

rorybunker/ma-stat-dsm

Folders and files

Latest commit

History

Repository files navigation

Multi-agent statistical discriminative sub-trajectory mining

Requirements

Create Environment

Create PostgreSQL Database

Data

NBA Dataset

nba_data_preprocess.py:

Other Dataset

MA-Stat-DSM

ma_stat_dsm.py:

Identifying statistically significant discriminative sub-matrices

significant_subtrajectories.py:

Obtaining discriminative trajectory sub-matrices

run.sh:

Stat-DSM

stat_dsm.py

References

About

Topics

Resources

Stars

Watchers

Forks

Languages