The data and code for our IJCAI-ECAI 2018 Paper: Constructing Narrative Event Evolutionary Graph for Script Event Prediction.
Data used in our paper can be found here. The codes here include PyTorch implementations of the PairLSTM baseline and our SGNN model. Code for EventComp model and how to extract the narrative event chains from raw NYT news corpus can be found here.
You need to download the data I used from google-drive. Besides, you need Python3.5 or 3.6, PyTorch 0.3.0, and Nvidia GPU, perhaps Titan XP or Tesla P100. You can run python3 evaluate.py to get the results reported in my paper, and run python3 event_chain.py to train a PairLSTM model, and run python3 main_with_args.py to train a SGNN model. I have writen some annotations in my codes, please read them and run!
It is a very time consuming extraction pipeline from the raw NYT/Gigaword raw corpus to get the preprocessed data. The good news is that you don't need to download the Gigaword corpus, because I have provided all the data you need to run the code.
Original Data:
encoding_with_args.csv and data2.csv
are the constructed NEEG in the paper. corpus_index_train0.txt, corpus_index_dev.txt and corpus_index_test.txt
are the original training, development and test sets I used to train the SGNN model. Just use the pickle
module to load them.
- Linux OS
- Python 3.5 or 3.6
- PyTorch 0.3.0
- GPU (Tesla P100 or Others)