Name		Name	Last commit message	Last commit date
parent directory ..
Helper		Helper
MakePredictions.py		MakePredictions.py
README.md		README.md
TrainModel.py		TrainModel.py
requirements.txt		requirements.txt

README.md

Code

This folder contains (a) the code to apply a model to our datasets and (b) the code to train a model on our dataset.

📜 Code to apply the model
📜 Code to train the model
📂 Helper
- 📜 Code for performing a grid search
- 📜 Implementation of a slanted triangular learning rate (Howard and Ruder, 2018) and a three class accuracy metric

We tested the code using Python 3.10.6. The required libraries are listed in the requirements.txt.

Code to apply the model

You can apply a model of your choice to a dataset of your choice by changing the confiugration in line 6-9:

# Configuration
model_file = r'../Model/analysis-model.h5'
data_file = r'../Data/Datasets/NLP.json'
output_file = r'NLP-Predictions.json'

Make sure to use the analysis-model.h5 file (~420MB) and not the pointer file with the same name (~1KB).

Code to train the model

You can train a model on a dataset of your choice by changing the train and dev datasets in line 14-16:

# File Configuration
train_file = r'../Data/Human Annotated Data/NLP.json'
dev_file = r'../Data/Human Annotated Data/ML.json'

You can also specify the hyperparameters to use by modifying the dict in line 19-24 and you can set the number of models to train for each combination of hyperparameters in line 25:

# Hyperparameter Configuration
hyperparameters = {
	'batch_size': [16],
	'epochs': [3],
	'learning_rate': [5e-5],
	'warmup_ratio': [0.06],
}
executions_per_trial = 10

The following keys in the hyperparameters dict can be set to a list of values to try:

Hyperparameter	Description
`batch_size`	number of samples per gradient update, passed to Model.fit(...)
`epochs`	number of epochs to train the model, passed to Model.fit(...)
`learning_rate`	maximum learning rate for the slanted triangular learning rate
`warmup_ratio`	fraction of iterations the slanted triangular learning rate increases
`weight_decay`	weight decay, passed to AdamW(...)
`adam_epsilon`	small constant for numerical stability, passed to AdamW(...)
`adam_beta_1`	exponential decay rate for the 1st moment estimates, passed to AdamW(...)
`adam_beta_2`	exponential decay rate for the 2nd moment estimates, passed to AdamW(...)

After running the script the following contents will be in a subfolder of the TrainModel folder:

File	Description
`best-model.h5`	weights of the best model based on the MSE on the dev set (you can use the weights with the code to apply the model)
`best-configuration.json`	hyperparameters of the best model
`log.json`	detailed log of hyperparameters and metrics for all epochs and trials
`predictions.json`	predictions of all models for the train and dev sets

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code

Code

README.md

Code

Code to apply the model

Code to train the model

Files

Code

Directory actions

More options

Directory actions

More options

Latest commit

History

Code

Folders and files

parent directory

README.md

Code

Code to apply the model

Code to train the model