Running BIUTEE

Table of Contents BIUTEE Pipeline Running BIUTEE from the Shell Pre-trained Models More Options

BIUTEE Pipeline

BIUTEE runs in three main phases:

Preprocessing (LAP)
Training
Testing

Given two XML files in RTE format, one for train and one for test, each needs to be preprocessed separately before training/testing. This converts the original text into an annotated dependency tree, and stores it in an internal format. This is useful, since one may want to experiment with different configurations of BIUTEE without running the entire LAP over and over again.

BIUTEE can then train a model from the preprocessed training data. This model is then used by BIUTEE during inference (the testing phase). It is important to note that BIUTEE can be tested in a certain configuration only when using a model that was trained under that exact configuration (same knowledge resources, optimization, etc).

BIUTEE also has an additional running mode - the visual tracing tool.

Running BIUTEE from the Shell

In Linux/Unix, BIUTEE can be run from the command line using run_biutee.sh from $BIUTEE/workdir. Running with -h reveals the modes and options:

 run_biutee.sh [OPTIONS] MODE
 
 Available modes:  lap_train
                   train
                   lap_train,train
                   lap_test
                   test
                   lap_test,test
                   visual
 
 -i <filename>     raw input file (rte pairs format)
 -s <filename>     serialized input file (saves/loads the preprocessed pairs after LAP)
 -m <dir>          models directory
 -o <filename>     results file (only in test modes)
 -r <list>         list of resources, separated by commas (no spaces)
 -t <int>          number of threads
 -e <port>         easyfirst parser's port
 -f                optimize F1 (instead of accuracy)
 -g                gap mode

For first-time use, we recommend running lap_train,train followed by lap_test,test. See how to upgrade BIUTEE and BIUTEE's knowledge resources for more information on -r.

Pre-trained Models

BIUTEE comes along with a few pre-trained models. They are all located under $BIUTEE/workdir/models/. The following table describes how each one was trained.

Model	Training Data	Resources
`basic`	RTE3 Dev Set	`WORDNET,CATVAR,SYNTACTIC`
`basic_bap`	RTE3 Dev Set	`WORDNET,CATVAR,SYNTACTIC,REDIS_BAP`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running BIUTEE

Table of Contents

BIUTEE Pipeline

Running BIUTEE from the Shell

Pre-trained Models

More Options

Documentation

Get Involved

Clone this wiki locally