- For MLM: divide the data to the number of epochs. Now it sees every instance of the train data just once, and the number of instances per batch = total_instances/num_epochs.
- Added pearson metric
- Fixed sentence counts for sentence level datasets
- Log the stderr
- Don't report 0.0 scores for out-of-dataset metrics
- Multi-seq fixed for CPU use
- Progress bar is now correct
- Updated some documentation
known issues:
- for generative models it should probably use the last subword for classification instead of the first
- freezing the language model seems to be broken