Skip to content

v0.4

Compare
Choose a tag to compare
@robvanderg robvanderg released this 27 Oct 08:40
· 162 commits to master since this release

This is a major update, there is no backwards compatibility, and performance is known to be different. This version is written from scratch and reduces the dependence on other python packages. New features include:

  • multiclas task type
  • regression task type
  • Log the losses of each task
  • Support a larger variety of language models (autoregressive models, models without special tokens)
  • Layer attention per task (and logging of its weights)
  • Plot the scores each epoch
  • Report dataset statistics
  • Better topn output support
  • Automatically detects size of language model
  • Code easier to debug

The main difference in normal usage is that validation_data_set is renamed to dev_data_set

Missing features (Some of these might be included in updates):

  • seq2seq task type
  • pearson correlation metric
  • dataset embeddings
  • --raw
  • label balancing

Please note that this version is tested less than the previous version, as it was already used for thousands of experiments. Please let us know if you find any bugs.