The usage of this task type is similar as seq. However, this task type assumes that
your data is in BIO format, and also guarantees that the output will be in valid BIO format.
Under the hood it is using a masked CRF, which disallows ill-formed output. Performance can be
expected to be higher for span labeling tasks (like NER) compared to the standard seq
decoder.