Prevent generating excess tokens #118

msakthiganesh · 2021-09-02T08:56:28Z

Hi,

Is there a way to prevent the GPT-J model from generating excess tokens after generating the required answer? For example, GPT-3 uses ### to prevent unnecessary tokens. Is there any way I can achieve the same without manually restricting the gen_len parameter?

Thanks in advance!

The text was updated successfully, but these errors were encountered:

kingoflolz · 2021-09-02T13:53:16Z

This is not implemented (and there are no plans to implement this in the foreseeable future) in the JAX codebase, however you can do so using the huggingface implementation (huggingface/transformers#13022)

kingoflolz closed this as completed Sep 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent generating excess tokens #118

Prevent generating excess tokens #118

msakthiganesh commented Sep 2, 2021

kingoflolz commented Sep 2, 2021

Prevent generating excess tokens #118

Prevent generating excess tokens #118

Comments

msakthiganesh commented Sep 2, 2021

kingoflolz commented Sep 2, 2021