Allow Prefilling Assistant Response w/ Chat Templates #2248

haileyschoelkopf · 2024-08-23T19:27:08Z

Models that are open-source and/or used via local-completions, as well as Claude, allow one to "prefill" the start of the assistant's response to a given input: https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/prefill-claudes-response

We don't currently support this with the current chat templating. We should consider supporting this via a new doc_to_text_response_prefill (name TBD...) field in the config file, for portions of input that are post-pended after applying a chat template.

One downside is that we would have to make models that can't accept such prefilled responses error out or ignore the prefill when evaluating on a task that uses this, and also this would further complicate the construction of contexts. So somewhat a tough decision. But Llama-3 uses this for tasks such as evaluation on MBPP so it's worth considering

The text was updated successfully, but these errors were encountered:

baberabb · 2024-08-24T15:14:30Z

They also use it for MMLU:
'gen_prefix': 'The best answer is '

haileyschoelkopf added the feature request A feature that isn't implemented yet. label Aug 23, 2024

baberabb linked a pull request Sep 2, 2024 that will close this issue

Gen Prefix #2274

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Prefilling Assistant Response w/ Chat Templates #2248

Allow Prefilling Assistant Response w/ Chat Templates #2248

haileyschoelkopf commented Aug 23, 2024

baberabb commented Aug 24, 2024

Allow Prefilling Assistant Response w/ Chat Templates #2248

Allow Prefilling Assistant Response w/ Chat Templates #2248

Comments

haileyschoelkopf commented Aug 23, 2024

baberabb commented Aug 24, 2024