Conversation pipeline fixes #26795

Rocketknight1 · 2023-10-13T15:49:43Z

This PR makes a couple of fixes to ConversationalPipeline to make it a lot easier to use:

Inputs can now just be conversations in standard list-of-dicts format. I think the Conversation class is quite hard for users to discover, and this is a lot more intuitive.
We no longer read max_length because very few models set this parameter, and so it's almost always the default PretrainedConfig value of 20, which is very low. Before this change, most calls to ConversationalPipeline produced no output or unnecessarily truncated the input because this limit was hit. We change the pipeline to use max_new_tokens instead, which is more modern.

cc @ArthurZucker for pipeline review and @gante if he has any comments about setting the generation parameters properly!

HuggingFaceDocBuilderDev · 2023-10-13T16:10:51Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2023-10-13T16:19:27Z

Also cc @lewtun - now you should actually be able to just use this pipeline in the docstrings instead of needing to do it manually in the docstrings with text-generation and apply_chat_template!

ArthurZucker

Looks good to me. We could set a default max_length to the class rather than hardcoding it wdyt?

ArthurZucker · 2023-10-16T06:38:24Z

src/transformers/pipelines/conversational.py

        n = model_inputs["input_ids"].shape[1]
-        if max_length - minimum_tokens < n:


nice cleanup !

ArthurZucker · 2023-10-16T06:39:25Z

src/transformers/pipelines/conversational.py

@@ -268,6 +268,10 @@ def __call__(self, conversations: Union[Conversation, List[Conversation]], num_w
        # Otherwise the threads will require a Conversation copy.
        # This will definitely hinder performance on GPU, but has to be opted
        # in because of this BC change.
+        if isinstance(conversations, list) and isinstance(conversations[0], dict):


can we update the doc to mention the type of input we expect?

gante

LGTM -- assuming the input documentation is updated :)

gante · 2023-10-16T13:23:44Z

src/transformers/pipelines/conversational.py

        conversation = model_inputs.pop("conversation")
-        generate_kwargs["max_length"] = max_length
+        if "max_length" not in generate_kwargs and "max_new_tokens" not in generate_kwargs:
+            generate_kwargs["max_new_tokens"] = 256


👍

It should be safe to add this default, as generate should throw a warning if the generation goes beyond what's supported by the model.

* Adjust length limits and allow naked conversation list inputs * Adjust length limits and allow naked conversation list inputs * Maybe use a slightly more reasonable limit than 1024 * Skip tests for old models that never supported this anyway * Cleanup input docstrings * More docstring cleanup + skip failing TF test * Make fixup

Rocketknight1 requested review from gante and ArthurZucker October 13, 2023 15:49

ArthurZucker approved these changes Oct 16, 2023

View reviewed changes

gante approved these changes Oct 16, 2023

View reviewed changes

Rocketknight1 added 7 commits October 16, 2023 16:16

Adjust length limits and allow naked conversation list inputs

bd4aeeb

Adjust length limits and allow naked conversation list inputs

805d200

Maybe use a slightly more reasonable limit than 1024

7c46c89

Skip tests for old models that never supported this anyway

25f4443

Cleanup input docstrings

2a82049

More docstring cleanup + skip failing TF test

dcf5820

Make fixup

89e48f6

Rocketknight1 force-pushed the conversation_pipeline_fixes branch from d9d9dea to 89e48f6 Compare October 16, 2023 15:17

Rocketknight1 merged commit 14b04b4 into main Oct 16, 2023
3 checks passed

Rocketknight1 deleted the conversation_pipeline_fixes branch October 16, 2023 16:27

Rocketknight1 mentioned this pull request Oct 18, 2023

Emergency PR to skip conversational tests to fix CI #26906

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversation pipeline fixes #26795

Conversation pipeline fixes #26795

Rocketknight1 commented Oct 13, 2023

HuggingFaceDocBuilderDev commented Oct 13, 2023 •

edited

Loading

Rocketknight1 commented Oct 13, 2023

ArthurZucker left a comment

ArthurZucker Oct 16, 2023

ArthurZucker Oct 16, 2023

Rocketknight1 Oct 16, 2023

gante left a comment

gante Oct 16, 2023

		n = model_inputs["input_ids"].shape[1]
		if max_length - minimum_tokens < n:

Conversation pipeline fixes #26795

Conversation pipeline fixes #26795

Conversation

Rocketknight1 commented Oct 13, 2023

HuggingFaceDocBuilderDev commented Oct 13, 2023 • edited Loading

Rocketknight1 commented Oct 13, 2023

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Oct 16, 2023

Choose a reason for hiding this comment

ArthurZucker Oct 16, 2023

Choose a reason for hiding this comment

Rocketknight1 Oct 16, 2023

Choose a reason for hiding this comment

gante left a comment

Choose a reason for hiding this comment

gante Oct 16, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 13, 2023 •

edited

Loading