Image pipelines spec compliance #33899

Rocketknight1 · 2024-10-02T18:03:48Z

This PR checks for Hub spec compliance for a lot of image pipelines that have very similar inputs. In most cases, no changes are required except deprecating the timeout argument, and renaming the images input to inputs.

HuggingFaceDocBuilderDev · 2024-10-02T19:38:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

LysandreJik

Looks good, but I wonder if we shouldn't mention in the deprecation/in the pipeline docs that the pipelines will now be 1-1 with the HF specs in general, to justify the change from images to inputs

LysandreJik · 2024-10-03T08:20:12Z

src/transformers/pipelines/depth_estimation.py

+            warnings.warn(
+                "The `images` argument has been renamed to `inputs`. In version 5 of Transformers, `images` will no longer be accepted",
+                FutureWarning,
+            )


I wonder if these warnings (and the docs) shouldn't point to the global definition of specs across the HF ecosystem to justify such a change

What's the right document to link users to as the "source of truth" there? Just the folder in the Hub repo here?

LysandreJik · 2024-10-03T08:20:38Z

src/transformers/pipelines/image_classification.py

@@ -1,3 +1,4 @@
+import warnings


This file is lacking the copyright!
Can you add it?

Wauplin · 2024-10-03T11:41:06Z

src/transformers/pipelines/depth_estimation.py

@@ -50,12 +51,12 @@ def __init__(self, *args, **kwargs):
        requires_backends(self, "vision")
        self.check_model_type(MODEL_FOR_DEPTH_ESTIMATION_MAPPING_NAMES)

-    def __call__(self, images: Union[str, List[str], "Image.Image", List["Image.Image"]], **kwargs):
+    def __call__(self, inputs: Union[str, List[str], "Image.Image", List["Image.Image"]] = None, **kwargs):


Note for myself: this change won't break Inference API since it's using positional argument and not keyword one (see here - private link). So all good to change it, same for all other pipelines :)

Wauplin · 2024-10-03T11:45:47Z

src/transformers/pipelines/image_classification.py

+        if "images" in kwargs:
+            warnings.warn(
+                "The `images` argument has been renamed to `inputs`. In version 5 of Transformers, `images` will no longer be accepted",
+                FutureWarning,
+            )


I wonder if this is working as expected. Say you have a user script like this:

pipeline = ImageClassificationPipeline(...) output = pipeline(images=<my-image>, foo="bar")

with this PR, the pipeline call will fail with something like "requires 1 positional argument, got 0" because inputs is not provided.

what I would suggest is to have a decorator like this:

def _rename_positional_arg(old_name: str) -> Callable: def _decorator(fn: Callable) -> Callable: def _inner(*args, **kwargs): if old_name in kwargs: if len(args) > 0 or "inputs" in kwargs: raise ValueError(f"Cannot pass 'inputs' and '{old_name}' at the same time.") warnings.warn("Deprecation message 'you should use inputs instead of {old_name}' ...") kwargs["inputs"] = kwargs.pop(old_name) return fn(*args, **kwargs) return decorator

which you can use like this:

... @_rename_positional_arg(old_name="images") def __call__(self, inputs: Union[str, List[str], "Image.Image", List["Image.Image"]] = None, **kwargs): ... # here you are guaranteed "inputs" is passed

This way, the pipeline will both if:

inputs are passed as positional args (as done in Inference API)

inputs are passed as inputs

inputs are passed as images

(I did not find a satisfying decorator name but you have the idea^^)

Actually, it should work! I set a new default argument value of None for inputs, so the pipeline can be called without that argument. However, it will raise an exception if neither inputs nor images are passed.

In other words:

Case 1: Users are passing "images" as a positional argument. The argument is renamed to inputs and they don't notice any change

Case 2: Users are passing "images" as a kwarg. inputs is now None but **kwargs contains an "images" key. The code detects this and sets inputs = kwargs.pop("images"), then prints the deprecation warning

Ooh, sorry I did not see the added = None. You are completely right, forget what I meant above :)

LysandreJik · 2024-10-03T12:52:46Z

Actually thinking more about it and inline with having less and less warnings in transformers, I wonder if we want to have a deprecation warning for images vs outputs, or if we just want to enable both without warning

Rocketknight1 · 2024-10-03T12:56:39Z

@LysandreJik yes, I'm fine with that! Want me to just drop all the deprecation warnings? I can maybe throw an error if users specify both arguments, or something.

…l move to another PR

Wauplin · 2024-10-03T13:35:55Z

If no deprecation warning is raised (i.e. accept both inputs and images on the long run), I would make sure the method signature is in line with the recommendation (i.e. passing inputs). In that sense, I don't think that having inputs: ... = None on the long run is desirable as it can be misleading to users. If it was only for a few versions before settling for the definitive signature (i.e. no images at all), it would have been fine but that's not the case, right?

Rocketknight1 · 2024-10-07T13:55:29Z

@Wauplin Unfortunately, if we don't set inputs = None, then users have to pass inputs, and we can't accept images as an alternative.

I think it's okay, though! There are a lot of classes in transformers that have None defaults for mandatory inputs - it just means that we handle the error when the argument is missing somewhere else.

LysandreJik · 2024-10-08T10:32:29Z

I'll let you decide the best course of action in terms of signature/arguments @Rocketknight1, but dropping the deprecation warnings seems to me like the right thing to do here

Rocketknight1 · 2024-10-08T12:34:04Z

Done! Warnings are removed, but I'll accept both keywords for now - I don't think the main input having a default of None should be too confusing, especially since the docstrings are clear.

* Update many similar visual pipelines * Add input tests * Add ImageToText as well * Add output tests * Add output tests * Add output tests * OutputElement -> Output * Correctly test elements * make fixup * fix typo in the task list * Fix VQA testing * Add copyright to image_classification.py * Revert changes to VQA pipeline because outputs have differences - will move to another PR * make fixup * Remove deprecation warnings

Rocketknight1 added 9 commits October 2, 2024 18:58

Update many similar visual pipelines

bb0624f

Add input tests

fc07a82

Add ImageToText as well

3035b03

Add output tests

4b25c86

Add output tests

c85a9a8

Add output tests

ca57cc9

OutputElement -> Output

6130447

Correctly test elements

ed2716a

make fixup

cc3abef

Rocketknight1 requested review from Wauplin and LysandreJik October 2, 2024 19:03

Rocketknight1 added 2 commits October 2, 2024 20:06

fix typo in the task list

379795e

Fix VQA testing

9605b08

LysandreJik approved these changes Oct 3, 2024

View reviewed changes

Wauplin reviewed Oct 3, 2024

View reviewed changes

Add copyright to image_classification.py

e422d96

Rocketknight1 added 2 commits October 3, 2024 14:29

Revert changes to VQA pipeline because outputs have differences - wil…

910cbf9

…l move to another PR

make fixup

79741ea

Remove deprecation warnings

e99a818

Rocketknight1 merged commit 3b44d2f into main Oct 8, 2024
20 checks passed

Rocketknight1 deleted the image_pipelines_spec_compliance branch October 8, 2024 12:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image pipelines spec compliance #33899

Image pipelines spec compliance #33899

Rocketknight1 commented Oct 2, 2024

HuggingFaceDocBuilderDev commented Oct 2, 2024

LysandreJik left a comment

LysandreJik Oct 3, 2024

Rocketknight1 Oct 3, 2024

LysandreJik Oct 3, 2024

Rocketknight1 Oct 3, 2024

Wauplin Oct 3, 2024

Wauplin Oct 3, 2024

Wauplin Oct 3, 2024

Rocketknight1 Oct 3, 2024

Rocketknight1 Oct 3, 2024 •

edited

Loading

Wauplin Oct 3, 2024

LysandreJik commented Oct 3, 2024

Rocketknight1 commented Oct 3, 2024 •

edited

Loading

Wauplin commented Oct 3, 2024

Rocketknight1 commented Oct 7, 2024

LysandreJik commented Oct 8, 2024

Rocketknight1 commented Oct 8, 2024

Image pipelines spec compliance #33899

Image pipelines spec compliance #33899

Conversation

Rocketknight1 commented Oct 2, 2024

HuggingFaceDocBuilderDev commented Oct 2, 2024

LysandreJik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rocketknight1 Oct 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LysandreJik commented Oct 3, 2024

Rocketknight1 commented Oct 3, 2024 • edited Loading

Wauplin commented Oct 3, 2024

Rocketknight1 commented Oct 7, 2024

LysandreJik commented Oct 8, 2024

Rocketknight1 commented Oct 8, 2024

Rocketknight1 Oct 3, 2024 •

edited

Loading

Rocketknight1 commented Oct 3, 2024 •

edited

Loading