OmDet Turbo processor standardization #34937

qubvel · 2024-11-26T12:51:59Z

What does this PR do?

Standardize post-processing for OmDet Turbo model (see #34926 for more info)

Rename score_threshold to threshold argument in post_process_grounded_object_detection
Rename classes to text_labels argument in post_process_grounded_object_detection and make it optional
Rename classes key to text_labels in post-processed output dictionary
Add labels (class indexes) key to post-processed output dictionary

The signature is going to be:

    def post_process_grounded_object_detection(
        self,
        outputs: "OmDetTurboObjectDetectionOutput",
        text_labels: Optional[Union[List[str], List[List[str]]]] = None,           # <--------- renamed: classes -> text_labels + Optional now
        threshold: float = 0.3,                                                    # <--------- renamed: score_threshold -> threshold
        nms_threshold: float = 0.5,
        target_sizes: Optional[Union[TensorType, List[Tuple]]] = None,
        max_num_det: Optional[int] = None,
    )

The output is going to be:

[
  {
      "boxes": torch.tensor of shape (N, 4),
      "labels": torch.tensor of shape (N,),                                       # <-------- new key, int indices, consistent with other object detection outputs
      "scores": torch.tensor of shape (N,),
      "text_labels": list of str names of len N (previously classes) or `None`    # <-------- renamed: classes -> text_labels
  },
  ...
]

"classes" key is also available, it will return text_labels and issue a warning

TODO:

Add deprecation for dict key

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2024-11-26T13:19:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qubvel · 2024-11-26T17:22:52Z

cc @yonigozlan for review if you have bandwidth

yonigozlan

Looks much better thanks for refactoring! 🤗

src/transformers/models/omdet_turbo/processing_omdet_turbo.py

docs/source/en/model_doc/omdet-turbo.md

Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

qubvel · 2024-12-02T14:58:08Z

@ArthurZucker please review when you have bandwidth

ArthurZucker

Thanks 🤗 sorry for the delay

ArthurZucker · 2025-01-07T16:06:01Z

src/transformers/models/omdet_turbo/processing_omdet_turbo.py

@@ -55,11 +69,23 @@ class OmDetTurboProcessorKwargs(ProcessingKwargs, total=False):
    }


-if is_torch_available():
-    import torch
+class _dict_with_warning(dict):


let's use camel casing please!

Fixed in 2d7a6b2

ArthurZucker · 2025-01-07T16:09:25Z

src/transformers/models/omdet_turbo/processing_omdet_turbo.py

            )
+            result = _dict_with_warning({"boxes": boxes, "scores": scores, "labels": labels, "text_labels": None})


not 100% we need this (the class is a bit over engineered) but alright

* Fix docstring * Fix docstring * Add `classes_structure` to model output * Update omdet postprocessing * Adjust tests * Update code example in docs * Add deprecation to "classes" key in output * Types, docs * Fixing test * Fix missed clip_boxes * [run-slow] omdet_turbo * Apply suggestions from code review Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Make CamelCase class --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

qubvel added 6 commits November 21, 2024 16:13

Fix docstring

f560077

Fix docstring

27c429d

Add classes_structure to model output

1513f15

Update omdet postprocessing

63862f8

Adjust tests

5a18357

Update code example in docs

36138ed

qubvel marked this pull request as draft November 26, 2024 12:52

qubvel added Vision Processing labels Nov 26, 2024

qubvel added the run-slow label Nov 26, 2024

qubvel added 5 commits November 26, 2024 16:12

Add deprecation to "classes" key in output

b6f8615

Types, docs

c4065c3

Fixing test

decd7e8

Fix missed clip_boxes

78948e0

[run-slow] omdet_turbo

7455a89

qubvel marked this pull request as ready for review November 26, 2024 17:22

qubvel requested a review from yonigozlan November 26, 2024 17:22

yonigozlan reviewed Nov 27, 2024

View reviewed changes

src/transformers/models/omdet_turbo/processing_omdet_turbo.py Outdated Show resolved Hide resolved

src/transformers/models/omdet_turbo/processing_omdet_turbo.py Outdated Show resolved Hide resolved

yonigozlan reviewed Nov 27, 2024

View reviewed changes

docs/source/en/model_doc/omdet-turbo.md Outdated Show resolved Hide resolved

Apply suggestions from code review

c17a8c8

Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

qubvel requested a review from ArthurZucker December 2, 2024 14:57

ArthurZucker approved these changes Jan 7, 2025

View reviewed changes

Make CamelCase class

2d7a6b2

qubvel requested review from molbap, stevhliu and Rocketknight1 as code owners January 16, 2025 17:10

qubvel merged commit 42b2857 into huggingface:main Jan 17, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OmDet Turbo processor standardization #34937

OmDet Turbo processor standardization #34937

qubvel commented Nov 26, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 26, 2024

qubvel commented Nov 26, 2024

yonigozlan left a comment

qubvel commented Dec 2, 2024

ArthurZucker left a comment

ArthurZucker Jan 7, 2025

qubvel Jan 17, 2025

ArthurZucker Jan 7, 2025

		)
		result = _dict_with_warning({"boxes": boxes, "scores": scores, "labels": labels, "text_labels": None})

OmDet Turbo processor standardization #34937

OmDet Turbo processor standardization #34937

Conversation

qubvel commented Nov 26, 2024 • edited Loading

What does this PR do?

TODO:

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Nov 26, 2024

qubvel commented Nov 26, 2024

yonigozlan left a comment

Choose a reason for hiding this comment

qubvel commented Dec 2, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Jan 7, 2025

Choose a reason for hiding this comment

qubvel Jan 17, 2025

Choose a reason for hiding this comment

ArthurZucker Jan 7, 2025

Choose a reason for hiding this comment

qubvel commented Nov 26, 2024 •

edited

Loading