Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention.
In this section, we present FasterViT object detection repository with DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection.
Please stay tuned for more pre-trained checkpoints !
Model | Backbone | Box AP | Download |
---|---|---|---|
DINO-4scale | FasterViT-4-21K-224 | 55.16 | model |
Copyright © 2023, NVIDIA Corporation. All rights reserved.
This work is made available under the NVIDIA Source Code License-NC. Click here to view a copy of this license.
For license information regarding the DINO repository, please refer to its repository.
This repository is built on top of the DINO repository repository. We thank the authors for their amazing work and releasing their code base.