FasterViT: Fast Vision Transformers with Hierarchical Attention

Object Detection with DINO

Please stay tuned for more pre-trained checkpoints !

Model	Backbone	Box AP	Download
DINO-4scale	FasterViT-4-21K-224	55.16	model

This work is made available under the NVIDIA Source Code License-NC. Click here to view a copy of this license.

For license information regarding the DINO repository, please refer to its repository.

This repository is built on top of the DINO repository repository. We thank the authors for their amazing work and releasing their code base.