Code of my dissertation design, referring to ViT from Dosovitskiy A, Beyer L, Kolesnikov A, et al. JART represents Jiang Action Recognition Transformer, which is a Transformer for image classification tasks. JART has showed excellent performance in UCF101 dataset.
Download UCF-101jpg dataset(not original UCF101) from here, then change train_list_path
in train.py to the path where you save your UCF101 dataset. Then just run it!!!