Zhiheng Li, Wenjia Geng, Muheng Li, Lei Chen, Yansong Tang, Jiwen Lu, Jie Zhou
python==3.8.17
Install other packages pip install -r requirements.txt
This code assumes CUDA support.
cd datasets/CrossTask_assets
wget https://www.di.ens.fr/~dzhukov/crosstask/crosstask_release.zip
wget https://www.di.ens.fr/~dzhukov/crosstask/crosstask_features.zip
wget https://vision.eecs.yorku.ca/WebShare/CrossTask_s3d.zip
unzip '*.zip'
Please download the pretrained models from Google Drive.
Arrange pretrained models into the path checkpoint/CrossTask_t3 or 4 or 5 or 6_best.pth.tar
T = 3:
python train_cont.py
T = 4:
python train_tower4.py
T = 5:
python train_tower5.py
T = 6:
python train_tower6.py
T = 3:
python test_cont.py
T = 4:
python test_tower4.py
T = 5:
python test_tower5.py
T = 6:
python test_tower6.py
If you find this code useful in your work then please cite:
@inproceedings{li2023skip,
title={Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning},
author={Li, Zhiheng and Geng, Wenjia and Li, Muheng and Chen, Lei and Tang, Yansong and Lu, Jiwen and Zhou, Jie},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={10297--10306},
year={2023}
}
Please contact Zhiheng Li @ lizhihan21@mails.tsinghua.edu.cn if any issue.
This code is built on P3IV. We thank the authors for sharing their codes and extracted features.