Lightweight Transformer for Vision-and-Language Tasks

We have implemented transformer and lightweight transformer for a set of Vision-and-Language tasks.

For Visual Question Answering, you can refer to here to reproduce results in our paper.

For Referring Expressiong Comprehension, you can refer to here to reproduce results in our paper.

For Image Captioning, you can refer to here to reproduce results in our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ImageCaptioning		ImageCaptioning
REC		REC
VQA		VQA
README.md		README.md

Provide feedback