This is the official repo for the paper Geometry Attention Transformer with Position-aware LSTMs for Image Captioning.
- Install the dependancies in
requirements.txt - Install java for evaluation, e.g. CIDEr, BLEU.
train_GAT3.shis for training our GAT model.train_transformer.shis for training the vanilla transformer model.
The evaluation codes are from Microsoft COCO Caption Evaluation and Consensus-based Image Description Evaluation.