This repository contains data and code for the papers Relation Transformer Network and Scenes and Surroundings: Scene Graph Generation using Relation Transformer(ICML workshop,2020) . This repository can also be used as a scene graph generator for visual question answering(VQA), please see our work Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering (ISWC,2021) for more details. If you like the paper, please cite our work:
@article{koner2020relation,
title={Relation transformer network},
author={Koner, Rajat and Sinhamahapatra, Poulami and Tresp, Volker},
journal={arXiv preprint arXiv:2004.06193},
year={2020}
}
@article{koner2021scenes,
title={Scenes and Surroundings: Scene Graph Generation using Relation Transformer},
author={Koner, Rajat and Sinhamahapatra, Poulami and Tresp, Volker},
journal={arXiv e-prints},
pages={arXiv--2107},
year={2021}
}
For visual genome please check the branch vg_0.4
For GQA and its associated scene graph generation for visual question answering please check the branch gqa_1.4
Feel free to open an issue if you encounter trouble getting it to work!