Tensorflow implementation for the paper Attentive Semantic Video Generation using Captions by Tanya Marwah*, Gaurav Mittal* and Vineeth N. Balasubramanian accepted at International Conference on Computer Vision 2017 (ICCV 2017) (*Equal Contribution).
Proposed network architecture for attentive semantic video generation with captions.digit 6 is moving up and down | digit 3 is moving left and right |
person 4 is walking left to right |
Caption 1: digit 4 is moving up and down Caption 2: digit 4 is moving left and right |
Caption 1: digit 4 is moving up and down Caption 2: digit 9 is moving left and right | Caption 1: digit 5 is moving left and right Caption 2: digit 9 is moving up and down |
Caption 1: person 10 is walking left to right Caption 2: person 10 is walking right to left |