Source code for Video Question Answering via Hierarchical Spatio-Temporal Attention Networks based on arctic-captions and arctic-capgen-vid.
To run this code you will need:
- Python 2.7
- Theano
- NLTK and WordNet Synsets
"Video Question Answering via Hierarchical Spatio-Temporal Attention Networks."
Zhou Zhao, Qifan Yang*, Deng Cai, Xiaofei He, Yueting Zhuang. To appear IJCAI 2017.
The code is released under a revised (3-clause) BSD License.