Video
Recent Advances in Image Captioning, Image-Text Retrieval and Visual Question Answering using Scene Graph Parsing, What Next?
Creating appropriate representation of data is the key for many recent breakthroughs in both language and vision. In natural language, from the structured representations like parse trees to BERT and Transformers pretrained on large scale…
Publication