Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning
Automatically generating captions for images has emerged as a prominent interdisciplinary research problem in both academia and industry. It can aid visually impaired users, and make it easy for users to organize and navigate through large amounts of typically unstructured visual data. In order to generate high quality captions, the model needs to incorporate fine-grained […]
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning Read More »









