Towards VQA Models That Can Read | Facebook AI Research
Studies have shown that a dominant class of questions asked by visually impaired users on images of their surroundings involves reading text in the image. But today’s VQA models can not read! Our paper takes a first step towards addressing this…
But today's VQA models can not read! Our paper takes a first step towards
addressing this problem. First, we introduce a new “TextVQA” dataset to facilitate
...