Towards VQA Models That Can Read - Facebook Research

Studies have shown that a dominant class of questions asked by visually impaired users on images of their surroundings involves reading text in the image. But today’s VQA models can not read! Our paper takes a first step towards addressing this problem.

16 Jun 2019 ... ... asked by visually impaired users on images of their surroundings involves reading text in the image. But today's VQA models can not read!

Lee mas