TextVQA
TextVQA is a dataset to benchmark visual reasoning based on text in images. TextVQA requires models to read and reason about text in images to answer questions about them.
cc-by-4.0
10k<n<100k
Visual Question Answering
English
Total downloads
1
Created: July 15, 2024