DOCCI
The DOCCI dataset consists of comprehensive descriptions on 15k images specifically taken with the objective of evaluating T2I and I2T models. These cover a lot of key details in the images, as illustrated below.
cc-by-4.0
10k<n<100k
Text-to-Image
Image-to-Text
English
Total downloads
2
Created: July 12, 2024