Generate descriptive text captions or OCR output from images.
42k models
10k models
8k models
2k models