HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text • 0.3B • Updated • 301k • 344
Collection for models & demos for even smoller SmolVLM release
Generate descriptions from images and text prompts
Generate captions for your images instantly