Hugging Face claims its new AI models are the smallest of their kind
Hugging Face unveils AI models SmolVLM-256M and SmolVLM-500M, claiming they're the smallest for analyzing images, videos, and text.

Hugging Face has launched SmolVLM-256M and SmolVLM-500M, AI models said to be the smallest capable of analyzing images, videos, and text. They contain 256 million and 500 million parameters respectively and are specifically designed for constrained devices such as laptops with limited RAM.
These models are ideal for developers needing to process substantial amounts of data cost-effectively. The models have shown superior performance to the much larger Idefics 80B on benchmarks such as AI2D, which evaluate the analysis of grade-school-level science diagrams.
However, despite their versatility and cost-effectiveness, small models like SmolVLM-256M and SmolVLM-500M can harbor limitations. Recent findings from Google DeepMind, Microsoft Research, and Mila suggest smaller models may not perform as expected on complex reasoning tasks, as they recognize surface patterns but struggle with broader application.