Hugging Face Introduces SmolVLM, a Small Multimodal Model that Runs on End Devices
SmolVLM is a small multimodal model with 2 billion parameters that accepts any combination of image and text input and generates text output. After launching the SmolLM lightweight language model in July, AI app development platform Hugging Face ...