moondream
Description
Add-On Required: This feature requires the Computer Vision Toolbox Model for Moondream Vision Language Model add-on.
The moondream object configures a pretrained Moondream™
vision-language model (VLM).
Use the Moondream model to quickly understand image content by generating descriptive captions. Due to the lightweight design and speed of the model, you can use it for downstream low-latency tasks like alt-text generation, image-text retrieval, and basic scene description.
To generate image captions using Moondream, use the captionImage
object function.
Creation
Syntax
Description
mdModel = moondream loads a pretrained Moondream
vision-language model with 2 billion parameters.
Properties
Object Functions
captionImage | Caption images using Moondream vision-language model (VLM) |
Examples
Tips
The quality of Moondream outputs can vary across different data domains. Validate its predictions using a data set from a domain similar to your intended application.
References
[1] “Moondream.” Accessed September 2, 2025. https://moondream.ai/.
Version History
Introduced in R2026a
