marengo3.0 multimodal embedding model out of the box. Marengo embeds text, image, audio and video into a single shared latent space, which makes it a great fit for retrieval over video libraries. This model returns vectors with dimension 512.
Hereβs what you have to do to use it -
- Sign up for an account with TwelveLabs and grab an API key. Thereβs a generous free tier at twelvelabs.io.
-
Set the key in the environment variable
TWELVELABS_API_KEY(or pass it directly to the constructor).

