Llama 3.2 Instruct 11B (Vision)
15 evaluationsMeta
This is an 11B parameter multimodal instruct model from Meta's Llama 3.2 series, optimized for vision tasks. It balances performance with efficiency, offering faster inference speeds than larger models, and is suitable for conversational and instruction-following scenarios requiring image understanding.
Input / 1M tokens
$0.245
Output / 1M tokens
$0.245


