How do you use Llama 3.2 11B Vision Instruct?

Llama 3.2 11B Vision Instruct can be called through OpenRouter's OpenAI-compatible API using the model ID meta-llama/llama-3.2-11b-vision-instruct. This page includes a ready-to-run curl example and the supported parameter list.

Is Llama 3.2 11B Vision Instruct free?

Llama 3.2 11B Vision Instruct is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

Llama 3.2 11B Vision Instruct

All models

MetaLlama

Llama 3.2 11B Vision Instruct

Q: What is Llama 3.2 11B Vision Instruct?

Llama 3.2 11B Vision Instruct is an AI model from Meta in the Llama series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

131K context

$0.245/M input

$0.245/M output

Llama 3.2 11B Vision Instruct is an AI model from Meta built for agent workflows, with support for text, image input and text output. Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

What is Llama 3.2 11B Vision Instruct?

Llama 3.2 11B Vision Instruct is an AI model from Meta that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Llama 3.2 11B Vision Instruct against other models for agent workflows and production deployments.

Model ID

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Modalities

Input

textimage

Output

text

Supported Parameters

frequency_penaltymax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstoptemperaturetop_ktop_p

More from Meta

Llama Guard 4 12B

164K ctx$0.180/M

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

Llama 4 Maverick

1.0M ctx$0.150/M

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

Llama 4 Scout

328K ctx$0.080/M

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

Llama Guard 3 8B

131K ctx$0.480/M

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification)...

Stay in the know

Llama 3.2 11B Vision Instruct

What is Llama 3.2 11B Vision Instruct?

More from Meta

Related content