Llama 3.2 Instruct 3B

MetaUnited States

A lightweight instruction-tuned model from the Llama 3.2 family, optimized for fast and efficient on-device or edge deployment. It offers low-cost inference with strong multilingual and conversational capabilities for its size.

FastCheapMultimodal

Input / 1M tokens

$0.15

Output / 1M tokens

$0.15

Output tokens/s

52.13

First-token seconds

0.64s

Supported plans

Benchmark history

Evaluations

TAU2

Measured May 29, 2026Source

Score

0.21

Lcr

Measured May 29, 2026Source

Score

0.02

Ifbench

Measured May 29, 2026Source

Score

0.26

Aime 25

Measured May 29, 2026Source

Score

0.03

Aime

Measured May 29, 2026Source

Score

0.07

Math 500

Measured May 29, 2026Source

Score

0.49

Scicode

Measured May 29, 2026Source

Score

0.05

Livecodebench

Measured May 29, 2026Source

Score

0.08

Hle

Measured May 29, 2026Source

Score

0.05

Gpqa

Measured May 29, 2026Source

Score

0.26

Mmlu Pro

Measured May 29, 2026Source

Score

0.35

Artificial Analysis Math Index

Measured May 29, 2026Source

Score

3.3

Artificial Analysis Intelligence Index

Measured May 29, 2026Source

Score

9.7

Terminalbench Hard

Measured May 29, 2026Source

Score

0.01

Artificial Analysis Coding Index

Measured May 29, 2026Source

Score

4.2

Plan availability

Products and plans that support this model

Synthetic

Synthetic is a platform that runs open-source AI models in private, secure datacenters. It offers subscription and usage-based pricing for accessing models like DeepSeek, Llama, MiniMax, Kimi, and others, with OpenAI-compatible API access for use in coding agents and tools.

User ratings

Loading ratings...

Discussion

Thinking... Make sure you are connected to GitHub server