Google

Gemma 3

GoogleSlug: gemma-3

Gemma 3 open models with various parameter sizes (12B, 1B, 270M, 27B, 4B)

Notes

Open models for various use cases

Models

Evaluations

Plan supports

Models in this family

5 models

Gemma 3 12B Instruct

15 evaluations

Google

Gemma 3 12B Instruct is a 12-billion parameter, open-weight model from Google designed for efficient on-device and edge deployment. It features enhanced reasoning and instruction-following capabilities, along with native multimodal support for processing both text and images. The model offers a strong balance of performance, speed, and accessibility for developers.

Input / 1M tokens

$0.09

Output / 1M tokens

$0.29

Gemma 3 1B Instruct

15 evaluations

Google

Gemma 3 1B Instruct is a lightweight, instruction-tuned language model from Google's Gemma 3 family. Designed for efficiency and speed, it is optimized for on-device and edge deployment scenarios. This model provides a strong balance of performance and low resource consumption for basic conversational and instruction-following tasks.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

Gemma 3 270M

13 evaluations

Google

Gemma 3 270M is a lightweight, small-parameter language model from Google's Gemma family. Designed for efficiency and speed, it is optimized for deployment on resource-constrained devices like mobile phones or edge hardware, offering low-latency and cost-effective inference.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

Gemma 3 27B Instruct

15 evaluations

Google

Gemma 3 27B Instruct is a 27-billion parameter open-source instruction-tuned model from Google, built on the Gemma 3 architecture. It is designed for strong reasoning and instruction-following capabilities, suitable for a wide range of text generation tasks. The model supports a long context window, making it effective for processing lengthy documents.

Input / 1M tokens

$0.11

Output / 1M tokens

$0.25

Gemma 3 4B Instruct

15 evaluations

Google

Gemma 3 4B Instruct is a lightweight, efficient instruction-tuned model from Google's Gemma family. It is designed for fast inference and deployment on resource-constrained environments like edge devices or local hardware, while maintaining strong reasoning capabilities for its size.

Input / 1M tokens

$0.04

Output / 1M tokens

$0.08

User ratings

Loading ratings...

Discussion

Thinking... Make sure you are connected to GitHub server