All models

3 of 55 open-source models (filtered).

Sort:

Flagship Gemma 2 release. Uses logit-distillation from a larger teacher model, which is how Google delivers near-70B quality from a 27B student. A solid choice when the Llama community licence doesn't fit and you need quality at the 27B–40B size range.

Context: 8K
License: gemma
VRAM Q4: 16.2 GB

Gemma 2 9B

Mid-tier Gemma. Strong general-purpose chat model at small scale. The Gemma Terms of Use permit commercial use subject to Google's prohibited-use policy.

Context: 8K
License: gemma
VRAM Q4: 5.4 GB

Gemma 2 2B

2.6B

Compact Gemma variant designed for on-device inference. Trained with knowledge distillation from larger Gemma 2 teachers. Runs comfortably on a phone at Q4.

Context: 8K
License: gemma
VRAM Q4: 1.6 GB