All models
3 of 40 open-source models (filtered).
Gemma 2 27B
27B
Flagship Gemma 2 release. Uses logit-distillation from a larger teacher model, which is how Google delivers near-70B quality from a 27B student. A solid choice when the Llama community licence doesn't fit and you need quality at the 27B–40B size range.
- Context
- 8K
- License
- gemma
- VRAM Q4
- 16.2 GB
Gemma 2 9B
9B
Mid-tier Gemma. Strong general-purpose chat model at small scale. The Gemma Terms of Use permit commercial use subject to Google's prohibited-use policy.
- Context
- 8K
- License
- gemma
- VRAM Q4
- 5.4 GB
Gemma 2 2B
2.6B
Compact Gemma variant designed for on-device inference. Trained with knowledge distillation from larger Gemma 2 teachers. Runs comfortably on a phone at Q4.
- Context
- 8K
- License
- gemma
- VRAM Q4
- 1.6 GB