AI21 Labs

Jamba

Hybrid Mamba/Transformer architecture with very long native context windows.

History & context

AI21 Labs' Jamba (March 2024) was the first major hybrid Mamba/Transformer/MoE model from a frontier lab. The state-space-model layers give Jamba linear-time scaling with sequence length, which makes very-long-context workloads more practical than with pure-attention architectures.

Jamba 1.5 Large (August 2024) is the current flagship — 398B total parameters with 94B active, native 256K context (effective beyond 140K). Licensed under AI21's open model licence, which permits most commercial use.

Flagship model

Jamba 1.5 Large

398B

Hybrid Mamba-Transformer-MoE model with native 256K context (effective beyond 140K). 94B active parameters out of 398B total. The state-space-model layers give it linear-time scaling with sequence length, making it interesting for very long contexts. Licensed under AI21's open model licence, which permits most commercial use.

Context: 256K
License: jamba-open
VRAM Q4: 238.8 GB

1 model in this family

Jamba 1.5 Large

398B

Context: 256K
License: jamba-open
VRAM Q4: 238.8 GB

Comparing Jamba against another family? Try the side-by-side comparator or browse all leaderboards.