Model families

Browse open-source models grouped by the team that released them.

Cohere's enterprise-oriented model family. Command R+ targets retrieval-augmented generation workflows.

Databricks' 132B mixture-of-experts (12 of 16 experts active per token) built on the MegaBlocks stack. Released under the Databricks Open Model Licence.

DeepSeek

Hangzhou-based lab known for highly efficient MoE training. DeepSeek V3 and R1 set new bars for open reasoning and coding.

Falcon

TII

Technology Innovation Institute (UAE). Falcon Mamba pioneered state-space-model open releases.

Gemma

Google DeepMind

Google's open-weights model family derived from the same research as Gemini. Strong performance at small scales.

Grok

xAI

xAI's open-weights releases. Grok 1 and Grok 2 weights have been published under Apache 2.0.

Hermes

NousResearch

Nous Research's community-driven fine-tune series. Hermes 3 sits on top of Llama 3.1 base weights, tuned for very strong tool use and steerable roleplay behaviour.

Jamba

AI21 Labs

Hybrid Mamba/Transformer architecture with very long native context windows.

Kimi

Moonshot AI

Beijing-based lab founded by ex-Tsinghua researchers. Kimi K1.5 and K2 push very long context and reasoning; K2 is a trillion-parameter MoE with strong tool-use behaviour.

Llama