OSAIM
Open Source AI Models

Model families

Browse open-source models grouped by the team that released them.

Command
Cohere

Cohere's enterprise-oriented model family. Command R+ targets retrieval-augmented generation workflows.

DeepSeek
DeepSeek

Hangzhou-based lab known for highly efficient MoE training. DeepSeek V3 and R1 set new bars for open reasoning and coding.

Falcon
TII

Technology Innovation Institute (UAE). Falcon Mamba pioneered state-space-model open releases.

Gemma
Google DeepMind

Google's open-weights model family derived from the same research as Gemini. Strong performance at small scales.

Grok
xAI

xAI's open-weights releases. Grok 1 and Grok 2 weights have been published under Apache 2.0.

Jamba
AI21 Labs

Hybrid Mamba/Transformer architecture with very long native context windows.

Llama
Meta

Meta's open-weights LLM family. Llama 3 introduced 128K context and strong instruction tuning across 1B–405B parameter scales.

Mistral
Mistral AI

Paris-based lab whose dense and mixture-of-experts open-weights models popularised sliding-window attention and high efficiency.

Nemotron
NVIDIA

NVIDIA's research and instruction-tuning effort built on top of Llama base models, with strong RLHF and reward-modelling work.

OLMo
Allen Institute for AI

AI2's fully-open initiative: weights, data and training code all under permissive licences.

Phi
Microsoft

Microsoft's small-language-model line, focused on data-quality-driven training. Phi-4 punches well above its weight class.

Qwen
Alibaba

Alibaba's Qwen series. Qwen2.5 delivers competitive frontier-class performance across sizes from 0.5B to 72B with permissive licensing.

Stable LM
Stability AI

Stability AI's general-purpose text models.

Yi
01.AI

01.AI's bilingual (Chinese/English) open-weights series including vision variants.