OSAIM
Open Source AI Models

40 open-source models indexed

The open-source AI model directory.

Licenses, parameters, benchmarks, VRAM and inference pricing for Llama, Mistral, Qwen, DeepSeek, Gemma, Phi and more. Pick the right open-weights model for your use case.

Best models for…

All use cases →

Curated picks for the six workloads we get asked about most.

Featured models

See all →
DeepSeek R1
671B

Reasoning model trained with reinforcement learning on top of DeepSeek V3-Base. MIT licence — even the weights are unrestricted, making R1 the most permissively-licensed frontier reasoning model. Generates long internal chains-of-thought before answering, trading latency for accuracy on math, code, and reasoning benchmarks. Distilled variants (e.g. R1 Distill Llama 70B) recover most of the quality at much smaller scales.

Context
128K
License
mit
VRAM Q4
402.6 GB
DeepSeek V3
671B

671B-parameter MoE model with 37B active per token. Trained for roughly $5.6M of compute — a landmark in cost-efficient frontier training. Frontier-class quality at a fraction of the cost of the closed proprietary frontier. The DeepSeek licence permits commercial use with limited restrictions on military and unlawful applications. Running V3 yourself requires serious hardware (8× H100 at fp8); most teams will use it via the DeepSeek API or providers like Together.

Context
128K
License
deepseek
VRAM Q4
402.6 GB
Jamba 1.5 Large
398B

Hybrid Mamba-Transformer-MoE model with native 256K context (effective beyond 140K). 94B active parameters out of 398B total. The state-space-model layers give it linear-time scaling with sequence length, making it interesting for very long contexts. Licensed under AI21's open model licence, which permits most commercial use.

Context
256K
License
jamba-open
VRAM Q4
238.8 GB
Nemotron-4 340B Instruct
340B

NVIDIA's reward-modelling research vehicle. Trained primarily to be a synthetic-data-generation specialist rather than a chat-first model. Useful for teams building instruction-tuning datasets at scale.

Context
4K
License
llama-3
VRAM Q4
204 GB
Grok 1
314B

xAI's first open-weights release: a 314B-parameter mixture-of-experts model. Apache 2.0 licensed. Largely a research artefact at this size — most users will run smaller models for production — but useful as a permissively-licensed reference for MoE research.

Context
8K
License
apache-2-0
VRAM Q4
188.4 GB
DeepSeek Coder V2
236B

Coding-focused MoE model with 21B active parameters out of 236B total. Supports 338 programming languages with strong performance across mainstream stacks (Python, TypeScript, Go, Rust, Java, C++) and competent results on niche languages where most open models falter. The DeepSeek licence applies — commercial use permitted with some application restrictions.

Context
128K
License
deepseek
VRAM Q4
141.6 GB

Browse by family

Command
Cohere

Cohere's enterprise-oriented model family. Command R+ targets retrieval-augmented generation workflows.

DeepSeek
DeepSeek

Hangzhou-based lab known for highly efficient MoE training. DeepSeek V3 and R1 set new bars for open reasoning and coding.

Falcon
TII

Technology Innovation Institute (UAE). Falcon Mamba pioneered state-space-model open releases.

Gemma
Google DeepMind

Google's open-weights model family derived from the same research as Gemini. Strong performance at small scales.

Grok
xAI

xAI's open-weights releases. Grok 1 and Grok 2 weights have been published under Apache 2.0.

Jamba
AI21 Labs

Hybrid Mamba/Transformer architecture with very long native context windows.

Llama
Meta

Meta's open-weights LLM family. Llama 3 introduced 128K context and strong instruction tuning across 1B–405B parameter scales.

Mistral
Mistral AI

Paris-based lab whose dense and mixture-of-experts open-weights models popularised sliding-window attention and high efficiency.

Nemotron
NVIDIA

NVIDIA's research and instruction-tuning effort built on top of Llama base models, with strong RLHF and reward-modelling work.

OLMo
Allen Institute for AI

AI2's fully-open initiative: weights, data and training code all under permissive licences.

Phi
Microsoft

Microsoft's small-language-model line, focused on data-quality-driven training. Phi-4 punches well above its weight class.

Qwen
Alibaba

Alibaba's Qwen series. Qwen2.5 delivers competitive frontier-class performance across sizes from 0.5B to 72B with permissive licensing.

Stable LM
Stability AI

Stability AI's general-purpose text models.

Yi
01.AI

01.AI's bilingual (Chinese/English) open-weights series including vision variants.

Browse by task

Browse by size

MMLU leaderboard — top 5

Full leaderboards →
  1. 90.80
  2. 88.50
  3. 86.10
  4. 86.00
  5. 86.00

Why open-source models matter

Open-weights models let you run inference locally, fine-tune on private data, and avoid vendor lock-in. Open Source AI Models tracks every model with public weights so you can pick on facts, not marketing.