OSAIM
Open Source AI Models

Comparison

DeepSeek R1 vs QwQ 32B Preview

Side-by-side specs, benchmarks and hosted-inference pricing.

Side A
DeepSeek R1
DeepSeek · DeepSeek

Reasoning model trained with reinforcement learning on top of DeepSeek V3-Base. MIT licence — even the weights are unrestricted, making R1 the most permissively-licensed frontier reasoning model. Generates long internal chains-of-thought before answering, trading latency for accuracy on math, code, and reasoning benchmarks. Distilled variants (e.g. R1 Distill Llama 70B) recover most of the quality at much smaller scales.

Side B
QwQ 32B Preview
Alibaba · Qwen

Qwen's reasoning-focused 'thinking' model. Generates long chains-of-thought before answering, similar to OpenAI's o1 and DeepSeek R1 lineage. Optimised for math and competition-style problem solving. The Preview tag means Qwen is iterating quickly; later versions may obsolete this one. Useful today for math-heavy workloads where a slow, careful answer is preferred to a fast wrong one.

Specs

Parameters671B32B
Context length128K33K
Modalitytexttext
Released2025-01-202024-11-28
LicenseMITApache 2.0
Commercial useYesYes
VRAM fp161342 GB64 GB
VRAM Q4402.6 GB19.2 GB

Benchmarks

GPQA71.565.2
MATH97.390.6
MMLU90.875.0
MMLU-Pro84.0

Cheapest hosted pricing

DeepSeek R1
deepinfra: $0.55 in / $2.19 out per 1M tokens
QwQ 32B Preview
together: $1.20 in / $1.20 out per 1M tokens
Highlighted cells indicate the better value for that row (higher score, larger context, lower VRAM).