Comparison

DeepSeek R1 vs QwQ 32B Preview

Side-by-side specs, benchmarks and hosted-inference pricing.

Side A

DeepSeek · DeepSeek

Reasoning model trained with reinforcement learning on top of DeepSeek V3-Base. MIT licence — even the weights are unrestricted, making R1 the most permissively-licensed frontier reasoning model. Generates long internal chains-of-thought before answering, trading latency for accuracy on math, code, and reasoning benchmarks. Distilled variants (e.g. R1 Distill Llama 70B) recover most of the quality at much smaller scales.

Side B

QwQ 32B Preview

Alibaba · Qwen

Qwen's reasoning-focused 'thinking' model. Generates long chains-of-thought before answering, similar to OpenAI's o1 and DeepSeek R1 lineage. Optimised for math and competition-style problem solving. The Preview tag means Qwen is iterating quickly; later versions may obsolete this one. Useful today for math-heavy workloads where a slow, careful answer is preferred to a fast wrong one.

Specs

Parameters	671B	32B
Context length	128K	33K
Modality	text	text
Released	2025-01-20	2024-11-28
License	MIT	Apache 2.0
Commercial use	Yes	Yes
VRAM fp16	1342 GB	64 GB
VRAM Q4	402.6 GB	19.2 GB

Benchmarks

ArenaHard	92.3	—
GPQA	71.5	65.2
IFEval	83.3	—
LiveCodeBench	—	41.9
MATH	97.3	90.6
MMLU	90.8	75.0
MMLU-Pro	84.0	—
SWE-bench Verified	49.2	—

Cheapest hosted pricing

DeepSeek R1

deepinfra: $0.55 in / $2.19 out per 1M tokens

QwQ 32B Preview

together: $1.20 in / $1.20 out per 1M tokens

Highlighted cells indicate the better value for that row (higher score, larger context, lower VRAM).