Comparison
DeepSeek R1 vs QwQ 32B Preview
Side-by-side specs, benchmarks and hosted-inference pricing.
Reasoning model trained with reinforcement learning on top of DeepSeek V3-Base. MIT licence — even the weights are unrestricted, making R1 the most permissively-licensed frontier reasoning model. Generates long internal chains-of-thought before answering, trading latency for accuracy on math, code, and reasoning benchmarks. Distilled variants (e.g. R1 Distill Llama 70B) recover most of the quality at much smaller scales.
Qwen's reasoning-focused 'thinking' model. Generates long chains-of-thought before answering, similar to OpenAI's o1 and DeepSeek R1 lineage. Optimised for math and competition-style problem solving. The Preview tag means Qwen is iterating quickly; later versions may obsolete this one. Useful today for math-heavy workloads where a slow, careful answer is preferred to a fast wrong one.
Specs
| Parameters | 671B | 32B |
| Context length | 128K | 33K |
| Modality | text | text |
| Released | 2025-01-20 | 2024-11-28 |
| License | MIT | Apache 2.0 |
| Commercial use | Yes | Yes |
| VRAM fp16 | 1342 GB | 64 GB |
| VRAM Q4 | 402.6 GB | 19.2 GB |
Benchmarks
| GPQA | 71.5 | 65.2 |
| MATH | 97.3 | 90.6 |
| MMLU | 90.8 | 75.0 |
| MMLU-Pro | 84.0 | — |