All models
2 of 40 open-source models (filtered).
Nemotron-4 340B Instruct
340B
NVIDIA's reward-modelling research vehicle. Trained primarily to be a synthetic-data-generation specialist rather than a chat-first model. Useful for teams building instruction-tuning datasets at scale.
- Context
- 4K
- License
- llama-3
- VRAM Q4
- 204 GB
Llama 3.1 Nemotron 70B Instruct
70B
NVIDIA's RLHF-tuned Llama 3.1 70B. Tops several Arena-style human-preference leaderboards and shipped with NVIDIA's reward-model research. Inherits the Llama 3 community licence.
- Context
- 128K
- License
- llama-3
- VRAM Q4
- 42 GB