All models

2 of 55 open-source models (filtered).

Nemotron ✕Llama 3 Community License ✕Clear all

Sort:

NVIDIA's reward-modelling research vehicle. Trained primarily to be a synthetic-data-generation specialist rather than a chat-first model. Useful for teams building instruction-tuning datasets at scale.

Context: 4K
License: llama-3
VRAM Q4: 204 GB

Llama 3.1 Nemotron 70B Instruct

70B

NVIDIA's RLHF-tuned Llama 3.1 70B. Tops several Arena-style human-preference leaderboards and shipped with NVIDIA's reward-model research. Inherits the Llama 3 community licence.

Context: 128K
License: llama-3
VRAM Q4: 42 GB