bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • 71B • Updated Oct 16, 2024 • 4.43k • 98
lmstudio-community/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • 71B • Updated Oct 15, 2024 • 173 • 38
mlx-community/nvidia_Llama-3.1-Nemotron-70B-Instruct-HF_4bit Text Generation • 11B • Updated Oct 16, 2024 • 1.44k • 12
mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-8bit Text Generation • 20B • Updated Oct 17, 2024 • 32 • 1
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated 10 days ago • 840 • 14
unsloth/Llama-3.1-Nemotron-70B-Instruct-bnb-4bit Text Generation • 37B • Updated Oct 17, 2024 • 20 • 20
DevQuasar/nvidia.Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • 71B • Updated Feb 1 • 14 • 1
mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-4bit Text Generation • 11B • Updated Oct 17, 2024 • 27 • 2
second-state/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • 71B • Updated Oct 18, 2024 • 1.05k
ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4 Text Generation • 11B • Updated Dec 7, 2024 • 139 • 6
joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4 Text Generation • 11B • Updated Nov 5, 2024 • 918 • 3
RohitPoreddy/Llama-3.1-Nemotron-70B-Instruct-HF-Q4-mlx Text Generation • 11B • Updated Nov 7, 2024 • 7