Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
502
Text Generation
GGUF
vllm
sparsity
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
6f2fa5b
Sparse-Llama-3.1-8B-2of4-GGUF
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
aashish1904
Upload README.md with huggingface_hub
6f2fa5b
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
6.06 kB
Upload README.md with huggingface_hub
6 months ago