RichardErkhov
/

AntNLP_-_TinyLlama-NoPE-HeadScale8k-awq

4-bit precision

Model card Files Files and versions

RichardErkhov commited on Dec 20, 2024

Commit

104cb0f

·

verified ·

1 Parent(s): 9af73c6

uploaded readme

Files changed (1) hide show

README.md +36 -0

README.md ADDED Viewed

	@@ -0,0 +1,36 @@

+Quantization made by Richard Erkhov.
+[Github](https://github.com/RichardErkhov)
+[Discord](https://discord.gg/pvy7H8DZMG)
+[Request more models](https://github.com/RichardErkhov/quant_request)
+TinyLlama-NoPE-HeadScale8k - AWQ
+- Model creator: https://huggingface.co/AntNLP/
+- Original model: https://huggingface.co/AntNLP/TinyLlama-NoPE-HeadScale8k/
+Original model description:
+---
+license: mit
+---
+# TinyLlama-NoPE-HeadScale8k
+## Citation
+```
+@misc{wang2024length,
+      title={Length Generalization of Causal Transformers without Position Encoding},
+      author={Jie Wang and Tao Ji and Yuanbin Wu and Hang Yan and Tao Gui and Qi Zhang and Xuanjing Huang and Xiaoling Wang},
+      year={2024},
+      eprint={2404.12224},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```