Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ library_name: transformers
|
|
28 |
<h1 style="margin-top: 0rem;">🌙 Kimi K2 Usage Guidelines</h1>
|
29 |
</div>
|
30 |
|
31 |
-
-
|
32 |
- For complete detailed instructions, see our guide: [docs.unsloth.ai/basics/kimi-k2](https://docs.unsloth.ai/basics/kimi-k2)
|
33 |
|
34 |
It is recommended to have at least 128GB unified RAM memory to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
|
|
|
28 |
<h1 style="margin-top: 0rem;">🌙 Kimi K2 Usage Guidelines</h1>
|
29 |
</div>
|
30 |
|
31 |
+
- You can now use the latest update of [llama.cpp](https://github.com/ggml-org/llama.cpp) to run the model.
|
32 |
- For complete detailed instructions, see our guide: [docs.unsloth.ai/basics/kimi-k2](https://docs.unsloth.ai/basics/kimi-k2)
|
33 |
|
34 |
It is recommended to have at least 128GB unified RAM memory to run the small quants. With 16GB VRAM and 256 RAM, expect 5+ tokens/sec.
|