artificialguybr
/

QWEN-2.5-0.5B-Synthia-II

@@ -25,6 +25,17 @@ This model builds upon the powerful Qwen2.5-0.5B base model, which features:
 - 490M parameters (360M non-embedding parameters)
 - 24 transformer layers
 - 14 attention heads for queries and 2 for key/values (GQA architecture)
 - Support for 32,768 context length
 - Advanced features like RoPE positional embeddings, SwiGLU activations, and RMSNorm

 - 490M parameters (360M non-embedding parameters)
 - 24 transformer layers
 - 14 attention heads for queries and 2 for key/values (GQA architecture)
+---
+### 🌐 Website
+You can find more of my models, projects, and information on my official website:
+- **[artificialguy.com](https://artificialguy.com/)**
+### 💖 Support My Work
+If you find this model useful, please consider supporting my work. It helps me cover server costs and dedicate more time to new open-source projects.
+- **Patreon:** [Support on Patreon](https://www.patreon.com/user?u=81570187)
+- **Ko-fi:** [Buy me a Ko-fi](https://ko-fi.com/artificialguybr)
+- **Buy Me a Coffee:** [Buy me a Coffee](https://buymeacoffee.com/jvkape)
 - Support for 32,768 context length
 - Advanced features like RoPE positional embeddings, SwiGLU activations, and RMSNorm