Blackroot's picture
Update README.md
c6db579 verified
|
raw
history blame
641 Bytes
---
license: mit
---
[![Discord](https://img.shields.io/discord/232596713892872193?logo=discord)](https://discord.gg/2JhHVh7CGu)
A semi custom network based on the follow paper [Simpler Diffusion (SiD2)](https://arxiv.org/abs/2410.19324v1)
This network uses the optimal transport flow matching objective outlined [Flow Matching for Generative Modeling](https://arxiv.org/abs/2210.02747)
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
![samples](./1.png)
![samples](./2.png)
![samples](./3.png)
![samples](./4.png)