README.md · Delta-Vector/Hamanasu-4B-PT at main

metadata

thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg
datasets:
  - Mielikki/Erebus-87k
  - NewEden/Orion-Asstr-Stories-16K
  - NewEden/Orion-LIT
  - NewEden/Fujin-Cleaned-Final
base_model:
  - IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
tags:
  - llama
  - roleplay
  - finetune
  - storywriting

Hamanasu 4B

🌌 Overview

This model is a finetune of Llama-3.1-Minitron-4B-Width-Base-chatml on 1B tokens of Stories & Books

This model is not usable for Chat.

All thanks to Tav for funding the train.

⚔️ Hardware

8x H100s
Epochs: 1
Base: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Axolotl Config ꒰(˶• ᴗ •˶)꒱

https://wandb.ai/new-eden/tavbussy/artifacts/axolotl-config/config-jpgzpr2g/v0/files/axolotl_config_3y5zkvbz.yml

⚡ Credits

Made by

Delta-Vector