Hamanasu-4B-PT / README.md
Delta-Vector's picture
Create README.md
03c862b verified
metadata
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg
datasets:
  - Mielikki/Erebus-87k
  - NewEden/Orion-Asstr-Stories-16K
  - NewEden/Orion-LIT
  - NewEden/Fujin-Cleaned-Final
base_model:
  - IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
tags:
  - llama
  - roleplay
  - finetune
  - storywriting
Model Visualization

Hamanasu 4B

🌌 Overview

This model is a finetune of Llama-3.1-Minitron-4B-Width-Base-chatml on 1B tokens of Stories & Books

This model is not usable for Chat.

All thanks to Tav for funding the train.

⚔️ Hardware

  • 8x H100s
  • Epochs: 1
  • Base: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Axolotl Config ꒰(˶• ᴗ •˶)꒱

https://wandb.ai/new-eden/tavbussy/artifacts/axolotl-config/config-jpgzpr2g/v0/files/axolotl_config_3y5zkvbz.yml

⚡ Credits


Made by
Delta-Vector