š§"raw" pretrained smol_llama checkpoints - WIP š§
-
BEE-spoke-data/smol_llama-101M-GQA
Text Generation ⢠Updated ⢠816 ⢠28 -
BEE-spoke-data/smol_llama-81M-tied
Text Generation ⢠Updated ⢠39 ⢠6 -
BEE-spoke-data/smol_llama-220M-GQA
Text Generation ⢠Updated ⢠628 ⢠12 -
BEE-spoke-data/verysmol_llama-v11-KIx2
Text Generation ⢠Updated ⢠40 ⢠4