Running 3.23k 3.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5 Sentence Similarity • 0.5B • Updated Mar 13 • 1.81k • • 61
Running 35 35 Transformer Calculator 📊 Calculate memory, parameters, and FLOPs for transformer models