Qwen/Qwen3-VL-235B-A22B-Instruct Image-Text-to-Text β’ 236B β’ Updated 10 days ago β’ 144k β’ β’ 247
Running 3.27k 3.27k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade 980 980 Model Memory Utility π Calculate vRAM needed for model training and inference
Running 11 11 Transformers Modular Refactor π» Interactive analyzer for modular models in Transformers lib