vllm-inference / runner.sh

Commit History

fix(runner.sh): --enforce-eager not support values
cb15911

yusufs commited on

fix(runner.sh): explicitly disabling enforce_eager
266e7dd

yusufs commited on

fix(runner.sh): disable eager-loading so it using cuda graph (in order for parallel and faster processing)
6bb48e9

yusufs commited on

feat(runner.sh): add specific task and code revision
dc19c1d

yusufs commited on

feat(runner.sh): using MODEL_ID only
490e6a3

yusufs commited on

feat(runner.sh): using runner.sh to select llm in the run time
69c6372

yusufs commited on