VLLM 0.7.2 can start the model normally, but there is no output when simulating a request using Curl, it blocks!
3
#2 opened 2 months ago
by
JZMALi
sglang inference issue
7
#1 opened 2 months ago
by
su400