Having trouble running this model with vLLM not sure why

by zacksiri - opened 15 days ago

Discussion

zacksiri

15 days ago

•

edited 15 days ago

I'm seeing the same error as mentioned here https://github.com/vllm-project/vllm/issues/15965

Could it be due to missing tekken.json file? I'm using the vllm-openai docker to run the model.

alexmarques

Red Hat AI org 11 days ago

Hi @zacksiri . Although this is a mistral model, it is derived from the HuggingFace definition (in contrast to using the original Mistral definition). Hence, you SHOULD NOT use these arguments: --tokenizer_mode mistral --config_format mistral --load_format mistral --tool-call-parser mistral

Also, I noticed that it works on vllm==0.8.3 but it fails on 0.8.4. I'll notify the vLLM team on this

zacksiri

10 days ago

Thank you!

zacksiri changed discussion status to closed 10 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment