Skip to content

Regarding issue with output format for Vicuna (lmsys/vicuna-7b-v1.5) using fastchat/serve/huggingface_api.py #3795

@sensai99

Description

@sensai99

Hi!

I was using the huggingface_api for inference on lmsys/vicuna-7b-v1.5 using the following command:

python3 -m fastchat.serve.huggingface_api --model lmsys/vicuna-7b-v1.5

The ASSISTANT's output looks like (with the special characters "▁" and additional spaces):

USER: Hello! Who are you?
ASSISTANT: ▁I ' m ▁a ▁language ▁model ▁called ▁Vic una , ▁and ▁I ▁was ▁trained ▁by ▁Lar ge ▁Model ▁Systems ▁Organ ization ▁( L MS YS ) ▁research ers .

However, I was expecting the output to be clean:

USER: Hello! Who are you?
ASSISTANT: I'm a language model called Vicuna , and I was trained by Large Model Systems Organization (LMSYS) researchers.

I need to have clean output because I am performing multi-turn generation (i.e. pass the first response of the assistant back to the assistant as context for generating next response).

Sorry if I am missing something fundamental here but any help would be much appreciated!

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions