Skip to content

support additional special tokens #29

@Derekglk

Description

@Derekglk

Hello @wangkuiyi ,

It seems this tokenizer only supports one special token "<|endoftext|>".
Does it support other additional special tokens? For instatnce the ones we added in special_tokens_map.json,
like
"<|user|>", "<|assistant|>", "<s>", "</s>" and "<unk>"?

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions