Skip to content

SmolLM2 minor issuesΒ #211

@ecntu

Description

@ecntu

Hello,

On a fresh conda env install following the README, I get two minor annoyances when trying to run SFT on SmolLM2 with

ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/deepspeed_zero3.yaml scripts/run_sft.py recipes/smollm2/sft/config_smol.yaml
  1. I get an unused parameter exception for use_flash_attention_2 from H4ArgumentParser. Deleting the corresponding line from the config solves the problem (I think).
  2. I get a ValueError: Chat template is already added to the tokenizer. If you want to overwrite it, please set it to None from the setup_chat_format [here](url). Adding
if hasattr(tokenizer, "chat_template") and tokenizer.chat_template is not None:
    tokenizer.chat_template = None  # Reset the chat template

solves the problem (I think).

I don't know if I am doing something wrong or if the solutions I mentioned are actual solutions.

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions