Skip to content

Conversation

@BenjaminBossan
Copy link
Member

This PR fixes a few issues with the examples/sft example.

  1. There was an error in argument parsing due to trl renaming an argument to max_length, which was now conflicting with another argument name already in use.
  2. If a user wanted to choose 8bit bnb quantization, they also had to pass use_4bit_quantization=True as an argument due to a wrong indentation.
  3. Documented that 8bit quantization does not work with FSDP (the aforementioned bug may have masked this).

I verified locally that 4bit bnb works with FSDP but 8bit raises an error (see #2833).

This PR fixes a few issues with the examples/sft example.

- There was an error in argument parsing due to trl renaming an argument
to max_length, which was now conflicting with another argument name
already in use.

- If a user wanted to choose 8bit bnb quantization, they also had to
pass use_4bit_quantization=True as an argument due to a wrong
indentiation.

- Documented that 8bit quantization does not work with FSDP (the
aforementioned bug may have masked this)
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants