sftrainer #757

thistleknot · 2023-10-21T16:23:14Z

thistleknot
Oct 21, 2023

I'm reading that sftrainer can be used to train an llm rather than simply trainer

https://huggingface.co/docs/trl/main/en/sft_trainer

there is also this concept about NEFTune which sounds like masking and/or lora dropout

However, I did also want to ask, does this setup support masked models? I know I was able to specify when manually training using my own code, I could set masked llm for llama.

winglian · 2023-10-22T02:57:04Z

winglian
Oct 22, 2023
Maintainer

Axolotl is similar to sfttrainer as it is a wrapper around the hf trainer.

As for NEFTune, ymmv. We've done some experiments using the neft-v3 branch and while.ot performs better than the mt bench score that trl reported, it still performed worse than a basic fine tune.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

sftrainer #757

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

sftrainer #757

Uh oh!

thistleknot Oct 21, 2023

Replies: 1 comment

Uh oh!

winglian Oct 22, 2023 Maintainer

thistleknot
Oct 21, 2023

winglian
Oct 22, 2023
Maintainer