Update dependency accelerate to v1.10.0 #97
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
==1.8.1
->==1.10.0
Release Notes
huggingface/accelerate (accelerate)
v1.10.0
: : N-D ParallelismCompare Source
N-D Parallelism
Training large models across multiple GPUs can be complex, especially when combining different parallelism strategies (e.g TP, CP, DP). To simplify this process, we've collaborated with Axolotl to introduce an easy-to-use integration that allows you to apply any combination of parallelism strategies directly in your training script. Just pass a
ParallelismConfig
specifying the size of each parallelism type—it's that simple.Learn more about how it works in our latest blogpost.
ParallelismConfig
fromPartialState
by @SunMarc in https://github.com/huggingface/accelerate/pull/3720FSDP improvements
We've fixed ignored modules attribute. With this, it is now possible to train PEFT model that moe layers that contrains
q_proj
andv_proj
parameters. This is especially important for fine-tuninggpt-oss
model.Minor improvements
New Contributors
Full Changelog: huggingface/accelerate@v1.9.0...v1.10.0
v1.9.0
: : Trackio support, Model loading speedup, Minor distributed improvementsCompare Source
Trackio tracker support
We've added support for a trackio, lightweight, 💯 free experiment tracking Python library built on top of 🤗 Datasets and Spaces.
Main features are:
space_id
.To use it with accelerate, you need to set
log_with
and initialize the trackersThanks @pcuenca for the integration !
Model loading speedup when relying
set_module_tensor_to_device
Setting tensor while clearing cache is very slow, so we added
clear_device
option to disable it.Another small optimization is using
non_blocking
everywhere and syncing just before returning control to the user. This makes the loading slightly faster.FDSP, Deepspeed, FP8 minor improvements
Accelerator()
configuring by @pstjohn in https://github.com/huggingface/accelerate/pull/3677🚨🚨🚨 Breaking changes 🚨🚨🚨
find_executable_batch_size()
will no longer halves the batch after every OOM. Instead, we will multiply the batch size by 0.9. This should help user not waste gpu capacity.What's Changed
New Contributors
Full Changelog: huggingface/accelerate@v1.8.1...v1.9.0
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.