Conversation
bfb705b to
6f6a312
Compare
6f6a312 to
86fba44
Compare
|
solves #462 |
|
@Mergifyio rebase |
✅ Branch has been successfully rebased |
86fba44 to
d40e922
Compare
|
|
||
| MINIMAL_TRAINING_ARGS = { | ||
| "max_seq_len": 140, # this config fits nicely on 4xL40s and may need modification for other setups | ||
| "max_batch_len": 15000, |
There was a problem hiding this comment.
are these parameters' changes intentional? why?
|
Is the idea behind the param changes that this could run on a smaller ec2 instance? Should we also update the workflow file in this pr? |
|
@Mergifyio rebase |
runs through Liger w/ and w/o CPUOffload parameterizes LoRA but doesn't enable it because of memory usage bug removes `smoketest.sh` from `tests` directory- all tests should use pytest in the future. Signed-off-by: James Kunstle <jkunstle@redhat.com>
Let's see if there's any issue without these changes. If not, we can cancel / postpone changes to a separate PR. Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>
✅ Branch has been successfully rebased |
8aad472 to
a3e0fde
Compare
|
This pull request has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. |
|
This pull request has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. |
|
This pull request has been automatically closed due to inactivity. Please feel free to reopen if you intend to continue working on it! |
☑️ Command disallowed due to command restrictions in the Mergify configuration.Details
|
1 similar comment
☑️ Command disallowed due to command restrictions in the Mergify configuration.Details
|
requires #590 to be merged first- this PR is based on that one
Closes #462