Adding script for processing many intermediate checkpoints at once for offline evals#731
Adding script for processing many intermediate checkpoints at once for offline evals#731IanMagnusson wants to merge 93 commits intomainfrom
Conversation
.gitignore
Outdated
| @@ -1,3 +1,6 @@ | |||
| # beaker yaml | |||
| guided-trout-2f805b9.yaml | |||
log.txt
Outdated
| @@ -0,0 +1,10 @@ | |||
|
|
|||
hf_olmo/convert_olmo_to_hf.py
Outdated
| upload_local_checkpoint(local_checkpoint_dir, args.destination_dir) | ||
|
|
||
| print(f"Converted checkpoint saved to {args.destination_dir}") | ||
| if args.cleanup_local_dir: |
There was a problem hiding this comment.
Is there ever a reason not to do this?
There was a problem hiding this comment.
I removed the if statement & the flag.
requirements.txt
Outdated
| @@ -0,0 +1,7 @@ | |||
| torch | |||
There was a problem hiding this comment.
We don't use requirements.txt in OLMo. We use pyproject.toml.
There was a problem hiding this comment.
Removed --- it was created to troubleshoot.
He does. What this one does is very similar but the focus is on batch conversion and wildcard acceptance. And it was written for oe-eval consistent ranking project with expediting its pipeline in mind. |
|
@jenahwang would it be possible to merge in your changes to the other script? or consolidate the two? it's confusing to have two conversion scripts, and it doubles maintenance. |
Making a draft PR for this so we can consider merging this in to main. It would be nice if we could do this so we don't run into version issues if we train models in the future that are not compatible with the version of the code forked here.