Skip to content

Releases: NousResearch/atropos

v0.4.0

10 Mar 04:20
1d78069

Choose a tag to compare

Highlights

New example trainer

Weights are shared between vLLM and the trainer, no comms needed to sync weights, and memory saved by using only one copy of the weights!

On Policy/Self Distillation Support

Now support logprobs from a teacher/prompted endpoint, fully supporting on policy distillation/self distillation!

OpenAI Endpoint for managed server

Launch an openai endpoint and collect rollouts from any program that takes in an openai endpoint!

What's Changed

Read more

v0.3.0

16 Jul 20:39
8284a0b

Choose a tag to compare

What's Changed

Read more

v0.2.1

18 May 14:58
c189fc3

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1

v0.2.0

13 May 22:57
4f0c464

Choose a tag to compare

What's Changed

  • Update README.md by @sukrucildirr in #2
  • [README] Add offline SFT data gen docs by @hjc-puro in #4
  • Add rejection sampling description to offline SFT docs. Also add atropos-dpo-gen to the pyproject.toml. by @hjc-puro in #5
  • Add process subcommand by @hjc-puro in #9
  • Add full wandb train/eval acc metrics, expand rollouts table with more information to Finance Prediction Environment by @teknium1 in #15
  • Fix PR Template for GitHub Web by @teknium1 in #17
  • Quick hotfix for better PR template by @teknium1 in #18
  • Removed mentions of NousResearch/DeepHermes-3-Llama-3-1B-Preview and … by @edmundman in #20
  • Update base env README with design philosophy by @hjc-puro in #25
  • 24 keyerror on self state in base register env fail by @dmahan93 in #27
  • fix multimodal envs. add view_run_multimodal by @sumo43 in #22
  • Support args in process cli in serve subcommand by @hjc-puro in #14
  • fix olympiadbench due to upstream changes by @dmahan93 in #31
  • run pre-commit on all files by @dmahan93 in #32
  • add pre-commit workflow and readme.md changes to point to debugging tools by @dmahan93 in #33
  • ⚡️ Speed up function grab_exact_from_heterogeneous_queue by 1,680% by @aseembits93 in #7
  • fix pre-commit by @dmahan93 in #37
  • Improve error logging for HTTP requests by @hjc-puro in #13
  • add gym taxi env by @dmahan93 in #36
  • Remove dependency on torch for default installation by @dmahan93 in #40
  • Add n kwarg being ignored workaround by @dmahan93 in #41
  • add custom server support by @dmahan93 in #28
  • Create upload_to_pypi.yml for releases by @dmahan93 in #42

New Contributors

Full Changelog: https://github.com/NousResearch/atropos/commits/v0.2.0