29 Jan 19:59

kaisopos

f26f1b6

v0.7 Latest

Latest

Oumi 0.7 Release

✨ Highlights

This release brings major platform upgrades (Python 3.14, PyTorch 2.9), new inference engines, rule-based evaluation judges, and significant CLI/documentation improvements.

🚀 New Features

Inference

Fireworks inference engine - New backend for Fireworks AI (#2158)

# Fireworks example (set FIREWORKS_API_KEY env var)
model:
  model_name: "accounts/fireworks/models/llama4-maverick-instruct-basic"
engine: FIREWORKS

oumi infer -i -c configs/recipes/llama4/inference/maverick_instruct_fireworks_infer.yaml

OpenRouter inference engine - New backend for OpenRouter (#2168)

# OpenRouter example (set OPENROUTER_API_KEY env var)
model:
  model_name: "anthropic/claude-sonnet-4.5"
engine: OPENROUTER

# Use via cli
oumi infer -i -c configs/apis/openrouter/infer_claude_4_5_sonnet.yaml

Loading spinner - Visual feedback during inference operations (#2085)
Pre-trained custom model support - Load your own pre-trained models (#2044)

Evaluation

Rule-based judges - Deterministic evaluation judges with CLI integration and examples (#2119, #2171)

# configs/projects/judges/rule_based/regex_match_phone.yaml
judge_params:
  prompt_template: "{response}"

rule_judge_params:
  rule_type: "regex"
  input_fields:
    - "response"
  rule_config:
    pattern: "\\d{3}-\\d{4}"
    input_field: "response"
    match_mode: "search"
    inverse: false

oumi judge dataset -c regex-match-phone --input data/judge_input.jsonl

Training

Metrics logging callback - Log training metrics to disk (#2140)
Per-reward function configuration - New reward_function_kwargs support (#2143)

trainer_type: TRL_GRPO

reward_functions:
  - rubric_reward
  - gsm8k

reward_function_kwargs:
  rubric_reward:
    judge_panel_path: "configs/projects/judges/rubric_panel.yaml"
  gsm8k:
    strict: true

Data & Synthesis

XLSX and DOCX support - New formats for synthesis and datasets (#2148)
Few-shot sampling - Sample few-shot examples from sources during synthesis (#2151)
Batch AttributeSynthesizer - Batch processing support (#2181)
RaR datasets - New datasets and base rubric dataset classes (#2144)

Infrastructure

Nebius cloud provider - New cloud option (#2179)
Kubernetes Skypilot support - Added k8s dependency (#2124)
ARM Docker support - Enabled ARM builds with useful utilities (#2141)
One-line installer - New install.sh script (#2155)

# Basic installation
curl -LsSf https://oumi.ai/install.sh | bash

# With GPU support
curl -LsSf https://oumi.ai/install.sh | bash -s -- --gpu

# With specific Python version
curl -LsSf https://oumi.ai/install.sh | bash -s -- --python 3.12

Telemetry - Optional usage analytics via PostHog (#2145)

📈 Improvements

Performance

Lazy CLI imports - Faster startup times (#2110)

CLI

List aliases, auto-complete, help, and common args improvements (#2122)
Judge command UX improvements (#2129)
Version and system info utilities (#2142)

oumi env  # Show Oumi version, Python version, installed packages, GPU info

Documentation

Complete docs refresh with new custom theme (#2133, #2167)
Added CLI reference sections for analyze, tune, and quantize (#2126)
Updated installation instructions (#2169)

Configs

Added Gemma-2-IT chat template and example config (#2159)
Updated Gemma3-4B-IT SFT training config (#2156)

⚠️ Breaking Changes

Dropped Python 3.9 support - Minimum supported version is now Python 3.10 (#2107)
Deprecated alpaca_eval integration (#2108)
Deprecated protobuf conversation definitions (#2127)

🐛 Bug Fixes

Fixed synthesis rounding errors (#2104)
Fixed logging of distributed training CLI commands (#2165)
Fixed FSDP transformer_wrap_class parsing for fully qualified names (#2164)
Fixed deprecated torch_dtype usage (#2123)
Fixed Oumi Tour notebook output (#2157)
Cleaned up errant print statements (#2121)

📦 Dependency Updates

PyTorch 2.9 and Python 3.14 support (#2109)
Updated: peft, uvicorn, bitsandbytes, click, pillow, typer, torchao, pycares, wandb

👋 New Contributors

Welcome to our new contributors!

Full Changelog: v0.6.0...v0.7

Contributors

brian-nguyen, lefft, and 3 other contributors

Assets 2

17 Dec 21:26

rlehman221

v0.6.0

078d9a3

v0.6.0

Oumi v0.6.0 Changelog

We’re excited to announce Oumi v0.6.0! This release brings Python 3.13 support, a powerful new CLI for dataset analysis, the TRL GOLD trainer for preference learning, and first-class Kubernetes deployment support.

Highlights

Python 3.13 Support

Oumi now officially supports Python 3.13, letting you take advantage of the latest Python performance improvements and features.
(#2092)

New `oumi analyze` CLI Command

Understanding your training data just got easier. The new oumi analyze command lets you inspect and analyze datasets directly from the command line—no code required.

# Analyze a local dataset
oumi analyze -c configs/examples/analyze/analyze.yaml

# Export results in different formats
oumi analyze -c configs/examples/analyze/analyze.yaml --format parquet --output ./my_results

Create a simple config to analyze any HuggingFace dataset:

# hf_analyze.yaml
dataset_name: argilla/databricks-dolly-15k-curated-en
split: train
sample_count: 1000
analyzers:
  - id: length

Check out the analyze documentation for more details.
(#2069, #2071)

TRL GOLD Trainer

We’ve added support for the GOLD (Generalized Online Learning from Demonstrations trainer from TRL. GOLD is an online preference learning algorithm that improves upon DPO by generating responses on-the-fly during training, leading to better alignment with less distribution shift.

# Run GOLD training with the example config
oumi train -c configs/examples/gold/train.yaml

Or configure it in your own training config:

training:
  trainer_type: "TRL_GOLD"
  gold:
    teacher_model_name_or_path: "HuggingFaceTB/SmolLM2-360M-Instruct"
    temperature: 0.9
    max_completion_length: 512
    lmbda: 0.5  # 50% on-policy, 50% off-policy

This requires TRL 0.26+, which is now the default.
(#2095, #2097)

Code Evaluation Judges

New LLM-as-judge evaluators specifically designed for assessing code quality. These judges can evaluate generated code for correctness, style, security, and other software engineering best practices—perfect for evaluating coding assistants and code generation models.

Thanks to @N-45div for this contribution!
(#2087)

Kubernetes Deployment

You can now deploy Oumi training jobs on Kubernetes clusters.

Option 1: Using SkyPilot (new in this release)

# k8s_job.yaml
name: my-training-job
resources:
  cloud: k8s
  accelerators: "A100:1"
run: |
  oumi train -c configs/recipes/llama3_1/sft/8b_lora/train.yaml

oumi launch up -c k8s_job.yaml --cluster my-k8s-cluster

Option 2: Direct kubectl deployment

For existing K8s clusters, you can deploy Oumi directly using kubectl. See the Kubernetes deployment guide for detailed instructions including platform-specific examples for EKS, GKE, and AKS.

Thanks to @min-oumi!
(#2054, #2068)

Custom Master Port for Distributed Training

Running multiple distributed training jobs on the same node? You can now specify a custom master port to avoid conflicts.

Thanks to @monnetb!
(#2021)

ARM Docker Images for Mac

Apple Silicon users rejoice! We now publish ARM64 Docker images, so you can run Oumi containers natively on M1/M2/M3 Macs without emulation overhead.
(#2049)

Bug Fixes

Fix Docker release action (#2023)
Fix length analyzer column naming and add comprehensive message summary tests (#2057)
Fix "too many files open" error when processing large datasets (#2060)
Fix lm_eval multi-GPU integration for distributed evaluation (#2064)
Fix mutable default argument in conversation handling (#2048)

Documentation

Add news item on OpenEnv notebook (#2022)
Add docs for missing inference params and how to serve LoRA adapters (#2047)
Add local Docker guide (#2058)

Deprecations

Cambrian model: The experimental Cambrian model has been deprecated (#2034)
target_col: Removed deprecated target_col field mentions (#2056)

Dependencies

TRL upgraded to 0.26 (#2097)
datasets library upgraded (#2091)
wandb >=0.21,<0.24 (#2032)
safetensors >=0.6,<0.8 (#2031)
bitsandbytes >=0.47,<0.49 (#2038)
torchao >=0.12,<0.15 (#2079)
deepspeed >=0.17.0,<0.19.0 (#2080)
pydantic >=2.11,<2.13 (#2081)
skypilot >=0.10.2,<0.12 (#2089)
torchdata is now optional (#2066)

New Contributors

@monnetb made their first contribution in #2021
@dependabot[bot] made their first contribution in #2029
@min-oumi made their first contribution in #2054
@N-45div made their first contribution in #2087

Full Changelog:
v0.5.0...v0.6.0

Contributors

monnetb, dependabot, and 2 other contributors

Assets 2

18 Nov 21:14

min-oumi

v0.5.0

e8ce6f6

v0.5.0

Oumi v0.5.0 Release Notes

We're excited to announce Oumi v0.5.0, featuring hyperparameter tuning capabilities, expanded inference options, and enhanced launcher functionality.

🚀 Major Features

Data Synthesis Module

Introducing oumi synth - a powerful data synthesis module for automatically generating high-quality training datasets using LLMs (#1965)
Template-based Generation: Control attributes like difficulty, style, and domain for diverse dataset creation
Domain-specific Datasets: Generate data for specialized fields (legal, medical, technical, etc.)
Data Augmentation: Expand existing small datasets by generating variations
Multiple Formats: Support for instruction-following, QA, and conversational datasets

Hyperparameter Tuning Module

Introducing oumi tune - a new hyperparameter search and optimization module for efficient model tuning (#1998, #1991). Thank you @gbladislau-aumo!

Inference & Training Enhancements

Bedrock Integration: Added AWS Bedrock Inference Engine support for scalable model deployment (#1983) - Thank you @aniruddh-alt!
GKD Trainer Support: New Generalized Knowledge Distillation trainer for model compression workflows (#2000)
OpenEnv RL Training: Demo notebook showcasing reinforcement learning training with reward visualization (#1996, #2012)

HPC & Launcher Improvements

NERSC Perlmutter Support: Oumi launcher now supports the NERSC Perlmutter HPC cluster (#1959)
Enhanced Logging: Added job log trailing and dedicated logs command for better debugging (#1951, #1964)
Lazy Cloud Initialization: Improved launcher startup performance (#1985)

✨ Improvements

Model Configuration

Added Qwen3 VL 4B model configurations (#1992, #1993)
Exposed chat_template_kwargs parameter in ModelParams for fine-grained control (#1997)

Developer Experience

Updated BaseConfig to support non-primitive field types (#1684)
Optional stdout_file parameter in SLURM client (#1974)

🐛 Bug Fixes

Fixed NaN values in dataset analyzer for single-conversation datasets (#1961)
Resolved SLURM environment variable issues (PMI_RANK → SLURM_PROCID) (#2010) (Thank you @AliliRayane !)
Fixed non-primitive field saving in base config (#2005)
Updated uv pip install commands to include --system flag (#1979)
Unique inference scratch filenames via hashing (#1986)

📦 Dependency Updates

Upgraded transformers: 4.56 → 4.57 (#1966, #1990)
Upgraded TRL: 0.24.0 → 0.25 (#1995, #2011)
Pinned uvicorn version for SkyPilot compatibility (#1978)

🎉 New Contributors

Welcome to our new contributors!

📖 Full Changelog

For a complete list of changes, see the full changelog

Contributors

gbladislau, aniruddh-alt, and 3 other contributors

Assets 2

20 Oct 15:38

oelachqar

v0.4.2

5e9f231

v0.4.2

Release Notes - v0.4.2

🚀 New Features

Model Support: Added support Qwen3-VL ([#1992](#1992))
HPC Cluster Support: Added support for NERSC Perlmutter HPC cluster in Oumi launcher ([#1959](#1959))
Enhanced Logging:
- Added ability to trail logs for launcher jobs ([#1951](#1951))
- Added launcher logs command for easier log access ([#1964](#1964))

🐛 Bug Fixes

Fixed Sky Pilot unit tests ([#1967](#1967))
Fixed GPU test issues ([#1970](#1970))
Pinned uvicorn version to resolve SkyPilot compatibility ([#1978](#1978))
Updated inference to always hash for unique scratch filenames ([#1986](#1986))
Improved error handling for document processing issues ([#1989](#1989))

🔧 Improvements

Performance: Lazy initialization of clouds in Oumi launcher for faster startup ([#1985](#1985))
Code Quality:
- Refactored dataset analysis utilities ([#1962](#1962), [#1982](#1982))
- Extracted conversation_turns to top-level for better data structure ([#1969](#1969))
- Made stdout_file optional in Slurm client ([#1974](#1974))

📚 Documentation

Updated README with latest information ([#1968](#1968))
Added synthesis documentation and example configurations ([#1965](#1965))

Full Changelog: v0.4.0...v0.4.2

Assets 2

14 Oct 16:35

shanghongsim

v0.4.1

e09cc6c

v0.4.1

What's Changed

Fix NaN values in dataset analyzer statistics for single conversations by @ryan-arman in #1961
[tiny] Add __init__.py to some test dirs by @wizeng23 in #1963
Add The Ability To Trail Logs For Launcher Jobs by @rlehman221 in #1951
Move compute statistics to analysis_utils by @ryan-arman in #1962
Fixed Sky Pilot Unit Tests Failing by @rlehman221 in #1967
extract conversation_turns from conversation_level_summary to top level by @ryan-arman in #1969
Update README.md by @wizeng23 in #1968
[tiny] Upgrade transformers to 4.56 by @wizeng23 in #1966
Quick fix for our GPU tests by @taenin in #1970
Support NERSC Perlmutter HPC cluster in Oumi launcher by @wizeng23 in #1959
Added Launcher Logs Command by @rlehman221 in #1964
Make stdout_file optional in slurm client by @rlehman221 in #1974
Add synth documentation and example configs by @jgreer013 in #1965
Add cuda launch blocking arg to e2e tests for debugging by @jgreer013 in #1976
Pin uvicorn version to fix skypilot by @jgreer013 in #1978
Update inference to always hash for unique inference scratch filenames by @jgreer013 in #1986
Lazy init clouds in oumi launcher by @oelachqar in #1985
Unblock oumi dataset support in synthesis by @jgreer013 in #1988

Full Changelog: v0.4.0...v0.4.1

Contributors

oelachqar, taenin, and 4 other contributors

Assets 2

02 Sep 20:38

wizeng23

v0.4.0

bb493eb

v0.4.0

Oumi v0.4 Changelog

✨ gpt-oss Training and Inference

OpenAI released two highly-anticipated open-weight models in August, gpt-oss-20b and gpt-oss-120b. They’re mixture-of-experts (MoE) reasoning models with strong tool-use performance, and are optimized with native 4-bit quantization for memory-efficient training and inference. You can now run training and inference on these models in Oumi!

Usage Example:

# Train gpt-oss-20b with LoRA on a single GPU
oumi train -c oumi://configs/recipes/gpt_oss/sft/20b_lora_single_gpu_train.yaml

# Run local inference on gpt-oss-120b using vLLM
oumi infer -i -c oumi://configs/recipes/gpt_oss/inference/120b_vllm_infer.yaml

⚡ DeepSpeed Support

DeepSpeed is a powerful and configurable optimization library that allows you to train large models efficiently using techniques like distributed training and memory optimization. Oumi now supports DeepSpeed in addition to PyTorch’s native Fully Sharded Data Parallel (FSDP) for distributed training!

Usage Example:

# Train Llama 3.1 8B using DeepSpeed’s ZeRO-3 optimization strategy
oumi train -c oumi://configs/examples/deepspeed/llama3_1_8b_deepspeed_z3_train.yaml

# Combine DeepSpeed with YARN RoPE scaling to enable training on longer contexts!
# Train Qwen2.5 7B with 128k token context length using YARN and DeepSpeed
oumi train -c oumi://configs/projects/limo/qwen2.5_7b_fft_yarn_deepspeed.yaml

🗄️ CLI Tool for Hugging Face Cache Management

When using datasets and models from Hugging Face Hub, over time it becomes hard to track what’s been cached, how much space it’s taking up, etc. In #1897, @aniruddh-alt has added a oumi cache utility to the Oumi CLI. This lets you view, add to, and delete from the Hugging Face Hub local cache, in addition to getting more information about entries in the cache.

Usage Example:

# View what’s in the cache
oumi cache ls

# Filter for items containing the substring “llama”, and sort by name
oumi cache ls -f *llama* --sort name

# Download a model to cache
oumi cache get Qwen/Qwen3-0.6B

# View information about the cached model
oumi cache card Qwen/Qwen3-0.6B

# Remove a model from cache
oumi cache rm Qwen/Qwen3-0.6B

🎯 Vision DPO and KTO Support

We have added support for two new training methods: Direct Preference Optimization (DPO) on Vision-Language Models and Kahneman-Tversky Optimization (KTO). Special thanks to @efsiatras for implementing KTO support in #1538!

Usage Example:

# Vision DPO on Qwen2.5-VL 3B
oumi train -c oumi://configs/recipes/vision/qwen2_5_vl_3b/dpo/train.yaml

# KTO on Phi-3
oumi train -c oumi://configs/recipes/phi3/kto/train.yaml

🛠️ Developer Experience

Upgrade several package dependencies to latest versions
Additional GGUF, MacOS LlamaCPP, and remote frontier model inference configs by @penfever in #1923 and #1947
Add Pre-Populated GitHub Issue Link On Failures by @rlehman221 in #1936
Add Verbose Flag to Oumi Train by @rlehman221 in #1940
Enable users to log data samples during training for debugging by @shanghongsim in #1943

New Contributors

@efsiatras made their first contribution in #1538
@rlehman221 made their first contribution in #1936

All Contributors

@aniruddh-alt, @efsiatras, @jgreer013, @kaisopos, @oelachqar, @penfever, @rlehman221, @ryan-arman, @shanghongsim, @stefanwebb, @taenin, @wizeng23

Full Changelog: v0.3.0...v0.4.0

Contributors

oelachqar, stefanwebb, and 10 other contributors

Assets 2

05 Aug 01:48

wizeng23

v0.3.0

32393df

v0.3.0

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

Quantization is a crucially important family of methods for reducing model size, for example, prior to deployment. Oumi now supports applying Activation-aware Weight Quantization (AWQ) to all models. See how in our notebook.

Usage Example:

# Quick start - quantize TinyLlama to 4-bit
oumi quantize --method awq_q4_0 --model "TinyLlama/TinyLlama-1.1B-Chat-v1.0" --output quantized_model

# With configuration file
oumi quantize --config quantization_config.yaml

⚖️ Judge API V2 (MAJOR UPDATE)

LLM-as-a-Judge is a method for using foundation models to reliably evaluate other foundation models. We’ve overhauled Oumi’s LLM-as-Judge interface for ease-of-use and flexibility. Check out our notebook here.

Usage Example:

from oumi.judges.simple_judge import SimpleJudge

# Built-in truthfulness judge
simple_judge = SimpleJudge(judge_config="oumi://configs/projects/judges/generic/truthfulness.yaml")

dataset = [{"request": "What is the capital of France?", "response": "Rome"}]
outputs = simple_judge.judge(dataset)

🎯 Adaptive Inference (NEW)

💪 Adaptive Inference, as we term it, refers to new features in Oumi for resuming training (or any task) when a job has crashed, as well as optimizing inference parallelization to maximize bandwidth. Learn more in our notebook.

🛠️ Developer Experience

Updated contributing guidelines
Enhanced documentation
Tutorial notebook fixes
Improved error handling and testing
MLflow integration improvements
Multi-node verl Slurm job support
Rich logging handler option

New Contributors

@amarpal made their first contribution in #1831
@42Shawn made their first contribution in #1837

Full Changelog: v0.2.1...v0.3.0

Contributors

amarpal and 42Shawn

Assets 2

11 Jul 18:00

kaisopos

v0.2.1

7740723

v0.2.1

What's Changed

Set infer_online and infer_from_file to private by @jgreer013 in #1745
Update launch.md by @shanghongsim in #1781
Add adaptive semaphore to enable future adaptive throughput scenarios by @jgreer013 in #1780
Fix a pyright regression by @taenin in #1783
Judge API V2 | Fix judge config from repo path by @kaisopos in #1782
Add permutable attributes and combination sampling for data synthesis by @jgreer013 in #1773
Removed collator in finetuning tutorial notebook by @shanghongsim in #1788
Update our contributing guidelines. by @taenin in #1789
Add adaptive concurrency controller in preparation for adaptive inference by @jgreer013 in #1784
Fixed issue with final conversations not consistently being saved by @jgreer013 in #1795
Add support for ingesting datasets for synthesis by @jgreer013 in #1790
Add support for adaptive inference by @jgreer013 in #1791
Add support for Example Sources in Synthesis by @jgreer013 in #1797
Webinar announcement and other news by @stefanwebb in #1800
Added utm_source parameters by @stefanwebb in #1802
Add code to handle document ingestion by @jgreer013 in #1796
Add code for handling basic document segmentation by @jgreer013 in #1803
Update mflow support in oumi trainer by @oelachqar in #1804
Add multi-node verl SLURM job by @wizeng23 in #1798
Fixed various tutorial notebooks by @shanghongsim in #1792
Add parameter logging to oumi trainer by @oelachqar in #1807
Judge API V2 | Enable prompt variable replacement by YAML by @kaisopos in #1805
[tiny] Update train config comment header by @wizeng23 in #1809
Add experimental option to use the rich logging handler by @oelachqar in #1810

New Contributors

@shanghongsim made their first contribution in #1781

Full Changelog: v0.2.0...v0.2.1

Contributors

oelachqar, stefanwebb, and 5 other contributors

Assets 2

23 Jun 19:04

wizeng23

v0.2.0

5ced77a

v0.2.0

Highlights

GRPO support for trl and verl trainers

Oumi now supports GRPO training for both the trl and verl libraries! This allows you to run GRPO training with no/low code using Oumi's configs. You can also benefit from other features of the Oumi platform, such as custom evaluation and launching remote jobs.

Running GRPO training in Oumi is as simple as:

Create a reward function, and register it to Oumi's reward function registry using @register("<my_reward_fn>", RegistryType.REWARD_FUNCTION).
Create a dataset class to process your HF dataset into the format needed for your target framework, and register it to Oumi's dataset registry using @register_dataset("@hf-org-name/my-dataset-name").
Create an Oumi training config with your model, dataset, reward function, and hyperparameters. For specific details on setting up the config for GRPO, see our documentation.
Launch the training job locally using the oumi train CLI, or launch a remote job using the oumi launch CLI.

For an end-to-end example using Oumi + trl, check out our notebook walkthrough. For verl, check out our multi-modal Geometry3K config. Finally, check out our blog post for more information.

Models built with Oumi: HallOumi and CoALM

We’re proud to announce the release of two models built with Oumi: HallOumi and CoALM! Both of these were trained on Oumi, and we provide recipes to reproduce their training from scratch.

🧀 HallOumi: A truly open-source claim verification (hallucination detection) model developed by Oumi, outperforming Claude Sonnet, OpenAI o1, DeepSeek R1, Llama 405B, and Gemini Pro at only 8B parameters. Check out the Oumi recipe to train the model here.
🤖 CoALM: Conversational Agentic Language Model (CoALM) is a a unified approach that integrates both conversational and agentic capabilities. It includes an instruction tuning dataset and three trained models (8B, 70B, 405B). The project was a partnership between the ConvAI Lab at UIUC and Oumi, and the paper was accepted to ACL. Check out the Oumi recipes to train the models here.

New model support: Llama 4, Qwen3, Falcon H1, and more

We’ve added support for many recent models to Oumi, with tested recipes that work out-of-the-box!

Vision Language Models
Text-to-text LLMs
- Falcon-H1 and Falcon-E
- Qwen3
- Phi-4-reasoning

Support for Slurm and Frontier clusters

At Oumi, we want unify and simplify the processes for running jobs on remote clusters. We have now added support for launching jobs on Slurm clusters, and on Frontier, a supercomputer at the Oak Ridge Leadership Computing Facility.

What's Changed

[bugfix] Allow prerelease when building docker image by @oelachqar in #1753
Update link to Oumi banner image in README by @wizeng23 in #1752
docs: add a badge and link to the social network Twitter by @Radovenchyk in #1751
Support OLCF (Oak Ridge Leadership Computing Facility) Frontier HPC cluster in Oumi launcher by @nikg4 in #1721
Judge API V2 | Core Functionality by @kaisopos in #1717
Update oumi distributed torchrun to fallback to oumi train -c cfg.yaml .... on a single-node with 1 GPU by @nikg4 in #1755
deps: Upgrade verl to 0.4.0 by @wizeng23 in #1749
add DCVLR logo to readme by @penfever in #1754
Judge API V2 | Few-Shots by @kaisopos in #1746
Update infer.md to fix a broken link by @ryan-arman in #1756
Judge API V2 | minor nit by @kaisopos in #1757
[Evaluation] Disabling flaky MMMU test by @kaisopos in #1758
Automatically tail SkyPilot logs by @wizeng23 in #1761
Enable vLLM for trl GRPO jobs by @wizeng23 in #1760
Judge API V2 | Implement CLI by @kaisopos in #1759
Updates to Oumi news for May, June by @stefanwebb in #1763
Additional news items by @stefanwebb in #1764
Judge API V2 | Support for built-in judges by @kaisopos in #1762
[bug] safetensors v0.6.0rc0 is causing a regression, prevent upgrading by @oelachqar in #1772
[verl] Support resuming from checkpoint by @wizeng23 in #1766
Upgrade accelerate and peft by @wizeng23 in #1774
[tiny] Pin flash-attn version by @wizeng23 in #1775
Pin the version of lm_eval to prevent a breaking change in the 4.9 release by @taenin in #1777
Update inference to resume from temporary result file when possible by @jgreer013 in #1734
[tiny] Fix gradient checkpointing for Oumi trainer by @wizeng23 in #1778
[tiny] Remove use_liger argument by @wizeng23 in #1779
Judge API V2 | Merge Judge and Inference configs by @kaisopos in #1776

Full Changelog: v0.1.14...v0.2.0

Contributors

oelachqar, stefanwebb, and 8 other contributors

Assets 2

10 Jun 20:55

wizeng23

v0.1.14

ca102c3

v0.1.14

What's Changed

Record latency histograms in base inference engine by @nikg4 in #1702
Feat: add falcon-e integration by @younesbelkada in #1705
[tiny] Minor update to fix the failing pre-commit checks by @oelachqar in #1707
Add collator kwargs field to DataParams by @oelachqar in #1708
[vision] Add option to process images individually by @oelachqar in #1706
Update dev_setup.md to correct the order of steps by @ryan-arman in #1709
Add configs for molmo support by @oelachqar in #1710
[tiny] fix pre-commits checks on a fresh install by @oelachqar in #1711
Add config for the Molmo O variant by @oelachqar in #1712
Add experimental molmo grpo config and train aliases by @oelachqar in #1713
Update installation.md to fix subversion handling by adding required … by @ryan-arman in #1715
Frontier: Fix -n param in launcher script by @nikg4 in #1720
Fix Falcon H1 dependency setup by @wizeng23 in #1723
letter count notebook improvements by @penfever in #1697
[vision] Update vision feature generator to support training on completions only by @oelachqar in #1722
[tiny] fix bug with vl collator by @oelachqar in #1725
Add data synthesis config, params, and unit tests by @jgreer013 in #1700
Add support for additional exception types for remote inference engine, as well as fast failing for non-retryable status codes. by @jgreer013 in #1704
Adds DPO + QLoRA example for Falcon-H1 by @stefanwebb in #1719
Update inference to always write intermediate results to file. by @jgreer013 in #1724
Added doc for new QLoRA param by @stefanwebb in #1727
Readme for Falcon-E and note on extra dependencies required by @stefanwebb in #1729
Add generic vision dataset by @oelachqar in #1726
[tiny][bug] make git cmd optional by @oelachqar in #1730
[tiny][bug] Add missing molmo feature by @oelachqar in #1731
[tiny] Update phi3-vision configs to use oumi trainer by @oelachqar in #1733
Minor bugfixes for 2 clouds in launcher code by @nikg4 in #1728
Update dev_setup.md to add additional instructions by @ryan-arman in #1736
Update trl to 0.18 by @wizeng23 in #1693
Update Verl trainer to export models in HF format by @nikg4 in #1714
Add lmms-lab/multimodal-open-r1-8k-verified dataset by @oelachqar in #1732
Add placeholders for DCVLR by @oelachqar in #1738
add debug logging capabilities to collators by @aniruddh-alt in #1678
[bug] update trainer to save processor when training with fsdp by @oelachqar in #1742
Add model revision param by @oelachqar in #1740
Add ability to customize HF model config via model.model_kwargs by @oelachqar in #1741
Add docker release workflow by @oelachqar in #1743
[bug] fix rank/local rank parsing for docker env by @oelachqar in #1747
deps: Update vLLM to 0.8.3 by @wizeng23 in #1739
[docs] update dcvlr readme by @oelachqar in #1748
Dcvlr by @penfever in #1750

New Contributors

@younesbelkada made their first contribution in #1705
@ryan-arman made their first contribution in #1709
@stefanwebb made their first contribution in #1719
@aniruddh-alt made their first contribution in #1678

Full Changelog: v0.1.13...v0.1.14

Contributors

oelachqar, stefanwebb, and 7 other contributors

Assets 2

Releases: oumi-ai/oumi

v0.7

Oumi 0.7 Release

✨ Highlights

🚀 New Features

Inference

Evaluation

Training

Data & Synthesis

Infrastructure

📈 Improvements

Performance

CLI

Documentation

Configs

⚠️ Breaking Changes

🐛 Bug Fixes

📦 Dependency Updates

👋 New Contributors

Contributors

Uh oh!

v0.6.0

Oumi v0.6.0 Changelog

Highlights

Python 3.13 Support

New oumi analyze CLI Command

TRL GOLD Trainer

Code Evaluation Judges

Kubernetes Deployment

Option 1: Using SkyPilot (new in this release)

Option 2: Direct kubectl deployment

Custom Master Port for Distributed Training

ARM Docker Images for Mac

Bug Fixes

Documentation

Deprecations

Dependencies

New Contributors

Contributors

Uh oh!

v0.5.0

Oumi v0.5.0 Release Notes

🚀 Major Features

Data Synthesis Module

Hyperparameter Tuning Module

Inference & Training Enhancements

HPC & Launcher Improvements

✨ Improvements

🐛 Bug Fixes

📦 Dependency Updates

🎉 New Contributors

📖 Full Changelog

Contributors

Uh oh!

v0.4.2

Release Notes - v0.4.2

🚀 New Features

🐛 Bug Fixes

🔧 Improvements

📚 Documentation

Uh oh!

v0.4.1

What's Changed

Contributors

Uh oh!

v0.4.0

Oumi v0.4 Changelog

✨ gpt-oss Training and Inference

⚡ DeepSpeed Support

🗄️ CLI Tool for Hugging Face Cache Management

🎯 Vision DPO and KTO Support

🛠️ Developer Experience

New Contributors

All Contributors

Contributors

Uh oh!

v0.3.0

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

⚖️ Judge API V2 (MAJOR UPDATE)

New `oumi analyze` CLI Command