Example: add offline eagle training commands to README #366

h-guo18 · 2025-09-24T22:48:55Z

What does this PR do?

Type of change: new example

Overview:

Add example of offline training to README.md;
Refactor REAME.md: put optional advanced usages to the end of the file for better clarity and readability;

Usage

# Add a code snippet demonstrating how to use this

Testing

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes/No
Did you write any new necessary tests?: Yes/No
Did you add or update any necessary documentation?: Yes/No
Did you update Changelog?: Yes/No

Additional Information

Summary by CodeRabbit

Documentation
- Reorganized README with two workflows: Training Draft Model with Online Base Model and Offline Base Model.
- Expanded Table of Contents (Online/Offline Training, After Training, Advanced Usage) and updated Simplified Workflow to reference EAGLE.
- Added concrete command examples for training (including Accelerate, ModelOpt, PTQ, QAT), validation (e.g., MT-Bench), and export.
- Introduced deployment guidance for TRT-LLM and SGLang with configuration examples and links.
- Applied minor wording and heading refinements for clarity.

Signed-off-by: h-guo18 <[email protected]>

copy-pr-bot · 2025-09-24T22:48:59Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

coderabbitai · 2025-09-24T22:49:02Z

Walkthrough

The README for examples/speculative_decoding was reorganized: “Complete Workflow” was replaced by separate online/offline draft-model training workflows, with new commands for training, validation, export, and deployment. It adds expanded contents, concrete Accelerate examples, hidden-state dumping for offline use, and deployment notes for TRT-LLM and SGLang.

Changes

Cohort / File(s)	Summary of Changes
Docs: Speculative Decoding README `examples/speculative_decoding/README.md`	Replaced “Complete Workflow” with Online/Offline training workflows; added commands for training, validation (MT-Bench), and export; documented hidden-state dumping and --offline-data; expanded contents; added deployment sections (TRT-LLM, SGLang) with examples; minor wording/heading updates and link adjustments.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor User
  participant DraftTrainer as Draft Trainer
  participant BaseModel as Base Model (Online)
  participant Validator as Validator (MT-Bench)
  participant Exporter as Checkpoint Export

  rect rgba(200,235,255,0.25)
    note over User,DraftTrainer: Online Training Flow
    User->>DraftTrainer: Launch accelerate training
    DraftTrainer->>BaseModel: Forward pass for hidden states
    DraftTrainer-->>User: Trained draft model checkpoint
  end

  rect rgba(220,255,220,0.25)
    note over User,Validator: Validation
    User->>Validator: Evaluate trained checkpoint
    Validator-->>User: MT-Bench metrics
  end

  rect rgba(255,240,200,0.25)
    note over User,Exporter: Export
    User->>Exporter: Export trained checkpoint
    Exporter-->>User: Deployment-ready artifacts
  end

sequenceDiagram
  autonumber
  actor User
  participant Dumper as Hidden State Dumper
  participant Storage as Offline Store
  participant DraftTrainer as Draft Trainer (Offline)
  participant Validator as Validator
  participant Exporter as Export

  rect rgba(200,235,255,0.25)
    note over User,Dumper: Offline Preparation
    User->>Dumper: Run base model to dump hidden states
    Dumper->>Storage: Save hidden states
  end

  rect rgba(220,255,220,0.25)
    note over User,DraftTrainer: Offline Training Flow
    User->>DraftTrainer: Train with --offline-data
    DraftTrainer->>Storage: Load hidden states
    DraftTrainer-->>User: Trained draft model checkpoint
  end

  rect rgba(255,240,200,0.25)
    note over User,Validator: Validation and Export
    User->>Validator: Evaluate checkpoint
    Validator-->>User: Results
    User->>Exporter: Export checkpoint
    Exporter-->>User: Artifacts for TRT-LLM / SGLang
  end

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

I thump my paws on pages new,
Two trails diverge: online, offline too.
I nibble commands, carrots of code,
Hidden states packed for the warren’s load.
Benchmarks hop, exports flow—
To TRT and SGLang we go!
— a cheerful doc-savvy bunny 🐇

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title accurately summarizes the primary change by indicating that an example of offline EAGLE training commands has been added to the README, which matches the PR’s main content and intent.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch haoguo/offlin-example

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

examples/speculative_decoding/README.md (1)
98-109: Clarify dataset alignment when using cached hidden states

When we pass cached hidden states, the trainer still streams the original JSONL so it can pair each example with the corresponding hidden-state file. If --data points at a different file (or even a differently ordered clone), training dies with shape/length assertions. Adding a one-line reminder here will save readers a painful debugging roundtrip.

Apply this diff to add the clarification:
 Then, train draft model with `--offline-data` argument:
@@
             --offline-data $HIDDEN_STATES_DIR
+> Note: Ensure the path supplied via --data matches the dataset you used when dumping hidden states (for example, Daring-Anteater/train.jsonl). A mismatch will cause ordering/length errors when the trainer pairs samples with cached activations.
</blockquote></details>

</blockquote></details>

<details>
<summary>📜 Review details</summary>

**Configuration used**: CodeRabbit UI

**Review profile**: CHILL

**Plan**: Pro

<details>
<summary>📥 Commits</summary>

Reviewing files that changed from the base of the PR and between 26c203abde6ea430dbb84e1f13e5673cd86a15bd and 4eda3e240cceae20356093ad4462491f1d102a45.

</details>

<details>
<summary>📒 Files selected for processing (1)</summary>

* `examples/speculative_decoding/README.md` (6 hunks)

</details>

<details>
<summary>⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)</summary>

* GitHub Check: linux
* GitHub Check: code-quality

</details>

</details>

codecov · 2025-09-24T23:02:13Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 73.46%. Comparing base (26c203a) to head (4eda3e2).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #366   +/-   ##
=======================================
  Coverage   73.46%   73.46%           
=======================================
  Files         172      172           
  Lines       17640    17640           
=======================================
  Hits        12959    12959           
  Misses       4681     4681

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: h-guo18 <[email protected]> Signed-off-by: Ye Yu <[email protected]>

h-guo18 added 2 commits September 24, 2025 22:42

update eagle offline to example

95bbe5d

Signed-off-by: h-guo18 <[email protected]>

polish

4eda3e2

Signed-off-by: h-guo18 <[email protected]>

h-guo18 changed the title ~~Haoguo/offlin example~~ Example: add offline eagle training commands to README Sep 24, 2025

h-guo18 self-assigned this Sep 24, 2025

h-guo18 marked this pull request as ready for review September 24, 2025 22:51

h-guo18 requested a review from a team as a code owner September 24, 2025 22:51

h-guo18 requested review from ChenhanYu and yeyu-nvidia September 24, 2025 22:51

h-guo18 marked this pull request as draft September 24, 2025 22:54

coderabbitai bot reviewed Sep 24, 2025

View reviewed changes

h-guo18 marked this pull request as ready for review September 24, 2025 22:59

h-guo18 marked this pull request as draft September 24, 2025 23:03

h-guo18 marked this pull request as ready for review September 24, 2025 23:14

ChenhanYu approved these changes Sep 24, 2025

View reviewed changes

ChenhanYu merged commit 4ff8fc9 into main Sep 24, 2025
27 checks passed

ChenhanYu deleted the haoguo/offlin-example branch September 24, 2025 23:41

yeyu-nvidia pushed a commit that referenced this pull request Oct 1, 2025

Example: add offline eagle training commands to README (#366)

0380247

Signed-off-by: h-guo18 <[email protected]> Signed-off-by: Ye Yu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Example: add offline eagle training commands to README #366

Example: add offline eagle training commands to README #366

Uh oh!

h-guo18 commented Sep 24, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Sep 24, 2025

Uh oh!

coderabbitai bot commented Sep 24, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

codecov bot commented Sep 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Example: add offline eagle training commands to README #366

Example: add offline eagle training commands to README #366

Uh oh!

Conversation

h-guo18 commented Sep 24, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Sep 24, 2025

Uh oh!

coderabbitai bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

h-guo18 commented Sep 24, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 24, 2025 •

edited

Loading

codecov bot commented Sep 24, 2025 •

edited

Loading