[hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking #390

oleksost · 2025-11-17T19:31:24Z

✨ Description

Added Kimi linear and gated DeltaNet layers in order to create checkpoints for vllm throughout benchmarking. This required updating transformers to a newer version (4.57.1).

Inference with the newly added layers using transformers backend has not been tested.

🔍 Type of change

Select all that apply:

🐛 Bug fix (non-breaking change that addresses a specific issue)
🚀 New feature (non-breaking change that adds functionality)
⚠️ Breaking change (a change that could affect existing functionality)
📈 Performance improvement/optimization (improves speed, memory usage, or efficiency)
🛠️ Code refactor (non-functional changes that improve code readability, structure, etc.)
📦 Dependency bump (updates dependencies, including Dockerfile or package changes)
📝 Documentation change (updates documentation, including new content or typo fixes)
🔧 Infrastructure/Build change (affects build process, CI/CD, or dependencies)

📝 Changes

List the key changes introduced in this PR:

Change A
Change B

✅ Checklist

Make sure the following tasks are completed before submitting the PR:

General

📜 I have read and followed the contributing guidelines.
🏷️ I am using a clear and descriptive PR title that summarizes the key change or feature introduced.
🎉 The functionality is complete, and I have tested the changes.
📝 I have updated the documentation if needed.
⚠️ The change does not introduce any new issues (e.g., runtime warnings, type checker errors, linting problems, unhandled edge cases).
🧩 I have commented my code, especially in hard-to-understand areas.

Dependencies and Configuration

🐋 I have updated the Docker configuration or dependencies, if applicable.
🔄 I have ensured compatibility with the existing setup after dependency changes.

Testing

🧪 I have added or updated tests to cover my changes.
✔️ New and existing tests pass locally with my changes.
🚦 I have tested these changes on GPUs and verified training stability.
🏋️ I have tested the changes on realistic training workloads, if applicable.

Performance Impact

📊 I have run benchmarks where applicable to evaluate the performance impact.
✅ The benchmarks show no performance regression.
🚀 The benchmarks indicate a potential performance improvement.
⚠️ The benchmarks indicate a potential performance degradation.
📈 I have provided benchmark results and detailed any performance impact below, if applicable.

📊 Performance Impact Details

If there is any impact on performance, describe it and provide benchmark results, if applicable:

🗒️ Additional Notes

Include any additional context, information, or considerations here, such as known issues, follow-up tasks, or backward compatibility concerns.

…nto hybrid_dev

…for vllm

oleksost added 3 commits September 16, 2025 09:02

wip

00111f0

Merge branch 'hybrid_dev' of https://github.com/ServiceNow/Fast-LLM i…

abecb2d

…nto hybrid_dev

enable creation of chewckpoints with gated delta net and kimi linear …

3d81a7b

…for vllm

oleksost changed the title ~~Hybrid dev: HF GDN and Kimi Linear layers for vllm benchmarking~~ [hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking Nov 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking #390

[hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking #390

Uh oh!

oleksost commented Nov 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking #390

Are you sure you want to change the base?

[hybrid dev] HF GDN and Kimi Linear layers for vllm benchmarking #390

Uh oh!

Conversation

oleksost commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✨ Description

🔍 Type of change

📝 Changes

✅ Checklist

General

Dependencies and Configuration

Testing

Performance Impact

📊 Performance Impact Details

🗒️ Additional Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oleksost commented Nov 17, 2025 •

edited

Loading