Skip to content

Conversation

@dependabot
Copy link

@dependabot dependabot bot commented on behalf of github Jul 15, 2025

Bumps sglang[all] from 0.4.3.post3 to 0.4.9.post2.

Release notes

Sourced from sglang[all]'s releases.

Release v0.4.8

Highlights

OpenAI-Compatible Server Refactor

Re-structured the OpenAI-compatible server to support production and enterprise environments. Key improvements include:

  • Consistent metrics and logging for better observability and debugging.

  • Unified error handling, request validation, and processing logic for improved reliability and maintainability.

  • Improved request tracking across sessions and components.

  • Fixed bugs in embedding requests and reasoning parsers.

This work was a collaborative effort involving engineers from academic and industry institutions. Special thanks to the Oracle Cloud team and the SGLang team and community — including @​slin1237, @​CatherineSue, @​key4ng, @​JustinTong0323, @​jhinpan, @​yhyang201, @​woodx9 and @​whybeyoung — for their invaluable contributions.

DeepSeek R1 FP4 on Blackwell GPU

Added support for DeepSeek R1 with FP4 and MTP on NVIDIA Blackwell GPU.

  • Integrated FlashInfer NVFP4 MoE, supporting TP, EP, and DP.

  • Supported 2-stream shared expert execution.

  • Achieved up to 90 TPS per user at isl/osl/bs = 1k/1k/16 on B200.

Further optimization in progress. Special thanks to the FlashInfer, NVIDIA Enterprise Products, Novita AI, DataCrunch, Google Cloud, and SGLang teams — especially @​Alcanderian and @​pyc96 — for their critical contributions.

Breaking Change: OpenAI-Compatible API Module Moved

The sglang/srt/openai_api directory has been removed and replaced with sglang/srt/entrypoints/openai.

Update your imports to the new module path. For example:

- from sglang.srt.openai_api.protocol import Tool
+ from sglang.srt.entrypoints.openai.protocol import Tool

What's Changed

... (truncated)

Commits
  • eb118d8 chore: bump v0.4.9.post2 (#7963)
  • 732fc8e chore: upgrade sgl-kernel 0.2.5 (#7971)
  • f2d5c49 [router] add worker abstraction (#7960)
  • 2a2d347 Fix wrong gemm branch cause 250us slower (#7969)
  • aa20560 delete uselese code caused by fuse allreduce+add_rmsnorm pr (#7970)
  • 61bb285 chore: upgrade xgrammar 0.1.21 (#7962)
  • 880221b Revert "[PD Disaggregation] replace transfer with batch transfer for better p...
  • 8f3173d chore: bump sgl-kernel v0.2.5 (#7964)
  • 26118a1 [fix]Update unitest for fp8_blockwise_scaled_grouped_mm kernel (#7932)
  • 475a249 temporarily disable deepep-8-gpu and activate two small tests (#7961)
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [sglang[all]](https://github.com/sgl-project/sglang) from 0.4.3.post3 to 0.4.9.post2.
- [Release notes](https://github.com/sgl-project/sglang/releases)
- [Commits](sgl-project/sglang@v0.4.3.post3...v0.4.9.post2)

---
updated-dependencies:
- dependency-name: sglang[all]
  dependency-version: 0.4.9.post2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels Jul 15, 2025
@dependabot @github
Copy link
Author

dependabot bot commented on behalf of github Aug 5, 2025

Superseded by #13.

@dependabot dependabot bot closed this Aug 5, 2025
@dependabot dependabot bot deleted the dependabot/pip/sglang-all--0.4.9.post2 branch August 5, 2025 02:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant