Update deterministic blog (#204)

Fridge003 · web-flow · commit e049c476a440 · 2025-09-22T23:44:37.000-07:00
diff --git a/blog/2025-09-22-sglang-deterministic.md b/blog/2025-09-22-sglang-deterministic.md
@@ -141,7 +141,7 @@ The standard chunking strategy operates on a "best-effort" principle. In this ex
 Attention kernel is an important part of determinism. For different attention backends, we modified them in different ways to satisfy their usage requirements.
 - For Flashinfer backend, we utilize the `fixed_split_size` and `disable_kv_split` arguments from [batch invariant FA2 kernels](https://github.com/flashinfer-ai/flashinfer/pull/1675) to fix split sizes during kernel planning. Truncation of chunked prefill is aligned to the prefill split size. ([PR link](https://github.com/sgl-project/sglang/pull/10645))
 - For FlashAttention-3 backend, num-splits of flash attention kernel are fixed to 1 to ensure determinism. ([PR link](https://github.com/sgl-project/sglang/pull/10651))
-- For Triton backend, we fix the split size of decoding, and manually set the alignment size of chunked prefill. ([PR link](https://github.com/sgl-project/sglang/pull/10694))
+- For Triton backend, we fix the split size of decoding, and manually set the alignment size of chunked prefill. Deterministic inference can also run on **AMD** hardware with the extensibility of Triton backend. ([PR link](https://github.com/sgl-project/sglang/pull/10694)). 
 
 
 ### Reproducible Non-Greedy Sampling
@@ -173,5 +173,6 @@ Our future efforts will focus on enhancing deterministic inference by addressing
 We would like to extend our heartfelt gratitude to the following teams and collaborators:
 - **SGLang team and community**: Baizhou Zhang, Biao He, Qiaolin Yu, Xinyuan Tong, Ke Bao, Yineng Zhang, Chi Zhang, Ying Sheng, Lianmin Zheng and many others
 - **Flashinfer team and community**:  Wenxuan Tan, Yilong Zhao, Zihao Ye
-- **slime team and community**: Yusheng Su, Zilin Zhu
+- **slime team and community**: Zilin Zhu
+- **AMD**: Yusheng Su
 - **Thinking Machines Lab**: for their awesome [blog](https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/) and [batch_invariant_ops library](https://github.com/thinking-machines-lab/batch_invariant_ops)