Skip to content

Commit da2ce40

Browse files
PatrykWomichalkuligowskimhelf-intel
authored
0.16.0 release docs -fix (#1111)
Signed-off-by: PatrykWo <patryk.wolsza@intel.com> Signed-off-by: mhelf-intel <monika.helfer@intel.com> Signed-off-by: Patryk Wolsza <patryk.wolsza@intel.com> Co-authored-by: Michał Kuligowski <michal.kuligowski@intel.com> Co-authored-by: Monika Helfer <monika.helfer@intel.com>
1 parent 27a10d4 commit da2ce40

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

docs/release_notes_v0.16.0.md

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -75,10 +75,6 @@ This release is based on [vLLM v0.16.0](https://github.com/vllm-project/vllm/rel
7575
- Fix sampler & TP>1 recompilations ([#935](https://github.com/vllm-project/vllm-gaudi/pull/935))
7676
- Restore default `temperature=0` for the server after #32723 ([#1037](https://github.com/vllm-project/vllm-gaudi/pull/1037))
7777

78-
## Security
79-
80-
When using non-default values for `presence_penalty`, `frequency_penalty`, or `repetition_penalty`, vLLM-Gaudi on `releases/v0.15.1` (or older) crashes during sampling because the internal tensor padding utility relies on NumPy, which produces tensors incompatible with `torch.compile` on HPU. The fix (PR #893) replaces the NumPy-based padding with a pure PyTorch implementation and is already included in `releases/v0.16.0` and `main`.
81-
8278
## Full Changelog
8379

8480
| PR | Title | Author |

0 commit comments

Comments
 (0)