Skip to content

Conversation

ggerganov
Copy link
Member

@ggerganov ggerganov commented Aug 21, 2025

target #15472

With the adoption of ggml_set_rows() it is no longer necessary to do defragmentation of the KV cache - the batch data is now placed non-contiguously into the KV buffers.

@ggerganov ggerganov requested a review from ngxson as a code owner August 21, 2025 13:07
@github-actions github-actions bot added script Script related examples python python script changes server labels Aug 21, 2025
@ggerganov ggerganov force-pushed the gg/kv-self-remove-api branch from 767616f to 3a3a93d Compare August 21, 2025 15:51
Base automatically changed from gg/kv-self-remove-api to master August 21, 2025 16:13
@ggerganov ggerganov merged commit 9ebebef into master Aug 22, 2025
100 of 109 checks passed
@ggerganov ggerganov deleted the gg/remove-defrag branch August 22, 2025 09:22
qnixsynapse pushed a commit to menloresearch/llama.cpp that referenced this pull request Aug 25, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

examples python python script changes script Script related server

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant