Skip to content

Conversation

@talentJay-ux
Copy link
Contributor

  1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
  2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
  3. Fix unit test KV Cache import issue for gpt_with_kv_cache_optimized
2025-12-14_23-41-51 2025-12-15_00-10-22

1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
3. Fix KV Cache import issue for gpt_with_kv_cache_optimized
@rasbt
Copy link
Owner

rasbt commented Dec 15, 2025

Thanks for the PR, and thanks also for the unit test, that's very helpful!

Copy link
Owner

@rasbt rasbt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a very carefully crafted PR. Thank you for this!

@rasbt rasbt merged commit d7f178d into rasbt:main Dec 16, 2025
13 checks passed
SethHWeidman pushed a commit to SethHWeidman/LLMs-from-scratch that referenced this pull request Dec 21, 2025
1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
3. Fix KV Cache import issue for gpt_with_kv_cache_optimized
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants