fix: fix wrong param in SentenceChunker #370

Linorman · 2025-10-18T03:50:31Z

Description

Summary: Just fix param name in SentenceChunker to avoid bugs in runtime.

Fix: #(issue)

Docs Issue/PR: (docs-issue-or-pr-link)

Reviewer: @(reviewer)

Checklist:

I have performed a self-review of my own code | 我已自行检查了自己的代码
I have commented my code in hard-to-understand areas | 我已在难以理解的地方对代码进行了注释
I have added tests that prove my fix is effective or that my feature works | 我已添加测试以证明我的修复有效或功能正常
I have created related documentation issue/PR in MemOS-Docs (if applicable) | 我已在 MemOS-Docs 中创建了相关的文档 issue/PR（如果适用）
I have linked the issue to this PR (if applicable) | 我已将 issue 链接到此 PR（如果适用）
I have mentioned the person who will review this PR | 我已提及将审查此 PR 的人

kakack · 2025-11-04T08:35:02Z

Please pull MemTensor:dev branch and solve the conflicts, thank you. @Linorman

Linorman · 2025-11-04T11:00:06Z

Please pull MemTensor:dev branch and solve the conflicts, thank you. @Linorman

Ok, I have already solved confilcts and merged to my main branch.

Copilot

Pull Request Overview

This PR updates the codebase to support both legacy and new API versions of the DynamicCache class from the transformers library. The new API uses a layers attribute with .keys and .values properties, while the legacy API uses key_cache and value_cache list attributes.

Removes version-based branching (previously using packaging.version checks) in favor of runtime attribute detection
Updates cache concatenation logic to handle both APIs through hasattr() checks
Fixes parameter naming in ChonkieSentenceChunker initialization

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
src/memos/memories/activation/kv.py	Replaced version-based API detection with runtime attribute checking; removed unused imports; updated `_concat_caches` to mutate first cache in-place for new API
tests/memories/activation/test_kv.py	Added compatibility layer in test helper `make_filled_cache()` and assertions to support both old and new DynamicCache APIs
src/memos/mem_os/utils/format_utils.py	Updated serialization functions to detect and handle both DynamicCache API versions when extracting layer counts, device info, dtype, and tensor shapes
src/memos/chunkers/sentence_chunker.py	Changed parameter name from `tokenizer_or_token_counter` to `tokenizer` to match the expected API

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-04T11:01:39Z

src/memos/memories/activation/kv.py

+            base = caches[0]
            for layer in range(num_layers):
-                # gather all K and V for this layer
                keys = [c.layers[layer].keys for c in caches]
                vals = [c.layers[layer].values for c in caches]
-                # single concat per layer
-                merged.layers[layer].keys = torch.cat(keys, dim=-2)
-                merged.layers[layer].values = torch.cat(vals, dim=-2)
-
+                base.layers[layer].keys = torch.cat(keys, dim=-2)
+                base.layers[layer].values = torch.cat(vals, dim=-2)
+            return base


Mutating the first cache object in-place (caches[0]) is problematic because it modifies the original cache that may still be referenced elsewhere. This could lead to unexpected side effects if the caller expects the original caches to remain unchanged. Consider creating a new DynamicCache() object and populating its layers similar to the legacy API path.

Copilot · 2025-11-04T11:01:39Z

src/memos/mem_os/utils/format_utils.py

+                    t = getattr(lyr, "keys", None)
+                    if t is None:
+                        t = getattr(lyr, "values", None)
+                    if t is not None:
+                        device_str = str(t.device)
+                        dtype_str = str(t.dtype)


[nitpick] The variable name t is unclear and could be improved for readability. Consider renaming it to tensor or sample_tensor to better convey its purpose as a tensor used to determine device and dtype information.

Suggested change

t = getattr(lyr, "keys", None)

if t is None:

t = getattr(lyr, "values", None)

if t is not None:

device_str = str(t.device)

dtype_str = str(t.dtype)

tensor = getattr(lyr, "keys", None)

if tensor is None:

tensor = getattr(lyr, "values", None)

if tensor is not None:

device_str = str(tensor.device)

dtype_str = str(tensor.dtype)

Linorman and others added 4 commits October 18, 2025 11:43

fix: fix wrong param in SentenceChunker

0f04984

fix: fix wrong param in hf.py

4809d32

fix: fix wrong param in the whole repo

aa11489

Merge branch 'main' into main

656d653

CaralHsi changed the base branch from main to dev November 4, 2025 08:25

Merge branch 'dev' into main

de789b2

Copilot AI review requested due to automatic review settings November 4, 2025 10:59

Copilot AI reviewed Nov 4, 2025

View reviewed changes

Merge branch 'dev' into main

40bc0c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: fix wrong param in SentenceChunker #370

fix: fix wrong param in SentenceChunker #370

Linorman commented Oct 18, 2025 •

edited

Loading

Uh oh!

kakack commented Nov 4, 2025 •

edited

Loading

Uh oh!

Linorman commented Nov 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 4, 2025

Uh oh!

Copilot AI Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: fix wrong param in SentenceChunker #370

Are you sure you want to change the base?

fix: fix wrong param in SentenceChunker #370

Conversation

Linorman commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist:

Uh oh!

kakack commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Linorman commented Nov 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Linorman commented Oct 18, 2025 •

edited

Loading

kakack commented Nov 4, 2025 •

edited

Loading