feat(eval): support memos api mode#116

Merged

CaralHsi merged 38 commits intoMemTensor:devfrom

Jul 24, 2025

Contributor

Duguce commented Jul 17, 2025 •

edited

Loading

Description

Summary: fix some bugs, such as search top k

Fix: #(issue)

Reviewer: @hush-cd

Checklist:

I have performed a self-review of my own code | 我已自行检查了自己的代码
I have commented my code in hard-to-understand areas | 我已在难以理解的地方对代码进行了注释
I have added tests that prove my fix is effective or that my feature works | 我已添加测试以证明我的修复有效或功能正常
I have added necessary documentation (if applicable) | 我已添加必要的文档（如果适用）
I have linked the issue to this PR (if applicable) | 我已将 issue 链接到此 PR（如果适用）
I have mentioned the person who will review this PR | 我已提及将审查此 PR 的人

Duguce and others added 29 commits

July 7, 2025 22:43


          feat(eval): add eval dependencies

4ef7418


          feat(eval): add configs example

ddad8c1


          docs(eval): update README.md

d623791


          Merge branch 'MemTensor:dev' into dev

3365de4


          feat(eval): remove the dependency (pydantic)

ed68e36


          Merge branch 'MemTensor:dev' into dev

4e99031


          feat(eval): add run locomo eval script

41368b9


          Merge branch 'MemTensor:dev' into dev


          fix(eval): delete about memos redundant search branches

8cd9361


          chore: fix format

a900910


          Merge branch 'MemTensor:dev' into dev

204bd27


          Merge branch 'MemTensor:dev' into dev

5d68ed9


          feat(eval): add openai memory on locomo - eval guide

42e9366


          Merge branch 'dev' into dev

f3d1d5d


          Merge branch 'MemTensor:dev' into dev

9bada64


          docs(eval): modify openai memory on locomo - eval guide


          Merge branch 'MemTensor:dev' into dev

dd2b2c5


          Merge branch 'MemTensor:dev' into dev

7a60c33


          Merge branch 'MemTensor:dev' into dev

ed86648


          Merge branch 'MemTensor:dev' into dev

aaab1ce


          Merge branch 'MemTensor:dev' into dev

710d4db


           feat(eval): add longmemeval evaluation pipeline

c98ded4


          chore(eval): formatter

79a5bce


          chore: update

37e2933


          feat(eval): add configs example

445c855


          Merge branch 'MemTensor:dev' into dev

82e60b5


          fix(eval): bugs about longmemeval

09b5a72


          Merge branch 'MemTensor:dev' into dev

efd2c0d


          fix(eval): search top k

fc0005a

Copilot AI review requested due to automatic review settings

July 17, 2025 10:40

This comment was marked as outdated.

Sign in to view

Duguce and others added 3 commits

July 17, 2025 18:42


          chore(eval): update

bf11ea7


          Merge branch 'MemTensor:dev' into dev

715b399


          feat(eval): support memos api mode

0d0d037

Duguce changed the title ~~fix(eval): fix search top k~~ feat(eval): support memos api mode

Contributor Author

Duguce commented Jul 21, 2025

@hush-cd can you help me review this pr? thanks!!!

Duguce requested a review from Copilot

July 21, 2025 02:11

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

This PR adds support for a new "memos-api" mode to the evaluation framework, enabling testing of MemOS functionality through API calls instead of just local mode. The changes expand the evaluation capabilities to include both local and API-based testing scenarios.

Introduces new MemOSAPI client for HTTP-based communication with MemOS services
Updates all evaluation scripts to support "memos-api" mode alongside existing modes
Fixes search functionality by implementing proper top_k parameter handling

Reviewed Changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
evaluation/scripts/utils/memos_api.py	New MemOSAPI client implementation for HTTP-based MemOS operations
evaluation/scripts/utils/client.py	Adds API mode support to memos_client function
evaluation/scripts/run_lme_eval.sh	Updates evaluation script configuration and adds new pipeline steps
evaluation/scripts/longmemeval/*.py	Updates all LongMemEval scripts to support memos-api mode
evaluation/scripts/locomo/*.py	Updates all LoCoMo scripts to support memos-api mode

evaluation/scripts/utils/memos_api.py

+                      """Register a user."""
+                      url = f"{self.base_url}/users/register"
+                      payload = json.dumps({"user_id": user_id})
+                      response = requests.request("POST", url, data=payload, headers=self.headers)

Copilot AI Jul 21, 2025

[nitpick] Use requests.post() instead of requests.request("POST", ...) for better readability and consistency with HTTP method naming conventions.

Suggested change

      
                    response = requests.request("POST", url, data=payload, headers=self.headers)
          
                    response = requests.post(url, data=payload, headers=self.headers)

Copilot uses AI. Check for mistakes.

evaluation/scripts/utils/memos_api.py

+                      url = f"{self.base_url}/add"
+                      payload = json.dumps({"messages": messages, "user_id": user_id, "mem_cube_id": cube_id})
+                      response = requests.request("POST", url, data=payload, headers=self.headers)

Copilot AI Jul 21, 2025

[nitpick] Use requests.post() instead of requests.request("POST", ...) for better readability and consistency with HTTP method naming conventions.

Suggested change

      
                    response = requests.request("POST", url, data=payload, headers=self.headers)
          
                    response = requests.post(url, data=payload, headers=self.headers)

Copilot uses AI. Check for mistakes.

evaluation/scripts/utils/memos_api.py

+                          }
+                      )
+                      response = requests.request("POST", url, data=payload, headers=self.headers)

Copilot AI Jul 21, 2025

[nitpick] Use requests.post() instead of requests.request("POST", ...) for better readability and consistency with HTTP method naming conventions.

Suggested change

      
                    response = requests.request("POST", url, data=payload, headers=self.headers)
          
                    response = requests.post(url, data=payload, headers=self.headers)

Copilot uses AI. Check for mistakes.

evaluation/scripts/utils/memos_api.py

+                      response = requests.request("POST", url, data=payload, headers=self.headers)
+                      return response.text
+                  def search(self, query: str, user_id: str | None = None, top_k: int = 10):

Copilot AI Jul 21, 2025

The top_k parameter is not included in the API request payload, but it's used to slice the results locally. This may cause inconsistent behavior if the API returns fewer than top_k results or if server-side filtering would be more efficient.

Copilot uses AI. Check for mistakes.

evaluation/scripts/utils/memos_api.py

Comment on lines +20 to +21

		register_res = json.loads(self.user_register(user_id))
		cube_id = register_res["data"]["mem_cube_id"]

Copilot AI Jul 21, 2025

No error handling for JSON parsing failures. If user_register returns invalid JSON, this will raise an unhandled exception without a helpful error message.

Suggested change

      
                    register_res = json.loads(self.user_register(user_id))
          
                    cube_id = register_res["data"]["mem_cube_id"]
          
                    try:
          
                        register_res = json.loads(self.user_register(user_id))
          
                        cube_id = register_res["data"]["mem_cube_id"]
          
                    except json.JSONDecodeError as e:
          
                        raise ValueError(f"Failed to parse JSON response from user_register: {e}")

Copilot uses AI. Check for mistakes.

evaluation/scripts/utils/memos_api.py

Comment on lines +42 to +44

+                          result = json.loads(response.text)["data"]["text_mem"][0]["memories"]
+                          text_memories = [item["memory"] for item in result][:top_k]
+                          return text_memories

Copilot AI Jul 21, 2025

No error handling for JSON parsing or key access failures. This could raise KeyError or json.JSONDecodeError without helpful context about the API response structure.

Suggested change

      
                        result = json.loads(response.text)["data"]["text_mem"][0]["memories"]
          
                        text_memories = [item["memory"] for item in result][:top_k]
          
                        return text_memories
          
                        try:
          
                            response_json = json.loads(response.text)
          
                            data = response_json.get("data", {})
          
                            text_mem = data.get("text_mem", [])
          
                            if not text_mem or not isinstance(text_mem, list) or "memories" not in text_mem[0]:
          
                                raise KeyError("Expected 'text_mem' to be a non-empty list with 'memories' key.")
          
                            result = text_mem[0]["memories"]
          
                            text_memories = [item.get("memory", "") for item in result][:top_k]
          
                            return text_memories
          
                        except json.JSONDecodeError as e:
          
                            raise ValueError(f"Failed to parse JSON response: {e}")
          
                        except (KeyError, IndexError) as e:
          
                            raise ValueError(f"Unexpected response structure: {e}")

Copilot uses AI. Check for mistakes.

bestwyj approved these changes

View reviewed changes

Duguce added 5 commits

July 21, 2025 22:16


          Merge branch 'MemTensor:dev' into dev

54521c6


          Merge branch 'MemTensor:dev' into dev


          Merge branch 'MemTensor:dev' into dev

638973a


          Merge branch 'MemTensor:dev' into dev

284f8cc


          Merge branch 'dev' into dev

29c88aa

hush-cd reviewed

View reviewed changes

evaluation/scripts/locomo/locomo_eval.py Outdated

    
                      type=str,

                      choices=["zep", "memos", "mem0", "mem0_graph", "langmem", "openai"],

                      choices=["zep", "memos", "mem0", "mem0_graph", "openai", "memos-api"],

                      help="Specify the memory framework (zep or memos or mem0 or mem0_graph)",

Contributor

hush-cd Jul 23, 2025

Update the help string to include "openai" and the newly added "memos-api" for consistency with the choices list.
Ensure corresponding updates are made in other relevant files where this argument is defined or documented to maintain coherence across the codebase.

evaluation/scripts/locomo/locomo_search.py

+                      speaker_2_memories=speaker_b_context,
+                  )
+                  print(query, context)

Contributor

hush-cd Jul 23, 2025

Are the print statements in multiple search functions necessary? Also, note that the zep_search function does not have a corresponding print statement.


          feat(eval): add memobase; fix bugs about share db

b391c90

tangg555 requested review from bestwyj and hush-cd

July 24, 2025 03:23

CaralHsi merged commit 3d90823 into MemTensor:dev

20 checks passed

tangg555 pushed a commit to tangg555/MemOS that referenced this pull request


          feat(eval): support memos api mode (MemTensor#116)

26cac6c

* feat(eval): add eval dependencies

* feat(eval): add configs example

* docs(eval): update README.md

* feat(eval): remove the dependency (pydantic)

* feat(eval): add run locomo eval script

* fix(eval): delete about memos redundant search branches

* chore: fix format

* feat(eval): add openai memory on locomo - eval guide

* docs(eval): modify openai memory on locomo - eval guide

* feat(eval): add longmemeval evaluation pipeline

* chore(eval): formatter

* chore: update

* feat(eval): add configs example

* fix(eval): bugs about longmemeval

* fix(eval): search top k

* chore(eval): update

* feat(eval): support memos api mode

* feat(eval): add memobase; fix bugs about share db

tianxing02 pushed a commit to tianxing02/MemOS that referenced this pull request


          feat(eval): support memos api mode (MemTensor#116)

81106a6

* feat(eval): add eval dependencies

* feat(eval): add configs example

* docs(eval): update README.md

* feat(eval): remove the dependency (pydantic)

* feat(eval): add run locomo eval script

* fix(eval): delete about memos redundant search branches

* chore: fix format

* feat(eval): add openai memory on locomo - eval guide

* docs(eval): modify openai memory on locomo - eval guide

* feat(eval): add longmemeval evaluation pipeline

* chore(eval): formatter

* chore: update

* feat(eval): add configs example

* fix(eval): bugs about longmemeval

* fix(eval): search top k

* chore(eval): update

* feat(eval): support memos api mode

* feat(eval): add memobase; fix bugs about share db

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet