Thanks for your project! After reading your paper, I find that MemOS can store KV cache, hidden states and attention weights. I get that KV cache can be used to reload the conversation history. However, I cannot understand the usage of storing hidden states and attention weights. Can you help explain that? Thanks!