Awesome-AI-Memory

The AI Hippocampus: How Far are We From Human Memory?

Memory plays a foundational role in augmenting the reasoning, adaptability, and contextual fidelity of modern Large Language Models (LLMs) and Multi-Modal LLMs (MLLMs). As these models transition from static predictors to interactive systems capable of continual learning and personalized inference, the incorporation of memory mechanisms has emerged as a central theme in their architectural and functional evolution. This survey presents a comprehensive and structured synthesis of memory in LLMs and MLLMs, organizing the literature into a cohesive taxonomy comprising implicit, explicit, and agentic memory paradigms.

Analogy to Human Brain

The architecture of memory in modern (M)LLMs is increasingly analogous to the synergistic relationship between different human brain systems, particularly the neocortex, the hippocampus, and the prefrontal cortex. This brain-inspired framework, which echoes the principles of Complementary Learning Systems theory, provides a powerful lens through which to understand the different memory paradigms evolving in AI.

Implicit Memory: The Neocortex.: We conceptualize the model's internal parameters as its digital neocortex. In the brain, the neocortex is the primary repository for long-term semantic knowledge, skills, and consolidated memories, which are learned slowly and stored in a distributed manner. Similarly, a transformer's weights embody the implicit memory of the model, the foundational "world knowledge" acquired during pre-training. This parametric knowledge represents the model's stable, generalized understanding of language, patterns, and facts.
Explicit Memory: The Hippocampal System.: To access specific, real-time, or episodic information, an AI system requires a mechanism analogous to the hippocampus. The hippocampus is critical for the rapid encoding of new episodic memories (\ie, specific events and their context) and acts as an index that binds together disparate elements of an experience stored across the neocortex. Explicit memory systems in AI, such as Retrieval-Augmented Generation (RAG), mimic this function. They serve as an "AI Hippocampus" by providing an on-demand, queryable index to external information (vector embeddings, knowledge graphs). This allows the model to ground its responses in specific, up-to-date facts without the need for slow, resource-intensive retraining of its entire parametric base (the "neocortex").
Agentic Memory: The Prefrontal Cortex.: The functionality of agentic memory is best analogized to the prefrontal cortex (PFC), the brain's executive control center. The PFC is responsible for working memory, goal-directed planning, and integrating information from both long-term stores (neocortex) and recent episodic memories (hippocampus) to guide behavior. Agentic memory systems similarly maintain a persistent state across interactions, manage working memory (\eg, a scratchpad), and orchestrate the strategic retrieval and use of both implicit and explicit memory to formulate plans and execute complex tasks. Furthermore, as we explore in xxx, this executive function extends to integrating information from specialized memory modules for spatial, temporal, and embodied intelligence, akin to how the PFC coordinates inputs from various sensory cortices.

🌟🌟🌟 Please feel free to make a PR if I missed something. 🌟🌟🌟

Table of Contents

1 Implicit Memory: Unveiling Knowledge Inside Transformers
2 Explicit Memory: When (M)LLMs Meet Retrieval
3 Agentic Memory: Consolidating Memories into Humanic Agents
4 Memory-augmented Multi-Modal Large Language models

1 Implicit Memory: Unveiling Knowledge Inside Transformers

Memos: A memory os for ai system

1.1 Memory Analysis of Transformers

1.1.1 Knowledge Memorization

Knowledge is encoded in FFNs.

Attention is More Crucial for Knowledge Storage

Knowledge Flows in Connections

Scaling Law for Knowledge Memorization

1.1.2 Associative Memory

Content-Addressable and Associative Memory

Energy-based Model Mimic Associative Memory

Transformer-based Models

Scaling Law for Associative Memory

Usage of Associative Memory

1.2 Implicit Memory Modification

1.2.1 Modification Methods

Incremental Training

Memory Editing

Memory Unlearning

1.2.2 Modification Benchmark

2 Explicit Memory: When (M)LLMs Meet Retrieval

2.1 Explicit Memory Representation

2.1.1 Free text

2.1.2 Graph

2.1.3 Vector

2.2 Training with Explicit Memory

2.2.1 Retrieval-based pretraining

2.2.2 Retrieval-based fintuning

2.3 Training with externalized parameteric knowledge

2.3.1 Long Contexts

2.3.2 Knowledge Injection

2.4 Limitations, Open Questions, Discussion

3 Agentic Memory: Consolidating Memories into Humanic Agents

3.1 Single-agent Memory

3.1.1 Short-term Memory

3.1.2 Long-term Memory

3.2 Multi-agent Memory

3.3 System Architecture

3.4 Evaluation on Agent Memory

3.5 Limitations, Open Questions, Discussion

4 Memory-augmented Multi-Modal Large Language models

4.1 Multimodal Context Modeling with Memory

4.1.1 Audio Context Modeling

4.1.2 Video Context Modeling

Large Video Language Models

Video Agents

4.1.3 Other Modalities

4.2 Downstream tasks

Object-level

Action-level

General-level

Others

Generation

4.3 Multimodal Contextual Memory for Robotics

4.3.1 Multimodal Memory-Augmented Agents

4.3.2 Memory-Enhanced Navigation, Odometry, and Manipulation

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
README.md		README.md
memory.png		memory.png

License

bigai-nlco/Awesome-AI-Memory

Folders and files

Latest commit

History

Repository files navigation