Letta vs LangChain Memory: Agent Memory Compared (2026)

March 15, 2026

Two of the most talked-about approaches to agent memory in 2026 are Letta (formerly MemGPT) and LangMem (LangChain's memory library for LangGraph). They both give AI agents the ability to remember things across sessions — but they couldn't be more different in scope.

Letta is a full agent runtime. Your agents don't just use Letta for memory — they run inside it. LangMem is a lightweight memory add-on that plugs into LangGraph's existing agent framework. One is a platform; the other is a component.

Both are open source. Both are Python-only. But the architectural gap between a complete agent operating system and a key-value memory store is enormous, and it drives every practical difference you'll encounter. This guide breaks down the tradeoffs so you can make the right call for your use case.

Letta vs LangChain Memory: Quick Comparison

	Letta (MemGPT)	LangMem (LangChain)
What it is	Full agent runtime with memory	Memory add-on for LangGraph
License	Apache 2.0	MIT
GitHub stars	~21K	~1.3K (part of LangChain ~48K ecosystem)
Memory approach	OS-inspired tiers; agent self-edits memory	Flat JSON key-value + vector search
Memory tiers	Core (RAM) · Recall (cache) · Archival (cold)	Single namespace/key store
Retrieval	Agentic tool calls against memory tiers	Vector similarity search
Knowledge graph	No	No
Temporal retrieval	No	No
Entity resolution	No	No
Prompt optimization	No	Yes
SDKs	Python	Python
Framework dependency	High (agents run inside Letta)	High (requires LangGraph)
LongMemEval score	Not published	Not published
Managed cloud	Yes ($20–200/mo)	No (self-host via LangGraph)
Self-hosted	Yes (free, Apache 2.0)	Yes (free, MIT), LangGraph infra required

Agent Memory Architecture: Letta vs LangChain

This is the defining difference, and everything else follows from it.

Letta is a complete agent platform. It grew out of the MemGPT research project, which proposed treating LLM context like an operating system manages virtual memory. Today, Letta provides the agent loop, tool execution, memory management, state persistence, and a visual Agent Development Environment (ADE). When you build with Letta, you're adopting it as your agent framework — not just adding memory to an existing stack. It's backed by a $10M seed round from Felicis, and the project has strong momentum with roughly 21K GitHub stars.

LangMem takes the opposite approach. It's a memory library — a component you wire into LangGraph agents. LangGraph handles the agent loop, orchestration, and tool execution. LangMem handles remembering. It stores memories as JSON items in LangGraph's built-in structured store and provides a background memory manager that can extract user preferences from conversations automatically.

The scope difference is enormous. Letta is trying to be the operating system for agents. LangMem is trying to be one good library inside an existing ecosystem. Neither approach is inherently wrong — they're solving different-sized problems.

Memory Models: Self-Editing Tiers vs Flat Key-Value

The way each system structures and manages memory reflects its architectural philosophy.

Letta's OS-Inspired Memory

Letta organizes memory into three tiers, inspired by a computer's memory hierarchy:

Core Memory — a small block that lives permanently in the agent's context window, like RAM. Contains the agent's persona, user details, and critical context. The agent reads and writes to it directly.
Recall Memory — conversation history stored outside the context window, like a disk cache. The agent searches it when it needs to look back at prior interactions.
Archival Memory — long-term storage for large volumes of information, like cold storage. The agent inserts and queries it through tool calls.

The key design choice: the agent self-edits its own memory. When the agent decides something is important, it calls a function to write it to the appropriate tier. When it needs to recall something, it calls a function to search. This is sophisticated — the agent uses its own reasoning to decide what matters. But every memory operation costs inference tokens, and memory quality depends entirely on the model's judgment.

LangMem's Flat Store

LangMem stores memories as flat JSON key-value items in LangGraph's structured store. Each memory gets a namespace and a key. Retrieval happens through vector similarity search against those items.

This is deliberately simple. It works well for personalization use cases — storing that a user prefers concise answers, tracking project names, remembering tool preferences. The background memory manager handles extraction without the agent needing to think about what to store.

LangMem's simplicity is both its strength and its limitation. There are no relationships between memories, no hierarchy, no way to model how information evolves over time. For straightforward personalization, that's fine. For complex institutional knowledge or multi-entity reasoning, the flat model runs out of road quickly.

Retrieval Quality: Letta vs LangChain Memory

Retrieval is where the philosophical differences become practical.

Letta's retrieval is agentic. The agent decides when to search memory, which tier to query, and what search terms to use. It can chain multiple memory lookups, reason about results, and refine its searches. This is powerful for complex scenarios where the agent needs to actively hunt through its memory to assemble an answer. The downside is latency and token cost — every memory lookup involves model inference.

LangMem's retrieval is vector similarity only. You query the store with a text string, and it returns the most semantically similar items. There's no graph traversal, no temporal filtering, no entity resolution, and no keyword search. For many personalization queries ("what does this user prefer?"), vector similarity is sufficient.

One area where LangMem genuinely stands out: prompt optimization. LangMem can analyze agent performance over time and suggest improvements to system prompts based on accumulated interactions. This is a unique feature that neither Letta nor most other agent memory systems offer. If you're actively iterating on prompt engineering, it's valuable.

Neither system publishes results on the LongMemEval benchmark, so there's no standardized way to compare retrieval quality head-to-head. Neither offers graph-based retrieval, temporal querying, or entity resolution — capabilities that go beyond traditional RAG and that the survey paper "Memory in the Age of AI Agents" identifies as important for production agent memory.

Lock-In and Portability

Both systems come with significant lock-in, just different flavors of it.

Letta's lock-in is architectural. Your agents run inside the Letta runtime. The agent loop, tool execution, and state management are all Letta's. If you later decide to switch to a different agent framework, you're not just swapping out a memory layer — you're rebuilding your agents from scratch. Your memory data is also structured around Letta's three-tier model, so migration means re-architecting how memory is organized.

LangMem's lock-in is framework-level. LangMem requires LangGraph. Your memories live in LangGraph's structured store. If you move away from the LangChain ecosystem, your memory system comes with you only if you rebuild the storage and retrieval layer. LangMem also doesn't provide SDKs outside of Python, so any non-Python services in your stack can't access agent memory natively.

Neither system is framework-agnostic. If portability matters to your architecture — especially in a polyglot environment — both options require careful consideration of what you're committing to.

Developer Experience

Letta offers a more powerful but steeper experience. The ADE (Agent Development Environment) gives you visual debugging, memory inspection, and agent monitoring in a web interface. You can watch your agent's memory operations in real time, inspect the contents of each memory tier, and step through agent reasoning. But the setup is heavier — you're configuring a full runtime, not just importing a library. Expect to spend meaningful time on initial configuration.

LangMem offers a simpler but narrower experience. If you're already building in LangGraph, adding LangMem is straightforward — install the package, configure the memory manager, and wire it into your graph. The learning curve is gentle because LangMem doesn't introduce new concepts beyond what LangGraph already uses. But if you're not in the LangGraph ecosystem, LangMem isn't an option at all.

Pricing and Self-Hosting: Letta vs LangChain Memory

Both projects are open source and free to self-host.

Letta offers a managed cloud service priced at $20–200/month depending on usage. Self-hosting is free under Apache 2.0, but you'll need to run the Letta server and manage the infrastructure for persistence, which includes PostgreSQL and the ADE.

LangMem is MIT-licensed and free. There's no separate managed service for LangMem — it runs inside your LangGraph deployment. That means your cost is whatever you're paying to host LangGraph (either self-hosted or through LangSmith/LangGraph Cloud). LangGraph Cloud pricing varies, but the memory component itself adds no additional cost.

For either system, the real expense in production is LLM inference. Letta's agentic memory operations consume more tokens per interaction than LangMem's passive approach, which can add up at scale.

When to Choose Letta

Letta is the stronger choice when you:

Want an integrated agent platform — you're starting fresh and want agent orchestration, memory, tools, and observability in one package
Need autonomous agents with sophisticated memory — agents that actively manage what they remember, with tiered storage and agentic retrieval
Value visual debugging — the ADE is a genuine advantage for understanding agent behavior in development
Are comfortable with Python-only — your entire stack is Python and you don't need multi-language SDK support

For a broader look at options, see our Letta alternatives guide.

When to Choose LangChain Memory

LangMem is the stronger choice when you:

Are already invested in LangGraph — LangMem integrates seamlessly and doesn't require learning a new framework
Have personalization-only memory needs — user preferences, conversation summaries, and simple context carry-over
Want minimal complexity — a flat key-value store with vector search is easier to reason about and debug than tiered, agentic memory
Need prompt optimization — LangMem's ability to improve system prompts based on interaction history is unique and valuable

For a broader look at options, see our LangChain memory alternatives guide.

Worth Considering: Hindsight

Both Letta and LangMem solve real problems, but both come with meaningful lock-in — either to a full runtime or to a specific framework. If that tradeoff gives you pause, Hindsight is worth evaluating.

Hindsight is a framework-agnostic memory layer. It doesn't care whether your agents run in LangGraph, CrewAI, a custom loop, or something else entirely. It provides multi-strategy retrieval — semantic search, keyword search, knowledge graph traversal, and temporal filtering with cross-encoder reranking — without requiring the agent to manage its own memory. It scores 94.6% on the LongMemEval benchmark, which neither Letta nor LangMem has published results for. And it ships SDKs for Python, TypeScript, and Go — something neither competitor offers.

Hindsight is also MCP-first, so any MCP-compatible client can use it as a memory tool immediately. If you want persistent agent memory without committing to a specific runtime or framework, it's a different kind of bet. See the detailed comparisons: Hindsight vs Letta and Hindsight vs LangChain Memory.

Verdict: Letta vs LangChain Memory for Agent Memory

Letta and LangMem are solving the same problem at very different altitudes. Letta gives you a complete agent operating system with sophisticated, tiered memory that the agent controls. LangMem gives you a simple, effective memory component for agents you're already building in LangGraph.

If you want power and integration and you're willing to adopt a new runtime, Letta is the more capable system. If you want simplicity and you're already in the LangChain ecosystem, LangMem gets you persistent memory with minimal effort.

The honest assessment: neither is a universal answer. Letta's scope means you're buying into a platform. LangMem's simplicity means you'll outgrow it if your memory needs get complex. Both are Python-only, and both lock you into their respective ecosystems. Choose based on where you are today and where your agent architecture is headed — and consider whether framework independence matters to your roadmap.

Further reading:

What Is Agent Memory? — foundational concepts and how persistent memory works
Agent Memory vs RAG — key architectural differences explained
Best AI Agent Memory Systems in 2026 — comparison of all 8 major frameworks
Letta Alternatives — other options if Letta isn't the right fit
LangChain Memory Alternatives — other options beyond LangMem