Mem0 vs Letta (MemGPT): AI Agent Memory Compared (2026)

March 15, 2026

If you're evaluating agent memory solutions, Mem0 and Letta (formerly MemGPT) probably both showed up in your search. They solve the same fundamental problem — giving AI agents persistent memory across sessions — but they represent two very different philosophies about how to do it.

Mem0 is a memory layer. You bolt it onto whatever agent framework you're already using and it handles storage and retrieval. Letta is an agent runtime. It manages memory as part of a full operating-system-inspired platform where your agents live and execute. That distinction shapes everything: architecture, developer experience, lock-in, and what kinds of agents each system is best suited for.

This guide compares them honestly so you can pick the right tool without having to prototype both. For a broader survey, see our best AI agent memory systems roundup.

Mem0 vs Letta: Quick Comparison

	Mem0	Letta (MemGPT)
What it is	Pluggable memory layer	Full agent runtime
GitHub Stars	~48K	~21K
License	Apache 2.0	Apache 2.0
Backed by	Y Combinator ($24M Series A)	Felicis ($10M seed)
Memory approach	Passive extraction + semantic search	Agent self-edits tiered memory blocks
Memory tiers	Flat store (vector + graph on Pro)	Core (RAM) · Recall (cache) · Archival (cold)
Retrieval	Semantic search (+ graph on Pro)	Agentic tool calls against memory tiers
Benchmark (LongMemEval)	49.0% (independent eval)	Not published
SDKs	Python, JavaScript	Python
Integration model	Library — drop into any stack	Platform — agents run inside Letta
Visual tooling	Dashboard	ADE (Agent Development Environment)
Compliance	SOC 2, HIPAA	Via self-hosting
Managed pricing	Free → $19/mo → $249/mo Pro	$20–200/mo
Self-hosted	Free (Apache 2.0)	Free (Apache 2.0)

Agent Memory Architecture: Mem0 vs Letta

The core difference between Mem0 and Letta isn't a feature list — it's a design philosophy.

Mem0: A Pluggable Memory Layer

Mem0 is a memory service with a simple API. You call add() to store memories and search() to retrieve them. Under the hood, Mem0 embeds your content into a vector database for semantic retrieval. On the Pro tier ($249/month), it also builds a knowledge graph that extracts entities and relationships, enabling multi-hop queries.

The key architectural choice: Mem0 is framework-agnostic. It doesn't care whether your agent runs in LangChain, CrewAI, AutoGen, or a custom loop. You import the SDK, point it at Mem0, and your agent has persistent memory. Your orchestration, tool management, and agent logic stay exactly where they are.

Letta: An Operating System for Agents

Letta takes the MemGPT research paper's core idea — treating LLM context like virtual memory — and builds a full runtime around it. Agents don't just use Letta for memory; they run inside Letta. The framework manages the agent loop, tool execution, state persistence, and memory across three tiers inspired by computer architecture:

Core Memory — a small block that lives in the context window (like RAM). The agent reads and writes it directly.
Recall Memory — searchable conversation history stored outside context (like a disk cache).
Archival Memory — long-term storage the agent queries via tool calls (like cold storage).

This is a fundamentally different scope. Letta is not a memory layer you add to an existing stack — it is the stack. If you're evaluating it purely as a memory solution, you need to understand that adopting Letta means adopting an entire agent platform.

Memory Management: Who Decides What to Remember?

This is where the philosophical divide becomes practical.

Mem0 extracts memories passively. When you call add() with a conversation or document, Mem0's extraction pipeline decides what facts to store. The developer controls what inputs go in, but the system handles decomposition and storage. This is predictable — the same input produces the same memories — and it keeps memory operations out of the agent's inference budget.

Letta agents self-edit their memory. The agent decides what's worth remembering by calling memory functions during its reasoning loop. When something seems important, the agent writes it to core, recall, or archival memory. When it needs context, it searches its own memory tiers.

The tradeoff is predictability vs intelligence. Mem0's passive extraction is consistent and token-efficient, but it can't make nuanced judgments about what matters in context. Letta's self-editing approach is more adaptive — the agent uses its own reasoning to curate memory — but memory quality depends entirely on the model's judgment. If the model fails to save something, it's gone. Every memory operation also costs inference tokens, since the agent has to reason about what to store and how.

As the survey paper "Memory in the Age of AI Agents" notes, both passive extraction and agentic self-editing are valid strategies — and both go well beyond traditional RAG architectures. The right choice depends on how much autonomy your agents need.

Retrieval Quality: Mem0 vs Letta Benchmarks

Retrieval is where the rubber meets the road for any memory system.

Mem0 uses semantic search as its primary retrieval mechanism. You embed a query, find the nearest vectors, and return the top matches. On the Pro tier, graph-based retrieval adds the ability to traverse entity relationships, which helps with multi-hop questions ("What projects has Alice worked on with Bob?"). On the free and standard tiers, semantic similarity is the only retrieval path.

Letta takes an agentic approach: the agent itself decides how to retrieve memories by calling tool functions against each memory tier. This is flexible — the agent can search recall memory for recent conversations, query archival for domain knowledge, or read core memory for persistent context — but retrieval quality depends on the agent making good tool-call decisions.

On benchmarks: An independent evaluation measured Mem0 at 49.0% on LongMemEval, which tests long-term memory retrieval across temporal, multi-hop, and knowledge-update scenarios. Letta has not published LongMemEval results. The absence of published benchmarks makes it hard to do a direct quantitative comparison, but Letta's agentic retrieval approach means results will vary significantly based on the underlying model and prompt engineering. Neither system currently employs multi-strategy retrieval with cross-encoder reranking, which recent research identifies as critical for robust performance across diverse query types.

Developer Experience

Mem0 optimizes for time-to-first-memory. Install the SDK, get an API key, call add() and search(). You can have working memory in an existing agent in minutes. Both Python and JavaScript SDKs are available, and the API surface is small enough to learn in an afternoon. The managed dashboard provides basic monitoring and memory inspection. Mem0 also has the largest community in the agent memory space (~48K GitHub stars), which means more examples, integrations, and Stack Overflow answers.

Letta has a steeper learning curve but more powerful tooling. The Agent Development Environment (ADE) is a visual interface for debugging agent behavior, inspecting memory state across all three tiers, and monitoring tool calls in real time. If you're building complex autonomous agents, the ADE is genuinely useful for understanding why an agent made a particular memory decision. The tradeoff: Letta is Python-only, setup requires configuring the runtime and ADE, and the concepts (core/recall/archival memory, agent-as-OS) take more time to internalize.

For teams that want to add memory to an existing agent quickly, Mem0's simplicity wins. For teams building agent-native applications from scratch who value visual debugging, Letta's integrated tooling is worth the setup cost.

Self-Hosting Agent Memory: Mem0 vs Letta

Both projects are open source under Apache 2.0, so you can self-host either without licensing concerns.

The lock-in story is different, though.

Mem0's lock-in is minimal. It's a memory service with a clean API boundary. If you outgrow it or want to switch, you swap out the memory calls. Your agent framework, orchestration, and tool logic are untouched. Mem0 doesn't own your agent — it just stores memories for it.

Letta's lock-in is architectural. Because agents run inside the Letta runtime, switching away means rewriting not just your memory layer but your entire agent infrastructure — the loop, tool execution, state management, and memory access patterns. This isn't a criticism of Letta's design (tight integration is what makes it powerful), but it's a real consideration. Adopting Letta is a bigger commitment than adopting Mem0, and unwinding it is proportionally harder.

For more on alternatives to each, see our guides on Mem0 alternatives and Letta alternatives.

Agent Memory Pricing: Mem0 vs Letta

Mem0 offers three managed tiers:

Free — 10K memories, basic features
Standard ($19/month) — higher limits, priority support
Pro ($249/month) — graph memory, advanced retrieval, SOC 2/HIPAA compliance

Self-hosted Mem0 is free with no feature gating on the open-source version, though the graph implementation differs from the managed Pro tier.

Letta offers managed cloud at $20–200/month depending on usage, with the ADE included. Self-hosted Letta is free and includes all features.

Both projects are viable at zero cost if you self-host. The managed pricing comparison depends heavily on your scale and which Mem0 tier you need — if you need graph features, Mem0 Pro at $249/month is significantly more expensive than Letta's managed offering.

When to Choose Mem0

Mem0 is the right choice when you want a memory layer, not a runtime. Specifically:

You have an existing agent framework and just need to add persistent memory
You need multi-language SDK support (Python + JavaScript)
Your use case is personalization — user preferences, conversation history, behavioral patterns
You want the largest community and most third-party integrations in the agent memory space
You need SOC 2 or HIPAA compliance on managed infrastructure
You want the simplest possible API with the fastest time to integration

When to Choose Letta

Letta is the right choice when you want an integrated agent platform with memory built in. Specifically:

You're building agents from scratch and want runtime + memory + tooling in one package
You value self-editing memory where the agent curates what it remembers
Your agents are highly autonomous and need to manage their own context intelligently
You want visual debugging via the ADE to inspect and tune agent memory behavior
You're comfortable with Python-only development
You prefer a tighter integration between agent logic and memory management over modularity

Worth Considering: Hindsight

If you're comparing Mem0 and Letta, it's worth also evaluating Hindsight. Hindsight shares Mem0's "library not platform" philosophy — it's a standalone memory layer you plug into any agent framework — but it addresses the retrieval depth limitations that both Mem0 and Letta face.

Where Mem0 relies on semantic search (plus graph on Pro) and Letta delegates retrieval to agent tool calls, Hindsight runs four retrieval strategies in parallel — semantic, keyword (BM25), knowledge graph traversal, and temporal reasoning — with cross-encoder reranking. This multi-strategy approach is why it scores 94.6% on LongMemEval, compared to Mem0's 49.0%. SDKs are available in Python, TypeScript, and Go, and it's MCP-native for drop-in compatibility with any MCP client. It ships as a single Docker container backed by embedded PostgreSQL, with an MIT license.

For detailed comparisons, see Hindsight vs Mem0 and Hindsight vs Letta.

Verdict: Mem0 vs Letta for Agent Memory

Mem0 and Letta are both solid projects, but they're solving different problems. Mem0 gives you a clean, pluggable memory API that fits into any stack — ideal for teams that want persistent agent memory without changing their architecture. Letta gives you a complete agent runtime with sophisticated, self-editing memory built in — ideal for teams building agent-first applications from the ground up.

If you need a memory layer: start with Mem0. If you need an agent platform: evaluate Letta. And if retrieval accuracy on diverse query types is your top priority, take a look at Hindsight before you commit.

Further reading:

What Is Agent Memory? — foundational concepts and how persistent memory works
Agent Memory vs RAG — key architectural differences explained
Best AI Agent Memory Systems in 2026 — comparison of all 8 major frameworks
Mem0 Alternatives — other options if Mem0 isn't the right fit
Letta Alternatives — other options if Letta isn't the right fit