Most agent memory systems are just search. Hindsight gives your agents real memory: the ability to retain what they learn, recall it when relevant, and reflect on experience to form new understanding. Not within a single session, but across weeks and months.

Most “agent memory” systems chop conversations into embedding vectors and push them into a database. The information is in there somewhere, as a floating point array, with no concept of when it happened or what it means.
RAG (Retrieval-Augmented Generation)
Agent Memory (Hindsight)
Storing things is trivial. The hard part is structuring experience into knowledge the agent can use to behave differently. Hindsight organizes memory around three core operations.
from hindsight_client import Hindsight
client = Hindsight(base_url="http://localhost:8888")
# Retain: store a memory
client.retain(bank_id="user-123", content="Alice prefers email and works at Google.")
# Recall: retrieve relevant memories
memories = client.recall(bank_id="user-123", query="How should I contact Alice?")
# Reflect: reason about what the agent knows
insight = client.reflect(bank_id="user-123", query="What do I know about Alice?")A single retrieval approach isn't enough. Vectors can't answer “what happened last Tuesday?” and keyword search misses paraphrases. Hindsight runs four strategies in parallel, fuses the results, and reranks against a token budget.
Conceptual similarity, paraphrasing
Names, exact terms, identifiers
Related entities, indirect connections
"Last Tuesday," "Q3," time-range queries
The agent gets the right context for the current task, fit to a predictable token budget. No more dumping everything vaguely related into the context window.
LongMemEval benchmark, independently verified by Virginia Tech and The Washington Post. Read the paper →
Lumping all memories together means the agent can't distinguish facts from synthesized knowledge. Hindsight uses a hierarchy: mental models first, then observations, then raw facts, so the agent reasons from the most refined knowledge available.
When someone asks “why did you suggest that?” the agent traces through specific world facts and experiences, points to the observations that informed its thinking, and references the mental model it consulted. That's a fundamentally different answer from “here are the five most similar text chunks I found.”
Hindsight is used in production at enterprises and growing startups. These are the capabilities that matter when you move past prototypes.
Hindsight is open source and runs in under a minute. Start building agents with real memory today.