//Chapter structure
No headings found on page
Part 07

Memory in Agents

Over the past few parts, we’ve explored what makes agents act — from tools and RAG, to MCP and reasoning models.

Today, we shift gears to something that determines how well they act over time: memory.

Because here’s the baseline:
AI models don’t have memory inherently. They’re stateless by design. Every input is treated independently unless you architect memory into the system.

Why Memory Matters

image

Image Source: https://arxiv.org/html/2502.12110v1

If an agent is helping you draft emails, summarize long threads, or manage workflows over days or weeks — it needs to remember:

  • The email format

  • The user's name

  • The tone to use

Sure, you could pass that information again and again with every prompt…
But wouldn’t it be better if the agent could retrieve the right information on its own, at the right time, from an external database?

That’s exactly where memory comes in.

“Wait… isn’t this just like Agentic RAG (Day 4)?”

Fair question — and you’re not wrong. Managing memory often looks a lot like doing Agentic RAG.

You:

  1. Write structured or unstructured memories (facts, logs, past outputs)

  2. Store them with metadata, tags, or embeddings

  3. Retrieve the relevant slice when needed

  4. Ground the model’s next action using that context

The difference:

  • RAG → Helps answer questions with knowledge.

  • Memory → Helps agents behave coherently over time.

Two Types of Memory in Agents

When designing real-world agent systems, you typically deal with two kinds of memory.

image

Image Source: https://langchain-ai.github.io/langgraph/concepts/memory/#what-is-memory

1. Short-Term Memory

Scoped to a single session or task.

Includes:

  • The conversation so far

  • Tools used

  • Responses generated

  • Documents retrieved

Think of it as a raw log of user–agent conversations.

LangGraph, Autogen, and similar frameworks treat this as part of the agent’s state.
But state grows fast, and most agents perform poorly when buried under irrelevant history.

Strategies to manage short-term memory:

  • Trim stale messages

  • Summarize the past into key points

  • Filter based on what’s still relevant

It’s a balancing act: context length vs clarity vs cost.

2. Long-Term Memory

Lives across sessions, days, weeks — even forever.

Helps agents remember:

  • Who the user is

  • How they prefer to interact

  • What’s already been done

  • Important past context

Examples:

  • “User prefers neutral tone”

  • “User name is X and stays in city Y”

  • “Invoice #123 has already been escalated”

More data ≠ better by default — it’s about retrieving the right thing at the right time.

Types of Long-Term Memory to Consider

Borrowing from cognitive science:

  • Semantic Memory → Facts and info (objective)
    “User speaks English and prefers Excel files.”

  • Episodic Memory → Past actions
    “Agent already generated a summary yesterday.”

  • Procedural Memory → Preferences (subjective)
    “Avoid passive voice. Prioritize action items.”

Examples by use case:

  • User-facing chatbots → Semantic memory for personalization

  • Process automation agents → Episodic memory to avoid retries or loops

  • Adaptive assistants → Procedural memory to adjust prompts based on feedback

Key Design Questions

Before saying “we need memory,” ask:

  • What kind?

  • Why is it needed?

  • How will it be stored, retrieved, and kept fresh?

Managing Memory in Practice

Managing memory often feels like managing RAG.
The hard part? Deciding what to store and what to retrieve.

Stuffing more text into the agent input rarely helps — it often hurts performance.

You need to design memory intentionally, based on:

  • The agent’s job

  • What it needs to recall

  • When it should recall it

  • How to keep it useful over time

A Few Enterprise Examples

Customer Support Agent

  • Needs: recent support history, known bugs, user sentiment

  • Memory types: episodic + semantic

Sales Copilot

  • Needs: previous pitches, user objections, close status

  • Memory types: semantic + procedural

Compliance Auditor Agent

  • Needs: flagged items, prior exceptions, policy changes

  • Memory types: episodic

In all cases, it’s not about how much data you store — it’s about how relevant and structured it is.

And yes, I’ve said this painfully many times, but I’ll say it again:

Problem-first, always. The memory strategy, like tools or planning, depends entirely on the problem you’re solving.

Up Next

In the next part, we’ll talk about multi-agent systems — what they are, how they coordinate, and whether you actually need more than one agent at all.

© 2026 LevelUp Labs®. All rights reserved.

© 2026 LevelUp Labs®. All rights reserved.

Created by