3 Sessions, Context, and Memory

Managing State in Claude Code

Author

AI-Powered SE Tutorial

Published

June 21, 2026

Abstract

A Claude Code session is more than a chat — it’s a stateful execution environment bound to a working directory, accumulating prompts, tool calls, edits, and checkpoints. The context window is a finite array that must be managed like memory: what you load in determines what the agent can reason about. This chapter covers session lifecycle, context window mechanics, and the three layers of persistent memory — connecting directly to the context engineering principles from the Ralph framework.

3.1 Sessions

A Claude Code session is bound to a working directory. When you run claude from ~/projects/my-app, the session’s scope is that project. The agent reads files relative to that directory, runs commands in that shell context, and stores its state there.

Each session accumulates:

Prompts — your messages and the agent’s responses
Tool calls — every file read, edit, shell command, and search
Edits — the actual changes made to files
Checkpoints — snapshots the session can return to

3.1.1 Session lifecycle

flowchart TD
    S["claude (start)"] --> A["New Session"]
    A --> W["Work — prompts, tools, edits"]
    W --> E["End / Exit"]
    E --> C["--continue (latest)"]
    E --> R["--resume (pick from list)"]
    E --> F["--fork-session (branch)"]
    C --> W
    R --> W
    F --> W
    style S fill:#dbeafe,stroke:#1e40af,color:#1e40af
    style A fill:#dcfce7,stroke:#166534,color:#166534
    style W fill:#fef3c7,stroke:#92400e,color:#92400e
    style C fill:#f3e8ff,stroke:#6b21a8,color:#6b21a8
    style R fill:#f3e8ff,stroke:#6b21a8,color:#6b21a8
    style F fill:#f3e8ff,stroke:#6b21a8,color:#6b21a8

Session lifecycle — sessions are bound to a directory and accumulate state

Command	Behavior
`claude`	Start a fresh session — new context array
`claude --continue`	Resume the most recent session with full context
`claude --resume`	Pick from a list of past sessions to resume
`claude --fork-session`	Branch from a session to try an alternative approach without losing the original

When to continue vs. start fresh: Continue when you’re working on the same task and need accumulated context (files read, errors seen, decisions made). Start fresh when switching to a different task — this is the chat hygiene principle from the Ralph framework. Dragging one task’s context into another task contaminates the model’s reasoning.

3.2 The Context Window as Memory

The context window is not magic — it’s a finite array. Everything the agent knows during a session lives in this array:

Your prompts write data into it
Tool results (file contents, command output) append more data
The agent’s own responses consume space
There is no persistent server-side memory happening automatically

This means the context window is a managed resource, like RAM. You can fill it with useful information or waste it on irrelevant data.

\[C_{\text{usable}} = C_{\text{total}} - O_{\text{model}} - O_{\text{harness}} - O_{\text{tools}}\]

Where:

\(C_{\text{total}}\) is the model’s total context window size
\(O_{\text{model}}\) is overhead consumed by the model’s own system instructions
\(O_{\text{harness}}\) is overhead from Claude Code’s internal orchestration
\(O_{\text{tools}}\) is space consumed by tool definitions and schemas

The usable context is always smaller than the advertised window size. As the session accumulates history, earlier content may be compacted (lossy compression) to make room for new tokens.

3.2.1 Context rot and the dumb zone

Two failure modes emerge from context management:

Context rot — quality degrades as the context grows stale, noisy, or overly full. Early instructions slide further from the model’s attention. Irrelevant tool outputs dilute the signal. The agent’s reasoning quality drops even though it’s the same model.

The dumb zone — a region of high context utilization where model performance noticeably worsens. The model isn’t broken — it’s overwhelmed. Too much information competing for attention produces worse results than less information with better focus.

Compaction — when the context fills up, the system performs a lossy compression event that drops earlier context to make room for new tokens. This can remove critical details like the original specification or early design decisions.

flowchart LR
    A["Low Utilization — Good Focus"] --> B["Medium — Productive"]
    B --> C["High — Dumb Zone"]
    C --> D["Full — Compaction Event"]
    D --> E["Post-Compaction — Lost Context"]
    style A fill:#dcfce7,stroke:#166534,color:#166534
    style B fill:#dcfce7,stroke:#166534,color:#166534
    style C fill:#fef3c7,stroke:#92400e,color:#92400e
    style D fill:#fee2e2,stroke:#b91c1c,color:#b91c1c
    style E fill:#fee2e2,stroke:#b91c1c,color:#b91c1c

Context quality vs. utilization — performance degrades as context fills

3.2.2 Practical context management

Keep sessions focused on one task — loading too many tasks dilutes model focus, important specs slide out of view, priorities blur, and error rates rise
Start fresh for new tasks — treat a new chat as a new array; reusing chats across tasks drags prior assumptions into new objectives
Front-load critical context — put the most important information (spec, constraints, patterns to follow) at the beginning of the session where it gets the most attention
Prune verbose tool output — if a command produces 500 lines of output but only 10 matter, tell the agent to focus on the relevant section

3.3 Three Layers of Memory

Claude Code provides three mechanisms for persisting information beyond a single session:

Layer	Scope	Persistence	Content
CLAUDE.md	Project	Permanent (file in repo)	Coding standards, conventions, project-specific rules
Memory	User + Project	Persistent (auto-managed)	User preferences, past decisions, learned behaviors
Session history	Session	Until session ends or is resumed	Accumulated prompts, tool calls, and results

3.3.1 CLAUDE.md — the always-on layer

CLAUDE.md files are loaded at the start of every session. They contain stable, project-specific instructions:

# CLAUDE.md

## Build & Test
- Run tests: `npm test`
- Lint: `npm run lint`
- Build: `npm run build`

## Conventions
- Use TypeScript strict mode
- Prefer `const` over `let`
- Components in PascalCase, utilities in camelCase
- Tests co-located with source files (*.test.ts)

## Architecture
- API routes in src/api/
- Business logic in src/services/
- Database queries in src/db/

CLAUDE.md is to Claude Code what .editorconfig is to text editors — it ensures consistent behavior regardless of who (or what) is working in the codebase. Keep it short, concrete, and non-conflicting. It is NOT the place for large tutorials or API documentation.

3.3.2 `.claude/rules/` — scoped topic rules

For rules that only apply to specific areas, use .claude/rules/ with topic files:

.claude/rules/
  testing.md      # testing conventions and commands
  security.md     # security requirements and constraints
  frontend.md     # UI/component patterns

Rules files support path-scoping — a rule in security.md can apply only to files under src/auth/. This reduces global context noise: instead of loading all rules into every session, only relevant rules are loaded when the agent works in a matching area.

3.3.3 Auto Memory — the learned layer

Claude Code’s auto memory system stores patterns learned from working with you. Unlike CLAUDE.md (which you write explicitly), memory accumulates from interactions: “the user prefers functional components over class components,” “this project uses pnpm not npm.”

The first 200 lines of memory are loaded per session. Memory provides context, not hard policy — it informs the agent’s defaults but can be overridden by explicit instructions.

3.3.4 Session history — the working layer

Within a session, the full history of interactions serves as working memory. This is why --continue is powerful for multi-step tasks — the agent remembers what it tried, what failed, and why. Use /context to inspect current context usage and see how much window remains.

3.4 Key Takeaways

Sessions are bound to a working directory and accumulate prompts, tool calls, edits, and checkpoints
The context window is a finite array — manage it like RAM, not like infinite storage
Context rot and the dumb zone degrade performance as context fills; start fresh sessions for new tasks
\(C_{\text{usable}} = C_{\text{total}} - O_{\text{model}} - O_{\text{harness}} - O_{\text{tools}}\) — your usable context is always smaller than the advertised window
Three memory layers: CLAUDE.md (always-on rules), Memory (learned preferences), Session history (working context)
--continue resumes context; --fork-session branches it — use these deliberately, not as defaults

3.5 Concept Map

flowchart TD
    A["Session"] --> B["Context Window"]
    B --> C["Finite Array"]
    C --> D["Context Rot"]
    C --> E["Dumb Zone"]
    C --> F["Compaction"]
    A --> G["Three Memory Layers"]
    G --> H["CLAUDE.md — Permanent"]
    G --> I["Memory — Learned"]
    G --> J["Session History — Working"]
    B --> K["Chat Hygiene"]
    K --> L["One Task Per Session"]
    style A fill:#dbeafe,stroke:#1e40af,color:#1e40af
    style B fill:#dcfce7,stroke:#166534,color:#166534
    style C fill:#fef3c7,stroke:#92400e,color:#92400e
    style D fill:#fee2e2,stroke:#b91c1c,color:#b91c1c
    style E fill:#fee2e2,stroke:#b91c1c,color:#b91c1c
    style G fill:#f3e8ff,stroke:#6b21a8,color:#6b21a8
    style K fill:#fce7f3,stroke:#9d174d,color:#9d174d

Sessions and memory concepts