Session Memory

Session memory preserves context across conversations by compacting session history and storing it for future retrieval. This helps maintain continuity when you hit context limits or need to clear your session.

Why Not Just Use `/compact`?

Claude Code’s /compact has a fundamental design flaw: it needs context space to run, but you only think to use it when context is almost full.

The timing problem:

/compact runs inside the current context window
When your context is nearly exhausted, there’s no room left to run the compaction
You try /compact, it fails, and now you’re stuck with a full window and no recovery

This isn’t a bug - it’s an inherent limitation of in-context compaction. You have to remember to run it before you need it, which defeats the purpose.

The alternative is worse: /clear then manually copy-paste chunks of your chat history back in, hoping you grabbed the right parts, fighting token limits, losing formatting. It works, but it’s tedious and error-prone.

ctxloom automates this properly:

Works after exhaustion - /clear then /recover operates outside the full window
External processing - ctxloom reads the raw session transcript from disk (JSONL files)
Separate LLM - A dedicated model (configurable, default: Haiku) distills the content
Controlled compression - Extractive strategy preserves decisions, code, and next steps
Persistent storage - Distilled summaries are saved to .ctxloom/memory/
Reliable recovery - Sessions are read from disk at request time, so the previous session is always findable

The workflow is simple: when you hit context limits, /clear and /recover. No timing anxiety.

Overview

When working on long sessions, you’ll eventually approach context window limits. Session memory lets you:

Clear the context window when you hit limits
Recover context from the previous session after /clear
Browse session history to find and load specific sessions

For durable, cross-session work items (distinct from the agent’s ephemeral to-dos), see Sessions and Tasks.

Usage

Session memory is always enabled - no configuration required.

The `/recover` Command

When you hit context limits and need to clear:

/clear
/recover

The /recover skill:

Reads the previous session for this project from disk (read-time — no process tracking)
Distills the raw JSONL transcript using a separate LLM (default: Haiku)
Returns the essence so you can continue working

Behind the skill, two tools do the work: recover_session recovers the most-recent session, and get_previous_session returns the previous session for the current project.

Alternative Recovery

You can also recover naturally:

What were we working on before the clear?

The AI will use get_previous_session to find and distill your previous session.

Browsing Session History

To see recent sessions with short summaries, read the ctxloom://sessions/recent resource — or just ask:

Show me recent sessions

Then load a specific one to continue it:

Load the distilled session from this morning

How It Works

Session Tracking

ctxloom records each session on disk under the project, in a harp-named session directory. Recovery is read-time: when you ask to recover, ctxloom reads the previous (or most-recent) session for the current project straight from disk. No live process or PID tracking is involved, so recovery works even after the AI process has fully restarted across /clear — and without manually specifying session IDs.

Compaction

Session compaction happens outside your session using a separate LLM call. This is critical - the compaction LLM has access to the full raw transcript, not a degraded context window.

The compaction process:

Read transcript - ctxloom reads the raw JSONL session log from disk
Chunk - Large sessions are split (default: 8000 tokens per chunk)
Distill - A fast model (default: Haiku) extracts key information:
- Decisions made and why
- Context established
- Progress achieved
- Next steps planned
Store - Result saved to .ctxloom/memory/distilled/

The distilled output is typically 10-20% of the original size while preserving actionable information.

Storage

Memory is stored in:

.ctxloom/memory/
└── distilled/           # Compacted session summaries
    └── session-id.md

Cross-Agent Workflows

Distilled memory is portable across agents - it’s stored as plain markdown. Raw session history is currently backend-specific (Claude and Antigravity use different transcript formats).

# Morning: Write code with Claude
ctxloom run --llm claude-code "implement the auth module"
# When done, compact the session (or let it auto-compact on context limit)

# Afternoon: Review with Antigravity
ctxloom run --llm antigravity
"Load the distilled session from this morning"
# Antigravity loads the markdown summary, continues the work

Use cases:

Development → Review - Write with one model, review with another
Fast → Thorough - Draft with Haiku, refine with Opus
Specialist models - Use different models for different task types

The distilled markdown captures decisions, progress, and next steps - everything the next agent needs to continue the work.

MCP Tools

Session memory provides these MCP tools:

Tool	Description
`compact_session`	Compact current or specified session
`list_sessions`	List available sessions with compaction status
`load_session`	Distill and load a specific session by ID
`recover_session`	Recover the most-recent session’s context after `/clear`
`get_previous_session`	Get the previous session for this project (read-time)

Browsing recent sessions is a resource, not a tool — read ctxloom://sessions/recent.

Example: Manual Compaction

Session's getting long, compact it

Example: Load Specific Session

Use load_session to load session abc123def456

Example: Recover Previous Session

Use get_previous_session to recover what we were working on

Example: Browse History

Recent sessions are exposed as the ctxloom://sessions/recent resource:

Show me recent sessions

Advanced Configuration

Compaction settings live under llm.compaction:

llm:
  compaction:
    llm: claude-code   # LLM used for distillation
    model: haiku       # Model to use (fast + cheap)
    chunks: 8000       # Tokens per chunk

CLI Commands

Manual Compaction

ctxloom memory compact

Compacts the current session directly from the command line.

List Sessions

ctxloom memory list

Shows all sessions with their compaction status.

Best Practices

Just /clear when needed - Don’t overthink it; ctxloom tracks your session automatically
Use /recover after clearing - Distillation happens on-demand, no pre-saving required
Browse older sessions - Read ctxloom://sessions/recent (or ask “show me recent sessions”) when you need context from days ago
Review recovered content - Check that important details were captured

Troubleshooting

Recovery Shows “No Previous Session”

If recovery can’t find the previous session:

Ensure you started the session with ctxloom run (not raw claude), so the session was recorded
Browse ctxloom://sessions/recent (or ask “show me recent sessions”) to find and load a specific one

Compaction Fails

If compaction fails:

Check that the LLM is configured correctly
Ensure you have API access for the compaction model
Try with a smaller chunk_size if sessions are very large

Memory Not Loading on Start

In eager mode, if memory isn’t loading:

Check memory.load_on_start isn’t set to false
Verify a distilled session exists in .ctxloom/memory/distilled/
Check for errors in session start hooks