Memory System

Overview

Crewship has a 3-tier memory hierarchy that gives agents persistent knowledge across sessions and shared context within crews. Each agent has its own private memory the LLM can read and write, every crew has a lead-owned shared memory all members read, and an ephemeral session tier lives only in the live context window. The three tiers compose into a single retrieval call at agent-run time — the orchestrator picks chunks from whichever tier fits within the per-prompt budget, ranks them by relevance, and stitches them into the system prompt before the LLM ever sees the user’s message. Memory is local-first with zero external dependencies. No embedding APIs, no vector databases, no cloud services. Retrieval is powered by SQLite FTS5 (BM25 ranking) running sub-millisecond on thousands of chunks, with the hybrid retrieval uplift from PRs #211/#212 layering dense scoring on top via RRF (reciprocal rank fusion) when an in-process embedding model is configured. The whole memory subsystem fits in internal/memory/ plus the consolidation worker in internal/consolidate/, both running inside crewshipd — no extra container, no extra port. The point of three tiers (instead of one big pile) is to keep the who-owns-what model clear. An agent can freely write to its own tier without affecting peers. The crew lead is the single writer for crew-shared memory, which means the FTS5 index has zero write contention and crew-wide context evolves under one editorial voice. And session memory is intentionally not persisted — context that’s irrelevant the next time the agent is invoked stays out of the index, keeping retrieval fast as the deployment ages.

When to use it

Turn memory on for any agent or crew whose value compounds across sessions. The system is opt-in (memory_enabled=false by default) precisely because the budget cost in the system prompt is real — only pay it where it earns its keep:

An agent has long-running user-specific preferences. Dark-mode preference, preferred PR style, favourite framework, “use PostgreSQL 16 for new DBs” — anything the user shouldn’t have to repeat on every new chat. Persist to AGENT.md once, retrieve forever.
A crew needs a single source of truth for conventions and decisions. Architecture decisions, deployment rules, security policies, “we always use Tailwind, never inline styles” — the lead curates CREW.md and every member-agent reads it. Beats stuffing the same instructions into each agent’s system prompt by hand.
An agent is accumulating project context. A code-review agent reading the same repo every day shouldn’t relearn the module layout each session. Daily logs (daily/2026-05-14.md) capture session observations; FTS5 search surfaces them when the same module comes up again.
A single-agent crew that should specialise over time. Without crew peers the budget is 100% agent-tier — the agent essentially gets a persistent notebook. Useful for “my personal research assistant” or “the on-call bot that remembers last week’s incidents.”
You’re domain-mining knowledge with topics/. When the crew has bounded subject areas (deployment, security, billing), populate topics/<name>.md files. Hybrid search ranks them above the daily noise on relevant queries.

Skip memory if the agent is single-use, the conversation never references prior sessions, or the user actively wants a stateless interaction. The budget cost of an enabled-but-unused memory layer is non-zero — it eats characters that could carry richer per-turn context instead.

Key concepts

Term	What it means here
Memory tier	One of `agent`, `crew`, `session`. Each is a separate FTS5 index and a separate slice of the system-prompt budget. `agent` is per-agent private, `crew` is lead-owned shared, `session` is the live LLM context window (not persisted).
`AGENT.md`	The canonical long-term knowledge file for one agent. Lives at `/crew/agents/{slug}/.memory/AGENT.md` inside the container, `~/.memory/AGENT.md` from the agent’s POV. The agent is instructed to append rather than overwrite, and to only remove entries that are explicitly outdated.
`CREW.md`	The crew equivalent of `AGENT.md`. Lives at `/crew/shared/.memory/CREW.md`. Only the lead writes; every agent in the crew reads. Curated, single-voice — not a free-for-all.
`daily/`	Per-day session notes (`daily/2026-05-14.md`). Captures what was done, decisions, observations. Both `agent` and `crew` tiers have a `daily/` folder; the lead writes the crew one. Searchable via FTS5; consolidated into the evergreen `*.md` over time by the Consolidation worker.
`topics/`	Crew-only. Domain-specific knowledge files (`topics/deployment.md`, `topics/security.md`) that accrete bounded subject expertise. Hybrid retrieval ranks them above daily noise on on-topic queries.
FTS5 index	SQLite’s full-text-search-5 extension. Each tier has its own `index.sqlite` file managed by the sidecar. BM25 ranks results — relevance scoring used by virtually every modern search system since Elasticsearch.
Hybrid retrieval (RRF)	When an in-process embedding model is configured, dense vector scoring runs alongside BM25 and the two ranked lists are fused via reciprocal rank fusion. From PR #212; sub-millisecond on tens of thousands of chunks because nothing leaves the process.
Budget	The character allowance for memory content in the system prompt. Default 15,000 chars. The crew tier is capped at 40% of total; the agent tier reclaims the remainder. Empty tiers reclaim their entire share.
System-prompt injection	The orchestrator’s `BuildMemoryBlock` reads the top-K chunks at session start and prepends them between `[AGENT MEMORY]` / `[END AGENT MEMORY]` and `[CREW SHARED MEMORY]` / `[END CREW SHARED MEMORY]` markers. The LLM sees memory as part of its system prompt; the agent never has to ask for it.
Sidecar memory engine	Each agent container’s sidecar process (`localhost:9119`) owns the agent’s FTS5 index and proxies search to the lead’s sidecar for crew-tier queries. The lead sidecar reindexes the crew tier every 60 seconds.
`memory_enabled`	Per-agent flag. `false` by default — memory tiers are not read, written, or indexed for that agent. Flip via `crewship agent update --memory`.
Consolidation	The nightly worker in `internal/consolidate/` that promotes recurring observations from `daily/*.md` into `AGENT.md` / `CREW.md` and archives the original log. Keeps the evergreen tier from drifting stale and the daily tier from growing unboundedly.

Usage

The end-to-end loop from “memory off” to “agent recalls last week’s decision” is five steps. Each step links down to the deeper detail section below.

Enable memory on the agent

Memory is memory_enabled=false by default. Flip it on at create or update time:

crewship agent create --name "Researcher" --memory
# or for an existing agent:
crewship agent update researcher --memory

The next time the agent runs in a session, its sidecar provisions ~/.memory/AGENT.md (and the crew tier if the agent belongs to one) and starts the FTS5 index.

Let the agent write to its own memory

No special API needed — the agent writes by running shell or file-tool commands inside its container. The two common patterns:

# Append a long-term fact to AGENT.md
echo "## Preference\nUser likes terse PR descriptions." >> ~/.memory/AGENT.md

# Drop a session-specific note in today's daily log
echo "- Investigated auth/middleware refactor; chose option B." \
  >> ~/.memory/daily/$(date -I).md

The crew lead writes to /crew/shared/.memory/CREW.md and /crew/shared/.memory/topics/<name>.md the same way — see File Structure for the full layout.

Subsequent sessions retrieve memory automatically

When the orchestrator builds the system prompt for the agent’s next session, BuildMemoryBlock selects the top-K chunks within budget and injects them between [AGENT MEMORY] / [CREW SHARED MEMORY] markers. The agent doesn’t have to ask — it reads the markers as part of normal context. See Reading Memory for the exact prompt shape.

Search explicitly when you need to

For mid-session lookups outside the initial prompt budget, the agent calls its sidecar’s search endpoint:

curl -sX POST localhost:9119/memory/search \
  -d '{"query": "PostgreSQL migration strategy", "scope": "both", "limit": 5}'

scope is one of agent, crew, both. With both, BM25 (and dense scores, when an embedding model is configured) are RRF-merged across the two engines and each result carries its origin.

Let consolidation tend the index

The Consolidation worker runs nightly: recurring observations from daily/*.md get promoted into the evergreen AGENT.md / CREW.md, and the daily logs get archived. You don’t run it by hand for normal operation — the workspace operator can trigger one off-cycle via crewship consolidate run --crew=<crew> (optionally scoped with --since) if a tier is visibly drifting.For development without a running server, the crewship memory CLI lets you reindex and search on-disk memory directly — useful for poking at a backup or a frozen container.

Examples

A research-assistant agent that remembers user preferences

You spin up researcher, ask it five PR reviews. Over the course of a week the agent picks up that you write tight commit messages and prefer the changelog updated in the same PR rather than later. On day one it appends:

echo "## User preferences\n- Prefers terse commit subjects (≤70 chars)\n- Wants CHANGELOG entries in same PR, not follow-up" \
  >> ~/.memory/AGENT.md

On day eight, you ask for a review of an unrelated PR. The orchestrator’s BuildMemoryBlock injects AGENT.md into the system prompt before the LLM ever sees your message, so the agent’s first review comment already says “I noticed the CHANGELOG entry is missing from this PR” without having to be told.

A crew lead recording an architectural decision once

Your backend crew decides Postgres 16 is the new baseline. The lead writes it once:

echo "## Decision (2026-05-14): PostgreSQL 16 for all new DBs\n- driver: pgx v5\n- migrations: internal/database/migrate.go\n- ADR-007 in repo for full context" \
  >> /crew/shared/.memory/CREW.md

Every other agent in the crew (migrations, api, tests) reads that line on their next session via the crew-tier system-prompt block — the lead doesn’t have to update three system prompts, and a new agent joining the crew picks up the same context for free.

Bounded subject expertise via `topics/`

A support crew handles deployment, billing, and security questions. Rather than letting CREW.md grow into a sprawling FAQ, the lead carves it up:

/crew/shared/.memory/topics/deployment.md   # 4kB on Kubernetes + Helm conventions
/crew/shared/.memory/topics/billing.md      # 2kB on Stripe webhook handling
/crew/shared/.memory/topics/security.md     # 6kB on Keeper credential policies

When a question lands that mentions “deployment”, hybrid retrieval ranks topics/deployment.md chunks at the top — the agent gets the right slice and not all 12kB at once. CREW.md stays small and editorial; the long-tail expertise lives in dedicated topic files that grow without bloating the budget.

Memory Tiers

CREW SHARED MEMORY (Lead-owned)
  Container: /crew/shared/.memory/
  Who writes: Lead               Who reads: all agents in crew
  Budget: max 40% of system prompt
  FTS5: Lead sidecar owns index (single writer, zero contention)

AGENT MEMORY (per-agent, private)
  Container: /crew/agents/{slug}/.memory/
  Who writes: the agent itself   Who reads: only that agent
  Budget: remainder (60-100%)

SESSION MEMORY (ephemeral)
  Lives in the LLM context window, not persisted

Budget is dynamic. If crew memory is empty, the agent reclaims the full budget for the agent tier.

A workspace-scope memory tier (org-level strategy, cross-crew knowledge) is on the v0.2 roadmap.

Enabling Memory

Set memory_enabled to true when creating or updating an agent:

crewship agent create --name "My Agent" --memory

Memory is disabled by default. Each agent has its own isolated memory store — agents cannot read each other’s personal memory.

File Structure

Agent Memory

Lives in ~/.memory/ inside the agent’s container, mapped to /crew/agents/{slug}/.memory/ on the filesystem.

~/.memory/
  AGENT.md              # Long-term curated facts (evergreen)
  daily/
    2026-04-14.md       # Today's session notes
    2026-04-13.md       # Yesterday's notes
  index.sqlite          # FTS5 search index (sidecar manages)

AGENT.md is the agent’s primary knowledge base — identity, learned facts, user preferences, project context. The agent is instructed to add new information and only remove entries that are explicitly outdated. Daily logs capture session-specific notes: what was done, decisions made, observations.

Crew Shared Memory

Lives at /crew/shared/.memory/ inside every agent container in the crew. The Lead agent writes crew-wide knowledge; all agents can read it.

/crew/shared/.memory/
  CREW.md               # Crew conventions, decisions, policies
  daily/
    2026-04-14.md       # Crew daily log
  topics/
    deployment.md       # Domain-specific knowledge
    security.md
  index.sqlite          # FTS5 index (Lead sidecar owns)

CREW.md contains crew-level decisions, conventions, and shared context that all agents need. The Lead maintains this file. topics/ holds domain-specific knowledge files that grow over time as the crew learns.

The Lead’s sidecar owns the crew FTS5 index and reindexes it every 60 seconds automatically. Non-lead agents search crew memory via the sidecar HTTP API.

How It Works at Runtime

1. Writing Memory

Agents write memory by executing file operations inside their container — shell commands, file tools, or editors. No special API needed.

# Agent writes to its personal memory
echo "## New Fact\nUser prefers dark mode" >> ~/.memory/AGENT.md

# Lead writes to crew shared memory
echo "## Decision\nUse PostgreSQL 16 for all DBs" >> /crew/shared/.memory/CREW.md

The preferred write surface is the memory.write tool the sidecar exposes over MCP. It is durable and fail-closed:

A successful memory.write is fsync’d and atomically renamed into place before it returns — a write the tool reports as done is on stable storage, never merely in the page cache.
If the persist cannot complete, the tool returns an isError result (the model sees the failure and reports it) instead of a false “done”. Success therefore means durability.
The tool is health-gated: memory.write / memory.read are only advertised to the model when the memory sidecar is actually reachable. If the sink is down, the model doesn’t see a memory tool at all and degrades explicitly (“I couldn’t persist that”) rather than calling a dead endpoint and getting a phantom success. Because sidecars are container-scoped and persistent, any warm crew container keeps the tools available across runs.

Recall (the [AGENT MEMORY] injection below) reads the file directly and does not depend on the sidecar, so an agent always reads back its durable memory even on a path that skips the sidecar. Writing mid-session is what needs the sink — see Memory observability for the authoritative way to verify a write landed (and why memory hybrid can lag it).

2. Reading Memory (System Prompt Injection)

When a session starts, the orchestrator reads memory files and injects them into the system prompt:

[AGENT MEMORY]
--- AGENT.md (long-term memory) ---
...agent's personal knowledge...
--- Daily log: 2026-04-14 (today) ---
...today's session notes...
[END AGENT MEMORY]

[CREW SHARED MEMORY]
--- CREW.md (crew-wide knowledge) ---
...crew conventions and decisions...
[END CREW SHARED MEMORY]

[MEMORY INSTRUCTIONS]
...instructions on how to write memory...
[END MEMORY INSTRUCTIONS]

2a. Operator model (per-user)

Alongside the three memory tiers, Crewship distils a small operator model for each person who works with a crew. It captures how an operator likes to work — communication style, timezone, language, level of detail — as a hint, not as a fact about who they are or what they want. The model is keyed on (operator, workspace), independent of which agent is answering. Every agent in a crew reads the same model, stored crew-shared at:

/crew/shared/.memory/users/{user_slug}.md

user_slug is a one-way hash (sha256(user_id ‖ workspace_id)[:16]), so the filename never carries the operator’s identity into a directory listing or a stack trace. The same hash gives each workspace its own isolated model for the same person — a user who works in two workspaces gets two independent models. When a session opens, the orchestrator injects only the session opener’s model — never another operator’s, even if one exists on disk — as an [OPERATOR MODEL] block, placed before the per-agent [PEER CONTEXT] block so the broad working-style hint frames the narrower per-agent relationship hint:

[OPERATOR MODEL]
This operator has worked with the crew before. The following profile was
distilled and merged across prior sessions — treat it as a hint about how
they prefer to work, not as a fact about who they are or what they want.
- timezone: UTC+1
- language: Czech
- tone: terse, technical
[END OPERATOR MODEL]

Evolving, not overwriting. A daily background sweep refreshes the model from recent sessions. Refreshes merge: a field the latest session is silent about is preserved from the prior model rather than dropped, so a one-off session that only touches one aspect of how an operator works can’t erase the stable picture built up over time. A field the session does re-touch is updated to the newer value. The model is capped at 1.5 KB. Eligibility. An operator gets a model once their interaction with the crew crosses a threshold (≥10 messages or ≥5 minutes of session time). Below that, no model is written. Opt-out and deletion. The operator model reuses the same per-(operator, workspace) consent flag as peer cards: opting out of one opts out of both. When an operator is opted out, the next sweep purges any existing model from disk and its index row, and records the deletion in the audit log. Deleting the user cascades the index row away as well.

3. Searching Memory

Agents search memory via the sidecar HTTP API with a scope parameter:

# Search personal memory only
curl localhost:9119/memory/search \
  -d '{"query": "PostgreSQL", "scope": "agent"}'

# Search crew shared memory
curl localhost:9119/memory/search \
  -d '{"query": "deployment", "scope": "crew"}'

# Search both (merged by relevance)
curl localhost:9119/memory/search \
  -d '{"query": "database", "scope": "both"}'

Dynamic Budget Allocation

The orchestrator allocates a character budget (default 15,000 chars) across memory tiers. Empty tiers reclaim their budget for lower tiers.

Agent Role	Crew (max)	Agent (remainder)
Lead (in crew)	40%	rest
Agent (in crew)	40%	rest
Single-agent crew	—	100%

Budget examples

Lead with small crew memory:

Crew: 2,000 chars (actual content, under 40% cap)
Agent: 13,000 chars (reclaimed from crew)

Single-agent crew (no peers, empty crew memory):

Agent: 15,000 chars (full budget)

API reference

The memory subsystem is reachable on the agent’s sidecar at localhost:9119 inside the container. There is no public REST endpoint — these calls are intentionally container-local so an agent can only ever query its own tiers (and, via the lead’s sidecar proxy, the crew tier it’s a member of). The schemas below are the source of truth; for the database-side view of the underlying tables see /api-reference/journal (journal-backed retrieval) and the SQLite migration log under internal/database/migrate.go (v54/v55 introduced the importance + FTS5 columns these endpoints read).

POST /memory/search

Search indexed memory with scope control. Request:

{
  "query": "database migration strategy",
  "limit": 10,
  "scope": "both"
}

Field	Type	Required	Default	Description
`query`	string	Yes	—	Search terms (FTS5 syntax)
`limit`	int	No	10	Max results (capped at 50)
`scope`	string	No	`"agent"`	`"agent"`, `"crew"`, or `"both"`

Response (200):

{
  "results": [
    {
      "file": "AGENT.md",
      "snippet": "All DB migrations are in internal/database/migrate.go...",
      "score": -2.345,
      "source": "agent"
    },
    {
      "file": "CREW.md",
      "snippet": "Use PostgreSQL 16 for all new databases...",
      "score": -1.437,
      "source": "crew"
    }
  ],
  "count": 2
}

When scope=both, results from both engines are merged and sorted by BM25 score. Each result includes a source field indicating its origin.

Status	Condition
400	Missing query, invalid JSON, or invalid scope
503	Requested memory engine not available
500	Search execution failed (all scopes)

GET /memory/status?scope=agent|crew

Check the state of a memory index. Response (200):

{
  "total_files": 5,
  "total_chunks": 23,
  "indexed_at": "2026-04-14T14:30:00Z",
  "total_size_kb": 48,
  "search_ready": true
}

POST /memory/reindex?scope=agent|crew

Trigger a full reindex. Context-aware — responds to client disconnect and SIGTERM. Response (200): Returns status object after reindexing.

CLI (Development & Debugging)

The crewship memory command provides direct filesystem access to memory indexes without a running server.

# Reindex agent memory
crewship memory reindex -p /path/to/agents/lead/.memory -S agent

# Search crew shared memory
crewship memory search "deployment" -p /path/to/crew-root -S crew

# JSON output for scripting
crewship memory search "PostgreSQL" -p /path -S crew -F json

Flag	Short	Description
`--path`	`-p`	Base path (meaning depends on scope)
`--scope`	`-S`	`agent` or `crew` (workspace scope ships in v0.2)
`--limit`	`-l`	Max results (search only, default 10)
`--format`	`-F`	`table` or `json` (search only)

Access Control

Operation	Agent	Lead
Read own agent memory	Yes	Yes
Write own agent memory	Yes	Yes
Read crew shared memory	Yes	Yes
Write crew shared memory	No	Yes

Agents in one crew cannot access another crew’s shared memory. Container bind mounts enforce this isolation at the filesystem level.

Limits

Limit	Value	Description
Total memory size	10 MB per agent	Maximum `.memory/` directory size
Daily log size	100 KB per file	Maximum single `daily/*.md` file
Crew memory reindex	Every 60 seconds	Automatic periodic reindex by Lead sidecar

Load-time injection scan

Memory files are authored by prior agent runs, so a file on disk can carry an indirect-injection payload (text ingested from a web fetch, a peer message, or a file read, then persisted). Every memory tier is scanned again at prompt assembly time, immediately before its content is placed into the system prompt — independent of the write-path scan. The scan runs per section. If a section’s body trips the scanner, only that section’s body is replaced with a deterministic notice; clean sibling sections in the same block are untouched:

--- Daily log: 2026-06-09 (today) ---
[BLOCKED: possible prompt injection in Daily log: 2026-06-09 (today) — category=prompt_injection pattern=ignore_previous_instructions; operator can inspect the file directly]

What this means for operators:

The live file is never modified. The substitution happens only in the assembled prompt. The file on disk is left exactly as written so you can open it and judge the content yourself — the notice points you at the file rather than discarding anything.
The label is preserved so you can tell which tier and which file produced the hit, and the category / pattern map back to the scanner rule.
It is deterministic. The same file content always yields the same notice (first-hit, fixed rule order), so a blocked section won’t flicker between runs.

Memory write overflow guidance

The memory.write tool is a pure bounded store — each tier has a fixed byte cap and the store never silently evicts or rewrites entries for the agent. When a write would cross a cap, the tool now hands the agent enough to fix it within the same turn instead of just rejecting the call:

Hard cap exceeded (the write would push the tier past its cap): the result is an error, nothing is written, and the result metadata carries current_entries (the current on-disk body) and usage (e.g. 3900 of 4000 bytes, 97%). The message instructs the agent to consolidate the current entries — merge duplicates, drop stale lines, summarize — and retry the write in this turn with mode='replace' carrying the consolidated body.
Soft cap (80%) crossed (the write succeeds but is close to the cap): the same current_entries + usage are attached and the warning steers the agent to consolidate and rewrite the consolidated body in this turn, before the next append is rejected.

append and replace are at parity — both surface current_entries + usage on overflow. This keeps the store dumb (it does not consolidate for the agent) while giving the agent the material to self-curate without losing the turn.

Best Practices

Writing effective AGENT.md

Use clear headings (## Identity, ## Learned Facts, ## Preferences, ## Project Context)
Be specific. “Use driver name sqlite not sqlite3” beats “there’s a sqlite driver thing”
Keep it under 5 KB. Concise memory leaves budget for daily logs and crew context
Prune outdated facts. Stale memory is worse than no memory

Writing effective crew memory (CREW.md)

Lead curates CREW.md with crew-wide conventions, architecture decisions, and policies
Use topics/ for domains. topics/deployment.md, topics/security.md keep CREW.md focused
Agents don’t write to crew memory — they write personal notes to their own AGENT.md
Don’t duplicate facts across agent and crew memory. Crew memory is for shared knowledge only

Optimizing search

Use consistent terminology. FTS5 is keyword-based, not semantic
Trigger reindex after bulk writes. Agent memory reindexes on startup; crew memory reindexes every 60s
Use scope=both to search across personal and crew knowledge simultaneously

Common pitfalls

Memory is memory_enabled=false by default. The most common “my agent isn’t remembering anything” report is just an agent that was never opted in. Check crewship agent get <slug> and flip the flag if needed.

Memory files are not encrypted at rest. AGENT.md, CREW.md, daily logs, and the index.sqlite files live as plain on-disk bytes inside the container. Never write secrets, tokens, or PII into memory — credentials belong in Keeper, not in ~/.memory/.

Crew reindex lags up to 60 seconds. The lead sidecar reindexes the crew tier on a 60-second tick. Writes to CREW.md or topics/*.md are searchable on the next tick, not immediately — if you need instant visibility, hit POST /memory/reindex?scope=crew from the lead’s sidecar.
Agent-tier reindex after /memory/write is asynchronous. A successful POST /memory/write returns 201 the moment the bytes are durable on disk; the FTS5 reindex (and the memory.updated journal entry) then run on a single-worker background queue rather than blocking the response. The lag is sub-second in practice. The queue is strict FIFO, so consecutive writes to the same file reindex in write order — turn N always lands before turn N+1, never a stale earlier write winning a race. Writes are still searchable essentially immediately; if you need a hard barrier (e.g. write-then-search in a tight loop), the queue is drained on sidecar shutdown so nothing is lost, but for instant in-session visibility you can hit POST /memory/reindex?scope=agent.
Single-agent crews have no crew tier. The crew scope returns 503 for an agent that isn’t a member of a crew (or whose crew has only itself). The budget for the agent tier is 100% in that case; no crew context to retrieve.
Agents cannot read other agents’ personal memory — even within the same crew. Container bind mounts enforce this at the filesystem level, not by API check. If a workflow needs cross-agent state, the right surface is CREW.md or topics/, written by the lead.
Budget is measured in characters, not tokens. The default 15,000-character budget is ~3,500–4,500 tokens depending on language and content. Don’t reason about budget as if it were tokens — the orchestrator slices on character boundaries.
The 10 MB per-agent memory cap is a hard limit. Once a directory exceeds it, new writes fail rather than evicting old ones — consolidation isn’t automatic eviction. Curated facts only, not raw logs.
Don’t delete AGENT.md entries casually. Memory is append-mostly. Remove only entries that are explicitly outdated — pruning live knowledge is what makes an agent feel “dumber than yesterday”.
Migration v54/v55 must be applied for hybrid retrieval. Older databases that predate the Crew Journal era (PRs #211/#212) don’t have the importance_score, reference_count, or journal_entries_fts columns these endpoints read. Migrations apply automatically on crewship start, so start the upgraded binary once before depending on RRF scoring.
Docker network Internal: true blocks sidecar → embedding model. If the agents network is Internal: true, the sidecar can read its on-disk FTS5 index but cannot reach a remote embedding service if one is configured, silently degrading hybrid retrieval to BM25-only. The fix lives in internal/provider/docker/docker.go:ensureNetwork — the network must be Internal: false with a gateway.

Episodic memory — the hybrid retrieval (RRF) + importance/decay/reinforce uplift from PRs #211/#212 that this guide builds on.
Consolidate — the nightly worker that promotes recurring daily/*.md observations into AGENT.md / CREW.md and archives the original logs.
Crew Journal — the event log behind the FTS5 mirror and the journal_embeddings table memory reads from.
Keeper — where secrets belong instead of memory files (memory is not encrypted at rest).
Orchestration — the runtime that calls BuildMemoryBlock to inject memory into system prompts at agent-run time.
Skills — for agent-side packaged knowledge that doesn’t change per-user/per-crew; complementary to memory rather than overlapping.

​Memory System

​Overview

​When to use it

​Key concepts

​Usage

​Examples

​A research-assistant agent that remembers user preferences

​A crew lead recording an architectural decision once

​Bounded subject expertise via topics/

​Memory Tiers

​Enabling Memory

​File Structure

​Agent Memory

​Crew Shared Memory

​How It Works at Runtime

​1. Writing Memory

​2. Reading Memory (System Prompt Injection)

​2a. Operator model (per-user)

​3. Searching Memory

​Dynamic Budget Allocation

​API reference

​POST /memory/search

​GET /memory/status?scope=agent|crew

​POST /memory/reindex?scope=agent|crew

​CLI (Development & Debugging)

​Access Control

​Limits

​Load-time injection scan

​Memory write overflow guidance

​Best Practices

​Common pitfalls

​Related

Memory System

Overview

When to use it

Key concepts

Usage

Examples

A research-assistant agent that remembers user preferences

A crew lead recording an architectural decision once

Bounded subject expertise via `topics/`

Memory Tiers

Enabling Memory

File Structure

Agent Memory

Crew Shared Memory

How It Works at Runtime

1. Writing Memory

2. Reading Memory (System Prompt Injection)

2a. Operator model (per-user)

3. Searching Memory

Dynamic Budget Allocation

API reference

POST /memory/search

GET /memory/status?scope=agent|crew

POST /memory/reindex?scope=agent|crew

CLI (Development & Debugging)

Access Control

Limits

Load-time injection scan

Memory write overflow guidance

Best Practices

Common pitfalls

Related