Six pillars · Recall™ Agent OS

Pillar grid

Memory

Tiered, durable, agent-readable

User memory auto-loads every turn (~200 lines, ~30KB). Reference memory loads on demand. Session memory is scratch. Repo memory holds verified facts. A semantic vector brain handles fuzzy recall across tens of thousands of chunks.

Local compute

Zero-cost compression and distillation

A local open-weights model on a consumer GPU compresses verbose memory, distills brain recalls, generates cold-start briefs, and writes rolling summaries that pre-empt host-side compaction — all without spending a token of paid context. The expensive model thinks; the free model summarizes.

Vault

WORM-immutable backup, 7-year retention

Nightly object-storage sync of every workspace byte. Time-based immutability blocks any delete, overwrite, even with the master key. Cool tier auto-tiers to Archive after 30 days. ~200 GB stored at well under a dollar a month.

Guardrails

Three layers, fail-closed

Kill-switch sentinel + spend-gate script + cloud budget alert. Every billable script must clear all three before touching money. Default ceiling configurable. Failure mode = abort, never silently overspend.

Brain backup

Two cloud copies + one sealed local

Weekly snapshot to vault/brain/. Two newest kept hot. Plus a sealed .SEALED.zip on local disk with sentinel files preventing the agent from mistaking it for live data. Restoration: rename + expand, three steps.

Continuity

Cold-start in 60 seconds

Per-session journals + a one-line audit trail + the local-model distiller produce a ~3KB curated brief on demand. The successor agent reads it instead of guessing from a lossy auto-summary. Context survives compaction, restart, model swap.

How they compose

Each pillar can run alone — but the leverage comes from the combination:

Memory + Continuity means the next session knows what this one decided.
Local compute + Memory means memory stays compressed and queryable without burning paid context.
Vault + Brain backup means every byte the agent ever touched is restorable for seven years.
Guardrails + everything means none of the above can run away with your wallet.

For the architecture diagrams of how the pieces wire together, see the architecture page. For an honest take on what each profile of hardware actually needs, see hardware reality.