Helix — Glossary¶

Shared vocabulary for the project. If a term here conflicts with code, the code is wrong — fix it (spec-first invariant, CLAUDE.md).

Term	Definition
Helix	The product: a local-first, portable, git-like memory layer for AI coding agents.
Strand	A user's whole memory as a single self-describing SQLite database; exported as a `.dna` file.
`.dna`	The portable strand artifact: signed (Ed25519), encrypted (XChaCha20-Poly1305), versioned.
Memory / Fact	One typed node in the graph (identity, preference, project, decision, entity, convention, snippet, fact).
Edge / Relation	A typed, weighted link between memories (e.g. `has_decision`, `depends_on`, `supersedes`).
Scope	Where a fact applies: `global` or `project:<id>`. Recall is scope-aware.
Ingestion	Accepting a user-routed slice (turn/code/note) for possible learning. No ambient capture.
Slice	A unit of routed input (a conversation turn, code block, or note) with metadata.
Redaction	Scrubbing secrets from a slice before any storage or model call (an invariant).
Heuristic gate	The cheap local check that decides whether a slice is worth extracting from — the main cost lever.
Extraction	Turning a slice into candidate typed facts (deterministic or LLM-backed).
Embedding	A vector representation of text used for semantic recall (local bge-small by default).
Consolidation	Deciding ADD / UPDATE / DELETE / NOOP for each candidate fact against existing memory.
Conflict resolution	Reconciling contradictory facts (recency > confidence > provenance; optional LLM tie-break).
Recall	Retrieving relevant memories for an agent (hybrid vector + graph + ranking).
Salience	A memory's current importance; decays over time unless reinforced.
Confidence	How sure Helix is a fact is true/durable.
Provenance	The record of what produced/changed a fact (slice, extractor, model, op) — the "why it believes this".
Decay / Reinforcement	Salience falling over time vs. rising when a fact is recalled/confirmed/re-seen.
Transfer	Moving memory between machines/agents/people via `export`/`import`/`merge`.
Merge	Combining two strands with consolidation + conflict resolution; reversible.
Rollback	Restoring a prior strand version from history.
Manifest	The plaintext, signed metadata of a `.dna` (schema, embedding space, counts, integrity root).
MCP	Model Context Protocol — the open interface through which agents read/write Helix memory.
LLM router	The component that picks/falls back across LLMs (free-tier-first) — or none at all.
Deterministic extractor	The no-LLM, rules+embeddings extractor; the $0 floor.
Daemon	The local Helix process hosting the engine for CLI, MCP server, and dashboard.
ADR	Architecture Decision Record — an entry in `DECISIONS.md`.
Episodic / semantic / procedural	The three long-term memory shapes Helix models (events / durable facts / skills), plus a cross-cutting entity graph.
Working memory	The live agent context window — used, but never Helix's system of record.
CLS (Complementary Learning Systems)	The fast-episodic-trains-slow-semantic model behind the two-stage write path.
Bi-temporal	Every fact carries valid-time (true in the world) and transaction-time (when Helix learned it); invalidation is append-only.
Importance	A write-time rating (1–10 / 0–1) of how significant a memory is; an input to salience.
Reflection	Periodic synthesis of higher-level insights from clusters of episodes, stored as new linked memories.
Sleep-time consolidation	A background, idle-time worker that consolidates/rewrites memory using a stronger model off the hot path.
Hybrid retrieval	Combining dense (embedding) and sparse (BM25) search.
RRF	Reciprocal Rank Fusion (k=60) — rank-only fusion of multiple result lists.
PPR	Personalized PageRank — lightweight graph-expansion retrieval (HippoRAG-style).
MMR	Maximal Marginal Relevance — diversity/dedup during result selection.
Matryoshka (MRL)	Embeddings whose dimensions can be truncated without re-embedding.
Quantization (int8/binary)	Compressing embedding vectors for smaller/faster storage (with optional float rescore).
Redaction	Tiered removal of secrets/PII (regex → detect-secrets/gitleaks → Presidio) before storage and before any LLM call.
Memory poisoning	An attack that plants false/persistent instructions into long-term memory; guarded by provenance + validation + review.
CRDT	Conflict-free Replicated Data Type — used for mechanical convergence during merge.
3-way semantic merge	Fact/field-level merge against a common ancestor for contradictory facts.
Daemon	The local Helix process (Streamable HTTP on 127.0.0.1) that serves the CLI, MCP shim, and dashboard.
Lethal trifecta	Private data + untrusted content + exfiltration ability — the agent-security risk a memory server must mitigate.