Helix — Glossary¶
Shared vocabulary for the project. If a term here conflicts with code, the code is wrong — fix it (spec-first invariant, CLAUDE.md).
| Term | Definition |
|---|---|
| Helix | The product: a local-first, portable, git-like memory layer for AI coding agents. |
| Strand | A user's whole memory as a single self-describing SQLite database; exported as a .dna file. |
.dna |
The portable strand artifact: signed (Ed25519), encrypted (XChaCha20-Poly1305), versioned. |
| Memory / Fact | One typed node in the graph (identity, preference, project, decision, entity, convention, snippet, fact). |
| Edge / Relation | A typed, weighted link between memories (e.g. has_decision, depends_on, supersedes). |
| Scope | Where a fact applies: global or project:<id>. Recall is scope-aware. |
| Ingestion | Accepting a user-routed slice (turn/code/note) for possible learning. No ambient capture. |
| Slice | A unit of routed input (a conversation turn, code block, or note) with metadata. |
| Redaction | Scrubbing secrets from a slice before any storage or model call (an invariant). |
| Heuristic gate | The cheap local check that decides whether a slice is worth extracting from — the main cost lever. |
| Extraction | Turning a slice into candidate typed facts (deterministic or LLM-backed). |
| Embedding | A vector representation of text used for semantic recall (local bge-small by default). |
| Consolidation | Deciding ADD / UPDATE / DELETE / NOOP for each candidate fact against existing memory. |
| Conflict resolution | Reconciling contradictory facts (recency > confidence > provenance; optional LLM tie-break). |
| Recall | Retrieving relevant memories for an agent (hybrid vector + graph + ranking). |
| Salience | A memory's current importance; decays over time unless reinforced. |
| Confidence | How sure Helix is a fact is true/durable. |
| Provenance | The record of what produced/changed a fact (slice, extractor, model, op) — the "why it believes this". |
| Decay / Reinforcement | Salience falling over time vs. rising when a fact is recalled/confirmed/re-seen. |
| Transfer | Moving memory between machines/agents/people via export/import/merge. |
| Merge | Combining two strands with consolidation + conflict resolution; reversible. |
| Rollback | Restoring a prior strand version from history. |
| Manifest | The plaintext, signed metadata of a .dna (schema, embedding space, counts, integrity root). |
| MCP | Model Context Protocol — the open interface through which agents read/write Helix memory. |
| LLM router | The component that picks/falls back across LLMs (free-tier-first) — or none at all. |
| Deterministic extractor | The no-LLM, rules+embeddings extractor; the $0 floor. |
| Daemon | The local Helix process hosting the engine for CLI, MCP server, and dashboard. |
| ADR | Architecture Decision Record — an entry in DECISIONS.md. |
| Episodic / semantic / procedural | The three long-term memory shapes Helix models (events / durable facts / skills), plus a cross-cutting entity graph. |
| Working memory | The live agent context window — used, but never Helix's system of record. |
| CLS (Complementary Learning Systems) | The fast-episodic-trains-slow-semantic model behind the two-stage write path. |
| Bi-temporal | Every fact carries valid-time (true in the world) and transaction-time (when Helix learned it); invalidation is append-only. |
| Importance | A write-time rating (1–10 / 0–1) of how significant a memory is; an input to salience. |
| Reflection | Periodic synthesis of higher-level insights from clusters of episodes, stored as new linked memories. |
| Sleep-time consolidation | A background, idle-time worker that consolidates/rewrites memory using a stronger model off the hot path. |
| Hybrid retrieval | Combining dense (embedding) and sparse (BM25) search. |
| RRF | Reciprocal Rank Fusion (k=60) — rank-only fusion of multiple result lists. |
| PPR | Personalized PageRank — lightweight graph-expansion retrieval (HippoRAG-style). |
| MMR | Maximal Marginal Relevance — diversity/dedup during result selection. |
| Matryoshka (MRL) | Embeddings whose dimensions can be truncated without re-embedding. |
| Quantization (int8/binary) | Compressing embedding vectors for smaller/faster storage (with optional float rescore). |
| Redaction | Tiered removal of secrets/PII (regex → detect-secrets/gitleaks → Presidio) before storage and before any LLM call. |
| Memory poisoning | An attack that plants false/persistent instructions into long-term memory; guarded by provenance + validation + review. |
| CRDT | Conflict-free Replicated Data Type — used for mechanical convergence during merge. |
| 3-way semantic merge | Fact/field-level merge against a common ancestor for contradictory facts. |
| Daemon | The local Helix process (Streamable HTTP on 127.0.0.1) that serves the CLI, MCP shim, and dashboard. |
| Lethal trifecta | Private data + untrusted content + exfiltration ability — the agent-security risk a memory server must mitigate. |