Install
openclaw skills install vememVisual entity memory — remember faces, objects, and places across sessions with persistent identity. Use when the user asks who is in an image, when you need to resolve an image to a specific known person/thing, when identifying someone from a photo, when labeling a new face for future recognition, or when maintaining knowledge (facts, events, relationships) keyed by appearance rather than by text. Bridges vision models and text LLMs by turning raw images into named entity references with attached context the text side can reason about.
openclaw skills install vememRead this first. This skill handles biometric data, so transparency up front beats surprises later.
The skill itself is instruction-only. It's the markdown you're reading — no scripts, no executables, no automatic installation of anything. Adding this skill to your ClawHub / Claude Code / Hermes / OpenClaw install does not by itself run code on your machine.
The skill instructs the agent to install and use the vemem Python package separately. That package is the component that actually reads images and writes to disk.
Local state it creates or reads:
~/.vemem/ (override with VEMEM_HOME) — the LanceDB store holding face embeddings, entity bindings, facts, and the event log. This is where biometric vectors live.~/.insightface/models/buffalo_l/ — InsightFace model weights (~200MB), downloaded from InsightFace's official distribution on the first face observation.observe_image / identify_image — either as base64 or as a filesystem path the library reads. vemem does not scan your disk for images on its own. It only sees bytes you hand it.Network activity vemem itself produces:
buffalo_l weights. After that, zero network activity from the library.vemem-mcp-server) uses stdio only — no network ports opened.vemem-openclaw-sidecar) binds to localhost only.What vemem does NOT do on its own:
references/examples.md show how to compose vemem with OpenAI / Anthropic APIs if you choose to. Those are your calls, with your API keys. If you stay local (Ollama etc.), nothing leaves your machine.observe_image is an explicit invocation by the agent or you.The ClawHub review correctly flagged that vemem has a first-party OpenClaw integration that can auto-process every image attachment. That integration is a separate install (vemem-openclaw-sidecar + registering a specific OpenClaw plugin) and is NOT enabled by adding this skill or by installing the base vemem package.
Enable it only if you understand you're granting an always-on face-recognition layer over every image your agent sees. Disable at any time by stopping the sidecar process.
main; pin a version in production (e.g. vemem==0.1.0) rather than tracking latestpip show -f vemem lists every file the install adds to your environmentlsof -p <pid> (open files + sockets) or strace -e trace=file,network -p <pid>forget() is test-verified to physically remove embeddings from LanceDB version history. Reproduce locally before trusting for regulated data.vemem stores biometric identifiers. If you deploy it to users other than yourself, YOU are the data controller under GDPR / BIPA / CCPA. The library provides primitives (forget / restrict / export) but does not enforce consent capture — that's your app's responsibility. Full deployer checklist: COMPLIANCE.md.
VEMEM_HOME path (e.g. /tmp/vemem-test) for your first session so you can inspect + delete the store wholesale.vemem is the identity layer that sits between a vision model (face/object detector) and a text LLM. It turns "an image of a person" into a named, stable entity ID — same person across sessions, same face across angles, same object across lighting.
It keeps track of facts, events, and relationships per entity — like Mem0 or mem0-style stores, but keyed on visual identity rather than on a user_id you have to know in advance.
Activate this skill when the user:
Do NOT activate for:
Run python -c "import vemem; print(vemem.__version__)". If that fails, install:
pip install vemem
# or with uv:
uv pip install vemem
First-time face encoding triggers a ~200MB InsightFace model download into ~/.insightface/. Warn the user if their network is constrained.
python -m vemem.mcp_server
This exposes 14 tools over stdio. Wire it into your host's MCP config. For Claude Desktop, the ready-to-paste config lives at docs/examples/claude_desktop_config.json in the vemem repo.
from vemem import Vemem
vem = Vemem() # LanceDB store at ~/.vemem, InsightFace encoder
Store path is overridable via VEMEM_HOME env var or Vemem(home="/path/to/store").
There are 13 operations. The ones you'll use most:
| Op | When |
|---|---|
observe(image_bytes) | A new image came in. Detect faces/objects, embed, persist. Returns a list of Observations, each with a stable content-hash id. |
label(observation_ids, name) | The user just told you who someone is. Creates the entity if new, binds those observations to it. This is the moment identity becomes permanent. |
remember(entity_id, fact) | Attach a fact to a known entity — "Charlie runs marathons", "the red mug lives in the kitchen". |
| Op | When |
|---|---|
identify(image_bytes, k=5) | Return candidate entities matching the image, ranked by similarity. Each candidate already includes attached facts — you don't need a separate recall call. |
recall(entity_id) | All known facts, events, and relationships for an entity. Use when the user references someone by name. |
| Op | When |
|---|---|
relabel(observation_id, new_name) | "That's not Charlie, that's Dana" — reassigns the observation and emits a negative binding so the clusterer won't re-attach it. |
merge(entity_ids) | Two entities turn out to be the same person. Folds them together, preserving facts with provenance. |
split(entity_id, groups) | One entity turns out to be two people. Separates them with cross-wise negative bindings. |
forget(entity_id) | Privacy delete — hard-removes observations, embeddings, bindings, facts. Physically prunes from storage version history (GDPR Art. 17 compliant). Not reversible. |
undo(event_id=None) | Undoes the most recent reversible op by you (within a 30-day window). Does not work on forget. |
observations = vem.observe(image_bytes)
candidates = vem.identify(image_bytes, k=3, min_confidence=0.4)
if candidates:
names = [f"{c.entity.name} (conf {c.confidence:.2f})" for c in candidates]
print(f"Visible: {', '.join(names)}")
else:
print(f"Unknown face(s) detected: {len(observations)}. Label with vem.label(obs_ids, name=...).")
observations = vem.observe(image_bytes)
charlie = vem.label([o.id for o in observations], name="Charlie")
vem.remember(charlie.id, "we met at the coffee shop on 2026-04-17")
candidates = vem.identify(image_bytes, k=3)
context_parts = []
for c in candidates:
fact_strs = "; ".join(f.content for f in c.facts)
context_parts.append(f"{c.entity.name} (conf {c.confidence:.2f}): {fact_strs}")
context = "People visible: " + " | ".join(context_parts) if context_parts else "No known faces."
# Feed `context` into your LLM's system message or context block.
# identify() said "Charlie" at 0.71 confidence, but user says it's Dana
candidates = vem.identify(image_bytes)
wrong_obs_id = candidates[0].matched_observation_ids[0]
vem.relabel(wrong_obs_id, "Dana")
# A negative binding against Charlie is now recorded — the clusterer won't re-assign.
# User says "forget everything about Sarah"
sarah = vem.store.find_entity_by_name("Sarah")
if sarah is not None:
counts = vem.forget(sarah.id)
print(f"Deleted: {counts}")
# Pruned from LanceDB version history — actually gone, GDPR-compliant.
# This is NOT reversible. Warn the user before calling.
label(..., name="Charlie") re-uses an existing Charlie entity by name — but renaming an entity doesn't merge it with another same-named one. If the user has two "Charlie"s, use merge() explicitly.forget() is irreversible. Ask for confirmation before calling. 30-day undo does not cover it.identify() with a different encoder than the one used when building the gallery, you get an empty result — not a false match. This is by design.entity_id as the user_id).entity_id vemem returns is a perfect user_id for Mem0 / Letta / Supermemory. They own text conversational memory; vemem owns visual identity.vemem stores biometric identifiers. If the host app is deployed to users, the deployer is the data controller under GDPR / BIPA / CCPA. Key primitives:
forget(entity_id) = Art. 17 erasure (with prune)restrict(entity_id) = Art. 18 restrictionexport(entity_id) = Art. 20 portabilityFull checklist: COMPLIANCE.md in the vemem repo.
label() to commit. This keeps the hot path deterministic.| Symptom | Cause | Fix |
|---|---|---|
identify() returns [] on a face you labeled earlier | Different encoder version, or the face isn't being detected | Check encoder.id hasn't changed; try min_confidence=0.2 to see raw scores |
RuntimeError: image pipeline unavailable | InsightFace weights not installed | First call downloads ~200MB from InsightFace to ~/.insightface/; ensure network access on first use |
ModalityMismatchError on merge | Trying to merge a face entity with an object entity | v0 keeps modalities separate; create an instance_of relationship instead |
OperationNotReversibleError on undo | Past 30 days, or op was forget | Not fixable — forget is deliberately irreversible; window is configurable via DEFAULT_UNDO_WINDOW |
Loaded on demand when the agent needs the detail — keep them out of the hot context path.
references/mcp-tools.md — every MCP tool's
exact input/output shape. Read when deciding parameter names for a
specific tool call.references/examples.md — copy-paste code
recipes for Ollama, OpenAI, and Claude; correction flows; privacy
flows; composition with Mem0 / Letta.references/troubleshooting.md —
expanded error matrix with diagnostic commands. Read when a tool
raises something unexpected.docs/spec/identity-semantics.mddocs/ARCHITECTURE.mddocs/examples/real_bridge.mddocs/examples/mcp_usage.mdCOMPLIANCE.md