Autonomous Agent Runtimes & Claws
Persistent AI entities that live on your machine and act for days
Claws sit above agents in Karpathy's stack: persistent, identity-stable, sandboxed runtimes. OpenClaw hit 350K+ stars in 60 days; founder Steinberger joined OpenAI Feb 14, 2026, the foundation took stewardship. SOUL.md + MEMORY.md are the identity primitives. NVIDIA NemoClaw and Trend Micro's threat model define the security frontier.
Research briefs like this — one per week. Validated sources, no filler.
Subscribe350K+
OpenClaw GitHub stars (Apr 2026 snapshot)
Star History / The New Stack
23+
LLM providers supported by SwarmClaw runtime
swarmclawai/swarmclaw
~4K vs ~434K
NanoClaw lines vs OpenClaw lines — the forkability tradeoff
Karpathy / nanoclaw.dev
What Is a Claw (and Why It's Not Just an Agent)
Karpathy's March 2026 framing on the No Priors podcast and follow-up X threads places claws as a distinct layer above LLM agents — the way agents sit above raw LLMs. A claw owns its sandbox, persists identity across sessions, schedules its own work, and communicates over messaging protocols rather than a chat UI. Where an agent executes a task, a claw resides on a machine. Karpathy's own claw "Dobby" runs his smart home over WhatsApp. The framing is still settling — "claw" was popularized by OpenClaw (originally Clawdbot, renamed January 2026 after a trademark complaint), and Karpathy adopted and abstracted the term.
LLM
BaseStateless next-token prediction
Agent
Layer 2Task-scoped, ephemeral execution with tools
Claw
Karpathy Mar 2026Persistent, identity-stable, sandboxed runtime with memory, scheduling, and messaging interface
Forkable claw
Design philosophyKarpathy's preference — minimal core (~4K LOC) plus skills the agent uses to modify its own repo
OpenClaw — The Anchor Project
Peter Steinberger (Austrian developer, ex-PSPDFKit) first published the project November 2025 as Clawdbot, renamed to OpenClaw in late January 2026. The architecture: a local-first Node.js daemon (the "Gateway") that maintains persistent connections to 20+ messaging platforms, routes channels to isolated per-agent workspaces, and reads two human-editable markdown files on every wake — SOUL.md (the agent's character spec / behavioral context) and MEMORY.md (an append-only continuity log). On Feb 14, 2026 Steinberger announced he was joining OpenAI to drive next-gen personal agents; the OpenClaw Foundation was established to hold stewardship and OpenAI committed to sponsoring it as MIT-licensed OSS. Star trajectory: 9K on launch day, 60K by day 3, 190K within two weeks, 250K by Mar 3, 2026, 350K+ by Apr 8 — faster than React, Linux, or Kubernetes.
SOUL.md
PatternIdentity primitive — plain markdown the agent injects into the system prompt on every wake. Personality, opinions, behavioral constants. Persistent context file, not a consciousness claim.
MEMORY.md
PatternAppend-only continuity log; human-readable, Git-trackable, ~100-line curated tier-1 context.
Messaging UI
InterfaceSignal, Telegram, WhatsApp, Discord, Slack, iMessage, Matrix, IRC, Teams (~20 total).
Pluggable LLM
RuntimeProvider-agnostic. Claude, GPT, DeepSeek, Gemini, OpenRouter, Ollama for local.
OpenClaw Foundation
GovernanceStewardship non-profit established Feb 2026; MIT license; OpenAI is named sponsor.
The Claw Ecosystem
The OpenClaw moment spawned a sub-ecosystem in weeks. Identity files split into specialized concerns; runtimes forked into lighter and heavier variants; vendors layered enterprise controls on top.
aaronjmars/soul.md
IdentityPersonality builder. Pipeline ingests blogs, podcasts, X posts and distills into SOUL.md + STYLE.md + MEMORY.md the agent embodies.
swarmclawai/swarmclaw
RuntimeSelf-hosted multi-agent runtime; 23+ LLM providers; MCP gateway with tool namespacing; npm and Docker install.
TechNickAI/openclaw-config
Skill packOpenClaw on Claude Code. Three-tier memory (MEMORY.md tier-1, daily journals tier-2, semantic vector store tier-3), /fleet command for multi-machine.
ClaudeClaw (Craddock)
PatternMedium playbook (Mar 2026) using `claude -p` headless mode + session resume to map messaging threads onto Claude Code processes.
NanoClaw
Minimal coreKarpathy-endorsed ~4K-LOC minimal claw. Skills modify the repo to add features rather than ship them.
NVIDIA NemoClaw
Vendor stackEnterprise security layer announced at GTC Mar 16, 2026. Kernel sandbox, Rust egress policy engine, privacy router that keeps sensitive context on-device.
memsearch (Zilliz/Milvus)
MemoryOpen-sourced Mar 12, 2026. Extracted OpenClaw's memory subsystem into a standalone library. Markdown logs + hybrid vector search + Git.
clawdocs.org
DocsCommunity docs. SOUL.md / IDENTITY.md / USER.md / AGENTS.md / TOOLS.md file-concern separation guide.
Adjacent: Autonomous Coding Agents
Claws share DNA with autonomous coding agents but the scope is different. A coding agent is task-bounded: open a PR, fix a bug, ship a feature, then exit. A claw resides — it answers messages on Sunday morning, runs cron jobs Tuesday night, edits its own MEMORY.md before sleep. Compare on persistence and interface, not just capability.
Claude Code + Agent SDK
Task-scopedAnthropic's CLI / SDK. Task-scoped by default but ClaudeClaw and openclaw-config wrap it into claw shape.
OpenHands (ex-OpenDevin)
Task-scopedOpen-source agent platform; ~72% SWE-bench Verified with Claude under CodeAct. Self-hostable.
Aider
InlinePair-programmer at the terminal. Inline, interactive — explicitly not autonomous.
Cursor Composer / Windsurf / Cline
IDEIDE-embedded agents. Human-in-the-loop per decision.
Cognition Devin
Task-scopedCommercial autonomous SWE; sandboxed browser/terminal/editor; ~$500/mo for teams.
Princeton SWE-agent
ResearchResearch framework optimized for benchmark eval, not daily use.
Replit Agent
IDECloud-native build agent inside the Replit IDE.
The SOUL.md Pattern as Architectural Primitive
The through-line across the ecosystem: agent state as plain human-editable markdown, version-controllable with Git, separated by concern. SOUL.md holds the character spec (what the agent embodies internally); IDENTITY.md holds the public-facing name/avatar (3-5 lines max); USER.md holds user facts; AGENTS.md holds rules and guardrails; TOOLS.md holds environment notes; MEMORY.md holds learned patterns and continuity. The pattern is load-bearing because it makes the agent's behavior auditable and debuggable by a human reading a file — not a vector store inspection ritual. memsearch then indexes those files into Milvus for semantic recall without ever becoming the source of truth. The files stay primary, the vectors are a derived index.
Risks & Security Posture
The same properties that make claws useful — persistence, messaging ingestion, tool access, autonomy — invert into the attack surface. Trend Micro's March 2026 "CISOs in a Pinch" analysis and the ISACA "Security Claw" piece (Richard Beck, April 2026) both flag the gap between deployment pace and control regimes. Documented vectors: the "Good Morning" attack (innocent-looking link contains hidden instructions the agent ingests as context), persistent memory poisoning (malicious prompt embedded in a benign email that triggers weeks later), emotional manipulation / guilt-tripping the model into self-sabotage, supply-chain compromise of skills, and exposed Gateway instances with RCE. Karpathy publicly noted hesitation about running OpenClaw specifically because of the "vibe-coded 400K lines being actively attacked at scale." NVIDIA's NemoClaw responds with kernel-level sandboxing and a Rust policy engine that lives outside the agent's reach — guardrails that prompt injection cannot rewrite.
Sovereign Angle — Your Claw on Your Silicon
A claw running on hardware you own, with memory in markdown files you control, talking to local models via Ollama, is the convergence point with the other three domains: home-ai-labs (the rig), ai-chips-silicon (the inference substrate), sovereign-ai-compute (the policy stack). The privacy router pattern NemoClaw introduced — keep sensitive context on-device, only reach for frontier models when policy allows — formalizes the hybrid. memsearch makes the memory layer portable. SOUL.md makes the identity layer portable. The claw becomes the user-facing surface of sovereign AI.
Key Findings
OpenClaw passed 250K GitHub stars by Mar 3, 2026 — beating React's 10-year record in 60 days; 350K+ by April.
Founder Peter Steinberger joined OpenAI on Feb 14, 2026; an MIT-licensed OpenClaw Foundation took stewardship with OpenAI as sponsor.
Karpathy's March 2026 framing positions claws as a layer above agents — persistent, identity-stable, sandboxed — but he publicly prefers minimal forkable cores (NanoClaw ~4K LOC) over the OpenClaw monolith (~434K LOC).
The SOUL.md + MEMORY.md pattern (plain markdown, Git-trackable, human-editable) is becoming the de facto identity primitive across the ecosystem — adopted by aaronjmars/soul.md, TechNickAI/openclaw-config, ClaudeClaw, and the clawdocs.org community docs.
Vendor consolidation moved within 60 days: NVIDIA NemoClaw (GTC Mar 16, 2026) for enterprise security; Zilliz memsearch (Mar 12, 2026) for the memory layer; SwarmClaw for multi-agent runtime with 23+ providers.
Trend Micro and ISACA both published 2026 threat assessments — "Good Morning" link-injection, persistent memory poisoning, emotional manipulation, and exposed Gateway RCE are the documented vectors. Karpathy himself flagged supply-chain risk in OpenClaw skills.
The category gap is persistence: Claude Code, OpenHands, Aider, Devin, Cursor Composer are task-scoped agents. A claw is what you get when an agent resides on a machine, holds identity across days, and uses messaging as the UI.
Research Transparency
Limitations
- •The field is six months old. Star counts, foundation governance, and vendor positioning are still moving weekly.
- •Terminology drift is unresolved. "Claw" was coined by Steinberger's project, abstracted by Karpathy, and is now applied to runtimes that share little architecture beyond a messaging UI and a persistent file.
- •Karpathy-vs-Steinberger framing ambiguity. Karpathy publicly prefers NanoClaw's minimal forkable core and has flagged security concerns about running OpenClaw specifically — the spec is still being contested by its most prominent commentators.
- •Many sources are vendor blogs, viral-moment journalism, or Medium pieces with measurable substance but limited peer review. Cross-referencing matters.
What We Don't Know
- ?Long-term reliability of self-editing markdown memory under months of autonomous append-only writes — does identity drift compound?
- ?How claws hold up under sustained adversarial inbox traffic. Trend Micro's threat model is theoretical-plus-demos; we lack longitudinal incident data.
- ?Jurisdictional and liability frameworks for autonomous-spending claws (payments, scheduling, contract signing). Foundation governance is set; legal regimes are not.
Frequently Asked Questions
An agent is task-scoped — it executes a job and exits. A claw is a persistent runtime that lives on a machine, holds identity and memory across sessions, schedules its own work, and uses messaging apps as the primary interface. Karpathy's March 2026 framing places claws as a distinct layer above agents in the AI capability stack.
Sources & References
23 validated sources · Last updated 2026-06-22