The AI that
actually does things.

Local-first AI agent OS for your machine. Daemon mode in sub-50ms, swarm runner across isolated git worktrees, sandboxed self-dev that only ships if tests pass, voice dictation, mermaid in the dashboard, and a one-shot importer that pulls your context out of Claude Code, Codex, or OpenCode. Tiered sandbox, signed plugins, multi-channel reach, Memory v2 — all built in. One product. One price. Yours forever.

⚡ Phantom v1.0.2 · Local AI Agent OS

Ten new powers.
One unified version.

Phantom v1.1.35 ships in two tiers. Free covers chat, plugins, memory, MCP, bench, doctor — all forever, no card needed. Pro (₹999 lifetime) adds daemon mode, swarm runner, sandboxed self-dev, and voice dictation. Pro features are unlocked free for the first 14 days so you can try them. Existing licence holders just /update; pre-gate users are grandfathered. Every future patch free, on both tiers.

Dual-Engine AI Architecture

A heavy Main Engine handles reasoning, coding, and execution. A fast Router Engine classifies your prompt and shapeshifts into the right specialist — File Organizer, DevOps Agent, Code Writer, or Security Auditor. Works with any OpenAI-compatible provider.

God Mode Execution

Real bash commands on your machine. 48 catastrophic patterns permanently blocked. Trust levels 1–4. Level 4 requires your PHC license key to unlock — and auto-downgrades to Trust 3 after 30 minutes idle.

Telegram Remote Control

Full two-way chat with PhantomCLI from your phone. All /commands work. Separate trust level from local CLI. Owner-only security gate.

Media API Integrations

Image generation (FAL.ai, DALL-E, Stability, Replicate), voice TTS (ElevenLabs, OpenAI, PlayHT), speech-to-text (Deepgram). Configure once, use via natural language.

3-Device License

One purchase activates up to 3 machines. Hardware fingerprinting + server-side enforcement. Manage devices via python run.py setup.

JARVIS Sci-Fi TUI

Boot sequence, HUD panels, Claude Code-style thinking spinner. Local web dashboard at localhost:8080. Nothing hosted externally — your data stays on your machine.

Live Task Progress

Every tool call — shell, file write, search, edit — is shown as a live checklist (◻ → ◼ → ✔ / ✖) in both the terminal and the web dashboard. For multi-step jobs the model plans upfront, so you see exactly where it is.

ML & Data Pipelines

Hand it a dataset — it auto-installs pandas, scikit-learn, xgboost, plotly, transformers, whatever fits the task. Writes modular Python, trains the model, serves an interactive FastAPI / Flask / Streamlit dashboard and prints the URL.

9-Event Hook Lifecycle

PreToolUse, PostToolUse, UserPromptSubmit, Stop, SubagentStop, SessionStart/End, PreCompact, Notification. Shell hooks (stdin JSON) + programmatic Python hooks via @hook decorators. Non-zero exit blocks for binding events; fail-open on timeout.

Pattern Permissions + Audit Chain

Scoped allow/deny/ask: bash:git:*, write:~/projects/**, deny:bash:rm:*. Every decision hash-chained into ~/.phantom/audit.jsonl. Tamper-detectable with /audit verify.

CLAUDE.md Hierarchy — with Live Reload

Auto-loads CONTEXT.md from three scopes (user, project-root, local dir) into the system prompt on session start. Poll-based watcher picks up edits live — no restart needed.

Multi-Tier Model Routing

Classifier picks cheap tier (tool-arg turns), mid tier (default), or expensive tier (reasoning cues) per turn. Cuts token cost 40–60% vs single-model. Per-model price sheet + daily spend alerts via the Notification hook.

VS Code Extension

Dedicated activity bar with 4 tree views (Sessions, Memories, Hooks, Tools). 5 code actions on any selection: explain, refactor, fix, add tests, document. Quick Pick palette, rich status bar with live cost tile, workspace context injection (active file + open files + diagnostics), transcript export to Markdown, inline diff preview. Stdio bridge to the CLI.

MCP Resources + Reconnect

Model Context Protocol client with resources/list, resources/read, subscribe/unsubscribe, and auto-reconnect with exponential backoff. Every MCP tool call surfaces in the model's structured tool list.

Dual Streaming Parsers

Native Anthropic SSE (message_start / content_block_delta / message_stop) AND OpenAI-compatible chunks. UTF-8 chunk-boundary safety, partial-JSON repair for tool args truncated mid-stream. 60+ combined tests.

OpenTelemetry Observability

Every model call + tool dispatch + hook fire emits a phantom.* OTel span with latency, tokens, ok/error. Point OTEL_EXPORTER_OTLP_ENDPOINT at Honeycomb / Datadog / Jaeger. In-memory exporter for local debugging.

Self-Heal After Launch

When you ask Phantom to build and run an app, it HTTP-GETs the URL after launch, detects 5xx / tracebacks in the body, reads the log, auto-patches the source, relaunches — before claiming "live". Tested against real Flask apps.

Parallel Subagents + Scratchpad

Multi-agent orchestrator runs dependency-ordered waves in parallel. SQLite-backed scratchpad lets upstream agents publish partial results downstream agents read mid-run — faster than re-running prompts. 6 built-in agent types with tool allowlists + worktree isolation.

11 Slash Commands + User-Defined

/help /clear /model /memory /perm /hook /session /compact /cost /exit. Drop a ~/.phantom/commands/mycmd.md for your own templates with frontmatter description.

4-Layer Settings Hierarchy

System → user → project-shared → project-local, with locked_keys for enterprise. 128K-token context compaction with PreCompact hook + LLM-backed summariser. Session save/load with atomic writes.

Iterative Web Research

/web now does Claude-Code-style two-round research: round 1 scrapes the top 5 pages, the router model inspects the gaps and proposes 2 specific follow-up queries, round 2 scrapes 2 more pages each — ~9 pages total at 6,000 chars each. Output is clean narrative (no Step 1/2/3 spam, no inline citation noise). Run /sources after to see the URLs with first-line previews. ~30–60s wall time, dramatically deeper answers.

Refinement-Aware Project Router

Follow-ups like "ui is not good" or "also add login" stay in the active project instead of spawning a new project_xxxxxxxx. Two-tier router detects refinement cues + active project and continues silently. Every refinement appended to phantom_summary.md as a run-history entry. Hardened multi-agent fallback: if the planner returns malformed JSON, a 4-agent default skeleton (Fetcher, Backend, Frontend, Runner) always produces a working build.

Clean Self-Update

/update auto-exits the REPL on a successful upgrade so Python's sys.modules cache can't silently keep running stale code. Failed updates and "already-latest" cases stay in the REPL so you can retry. Backed by 4 dedicated tests + the full 775-Python-test suite (903 with TS + VS Code).

Feature

Phantom v1.0.2

Claude Code

OpenClaw

AutoGPT

AgentZero

Any OpenAI-compatible model

Local OS bash execution

9-event hook lifecycle

Pattern permissions (bash:git:*, write:~/**)

Hash-chained audit log

CLAUDE.md hierarchy with live reload

Multi-tier model routing (cheap/mid/expensive)

VS Code extension with activity bar + code actions

MCP resources + subscribe + reconnect

Anthropic SSE + OpenAI SSE streaming

Anthropic only

Programmatic Python hooks (@hook decorator)

shell only

Parallel multi-agent with shared scratchpad

sequential only

Self-heal after launching built apps

OpenTelemetry spans + metrics

Telegram two-way bot

Persistent memory — episodic log + FTS5 RAG

Image / Video / Voice APIs

Permanent dangerous-cmd blocklist

Live task-progress checklist

Auto ML pipelines + dashboards

One-time payment, lifetime

Anthropic sub

// early feedback

What beta testers say.

★★★★★

"Getting local OS execution working was shockingly simple. It organised my entire Downloads folder in one sentence."

Arjun R.Backend Engineer · Bangalore

"Telegram integration is insane. I literally SSH into my VPS by texting my bot while commuting. Set it up in 10 minutes."

Suresh K.DevOps Engineer · Chennai

"Replaced 8 Python automation scripts with natural language. The God Mode terminator animation when you hit Level 4 is 🔥"

Meera R.Full-Stack Developer · Mumbai

The AI that actually does things.

Ten new powers. One unified version.

Daemon Mode NEW

Swarm Runner NEW

Sandboxed Self-Dev NEW

Voice MVP NEW

Cross-Harness Importer NEW

Reproducible Benchmarks NEW

Custom OpenAI-Compatible Providers NEW

Mermaid in Dashboard NEW

Three New First-Party Plugins NEW

The complete platform — all built in.

Everything you need.Nothing you don't.