Architecture April 7, 2026 · 10 min read

From WhatsApp to Shipped PR: How I Automated My Entire Dev Workflow

Director @ Microsoft

Three forge stations connected by golden chains — amber blueprint scroll, blue memory orb, and purple message scrolls — with a smartphone beaming light into the central forge

I'm at dinner. My phone buzzes — a WhatsApp message from me to myself, earlier that day: "Build user profile API for my-api project. Include avatar upload, email verification, and role-based access."

By the time I get home, the feature is built. Tests pass. An independent review found zero drift violations. A PR is waiting. Every architectural decision is captured in semantic memory so the next feature starts smarter than this one did.

No, this isn't a demo. This is what happens when you wire three open-source tools together into a closed-loop system.

The Three Pillars

Each tool solves one critical problem in AI-assisted development:

// The Unified System

Plan Forge — The blueprint

What to build, how, and when to stop

│

OpenBrain — The memory

Why we decided, what we learned, what failed

│

OpenClaw — The nervous system

Always-on orchestration across every channel

Alone, each is useful. Plan Forge gives you guardrails but forgets between sessions. OpenBrain gives you memory but waits passively for queries. OpenClaw gives you reach (20+ messaging channels) but has no methodology or memory.

Together, they form something none of them can be alone: a system that plans, builds, remembers, and communicates across every surface you use.

The Full Lifecycle: Phone to PR

Here's what actually happens when I send that WhatsApp message.

1. Request Capture

I send a message to my OpenClaw gateway on WhatsApp: "Build user profile API for my-api project." OpenClaw's skill engine activates the plan-forge-orchestrator skill.

Before doing anything else, OpenClaw queries OpenBrain: "What do we already know about this project?" OpenBrain returns prior architectural decisions, active conventions, and patterns from past features. OpenClaw compiles this into a context briefing.

2. Plan Hardening

OpenClaw launches a Copilot CLI session via the Agent Client Protocol (ACP). The session loads all Plan Forge guardrails — auto-loading instruction files that enforce architecture, security, testing, and error handling standards based on what file is being edited.

The AI generates a hardened execution contract with the 6 mandatory blocks: scope contract, execution slices, validation gates, forbidden actions, rollback protocol, and completion criteria. It searches OpenBrain for prior decisions before locking anything down.

My phone buzzes: "Plan hardened. 5 slices. Approve?"

I reply: "Approved."

3. Slice-by-Slice Execution

A new Copilot CLI session starts (fresh context — the builder must not self-review). For each slice:

The AI writes the code, following Plan Forge guardrails
Build + test gates run — the slice must pass before proceeding
Every decision gets captured to OpenBrain with full provenance
My phone buzzes: "Slice 2 done. Tests pass. ✓"

If a slice fails, I get notified immediately. I can reply with guidance, or let the auto-escalation chain try a more capable model.

4. Independent Review

A third Copilot CLI session starts — fresh context again. The reviewer loads the hardened plan as reference, queries OpenBrain for all captured decisions, and audits the implementation against Plan Forge guardrails.

My phone buzzes: "Review complete. 0 drift violations. Ship it?"

I reply: "Ship it."

5. Ship + Learn

The code is committed, pushed, and a PR is created. OpenBrain captures a batch postmortem — every decision, pattern, and lesson from the entire feature. Next time an AI session works on this project, it starts with all of that context.

Update (v2.18): Plan Forge now includes a Context Fuel instruction file that teaches agents to manage their own context window — when to load which guardrails, how to recognize context degradation mid-session, and when to recommend a fresh session. Combined with OpenBrain’s persistent memory, this means agents don’t just have context — they know how to use it efficiently.

The loop closes. Each feature makes the next one smarter. Memory compounds. The system gets better without any manual intervention.

Why Not Just Use One Tool?

I get this question a lot. Here's the honest answer:

Plan Forge alone gives you excellent guardrails in VS Code. But it's session-isolated — great for preventing drift within a session, but no memory across features. And you can't interact with it from your phone.

OpenBrain alone gives you semantic memory. But it's passive — it captures and retrieves, but doesn't enforce anything. You have to remember to search it.

OpenClaw alone gives you reach across 20+ channels and an agent runtime. But it has no development methodology and no memory. It's a powerful conduit with nothing to enforce.

The magic is in the integration layer. OpenClaw routes work and manages sessions. Plan Forge provides the methodology and guardrails. OpenBrain makes sure nothing is ever forgotten. Each tool does one thing well. Together, they cover every gap.

The Architecture

// Unified Development System

Developer Surfaces

Phone (WhatsApp / Telegram / Slack)

Desktop (VS Code + Copilot)

Terminal (Copilot CLI)

Voice (OpenClaw Talk Mode)

↓ all routes converge ↓

OpenClaw Gateway

Skills engine · Session manager · Cron · Notifications

↓ orchestrates ↓

Plan Forge

Guardrails · 21 agents
4 stations · LiveGuard

OpenBrain

7 MCP tools · pgvector
Semantic search · Provenance

↓ executes via ↓

Copilot CLI (ACP)

Headless · Programmable · Your unlimited Copilot tokens

The key enabler is the Agent Client Protocol (ACP) — GitHub Copilot CLI can run as a programmable server that any external tool can control. OpenClaw uses ACP to launch sessions, send prompts, manage tool approvals, and stream results. No GUI needed. No manual intervention.

What This Means in Practice

Describe a feature from your phone — WhatsApp, Telegram, Slack, Discord, Signal, even iMessage
Get progress updates wherever you are — every slice, every gate, every review result
Approve or reject via chat — no need to open an IDE
Every decision is captured automatically — no manual memory management
Every future session starts smarter — the AI already knows your conventions, patterns, and lessons
Independent review is structural — fresh context, no sunk cost bias, full decision history
Uses your existing Copilot subscription — unlimited tokens, no separate API bills

Tools in a Toolbox

The unified system is the most powerful configuration, but each tool works standalone. Use one. Use two. Combine them however you want. They're tools in a toolbox, not a monolith.

Just want guardrails? — Plan Forge standalone in VS Code
Just want memory? — OpenBrain as a standalone MCP server
Just want reach? — OpenClaw as a standalone AI gateway
Want the full loop? — Wire all three together with the Unified System Architecture

All three are free. All three are MIT licensed. All three are self-hosted — no cloud dependencies unless you want them.

The future of development isn't AI writing code. It's AI writing code within a system that plans, enforces, remembers, and improves — automatically, across every surface you use.

← Previous: Quorum Mode Next: Seven Agents →