From WhatsApp to Shipped PR: How I Automated My Entire Dev Workflow
Director @ Microsoft
I'm at dinner. My phone buzzes — a WhatsApp message from me to myself, earlier that day: "Build user profile API for my-api project. Include avatar upload, email verification, and role-based access."
By the time I get home, the feature is built. Tests pass. An independent review found zero drift violations. A PR is waiting. Every architectural decision is captured in semantic memory so the next feature starts smarter than this one did.
No, this isn't a demo. This is what happens when you wire three open-source tools together into a closed-loop system.
The Three Pillars
Each tool solves one critical problem in AI-assisted development:
Alone, each is useful. Plan Forge gives you guardrails but forgets between sessions. OpenBrain gives you memory but waits passively for queries. OpenClaw gives you reach (20+ messaging channels) but has no methodology or memory.
Together, they form something none of them can be alone: a system that plans, builds, remembers, and communicates across every surface you use.
The Full Lifecycle: Phone to PR
Here's what actually happens when I send that WhatsApp message.
1. Request Capture
I send a message to my OpenClaw gateway on WhatsApp: "Build user profile API for my-api project." OpenClaw's skill engine activates the plan-forge-orchestrator skill.
Before doing anything else, OpenClaw queries OpenBrain: "What do we already know about this project?" OpenBrain returns prior architectural decisions, active conventions, and patterns from past features. OpenClaw compiles this into a context briefing.
2. Plan Hardening
OpenClaw launches a Copilot CLI session via the Agent Client Protocol (ACP). The session loads all Plan Forge guardrails — 18 instruction files that auto-enforce architecture, security, testing, and error handling standards.
The AI generates a hardened execution contract with the 6 mandatory blocks: scope contract, execution slices, validation gates, forbidden actions, rollback protocol, and completion criteria. It searches OpenBrain for prior decisions before locking anything down.
My phone buzzes: "Plan hardened. 5 slices. Approve?"
I reply: "Approved."
3. Slice-by-Slice Execution
A new Copilot CLI session starts (fresh context — the builder must not self-review). For each slice:
- The AI writes the code, following Plan Forge guardrails
- Build + test gates run — the slice must pass before proceeding
- Every decision gets captured to OpenBrain with full provenance
- My phone buzzes: "Slice 2 done. Tests pass. ✓"
If a slice fails, I get notified immediately. I can reply with guidance, or let the auto-escalation chain try a more capable model.
4. Independent Review
A third Copilot CLI session starts — fresh context again. The reviewer loads the hardened plan as reference, queries OpenBrain for all captured decisions, and audits the implementation against Plan Forge guardrails.
My phone buzzes: "Review complete. 0 drift violations. Ship it?"
I reply: "Ship it."
5. Ship + Learn
The code is committed, pushed, and a PR is created. OpenBrain captures a batch postmortem — every decision, pattern, and lesson from the entire feature. Next time an AI session works on this project, it starts with all of that context.
Update (v2.18): Plan Forge now includes a Context Fuel instruction file that teaches agents to manage their own context window — when to load which guardrails, how to recognize context degradation mid-session, and when to recommend a fresh session. Combined with OpenBrain’s persistent memory, this means agents don’t just have context — they know how to use it efficiently.
The loop closes. Each feature makes the next one smarter. Memory compounds. The system gets better without any manual intervention.
Why Not Just Use One Tool?
I get this question a lot. Here's the honest answer:
Plan Forge alone gives you excellent guardrails in VS Code. But it's session-isolated — great for preventing drift within a session, but no memory across features. And you can't interact with it from your phone.
OpenBrain alone gives you semantic memory. But it's passive — it captures and retrieves, but doesn't enforce anything. You have to remember to search it.
OpenClaw alone gives you reach across 20+ channels and an agent runtime. But it has no development methodology and no memory. It's a powerful conduit with nothing to enforce.
The magic is in the integration layer. OpenClaw routes work and manages sessions. Plan Forge provides the methodology and guardrails. OpenBrain makes sure nothing is ever forgotten. Each tool does one thing well. Together, they cover every gap.
The Architecture
7-step pipeline · Lifecycle hooks
Semantic search · Provenance
The key enabler is the Agent Client Protocol (ACP) — GitHub Copilot CLI can run as a programmable server that any external tool can control. OpenClaw uses ACP to launch sessions, send prompts, manage tool approvals, and stream results. No GUI needed. No manual intervention.
What This Means in Practice
- Describe a feature from your phone — WhatsApp, Telegram, Slack, Discord, Signal, even iMessage
- Get progress updates wherever you are — every slice, every gate, every review result
- Approve or reject via chat — no need to open an IDE
- Every decision is captured automatically — no manual memory management
- Every future session starts smarter — the AI already knows your conventions, patterns, and lessons
- Independent review is structural — fresh context, no sunk cost bias, full decision history
- Uses your existing Copilot subscription — unlimited tokens, no separate API bills
Tools in a Toolbox
The unified system is the most powerful configuration, but each tool works standalone. Use one. Use two. Combine them however you want. They're tools in a toolbox, not a monolith.
- Just want guardrails? — Plan Forge standalone in VS Code
- Just want memory? — OpenBrain as a standalone MCP server
- Just want reach? — OpenClaw as a standalone AI gateway
- Want the full loop? — Wire all three together with the Unified System Architecture
All three are free. All three are MIT licensed. All three are self-hosted — no cloud dependencies unless you want them.
The future of development isn't AI writing code. It's AI writing code within a system that plans, enforces, remembers, and improves — automatically, across every surface you use.