Review & Ship
Sweep, review, and ship in 10 minutes.
This covers Steps 4–6: the completeness sweep, independent review, and shipping. For deeper explanations of each step see Chapter 6: Your First Plan.
Step 4: Sweep for Deferred Work
The completeness sweep scans every code file for markers that indicate unfinished work: TODO, FIXME, HACK, stub, placeholder, mock data. For a health endpoint this should return zero.
pforge sweep
If the sweep finds any markers, resolve them before continuing. Deferred-work markers are how technical debt silently accumulates, this is where you catch them before they ship.
Step 5: Independent Review
Critical: start a brand-new chat session by clicking the + button. The reviewer must not carry context from the builder, context contamination is the most common source of missed errors.
- Select Agent Mode
- Attach
.github/prompts/step5-review-gate.prompt.md - Replace
<YOUR-HARDENED-PLAN>withPhase-1-HEALTH-ENDPOINT-PLANand send
The review agent checks every change against the Scope Contract: forbidden files not touched, no architecture violations, test coverage meets MUST criteria, no scope creep. For a simple health endpoint, expect a clean PASS.
Step 6: Ship
One final session (new or continued if context allows) to commit and close out the feature:
- Attach
.github/prompts/step6-ship.prompt.md - The agent commits using a conventional commit message:
feat(health): add GET /health endpoint - Updates
docs/plans/DEPLOYMENT-ROADMAP.mdto mark the phase complete - Captures a short postmortem for future sessions
# Stage everything and commit
git add -A
git commit -m "feat(health): add GET /health endpoint"
git push origin main
What Just Happened
You've completed all 4 sessions of the Plan Forge pipeline:
Session 1 (Specify & Plan) ✓ Described the feature; AI structured the plan
Session 2 (Execute) ✓ AI built it slice-by-slice with gates
Session 3 (Review) ✓ Fresh AI session audited for drift and errors
Session 4 (Ship) ✓ Committed, roadmap updated, postmortem captured
The four-session model is deliberate. Each session has a single responsibility and fresh context, the reviewer couldn't carry bias from the builder even if it wanted to. This is what makes the pipeline scale from a health endpoint to a 40-slice refactor.
What's Next
You've run the full pipeline end-to-end. The same process works for any feature, the pipeline scales with the work:
Deep dive on scope contracts, slices, validation gates, and stop conditions.
Live slice view, cost tracking, traces, and replay — your command desk.
Project Principles, profiles, and forge.json — make Plan Forge yours.
The Forge Shop, four stations, and why this architecture works.
Three worked case studies — the closed loop, the 99-vs-44 A/B test, the quorum run — for context on what just happened.