Chapter 7

The Dashboard

37 tabs across 4 top-level groups (Forge / LiveGuard / Forge-Master / Settings). Real-time execution monitoring, cost tracking, session replay, one-click actions, watcher live feed, and LiveGuard health.

Starting the Dashboard

The dashboard is part of the MCP server (Model Context Protocol, the standard that lets AI agents call functions). Start it, then open your browser:

Terminal

# Full MCP server (stdio + HTTP + WebSocket)
node pforge-mcp/server.mjs

# Dashboard + REST API only (no MCP stdio)
node pforge-mcp/server.mjs --dashboard-only

Open localhost:3100/dashboard. The dashboard connects via WebSocket on port 3101 for real-time updates.

Auto-start: If you have .vscode/mcp.json configured (created during setup), the MCP server starts automatically when Copilot uses a forge tool. The dashboard is always available at port 3100 while the server runs.

Tab Categories

The dashboard groups its tabs into 4 top-level groups (Forge / LiveGuard / Forge-Master / Settings). Knowing which group a tab lives in is the fastest way to find what you're looking for, especially across the 37 tabs total. Click a group tab in the top nav to expose its sub-tabs.

Group	Tabs	Purpose
Forge (19)	Home, Review, Progress, Crucible, Governance, Runs, Cost, Actions, Replay, Traces, Skills, Tempering, Memory, Timeline, Inner Loop, Extensions, Anvil/Lattice, GitHub Metrics, Team Dashboard	Build, execute, ship; active-run monitoring
LiveGuard (7)	Health, Incidents, Triage, Security, Env, Watcher, Bug Registry	Post-deploy defense
Forge-Master (1)	Studio	Read-only reasoning orchestrator
Settings (10)	General, Models, Execution, API Keys, Updates, Memory, Bridge, Crucible, Brain, Forge-Master	Platform-wide config (safe-write to `.forge.json`)

Dashboard tab taxonomy: 4 top-level groups. Forge (blue, default, 19 tabs) holds Home, Review, Progress, Crucible, Governance, Runs, Cost, Actions, Replay, Traces, Skills, Tempering, Memory, Timeline, Inner Loop, Extensions, Anvil/Lattice, GitHub Metrics, Team Dashboard, covers build/execute/ship and active-run monitoring. LiveGuard (amber, 7 tabs) holds Health, Incidents, Triage, Security, Env, Watcher, Bug Registry, covers post-deploy defense. Forge-Master (cyan, 1 tab) holds Studio, read-only reasoning orchestrator. Settings (purple, 10 sub-tabs) holds General, Models, Execution, API Keys, Updates, Memory, Bridge, Crucible, Brain, Copilot, platform-wide configuration with safe-write to .forge.json. — Figure 7-1. Dashboard tab taxonomy

Progress Tab

The default view during plan execution. This is where you watch your plan come to life, real-time slice status via WebSocket updates:

Slice 1

Auth Models + Migration

✓ passed · 42s · $0.08

Slice 2

Repository Layer

✓ passed · 39s · $0.07

Slice 3

Service Layer

⏳ executing...

Slice 4

API Controller

⏸ queued

Each card shows: slice title, status (queued → executing → passed/failed), duration, model used, token count, and cost. Cards update in real-time as events arrive over WebSocket.

Runs Tab

History of all plan executions. Each row shows:

Column	Content
Plan	Plan file path (clickable → shows slice detail)
Status	✓ Complete, ✗ Failed, ● Partial
Slices	Passed / Total count
Duration	Total wall-clock time
Cost	Total USD across all slices
Model	Primary model used
Date	Execution timestamp

Click any row to expand slice-by-slice detail: per-slice tokens, duration, model, and pass/fail status.

Dashboard Runs tab showing run history with plan names, slice counts, durations, and costs

Cost Tab

Two visualizations:

Doughnut chart, spend breakdown by model (which models cost the most)
Bar chart, monthly trend (cost over time, spot anomalies)

Data comes from .forge/cost-history.json which is updated automatically after each run. The cost tab supports a 23-model pricing table, including Claude, GPT, Grok, Gemini, and custom API providers.

Dashboard Cost tab showing total spend, doughnut chart by model, monthly spend bar chart, and model performance table with pass rates

Cost anomaly detection: If a run costs >2× the historical average, the Runs tab shows a warning badge. Use pforge run-plan --estimate to predict costs before executing.

Actions Tab

One-click buttons for common operations, no terminal needed:

🔨 Smith

Environment diagnostics

🔍 Sweep

Find TODO/FIXME markers

📊 Analyze

Consistency scoring

📋 Status

Phase status from roadmap

✓ Validate

Setup file validation

🧩 Extensions

Browse extension catalog

Each button calls a forge MCP tool through the generic /api/tool/:name dispatcher (e.g. POST /api/tool/forge_smith, POST /api/tool/forge_sweep) and displays results inline.

Dashboard Actions tab showing one-click buttons for Smith, Sweep, Analyze, Status, Validate, and Extensions

Replay Tab

Browse agent session logs from past executions. Each run's .forge/runs/<timestamp>/ directory contains per-slice logs. The Replay tab renders them with:

Slice selector, pick which slice's log to view
Error highlighting, errors and warnings highlighted in red/amber
File filter, filter log entries by file path patterns
Search, free-text search within the session log

Use this to diagnose why a slice failed, the full agent conversation, including tool calls, is captured.

Dashboard Replay tab showing session log viewer with slice selector, error highlighting, and search

Extensions Tab

Visual catalog browser with search. Shows all community extensions from extensions/catalog.json:

Extension name, version, author, category
What it provides (instruction files, agents, prompts)
Tags and Spec Kit compatibility
One-click install button

Equivalent to pforge ext search + pforge ext add but with a visual interface.

Dashboard Extensions tab showing visual catalog browser with search, install buttons, and extension details

⚙ Settings Group, The Settings group has 9 purpose-built sub-tabs (General, Models, Execution, API Keys, Updates, Memory, Bridge, Crucible, Brain) for platform-wide configuration. Full reference: Dashboard — Settings Group →

Traces Tab

OTLP (OpenTelemetry Protocol) trace waterfall view. Every plan execution emits OpenTelemetry spans:

Span	What It Captures
run (root)	Plan file, total duration, slice count, model
└ slice-N	Slice title, status, tokens in/out, cost, gate result
└ gate	Gate command, exit code, output
└ escalation	If a model failed and escalated to the next in chain

Click any span to expand: duration, resource attributes (project, version, preset), severity. Traces are stored in .forge/runs/<timestamp>/traces.json and can be exported to any OTLP-compatible backend (Jaeger, Grafana Tempo, etc.).

Dashboard Traces tab showing OTLP trace waterfall view

Skills Tab

Monitor skill executions triggered via forge_run_skill or /slash-command. Shows:

Recent skill runs, skill name, start time, status, duration
Step-level detail, each skill step with pass/fail, output, timing
Event log, WebSocket events (skill-started, skill-step-completed, skill-completed)

Dashboard Skills tab showing recent skill runs with step-level detail and event log

Watcher Tab

Read-only view of another project's pforge run, consumed from a second VS Code / Copilot session. Subscribes to watch-snapshot-completed, watch-anomaly-detected, and watch-advice-generated hub events emitted by forge_watch / forge_watch_live. Shows:

Latest snapshot, target path, run state, run ID, anomaly count, diff cursor
Anomalies feed, severity-coded codes (stalled, slice-failed, quorum-dissent, quorum-leg-stalled, skill-step-failed, model-escalated, etc.) with message + run ID
Advice history, analyze-mode narratives from the frontier model, with token + timing metadata
Live Watch + Watch Snapshot cards on the Actions tab copy the matching pforge watch-live / pforge watch invocations

📋 Forge-Master Studio Tab, The Studio tab provides the reasoning orchestrator's chat interface with prompt gallery, streaming replies, embedding cache tile, and quorum advisory. Full reference: Dashboard — Forge-Master Studio →

Observer Narrations Card

Live feed of narrations produced by the Forge-Master Observer — the background hub subscriber that batches live plan events and narrates notable patterns in plain prose. The card renders the last 20 narrations, updating in real time via the existing dashboard WebSocket (observer:narration event type).

Per-narration display: timestamp, batch event count badge, narration text (markdown), cost in $
Live updates: new narrations push instantly without a page refresh
Empty state: when the observer is disabled, the card shows "Observer disabled — enable in Settings" with a deep-link to Settings → Forge-Master so you can turn it on without navigating away
Source: driven by observer:narration hub events emitted from pforge-master/src/observer-loop.mjs; narrations are also stored in Brain via brain_capture if cfg-observer-brain-capture is enabled

Observer Narrations card showing three recent narrations with timestamps, batch-event-count badges, and per-narration cost

Cross-Run Watcher Anomalies Card

Retrospective health view powered by forge_watch({ mode: "cross-run" }). Aggregates .forge/runs/*/summary.json files into a health snapshot and surfaces recurring failure patterns across your run history — useful for diagnosing systemic issues that individual-run views miss.

Refresh button: triggers GET /api/watcher/cross-run server-side (wraps the cross-run watcher), rendering fresh results within 2 s for repos with ≤ 50 runs
Cache: last result is cached in .forge/cross-run-cache.json with a 1-hour TTL; the cached result loads automatically on page load so the card is never blank
Anomaly table columns: code (e.g., cross-run.recurring-gate-failure, cross-run.retry-rate-spike, cross-run.cost-anomaly-trend, cross-run.slice-timeout-cluster), severity, recommendation
Severity color coding: matches the existing per-run anomaly display for visual consistency

Cross-Run Watcher Anomalies card showing a table of anomaly codes with severity and recommendation columns after clicking Refresh

Auditor Latest Report Card

Renders the most recent Plan-Health Auditor report from .forge/health/latest.md directly on the dashboard. The auditor is invoked automatically after failed runs or every N runs (configurable in Settings → Forge-Master).

Report header: timestamp of the latest report and a "N reports since YYYY-MM-DD" counter showing how many historical reports are archived
Sanitized markdown: the report body is rendered as sanitized HTML — <script> tags, raw HTML injection, <iframe>, and javascript: URLs are all stripped server-side before the response leaves GET /api/auditor/latest; safe elements (headings, lists, code blocks, bold/italic) render normally
Archive link: "View history" opens the .forge/health/ archive listing so you can browse older reports
Backend endpoint: GET /api/auditor/latest returns { markdown, timestamp, archive: [...] }

Auditor Latest Report card showing timestamp header, reports counter, first half of sanitized markdown report with headings and lists, and View history archive link

Audit-Loop Activation

The audit loop is opt-in. It's not on a Settings tab, mode is read from .forge.json#audit.mode directly:

Mode values, off (default) / auto / always
Max rounds, drain caps after N rounds (default 5)
Allowed environments, dev and staging by default; production is hard-blocked unless allowProduction: true in scanner opts (and even then only with explicit override)
Live progress, drain rounds stream via tempering-round-completed hub events and surface in the Forge → Tempering tab

Trigger manually with pforge audit-loop --auto (respects .forge.json#audit.mode) or via the forge_tempering_drain MCP tool. See Audit Loop deep dive for the full activation flow.

Timeline Tab 9 sources

Unified chronological view of every event across the shop. Source chips filter the feed:

run, plan executions (slice progress, completes, aborts)
incident, LiveGuard incident lifecycle
bug, Bug Registry status changes
deploy, forge_deploy_journal entries
crucible, smelt lifecycle (started / question / finalized)
fm-turn v2.82, Forge-Master turns (lane + truncated user message + turn number)
memory, memory-captured events from OpenBrain
tempering, audit-loop drain rounds
watch, watcher snapshot / anomaly / advice events

The CLI equivalent is pforge timeline, same 9 sources, same correlation-id grouping, JSON-pipeable for scripts.

Dashboard Timeline tab, unified chronological view across 9 sources (run, incident, bug, deploy, crucible, fm-turn, memory, tempering, watch)

🛡 LiveGuard Tabs, The LiveGuard group has 7 amber-accented tabs (Health, Incidents, Triage, Security, Env, Watcher, Bug Registry) for post-deploy defense. Full reference: Dashboard — LiveGuard Tabs →

Port Reference

Port	Protocol	Purpose
`3100`	HTTP	Dashboard UI + REST API
`3101`	WebSocket	Real-time events (slice progress, run completion)

Port conflict? If another service uses 3100/3101, set PORT and WS_PORT environment variables, or use --port flag: node pforge-mcp/server.mjs --port 4100.

📄 Full reference: capabilities, Appendix V — Event Catalog (every WebSocket event with payload and retention), EVENTS.md on GitHub (raw JSON schema)