| C2 — LAN Multi-Viewer | REFUTED | Directly contradicts STRATEGY.md "not working on hosted multi-user backend"; annotation queue gap does not require real-time LAN sync. |
| C6 — Textual TUI as Primary Surface | REFUTED | Contradicts the locked stack (Gradio Blocks); Gradio gotchas cited are valid but do not justify a full stack pivot. |
| I4 — uvx No-Install Execution | DUPLICATE | Restatement of P4/XC3 with no additive basis; collapsed into idea #3. |
| I6 — Two-Run Diff as Default View | DUPLICATE | Same move as C7, subsumed by idea #4 which is the stronger expression. |
| A5 — gr.TraceTree Component Library | WEAK | Premature scope expansion — component library publication before any application code is written; community distribution bet that cannot be validated yet. |
| X1 — FDR Forensic Replay Mode | WEAK | Compelling analogy but "iterating signals replay" overstretches; timeline scrub is a different interaction contract requiring new infrastructure; too complex for greenfield Phase 1. |
| X6 — Failure Topology View | WEAK | Clustering failing paths requires multi-run store + path-indexing infrastructure not yet designed; long prerequisite chain makes it Phase 3+ work. |
| C1 — OS-Level Airgap Mode | WEAK | pf/nftables requires elevated permissions on most systems; overstates the "zero-leakage" commitment in STRATEGY.md; P8 (zero-outbound CI test) is the achievable version. |
| C4 — Framework-Agnostic Ingestion (OTel + LangChain adapters) | WEAK | "10x addressable market" claim is scope creep for greenfield; the L2 TypedDict schema (idea #7) captures the reversibility without the premature scope expansion. |
A3 — Own the deepeval inspect Pipe | WEAK | Depends on deepeval inspect producing a stable, parseable output — unverified; jq/delta analogy is apt in principle but upstream dependency is an assumption failure until verified. |
| XC4 — CLI Pipe + Framework-Agnostic Schema | WEAK | Inherits A3's unverified upstream assumption; otherwise sound in principle — revisit after deepeval inspect output format is confirmed stable. |
| XC6 — Component Library + CSS API + Community | WEAK | Inherits A5's premature-for-greenfield concern; CSS API and community theming are sound but the combined scope is too large for Phase 1. |
| XC7 — Airgap + LAN Multi-Viewer + .rvtrace | WEAK | C2 component is refuted (strategy contradiction); C1 component has permissions trap; .rvtrace element is sound and folded into idea #3. |
| P5 — Annotation Blindness Fix | WEAK | Solo flagging to JSONL is narrower than the confirmed annotation queue gap (which involves assigning to reviewers); the "flywheel" version of this is captured in idea #4. |
| A1 — Dashboard-to-Fixture Exporter | WEAK | Mechanism for converting a live dashboard selection to a pytest fixture is underspecified; implementation gap is large; worth revisiting after ideas #4 and #5 ship. |
| A7 — Self-Contained HTML Export | DUPLICATE | Superseded by the .rvtrace format in idea #3; Gradio Blocks HTML serialization is non-trivial and the simpler JSON envelope is the better sharing primitive. |
| I1, A6, C3 — Various pytest integration forms | DUPLICATE | All subsumed by idea #1 (XC2), which is the tightest combination of the three moves. |
| P1 · P2 · P3 · P7 · P8 · I2 · I3 · I5 · I7 · L1 · L3 · L4 · L7 · X2 · X3 · X4 · X5 · X7 (standalone) | SUBSUMED | Each is sound in isolation; all subsumed by the cross-cutting combinations (ideas #1–#4) or by the foundational ideas (#5–#7) that incorporate them. See raw-candidates.md for individual entries. |