CUA-verified / slack-clone (long-horizon PR #65) — multi-stage: codex build (81 steps) + shell correctness gates + opus-4-7 CUA UX rubric