Experiment: Copilot Smart Recap V2 — Suggested Action Items
Suggest follow-up action items at the end of every Copilot chat session (Drafting Agent for paid Copilot users, vanilla BizChat for non-Copilot users). Measures uptake on suggested actions and downstream effect on retention and upsell across Consumer + Commercial.
- Engagement (Commercial) +3.1% — stat-sig improvement on suggested-action acceptance rate.
- Upsell (Consumer) Copilot Pro upgrade flow conversion +1.8%, stat-sig.
- No regression on the primary North Star metric (chat satisfaction) in either segment.
- Engagement (Consumer) flat — variant did not move the consumer needle as anticipated.
- Daily session count slightly negative (−0.6%, not stat-sig) on Commercial.
- Ship-readiness scorecard: 9 of 12 guardrails passing — below the 11/12 historical bar.
- 7-day retention (Consumer) regression −0.42% is stat-sig and triggers a Fail guardrail — Ship gate engaged.
- Sample size on Consumer Pro segment is below recommended power (~62%).
- Holiday seasonality may be inflating Commercial engagement gain.
No Generative Insights to display. Pick a Ring outside SDF/MSIT and an Iteration not in {1, 3, 5, 7} to view results.
💡 Decision
⚠ Guardrail Override — required when shipping despite a failing guardrail
Supporting Evidence
- 🔗 Scorecard ↗ (Expedite · 12 metrics · refreshed 6h ago)
- 🔗 Experiment Portal ↗ (access may be restricted)
- 🔗 PM design doc ↗ (Smart Recap V2 — Suggested Actions)
- 🔗 Telemetry dashboard ↗ (Geneva · Engagement & Retention)
- 📋 Pre-launch eval set — Not linked
The Ring/Iteration combination you selected was not flighted, so no scorecard, guardrail metrics, or Copilot recommendation exists. Decision controls are disabled until you pick a slice with executed data.
No scorecard has been executed for the selected Ring × Iteration combination. Guardrail evaluation requires a scorecard run; pick a slice with executed data to view Consumer and Commercial guardrails.
📊 Consumer Guardrails
R-05 · R-08 · R-10📊 Commercial Guardrails
R-05 · R-08 · R-10🕒 Decision History
R-11 · R-20 · preview-
14 days agoPriya Subramanian — started experiment at 50/50 allocation.
-
3 days agoMarcus Hill (DS) — flagged 7-day retention regression in Consumer cohort.
-
todayCopilot — generated recommendation: Iterate.
-
30 days ago2026-04-13T16:42:08ZOverride David Lydston — shipped V1 progression against Copilot verdict.Actor:David Lydston <dalyds@microsoft.com> Timestamp (UTC):2026-04-13T16:42:08Z Original verdict:Iterate (Copilot recommendation at override time) Override decision:Ship Why acceptable:Build keynote demo dependency — narrow 5% audience, two-week sunset, no upmarket exposure. Mitigation:Daily metric review by DS partner; auto-rollback rule on any >1% retention regression. Followups:Bug 8423218 — rollback playbook; Workitem 8423219 — re-run with corrected sample size.
Decisions are recorded per scorecard. Since no scorecard exists for this Ring × Iteration, there are no historical decisions or overrides to show.