feat: surface A/B configs and verdicts on /admin/reviews #55
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "feat/admin-reviews-ab-display"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The all-reviews list previously showed only the primary config's
verdict and model, hiding the existence of any shadow configs that
ran alongside. Admins running A/B comparisons (Sonnet primary + Opus
shadow, etc.) had to drill into each review to see whether the two
agreed.
Replace the single Verdict + Model columns with a Configs column
that lists every config that ran on the PR — primary first, then
shadows in timestamp order — each as a verdict-colored pill carrying
the display name and verdict label. Disagreement is now visible
inline: when configs return different verdicts, the pills' verdict
colors mismatch and the row stands out at a glance.
Adds tracker.ConfigOutcomesForGroups to bulk-fetch the
comparison-group siblings for the listed primaries in a single query
(no N+1). Falls back to a single-pill rendering using the stored
backend + verdict when a row has no comparison group, so legacy or
shadow-less reviews still render sensibly.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com
Automated review by pr-reviewer v0.23.0 | Safety Check | Claude Sonnet 4 | tracking id
r-f4988b-270d08This is an AI-generated review and may contain mistakes.
Status: ✅ Completed
✅ Verdict: LGTM — Clean feature addition that improves admin visibility into A/B testing configurations without introducing security risks or breaking changes.
Complexity: moderate
Safety Check Review
This PR adds A/B configuration display functionality to the admin reviews page. The changes appear safe:
Key Changes:
Safety Assessment:
The code follows existing patterns, handles errors appropriately, and maintains backward compatibility with legacy reviews that lack comparison groups.