Instruction: Describe how you would evaluate a support assistant before and after launch.
Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would evaluate a support assistant before and after launch.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
I would build the eval loop around the actual support workflow. Start with historical tickets, agent notes, escalation reasons, and resolution outcomes. Turn those into a benchmark with slices for direct resolution, clarification needed, escalation required, policy-sensitive cases, and cases where the assistant should stay narrow and cite evidence.
Then I...
easy
easy
easy
easy
easy
easy