Instruction: Answer this as a story about undoing a promising optimization after seeing the real impact.
Context: Evaluates whether the candidate can communicate judgment, collaboration, and ownership in a real setting. Answer this as a story about undoing a promising optimization after seeing the real impact.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
One example I would use is this: We once pushed a prompt and routing optimization that reduced cost and improved median latency, but after rollout we started seeing more incomplete answers on a high-value workflow. The change looked fine in averages because...
easy
easy
easy
easy
easy
easy