Users like the new assistant more, but task completion got worse. How would you resolve the conflict?

Instruction: Explain how you would reason about a system that feels better but performs worse.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain how you would reason about a system that feels better but performs worse.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would first verify that the conflict is real and not a measurement artifact, then I would unpack what users mean by "like." They may prefer the tone, speed, or confidence of the assistant even if it is actually less effective at finishing the task.

If the tradeoff is real,...

Related Questions