An operations agent appears correct in logs, but user-visible outcomes are inconsistent. How would you trace the issue end to end?

Instruction: Explain how you would debug an agent whose internal traces look healthy while customers still see wrong outcomes.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain how you would debug an agent whose internal traces look healthy while customers still see wrong outcomes.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would stop at the customer-visible output, not the orchestrator log. The trace has to prove that the intended action...

Related Questions