An agent works in staging but fails in production after tool latency spikes. What would you change first?

Instruction: Explain the first production controls you would add when tool latency starts breaking an agent.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain the first production controls you would add when tool latency starts breaking an agent.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

My first move would be to protect the user experience with tighter timeouts and a degraded path. Then I would...

Related Questions