Design a latency budget for a multi-step agent workflow.

Instruction: Describe how you would budget latency across planning, retrieval, tool use, and generation.

Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would budget latency across planning, retrieval, tool use, and generation.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would budget latency by stage: request intake, retrieval, planning if any, tool execution, model turns, approval waits, and final rendering. Each stage should have a target and a known escalation or degradation policy when it exceeds budget.

I also separate hard real-time needs from...

Related Questions