Instruction: Describe how you would budget latency across planning, retrieval, tool use, and generation.
Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would budget latency across planning, retrieval, tool use, and generation.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
I would give each stage a latency budget and decide ahead of time what the system does when one stage...
easy
easy
easy
easy
easy
easy