Design defense-in-depth for a browsing and tool-using agent.

Instruction: Explain how you would layer controls for an agent that reads the web and can take actions.

Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Explain how you would layer controls for an agent that reads the web and can take actions.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would layer controls around the full path: treat browsed content as untrusted, sanitize and label retrieved material, enforce policy before tool use, require approvals for risky actions, and log the chain from source content to final action.

I also want containment if one layer...

Related Questions