A browsing agent combines information from several safe pages into one unsafe action. How would you defend against that?

Instruction: Explain how you would handle compositional safety risk across several individually safe inputs.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain how you would handle compositional safety risk across several individually safe inputs.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would stop evaluating safety one page at a time. The real question is whether the final action is safe,...

Related Questions