Instruction: Answer this as a story about discovering a serious failure mode early.
Context: Evaluates whether the candidate can communicate judgment, collaboration, and ownership in a real setting. Answer this as a story about discovering a serious failure mode early.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
One example I would use is this: In one review, we tested a workflow that looked harmless because each step individually was allowed. But once the assistant combined retrieved content with a write action and long-lived memory, it could carry forward more sensitive context than anyone had intended.
We caught it...
easy
easy
easy
easy
easy
easy