Instruction: Explain how you would respond when the leak comes from the tool layer rather than the conversational layer.
Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain how you would respond when the leak comes from the tool layer rather than the conversational layer.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
The way I'd think about it is this: I change the tool and data layer first. If hidden sensitive fields are leaking through tool outputs, then the model’s polite surface behavior is not the real issue. The system is exposing data the model should not have...
easy
easy
easy
easy
easy
easy