Instruction: Describe how you would respond when prompt-only safety is clearly losing the race.
Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Describe how you would respond when prompt-only safety is clearly losing the race.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
That pattern tells me the system is too dependent on the prompt. I would add stronger runtime controls and update...
easy
easy
easy
easy
easy
easy