Design an incident response process for prompt injection or data leakage.

Instruction: Describe how you would respond operationally to a serious AI safety incident.

Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would respond operationally to a serious AI safety incident.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would run AI safety incidents like real production incidents: contain first, understand impact quickly, and turn the root cause...

Related Questions