The assistant is safe in English and much weaker in another language. How would you address it?

Instruction: Describe how you would handle language-specific safety gaps.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Describe how you would handle language-specific safety gaps.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would treat it as a real safety gap, not as a localization detail. If protections hold only in English, the product is safer for one user group than another and the evaluation program is too narrow.

I would build multilingual safety slices, review...

Related Questions