A model change improves average score but increases policy violations. What do you do?

Instruction: Describe how you would respond when a model improves overall quality but hurts safety or policy compliance.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Describe how you would respond when a model improves overall quality but hurts safety or policy compliance.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would stop treating the average score as the decision-maker. If the change increases policy violations, it fails a different...

Related Questions