Instruction: Explain how you would improve human review when the model can unduly influence reviewers.
Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Explain how you would improve human review when the model can unduly influence reviewers.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
I would redesign the approval surface so reviewers judge structured facts, not just fluent persuasion. If the model can talk a reviewer into a risky action, the approval object is probably too dependent on narrative framing and too weak on explicit evidence, consequences, and...
easy
easy
easy
easy
easy
easy