Instruction: Describe the difference between judging the model itself and judging whether the user task was completed.
Context: Checks whether the candidate can explain the core concept clearly and connect it to real production decisions. Describe the difference between judging the model itself and judging whether the user task was completed.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
A model can produce a strong-looking answer and still fail the task. I think of model quality as one layer,...
easy
easy
easy
easy
easy
easy