Instruction: Describe the metrics and processes used for assessing NLP model quality.
Context: This question probes the candidate's familiarity with model evaluation techniques specific to NLP, ensuring they can not only develop but also rigorously test NLP systems.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
The way I'd approach it in an interview is this: I evaluate NLP models at three levels: task metrics, failure patterns, and real-world usefulness. The exact metric depends on the task, such as F1 for extraction, BLEU or more modern alternatives for...
easy
medium
medium
medium
hard