Instruction: Explain how you would implement a system to safely rollback a failed ML model deployment.
Context: This question tests the candidate's foresight and planning skills in ensuring high availability and reliability of ML systems through safe deployment practices.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
The way I'd approach it in an interview is this: I design rollback before rollout. That means keeping prior model versions deployable, preserving compatible feature and schema paths, and making traffic control explicit so the system can return to...