Design a rollout plan for moving high-volume traffic to a smaller model.

Instruction: Describe how you would migrate significant traffic to a cheaper model safely.

Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would migrate significant traffic to a cheaper model safely.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would start with slice-based offline comparison, then shadow or canary the smaller model on real traffic before letting it carry meaningful volume. High-volume migration amplifies small quality differences quickly, so I want the first stages to be reversible and heavily observed.

I would also define where the...

Related Questions