Instruction: Describe how you would decide whether to use one large model or route traffic across models of different sizes.
Context: Checks whether the candidate can explain the core concept clearly and connect it to real production decisions. Describe how you would decide whether to use one large model or route traffic across models of different sizes.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
If the workload has clear easy and hard paths, routing can save a lot of money. If the requests are...
easy
easy
easy
easy
easy
easy