A serving change improves throughput and makes incident diagnosis harder. How would you respond?

Instruction: Describe how you would handle a performance improvement that harms observability.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Describe how you would handle a performance improvement that harms observability.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would not accept the observability loss as a harmless tradeoff. If the new serving path hides which model, route, or fallback produced a bad outcome, then it may be improving capacity while degrading operating safety.

I would either restore the missing diagnostic data in the...

Related Questions