Latency doubled after adding reranking and citations. What would you change first?

Instruction: Describe your first optimization move when RAG quality features add too much latency.

Context: Tests how the candidate diagnoses the problem, chooses the safest next step, and reasons through recovery. Describe your first optimization move when RAG quality features add too much latency.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would avoid ripping out the expensive layer globally. First I would find the queries that need reranking and rich...

Related Questions