Instruction: Describe how you would use test results to guide an agent without letting it thrash.
Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would use test results to guide an agent without letting it thrash.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
I would make the loop explicit: run targeted validation, classify failures, decide whether the failure points to bad grounding, bad implementation, or missing tests, then let the agent use that signal in a bounded next step. The loop should add information, not...
easy
easy
easy
easy
easy
easy