Design a testing strategy for tool contracts used by agents.

Instruction: Describe how you would test tool contracts that models depend on.

Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Describe how you would test tool contracts that models depend on.

Official answer available

Preview the opening of the answer, then unlock the full walkthrough.

I would test at three levels: schema validity, deterministic contract behavior, and model-facing behavioral reliability. The first two are familiar engineering work. The third is where agent systems often fail even when the API tests are green.

So I want replay tests of representative model calls, change-detection...

Related Questions