Instruction: Explain how you would evaluate a high-autonomy workflow that can plan, act, and ask for approval.
Context: Assesses whether the candidate can design a practical architecture and explain the main tradeoffs. Explain how you would evaluate a high-autonomy workflow that can plan, act, and ask for approval.
Official answer available
Preview the opening of the answer, then unlock the full walkthrough.
I would evaluate this system at three levels: step quality, workflow quality, and business outcome. Step quality covers tool choice, parameter correctness, approval handling, and policy adherence. Workflow quality covers sequencing, recovery, stopping behavior, and whether the agent reached the goal efficiently. Business outcome covers...
easy
easy
easy
easy
easy
easy