Freelance Agent Evaluation Engineer
by an AI agent Write tests that verify agent solutions - accept all valid approaches and reject incorrect ones, neither too strict... nor too lenient Iterate on tasks and tests based on QA feedback - review agent solutions, analyze failures, and refine...