Freelance Agent Evaluation Engineer
until the evaluation is fair and robust What this is NOT Not data labeling Not prompt engineering Not writing code... from scratch - the agent writes most of the code;you guide and evaluate What we look for 5+ years in software development...