Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Madrid | 11/05/2026 17:05:50 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Valencia | 11/05/2026 17:05:55 PM | Salario: S/. No Especificado | Empresa: Mindrift