Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Madrid | 01/06/2026 17:06:27 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Valencia | 01/06/2026 17:06:33 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Barcelona | 01/06/2026 17:06:19 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: España | 01/06/2026 17:06:00 PM | Salario: S/. No Especificado | Empresa: Mindrift