Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: España | 11/05/2026 17:05:14 PM | Salario: S/. No Especificado | Empresa: Mindrift

Sagemaker DevOps Engineer - Europe

Sagemaker Lifecycle configurations Create CICD pipelines for end-users to deploy custom Docker images & Kernels in Sagemaker...

Lugar: España | 11/05/2026 17:05:12 PM | Salario: S/. No Especificado | Empresa: Xenon7

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Barcelona | 11/05/2026 17:05:59 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Madrid | 11/05/2026 17:05:50 PM | Salario: S/. No Especificado | Empresa: Mindrift

Freelance Agent Evaluation Engineer

AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria...'s very hard to create tasks that challenge frontier models without using frontier models. What we look...

Lugar: Valencia | 11/05/2026 17:05:55 PM | Salario: S/. No Especificado | Empresa: Mindrift