Freelance AI Evaluation Scenario Writer

human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure... structured test cases that simulate complex human workflows. Define gold-standard behavior and scoring logic to evaluate agent...

Lugar: México | 24/12/2025 18:12:01 PM | Salario: S/. No Especificado | Empresa: Mindrift

Evaluation Scenario Writer - AI Agent Testing Specialist

human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure... structured test cases that simulate complex human workflows. Define gold-standard behavior and scoring logic to evaluate agent...

Lugar: Guadalajara, Jal. | 24/12/2025 18:12:52 PM | Salario: S/. No Especificado | Empresa: Mindrift

Evaluation Scenario Writer - AI Agent Testing Specialist

human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure... structured test cases that simulate complex human workflows. Define gold-standard behavior and scoring logic to evaluate agent...

Lugar: México | 24/12/2025 18:12:29 PM | Salario: S/. No Especificado | Empresa: Mindrift
1