Freelance Agent Evaluation Engineer
-performed tasks and define gold-standard behavior to compare agent actions against. You'll work to ensure each scenario... that simulate complex human workflows. Define gold-standard behavior and scoring logic to evaluate agent actions. Analyze agent...
Lugar: Veracruz, Ver. | 21/01/2026 18:01:52 PM | Salario: S/. No Especificado | Empresa: Mindrift