looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate.... Curious and open to working with AI-generated content, agent logs, and prompt-based behavior. Nice to Have Experience...
looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate.... Curious and open to working with AI-generated content, agent logs, and prompt-based behavior. Nice to Have Experience...
Lugar:
Berlin | 08/01/2026 18:01:23 PM | Salario: S/. No Especificado | Empresa:
Mindrift looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate.... Curious and open to working with AI-generated content, agent logs, and prompt-based behavior. Nice to Have Experience...
annotation Training and evaluation of large language models Benchmarking and agent-based code execution in sandboxed...
Lugar:
Hamburg | 08/01/2026 18:01:31 PM | Salario: S/. No Especificado | Empresa:
Mindrift? This is a flexible, project-based opportunity well-suited for: Analysts, researchers, or consultants with strong critical thinking...
Lugar:
Hamburg | 08/01/2026 18:01:07 PM | Salario: S/. No Especificado | Empresa:
Mindrift-time, remote position, collaborate from anywhere with your home studio or local recording facility. Project-based workload... (approximately 5+ hours per week per project, depending on assignments). Competitive piece-work compensation based on completed...
looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate.... Curious and open to working with AI-generated content, agent logs, and prompt-based behavior. Nice to Have Experience...
looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate.... Curious and open to working with AI-generated content, agent logs, and prompt-based behavior. Nice to Have Experience...
Lugar:
Hamburg | 08/01/2026 18:01:26 PM | Salario: S/. No Especificado | Empresa:
Mindrift criteria to evaluate the accuracy of the AI’s answers. Correct the model’s responses based on your domain-specific knowledge...
Lugar:
Berlin | 08/01/2026 18:01:34 PM | Salario: S/. No Especificado | Empresa:
Mindrift? This is a flexible, project-based opportunity well-suited for: Analysts, researchers, or consultants with strong critical thinking...
Lugar:
Berlin | 08/01/2026 18:01:52 PM | Salario: S/. No Especificado | Empresa:
Mindrift