Freelance Agent Evaluation Engineer
with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web...
with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web...
, or structured reasoning tasks (e.g., SWE-bench, Terminal-bench, or similar) Experience working with Docker (building images...