
Search by job, company or skills

About Turing:
Turing is one of the world's fastest-growing AI companies, accelerating the advancement and deployment of powerful AI systems. Turing helps customers in two ways: working with the world's leading AI labs to advance frontier model capabilities in thinking, reasoning, coding, agentic behavior, multimodality, multilinguality, STEM, and frontier knowledge; and leveraging that work to build real-world AI systems that solve mission-critical priorities for companies.
Role Overview:
We are looking for experienced SwarmBench Task Engineers — Code / SWE to design and build high-quality multi-agent benchmark tasks based on real-world software engineering workflows.
In this role, you will create tasks grounded in real open-source code changes such as bug fixes, migrations, and refactors. These tasks are used to evaluate how effectively AI agents can understand large codebases, apply precise modifications, and produce correct, testable outputs.
You will work within a structured evaluation framework (Harbor), define clear task instructions, design verification logic, and decompose complex engineering problems across multiple specialized agents.
What does day-to-day look like:
Requirements:
Offer Details:
Job ID: 148983965
Skills:
Django, Pytest, Javascript, Docker, Node.js, Flask, FastAPI, Python, AI coding benchmarks, Git workflows, unittest
Skills:
Python, Numpy, Scipy, Research, Docker, AI ML, Quantitative, Math, Llm, Reasoning
We don’t charge any money for job offers