
Search by job, company or skills
Role Overview
Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.
Key Responsibilities
AI Agent Evaluation
Technical Assessment
Project Workflow
Qualifications
Preferred (Nice to Have)
Compensation
Equal Opportunity Statement
Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics.
Apply Now!
Job ID: 148375225
Skills:
Schema Design, Python, Excel-compatible output generation, canonical data modeling, workflow orchestration, rule engines, testing traceability, document parsing, LLMs in structured and constrained roles, data-processing pipelines, Agentic workflows
Skills:
Machine Learning, Neural Networks, Node.js, Angular, Tensorflow, React, AR, Pytorch, Javascript, Docker, Vr, Azure, Kubernetes, Python, AWS, XR
Skills:
Rest Apis, Python, S3, Api Gateway, embedding pipelines, vector databases, Pinecone, Amazon Bedrock, Weaviate, OpenSearch, Step Functions, LLM architectures, RAG pipelines
We don’t charge any money for job offers