
Search by job, company or skills
Role Overview
Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.
Key Responsibilities
AI Agent Evaluation
Technical Assessment
Project Workflow
Qualifications
Preferred (Nice to Have)
Compensation
Equal Opportunity Statement
Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics.
Apply Now!
Job ID: 148344571
Skills:
Azure, Tensorflow, AWS, Distributed Systems, Pytorch, Python, Kubernetes, Docker, Gcp, fine-tuning LLMs, model optimization, deployment strategies, vector databases, embeddings, Weaviate, Hugging Face Transformers, Pinecone, FAISS, prompt engineering
Skills:
sdk development , Python, API lifecycle management, AI Gateway operations, LLM APIs, OpenTelemetry, LangGraph ADK, Telemetry observability
Skills:
sdk development , Microsoft Azure, Python, LLM observability, AI coding assistants, Batch async API patterns, AI Gateway operations, LLM API design, API lifecycle management, LangGraph ADK, Agent frameworks, OpenTelemetry, Guardrails, Telemetry observability, GCP Vertex AI
Skills:
Api Integration, Backend Development, Python, ETL middleware concepts, LLM evaluation and benchmarking, Error handling, Confidence scoring, Agent orchestration, Prompt engineering
Skills:
Api Development, Machine Learning, Predictive Modeling, Python, Git, LangChain, Data Processing, Prompt Engineering, LLM APIs, Vector Databases, AutoGen, CrewAI, RAG Architecture
We don’t charge any money for job offers