Search by job, company or skills

V

AGENTIC AI

Fresher
Save
new job description bg glownew job description bg glow
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

MODEL VALIDATOR

Eval set development - ability to benchmark agent performance through reasoning paths

Adversarial testing - ability to break agent by giving it conflicting instructions etc

Scholastic regression testing - Measure variance in agent behavior.

Tool call validations - Agent calls the correct external APIs and databases

Ability to review thought chains and identify where agent logic diverged from BRD

Must have knowledge of applying judge LLMs to grade outputs

Python and framework - Proficiency in DeepEval, Langsmith etc

Ability to do semantic debugging - Look at agent's thought trace

Screening Criteria

SDETs as they have coding background to testing - They can develop evals

Good knowledge of data / SQL based testing etc

Domain background is added advantage for such roles

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 147766727

Similar Jobs

Pune, India

Skills:

NumpyGitPandasDockerFlaskFastAPIPythonLangChainvector databasesPineconeSemantic KernelChromaasyncioFAISSGenerative AI LLMsWeaviateLlamaIndex

Delhi, India

Skills:

PythonLangChainAutoGen or equivalentLLM Application DevelopmentAgentic Multi-step AI WorkflowsREST API IntegrationRAG Retrieval Augmented GenerationAI Telemetry MonitoringPrompt Engineering

Hyderabad, India

Skills:

ApisAI MLGcpSystem DesignAzurePythonAWSLangChainLLMsGPT prompt engineeringAutoGenLLM-based applications

Gurugram, Gurugram, India

Skills:

ApisPython ProgrammingCrewAILangChainAgentic AI architecturesAI workflowsprompt engineeringcontext-aware reasoningAutoGendata modelsLLM-based NLPexternal toolchains

Noida, India

Skills:

PythonRest Apis