Deep understanding of Tools, Memory, and Planning in agent-based systems.
Strong grasp of vector databases and their internal architectures (e.g., ANN algorithms, HNSW, IVF).
Ability to justify vector DB usage over traditional RDBMS with technical and performance reasoning.
Proven experience designing complex AI/ML or GenAI architectures at scale.
Proficiency with Python and key AI libraries: LangChain, Transformers, LlamaIndex, etc.
Experience working with commercial and open-source LLMs (GPT-4, Mistral, LLaMA, Claude, etc.).
Desirable Skills
Experience with fine-tuning LLMs or working with proprietary/custom models.
Exposure to prompt engineering for diverse tasks (summarization, QA, classification, etc.).
Understanding of MLOps and deployment strategies for LLM pipelines.
Education Qualification
This is not a developer role. Candidates with only application-layer experience or limited understanding of LLM internals, vector DB architecture, or agentic workflows will not be a fit. We expect this architect to drive architectural decisions, not just write code.