About the Role
We are hiring a senior, hands-on Agentic RAG Engineer to design, build, and operate the Retrieval-Augmented Generation platforms that power our autonomous AI agents.
This is a lead-by-example role:
- You design the architecture
- You write the Python
- You ship to production
- You mentor engineers by building real systems
You will lead the technical direction of RAG and agent memory systems, while remaining deeply involved in implementation, observability, and operational readiness.
GCP is our primary platform, but all designs should be multi-cloud capable.
Key Responsibilities
RAG & Backend Engineering (Python-First)
- Design and build production-grade RAG pipelines
- Implement:
- Retrieval strategies
- Vector database integrations
- Agent memory and state management
- Prompt orchestration and chaining
- Build scalable Python services using FastAPI / Django / similar
- Integrate LLM APIs (OpenAI, Claude, Gemini) and open-source models (Llama, Mistral)
- Implement model/version rollout, rollback, and simulation testing
Agentic Systems & Workflow Design
- Build and operate multi-step agent workflows
- Enable:
- Tool calling
- Human-in-the-loop interventions
- Safe agent execution patterns
- Define patterns for:
- Prompt versioning
- Context management
- Token and cost control
- Collaborate closely with AgentOps to ensure production-safe execution
Full-Stack & Observability
- Design and contribute to internal UIs for:
- Agent execution monitoring
- Decision and reasoning audits
- Prompt testing and visualization
- Implement structured logging and telemetry for:
- Retrieval quality
- Agent decisions
- Token usage and latency
- Work with Prometheus, Grafana, OpenTelemetry, or ELK-based stacks
Cloud, DevOps & Production Readiness
- Own deployment pipelines for RAG and agent services
- Work hands-on with:
- Docker
- Kubernetes
- Terraform
- CI/CD pipelines
- Ensure secure API design, auth, sandboxing, and operational guardrails
- Optimise for scalability, performance, and cloud cost efficiency on GCP
Technical Leadership & Team Enablement
- Act as technical lead for Agentic RAG engineering
- Set architectural standards and best practices
- Review code and designs with a high bar
- Mentor engineers in:
- Pythonic system design
- RAG correctness and evaluation
- Production-grade GenAI systems
- Partner with Product and Platform leads on roadmap and delivery
Required Skills & Experience
Core Engineering
- 5+ years of strong Python engineering experience
- Proven backend or full-stack development background
- Experience with FastAPI, Django, Flask, or similar frameworks
- Comfort contributing to frontend systems (React / Next.js / Vue) when needed
RAG, LLMs & Agentic Systems
- Hands-on experience building RAG pipelines in production
- Strong understanding of:
- Vector databases and retrieval strategies
- Prompt chaining and context handling
- Agent workflows and tool invocation
- Experience with frameworks such as:
- LangChain
- LangGraph
- LlamaIndex
- AutoGen / CrewAI
Cloud & Platform Engineering
- Strong experience with GCP, including:
- Vertex AI
- GKE / Compute Engine
- Cloud Functions
- Cloud Storage, Pub/Sub
- Hands-on DevOps skills:
- Docker
- Kubernetes
- Terraform
- CI/CD tooling
- Understanding of secure APIs, auth, and sandboxing patterns
Nice to Have
- Multi-cloud experience (AWS, Azure)
- Experience with Responsible AI, guardrails, and eval frameworks
- Contributions to open-source AI or infrastructure projects
- Experience building internal tooling or monitoring dashboards for AI systems
What Success Looks Like
- RAG systems are accurate, observable, and cost-efficient
- Agent failures are explainable and debuggable
- Engineers follow clear, scalable RAG patterns you defined
- Product teams trust agent outputs in production
- You are the technical authority for RAG at BridgeAI
What This Role Is (and Is Not)
✔ Deeply hands-on technical leadership
✔ Python-first engineering
✔ Production-grade RAG ownership
✔ Mentorship through real code
✖ Not a research-only role
✖ Not a people-manager-only role
✖ Not a demo or prototype position