
Search by job, company or skills
About TrueFoundry
Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure.A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all.
That infrastructure layer is being built right now.
We're TrueFoundry, and we're building it. We're looking for a Senior SRE/DevOps Engineer to join the team.
The Problem We're Solving
Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.
The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.
You need a control plane that handles:
We've built two products to solve this:
AI Gateway is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance.
AI Deploy is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure.
We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.
Roles / Responsibilities:
Requirements
Experience with Golang or Python is must.
Benefits at TrueFoundry
Our Way Of Working
Job ID: 148678221
Skills:
Azure, Devops, Automation, Python, Terraform, Cloud security
Skills:
Java, Terraform, Ansible, Apache Cassandra, Prometheus, Aws Ec2, Grafana, Python
Skills:
Prometheus, Apache Tomcat, Grafana, Docker, Terraform, MySQL, Python, Java, Newrelic, Cortex, Datadog, Jenkins, Linux, Ansible, Splunk, Puppet, Azure, Kubernetes, Chef, Go, Groovy DSL, cfengine, Cloud Formation Templates, AKS, EKS
Skills:
Jenkins, Terraform, Ansible, Apache Kafka, Aws Ec2, Python, GitLab CI, GitHub Actions, Kafka Schema Registry, Kafka MirrorMaker
Skills:
Grafana, Aws Ec2, Ansible, Prometheus, Kubernetes, Python, Docker, Terraform, Jenkins, Apache Kafka, Elasticsearch, PostgreSQL, GitHub Actions, Kafka MirrorMaker, Kafka Schema Registry, GitLab CI, OpenSearch
We don’t charge any money for job offers