
Search by job, company or skills
Role Overview
We're looking for a Site Reliability Engineer (SRE) who thrives at the intersection of software engineering and systems operations. You will design, build, and maintain scalable, resilient infrastructure, automate operational processes, and ensure our platform remains fast, secure, and highly available.
Note: Apply only if you have these skill sets
Key Responsibilities
Monitor and improve system performance, reliability, and availability.
Detect, analyze, and resolve incidents using observability data and distributed-systems best practices.
Champion observability: metrics, logging, tracing, dashboards, and alerting Ability to understand and grok logs in multiple environments.
Diagnose issues across multi-tier, distributed architectures in a cloud environment (GCP preferred).
Collaborate closely with development teams to embed reliability into the software lifecycle.
Build automation for deployments, monitoring, scaling, and incident response.
Conduct root-cause analysis, write postmortems, and drive long-term remediation.
Ensure security, compliance, and configuration consistency across infrastructure.
Participate in on-call rotations and continuously improve incident management processes.
Knowledge using Symfony/Angular to debug application workflows.
What You Bring
5+ years in SRE, Production Engineering, or Reliability/Performance Engineering.
Strong scripting/programming (Python, Bash, or similar).
Hands-on experience with GCP (preferred) or other cloud providers.
Working knowledge of Kubernetes, CI/CD pipelines, and infrastructure-as-code.
Experience with monitoring/logging tools (ELK, Graylog, etc.).
Solid understanding of distributed systems, networking, and performance fundamentals.
Working familiarity with Symfony and Angular for debugging and integration.
Nice to Have
Experience in SaaS or high-availability, high-traffic environments.
Database performance tuning (MySQL preferred).
Experience with Kafka, offline/edge systems, or event-driven architectures.
Communication & Culture
Clear communicator capable of translating complex issues for technical and non-technical audiences.
Strong cross-functional collaboration, especially during incidents.
Continuous-improvement mindset and a bias for automation.
Comfortable in a fast-paced, product-led, agile environment.
Why Join Us
Join a product-led company with global impact.
Work on challenging multi-country deployments and diverse technologies (Kafka, offline databases, distributed systems).
A passionate, highly skilled team that values ownership, learning, and innovation.
Competitive compensation and a culture of continuous growth.
Job ID: 144362587