Search by job, company or skills

Kevit.io

DevOps Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About The Role

We are looking for a Devops Engineer to join our platform team and own the cloud infrastructure behind our product. You'll design, run, and evolve our aws- and eks-based platform, spread across multiple aws accounts and regions. Your work will focus on reliability, automation, security, and developer productivity not just keeping the lights on. If you enjoy getting deep into kubernetes, networking, and infra-as-code rather than clicking around in the aws console, this role is for you.

About The Team

You'll be part of our platform team, responsible for the foundational cloud infrastructure that all product engineering teams build on. The team works closely with backend engineers (mostly node.js, with services in python and go) to provide stable, secure, and efficient environments from development through production. We own core building blocks like aws networking, kubernetes clusters, databases, ci/cd pipelines, and observability. The mandate is simple: make the platform reliable, repeatable, and boring so product teams can move fast without breaking the infra.

Key Responsibilities

  • Design, operate, and improve aws infrastructure across multiple accounts and regions, with a strong focus on high availability, security, and cost awareness.
  • Own day-to-day operations of our aws eks clusters, including upgrades, node pool management, autoscaling, ingress, and workload scheduling.
  • Build and maintain advanced vpc networking: route tables, nat gateways, public/private subnets, vpc peering, security groups, and network acls.
  • Implement and maintain infrastructure as code (iac) for all core components (vpc, eks, databases, albs, iam, etc.) to ensure environments are reproducible and reviewable.
  • Manage and harden managed data services (e.g. mongodb, postgresql, redis) at the infrastructure level: provisioning, backups, failover configurations, monitoring, and upgrades.
  • Own and improve ci/cd workflows using github actions, from build and test pipelines to automated deployments into multi-account/multi-region environments.
  • Work closely with engineering teams using github, node.js, python, and go to standardize build, test, packaging, and deployment practices.
  • Manage container image workflows with docker and aws ecr, including image build standards, tagging strategies, and cleanup policies.
  • Contribute to and improve observability across the stack (metrics, logs, alerts, dashboards) so incidents are detected early, and debugging is practical.
  • Participate in incident response, root cause analysis, and post-incident improvements to drive reliability across the platform.
  • Document infrastructure, standards, and runbooks so operational knowledge is shared and not locked in individuals heads.
  • Continuously seek opportunities to automate manual operational tasks and reduce toil for both the platform and product engineering teams.

What You Bring

  • Strong hands-on experience running production workloads on aws (networking, compute, storage, iam, managed services).
  • Solid experience with Kubernetes (preferably eks): deploying applications, managing clusters, working with ingress, services, config maps, secrets, and rbac.
  • Practical understanding of aws networking concepts: vpc design, subnets, route tables, nat gateways, vpc peering, and security group design.
  • Experience implementing and maintaining infrastructure as code for cloud resources.
  • Good grasp of ci/cd concepts and experience building pipelines (ideally github actions).
  • Comfort working with docker: building images, optimizing dockerfiles.
  • Working knowledge of s3, cloudfront, and alb.
  • Ability to collaborate closely with software engineers.
  • Clear, concise communication and strong documentation habits.

Good To Have

  • Experience with multi-account and multi-region aws setups.
  • Background with observability tools.
  • Experience tightening security in cloud environments.
  • Exposure to platform engineering concepts.
  • Ability to read and reason about application code in node.js, python, or go.
  • Experience in a product/SaaS environment.

What We Offer

  • Ownership of meaningful platform components.
  • A modern, cloud-native stack with real scaling and reliability challenges.
  • Collaboration with experienced engineers.
  • A culture valuing pragmatic, clear, and sustainable engineering.
  • Support for continuous learning and growth.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 135875923