Search by job, company or skills

  • Posted 7 days ago
  • Be among the first 50 applicants
Early Applicant

Job Description

Key Responsibilities

  • Design, implement, and manage AWS cloud infrastructure with a focus on high availability, fault tolerance, and scalability.
  • Architect and operate containerized workloads using Amazon EKS and AWS Fargate.
  • Automate infrastructure provisioning and configuration using Ansible and Infrastructure-as-Code best practices.
  • Manage and optimize AWS services including MSK (Managed Kafka), ElastiCache, Route 53, and related networking components.
  • Design and implement multi-region, multi-AZ, and replication strategies to ensure resilience and disaster recovery.
  • Monitor system performance, reliability, and cost, and proactively drive optimizations.
  • Implement and enforce security best practices including IAM, network isolation, secrets management, and compliance standards.
  • Troubleshoot complex production issues across infrastructure, networking, and application layers.
  • Collaborate closely with application, platform, and security teams to support CI/CD pipelines and release automation.
  • Improve observability, alerting, and incident response processes to drive operational excellence.
  • Take end-to-end ownership of tasks and deliverables, ensuring timely and high-quality outcomes with minimal supervision.

Mandatory Skills

  • Strong hands-on experience with AWS services including EC2, VPC, IAM, ALB/NLB, Security Groups, and CloudWatch.
  • Deep expertise in Amazon EKS and AWS Fargate in production environments.
  • Proven experience with Ansible for configuration management and automation.
  • Experience working with Amazon MSK (Kafka), including cluster setup, scaling, and monitoring.
  • Hands-on experience with Amazon ElastiCache (Redis or Memcached).
  • Strong understanding of Route 53, DNS routing, health checks, and traffic management.
  • Experience implementing multi-AZ deployments, replication, and disaster recovery strategies.
  • Strong understanding of Linux systems, networking fundamentals, and container orchestration.
  • Experience with CI/CD tools and Git-based workflows.
  • Ability to work independently, prioritize effectively, and consistently meet delivery timelines.

Desirable Skills

  • Experience with Terraform or CloudFormation.
  • Knowledge of observability and monitoring tools such as Prometheus, Grafana, ELK, or OpenTelemetry.
  • Experience working on high-scale, high-availability production systems.
  • Exposure to security and compliance frameworks such as ISO, SOC2, or PCI-DSS.
  • Strong problem-solving, decision-making, communication, and collaboration skills.
  • Ownership mindset with accountability from design through production support.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 142737595

Similar Jobs