About The Role
We are seeking a highly skilled and experienced engineer to design, build, and optimize cloud-native and high-performance computing (HPC) environments on AWS. The ideal candidate will have deep expertise in AWS cloud services, Kubernetes orchestration, and workload scheduling, with additional exposure to HPC environments being a strong plus. As a Lead/Senior Engineer, you will provide technical leadership, guide a team of engineers, and collaborate closely with cross-functional stakeholders to deliver scalable and reliable infrastructure solutions.
Key Responsibilities
- Design, implement, and optimize scalable AWS architectures leveraging services such as EC2, RDS, EKS, FSx, VPC, Lambda, IAM, and CloudWatch.
- Automate using Terraform, CloudFormation, and configuration management tools.
- Develop and manage CI/CD pipelines and cloud-native development tooling.
- Implement network, performance, and security best practices across workloads.
- Deploy and manage containerized applications using Docker on Slurm and Kubernetes.
- Troubleshoot distributed system issues including networking, compute, and storage bottlenecks.
- Participate in cost optimization, governance, and cloud operational excellence.
- Provide mentorship and technical guidance to team members.
- Collaborate with global teams, architects, and stakeholders to align cloud solutions with business priorities.
Required Skills & Qualifications
- 7 years of cloud engineering or DevOps experience with strong hands-on AWS.
- Solid expertise in AWS services including compute, networking, identity, and observability tools
- Strong experience with High Performance Computing (HPC) environments, including workload scheduling, job orchestration, and performance tuning.
- Proven experience with managing IaC pipelines at scale.