We are looking for a lead cloud engineer who has got good experience working with atleast 5 Plus years of experience in AWS and in building cloud components and preferably comes from a development background.
Roles and Responsibilities:
- You will be responsible for the development of cloud agnostic control tower/landing zone platform.
- You will be responsible for the development of terraform based reusable cloud native infra as a code platform.
- You will be responsible along with the team in rolling out continuous observability across different cloud providers.
- You will be responsible along with the team in setting up best in class security posture on the cloud.
- You will be responsible for the development of tools and code bases that contribute to the development of platform under platform engineering initiatives for different teams .
- You will be responsible for participating in Cloud and DevOps architecture connects.
- You will be responsible for mentoring junior resources with right coding best practices as an individual contributor.
- You will be responsible along with the team for delivering infra platforms across different projects.
- You will be responsible along with the team for documenting processes and best practices to ensure knowledge sharing across the team.
Must Have Skills:
- Experience: Around 8 years of total experience, including at least 5 years of hands-on experience in cloud architecture and infrastructure engineering specifically in building and implementing cloud environments, not merely deploying applications on the cloud.
- Expertise in DevOps & Cloud tools
- Hands-on experience with Infra as a code tools like terraform, ansible, puppet, chef, cloud formation , etc..
- Expertise in any Cloud (AWS)
- Good understand of version control (Git, Gitlab, GitHub)
- Hands-on experience in Container Infrastructure (Docker, Kubernetes)
- Ability to define container-based environment topology following principles of designing a well-architected framework.
- Ensuring availability, performance, security and scalability of production system
- Hands on implementation experience in AWS control tower and at least 2 implementations
- Hand on experience with Artifact repositories (Nexus, JFrog Artifactory)
- Hands on experience with CI/CD tools on-premises/cloud (Jenkins, CircleCI, etc. )
- Hands on experience with Monitoring, Logging, and Security (CloudWatch, cloud trail, log analytics, hosted tools such as ELK, EFK, Splunk, Datadog, Prometheus,)
- Hands-on experience with scripting languages like Python, Ant, Bash, and Shell
- Hands-on experience in designing pipelines & pipelines as code.
- Hands-on experience in end-to-end deployment process & strategy
- Hands-on experience of AWS with a good understanding of computing, networks, storage, IAM, Security, and integration services
- Capability to write complex code - e.g., automation of recurring/mundane tasks,
- Good understand of OS administration (CPU, memory, network performance troubleshooting), also demonstrates strong troubleshooting skills
- Demonstrates HA/DR design on Cloud platform as per SLAs/RTO/RPO
- You should have solid understanding of Terraform, understanding of usage and creation of terraform modules
- Good exposure on AWS CLI and SDK like boto3, azure SDK, etc..s
- Good understanding of AWS networking (VPC, security group, routing, TGW and vpc peering
- Good understanding services like RDS Mysql, MongoDB, Document DB, Postgres,
- Good understanding of Kubernetes self-managed clusters, Fargate, S3, NLB, ALB, API gateways, route 53, WAF, Advance shield, Lambda and KMS.
Qualification:
- BE/B.Tech or equivalent degree in Computer Science or related field.
Location: