Job Requirements
Senior Cloud Engineer is responsible for designing, creating, maintaining and monitoring secure scalable AWS Cloud Infrastructure needs for cloud-based Web application/Machine Learning workloads.
Key Responsibilities
- Cloud Infrastructure
- Provision and manage AWS resources: EKS/ECS clusters, VPCs, ALBs/NLBs, IAM roles, RDS, S3, AWS Managed Services like SageMaker, IOT-Core, Greengrass
- Implement secure networking (subnets, NACLs, security groups, flow logs).
- Configure compute services (EC2, Fargate, Auto Scaling).
- Infrastructure as Code
- Develop reproducible IaC templates using Terraform/CloudFormation, and AWS CDK.
- Authentication & Authorization
- Implement IAM policies, roles, and federated identity (OIDC/OAuth).
- Ensure least-privilege access and compliance across accounts.
- Monitoring & Operations
- Set up AWS CloudWatch, CloudTrail, and Systems Manager for secure operations.
- Build custom dashboards with AWS native services and Managed Grafana.
- Implement alerting, log management, and custom metrics for 24/7 health monitoring.
- Governance & Documentation
- Support multi-account strategies and organizational structures.
Work Experience
Required Skills (Technical Competency):
- Cloud: AWS (SAAS, PAAS, IAAS and Development), Containers/Docker
- Strong expertise in provisioning AWS services: EKS/ECS, VPC, ALB/NLB, IAM, RDS, S3, EC2.
- Proficiency in Terraform, CloudFormation, AWS CDK.
- Experience with AWS CloudWatch, CloudTrail, Grafana dashboards.
- Hands-on knowledge of networking and security (NACLs, SGs, routing).
- Familiarity to provision system with secured access (IAM Roles, ACM Cert Management).
- Experience in Shell Scripts to manage/debug plant devices
- Experience in building monitoring systems like log aggregation, analytics, distributed systems tracing, and alerting.
- Expertise with containerization and cluster management technologies like Docker and Kubernetes
- Experience in building monitoring systems like log aggregation, analytics, distributed systems tracing, and alerting.
- Able to build tools from scratch when needed.
- Knowhow of build/release systems, CI/CD systems, AWS DevOps Solutions, Jenkins
- Strong problem-solving skills.
Desired Skill Sets
- Experience with modern web services architecture.
- Ability to conduct workshops and training sessions.
- Experience with Kubernetes/EKS and serverless (Fargate).
- Experience in doing IOT set up and management between Cloud and OnPrem infrastructure
- Knowledge of AWS Organizations and multi-account governance.
- Awareness of CI/CD pipelines for IaC deployments.
- Strong documentation and diagramming skills.
- Ability to quickly learn new and existing technologies.
- Knowledge in configuration and management of continuous improvement tools
- Experience working with multicultural teams.
- Must be a team player and include others in the technical decision-making process.
- Excellent communication skills.