A system Engineer/ administrator with AWS experience is responsible for managing and maintaining Linux-based infrastructure and AWS cloud environments. The role involves deploying, monitoring, securing, and optimizing cloud and on-premises systems to ensure high availability, performance, and compliance.
Key Responsibilities:
- Cloud Infrastructure Management: Design, deploy, and maintain AWS services including EC2, S3, IAM, VPC, Route 53, Lambda, and EBS.
- Linux Administration: Install, configure, patch, and troubleshoot Linux servers (RHEL/CentOS).
- Automation & Configuration Management: Develop and maintain Ansible playbooks and Terraform scripts to automate infrastructure provisioning and administration.
- Security & Compliance: Perform OS hardening, vulnerability remediation, and support audit and compliance requirements.
- Monitoring & Troubleshooting: Monitor infrastructure using tools such as Grafana, Nagios, and Zabbix; resolve critical incidents and conduct RCA.
- Backup & Disaster Recovery: Manage backup, restore, and HA/DR activities to ensure business continuity.
- Patching & Upgrades: Perform regular OS patching, application updates, and version upgrades.
- Documentation & ITIL Processes: Maintain SOPs and follow Incident, Change, and Service Request management processes.
- 24/7 Support: Participate in shift operations and on-call support to maintain system uptime.
Required Skills:
AWS, Linux Administration, Ansible, Terraform, Shell Scripting, Grafana, Nagios, Security Hardening, Patching, Backup & Disaster Recovery.