We are looking for a Linux & AWS Support Engineer to join our team and provide 24x7 monitoring, troubleshooting, and support for cloud-based and on-premise infrastructure. The ideal candidate should have strong expertise in Linux administration, AWS, Kubernetes, and monitoring tools.
Key Responsibilities
Infrastructure Monitoring & Troubleshooting
- Provide 24x7 monitoring and triaging of alerts/tickets, ensuring timely resolution.
- Troubleshoot issues related to Linux virtual servers, clusters, and DNS servers.
- Identify recurring incidents/events and escalate as per the escalation matrix.
- Ensure adherence to SLAs as per the customer agreement.
Cloud & System Administration
- Manage and support Linux-based environments with AWS and Kubernetes.
- Work on planned activities like patching, change implementation, and system updates.
- Provide support for clients in a high-availability environment.
- Automate tasks using scripting and automation tools (added advantage).
Collaboration & Process Compliance
- Follow defined processes for incident escalation and resolution.
- Coordinate with cross-functional teams to ensure smooth operations.
- Interact with diverse user communities and provide quick-response support.
Requirements
Mandatory Skills & Experience
- Strong communication and interpersonal skills.
- Proficiency in Linux administration and troubleshooting.
- Hands-on experience with AWS and/or Azure.
- Experience with monitoring tools like Dynatrace, Kibana, Grafana, and Site24x7.
- Ability to work in shifts (24x7 support environment).
Preferred Skills
- Knowledge of Kubernetes and cloud automation tools.
- Experience with scripting for automation.
- Open to learning new technologies and adapting to dynamic environments.
Additional Information
- Onsite work is mandatory during all business working days.