Key Responsibilities
Infrastructure Reliability & Operations
- Ensure high availability, reliability, scalability, and manageability of data storage and Unix infrastructure
- Provide L2/L3 technical support for infrastructure outages and service-level issues
- Coordinate with operations teams for incident resolution and system stability
System Support & Troubleshooting
- Diagnose and resolve complex configuration, performance, and bottleneck issues
- Support backup and recovery procedures to ensure data integrity and business continuity
- Participate in change management and ensure smooth execution of scheduled system updates
Automation & Engineering
- Develop and maintain automation solutions for Unix infrastructure operations
- Work as part of the Core Automation Team for global infrastructure management
- Contribute to automation projects and large-scale global rollouts
Documentation & Process Management
- Create and maintain detailed documentation for infrastructure systems, configurations, and processes
- Define and improve operational procedures and system support workflows
Monitoring & Tooling
- Work with infrastructure monitoring and observability tools such as Foreman, ELK Stack, and Grafana
- Enhance system visibility and proactive issue detection through monitoring solutions