Roles and Responsibilities:
- As a DevOps SRE Engineer, should have knowledge of SRE and ITIL Processes
- 3 to 6 years of software development experience
- Have worked on Incidents and Change Release handling in Production Environment
- Must be ready and available to work in shifts and Oncall during weekends
- Contribute to Proactive Monitoring and Automation
- Drive compliance and security efforts related to applications and tools
- Develop and maintain installation and configuration procedures, and drive automation around these areas
- Work towards advancing the DevOps discipline across the enterprise
- Work with the team on manual support evaluation and provide recommendations on DevOps tooling and automation, wherever possible
- Analytical and problem-solving skills to troubleshoot system problems
Secondary Duties:
- Maintain and improve DevOps methodology and pipeline that support CI/CD for on-prem and cloud hosted solutions
- Practical experience of Incident management, Change Management and Release Process
- Troubleshoot and resolve Job failures within SLA
- Work on security scans reports and try to fix vulnerabilities which are in scope
- Should have experience of Application/s Migration from On-Prem to Cloud
Required Skill Set:
- Bachelor's Degree required in Computer Science or Engineering or equivalent certifications
- Experience with any scripting language (PowerShell, Bash, Perl, Python, Ruby)
- Should be well versed in Terraform
- Hands On experience with any cloud platform
- Practical experience on cloud tech stack - K8s (GKE), Cloud Run, Cloud Functions, Platform, Load Balancers and Firewalls
- Understanding of Microsoft and Red Hat Linux server technologies (Active Directory, DNS, LDAP, IIS, basic administration and troubleshooting)
- Well versed with SSL Certificate Management
- Experience with file transfer protocols like FTP, FTPS, SFTP
- Knowledge of file transmission security and best practice:encryption technology, key exchange, internet private circuit
- Ability to bridge the gap between Development and Production Support team
Preferred Qualifications:
- Working experience and understanding of SRE practices and how to align that with a DevOps approach to CI/CD.
- Windows Servers Monitoring and Automation experience
- Security Management & Vulnerability Remediation
- Multiple OS management experience, Windows, Linux
- Experience working with Datadog Monitoring, and/or any monitoring tool experience
- Good to have knowledge ofJenkins/Octopus/Cloud Run, Ansible
- Knowledge of post deployment validation checks, manage cloud secrets, cloud connectivity hub, firewalls, proxies and security
- GCP - Cloud Run, Functions, Load Balancers, Firewalls, Workflows, Monitoring and Cost Optimization
Tools & Technologies:
- Good to have hands-on experience in below tools:
- Service Manager, Incident/Change Management Tool
- Monitoring Tools like Datadog, Logic Monitor, Thousand Eyes, Splunk
- Cloud Platform, preferable GCP
- CI/CD Configuration, Deployment and Management
- JIRA, Jenkins/Octopus, Harness
- Terraform, Ansible, GKE/Kubernetes
- Experience on any Scripting language - PowerShell, Bash, Perl, Python, Ruby