Job Description
Job Description
Role: Site Reliability Engineer (SRE) / Monitoring Engineer
Responsible for maintaining site reliability through proactive monitoring, incident detection, and resolution. Ensures system uptime, performance, and resilience by designing and adjusting monitoring solutions, collaborating with cross‑functional teams, and automating repetitive tasks.
Skills / Products (Network, Common, Software Engineering)
Network: Palo Alto Firewall, Citrix NetScaler, Cisco ACI & Routers, VeloCloud & Aruba SDWAN, Aruba Wireless, Meraki Wireless, Cisco RAS (VPN), NSX Firewall, Infoblox.
Common: Puppet, Ansible, Vulnerability management best practices, ITSM, Release management.
Software Engineering: GitHub, Python/any other programming language.
Capabilities
Troubleshoot routing, latency, throughput, and device‑level issues.
Review and deploy automation manifests/playbooks.
Identify and prioritize vulnerability remediation.
Apply ITSM and release management practices.
Store and update code/scripts in GitHub.
Write automation code independently, leveraging AI if needed.