Search by job, company or skills

solvex solutions

Senior Operations Engineer

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 23 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role: Senior Operations Engineer

Team: Scientific Computing Platform

Location: Bangalore or Chennai (1-2 day/week onsite if near office)

Duration: ASAP – End of 2026

What You'll Do

  • Handle and fix system issues, find root causes, and prevent them from happening again
  • Set up monitoring, logging, and alerts to keep systems reliable and performant
  • Apply Site Reliability Engineering (SRE) practices to improve operations
  • Troubleshoot complex problems across applications, infrastructure, and user issues
  • Manage tools like Ansible, Vault, Consul, Prometheus, and Grafana
  • Automate deployments using GitOps and CI/CD pipelines
  • Support and mentor junior engineers

Must-Have Skills

  • Strong experience with Linux systems (admin, troubleshooting)
  • Knowledge of automation and Bash scripting
  • Experience working in DevOps and Agile environments
  • Good communication skills (can explain tech to non-tech users)

Nice-to-Have Skills

  • Experience with HPC (High-Performance Computing) applications
  • Knowledge of SLURM workload manager
  • Experience with OpenStack
  • Infrastructure as Code tools (Ansible, Terraform, CloudFormation, CDK)
  • Cloud experience (especially AWS)
  • Familiarity with tools like EasyBuild or Spack
  • Experience with containers (Docker, Singularity, enroot)
  • Knowledge of HPC testing/benchmark tools like Reframe

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 147322509