Search by job, company or skills

IBM

Software Engineer II - HashiCorp Platform DR

2-5 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 18 days ago
  • Over 100 applicants
Quick Apply

Job Description

At HashiCorp, we build the Infrastructure Cloud to help enterprises take a unified approach to reliability, disaster recovery, and operational resilience across cloud and enterprise environments. Our platform ensures the highest standards of availability, performance, and fault tolerance, enabling organizations to operate at scale with confidence.

Role Overview:

As a Disaster Recovery Engineer on the HashiCorp Disaster Recovery team, you will design, implement, and own solutions that strengthen disaster recovery (DR) governance and reliability across our cloud products. Your work will focus on system resilience, operational readiness, and high availability, directly impacting the reliability of our platform.

Key Responsibilities:

  • Design, implement, and optimize DR solutions to ensure high availability and fault tolerance across cloud products.
  • Develop and execute comprehensive DR testing strategies, identifying bottlenecks and failure points impacting Recovery Point Objectives (RPO) and Recovery Time Objectives (RTO).
  • Drive compliance and reliability initiatives by integrating DR best practices into system architecture and leveraging Chaos Engineering to validate failure scenarios.
  • Build scalable automation frameworks for testing, incident simulation, and recovery orchestration to reduce manual effort and improve operational efficiency.
  • Collaborate with cross-functional engineering, product, and infrastructure teams to embed operational readiness into development lifecycles.
  • Lead incident/DR response drills and chaos experiments, analyze test results, document findings, and implement proactive improvements.
  • Monitor system performance and availability, creating dashboards and observability tools to provide actionable insights for reliability improvements.
  • Mentor engineers and promote a culture of resilience, sharing best practices in system design, testing, and disaster recovery preparedness.

Required Education:

  • Bachelor's Degree in Computer Science, Engineering, or a related field.

Preferred Education:

  • Master's Degree.

Required Technical and Professional Expertise:

  • 3+ years of experience in software development, reliability engineering, systems engineering, or non-functional testing, with a focus on disaster recovery, backup, and cloud resilience.
  • Proficiency in Golang and hands-on experience with version control systems such as Git or GitLab.
  • Strong understanding of microservices architecture and resilient distributed system design in cloud environments.
  • Experience with CI/CD pipelines, automation, and quality/reliability in software delivery.
  • Exposure to cloud platforms (AWS, Azure, or GCP) and container orchestration technologies (Nomad, Kubernetes).
  • Strong collaboration and communication skills, able to articulate technical concepts across teams.
  • Customer-centric mindset with a focus on high-quality, scalable, and fault-tolerant solutions.

Preferred Technical and Professional Experience:

  • Hands-on experience with HashiCorp products (Terraform, Packer, Waypoint, Nomad, Vault, Boundary, Consul).
  • Prior experience in disaster recovery testing or working on product reliability and resilience.
  • Commitment to continuous learning in reliability engineering and DR strategy development

More Info

Job Type:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

At IBM, we do more than work. We create. We create as technologists, developers, and engineers. We create with our partners. We create with our competitors. If you're searching for ways to make the world work better through technology and infrastructure, software and consulting, then we want to work with you. We're here to help every creator turn their "what if" into what is. Let's create something that will change everything.

Job ID: 132904095

Similar Jobs