Search by job, company or skills

Triomics

Site Reliability Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 20 applicants
Early Applicant

Job Description

About Triomics:

Triomics is building the modern technology stack for oncology trial sites and investigators that unifies the workflows of clinical care and clinical research, moving the healthcare industry closer to the vision of Clinical Research as a Care Option. Our platform, which is based on our proprietary oncology-focused large language model (OncoLLMTM) co-developed with several leading cancer centers, eliminates the operational inefficiencies in patient recruitment, data curation, and other laborious tasks involved in clinical research, thus enabling the generation of high-quality data and speeding up the clinical trials.

About the Role

We are looking for a DevOps / Site Reliability Engineer to help design, build, and maintain scalable, secure, and reliable infrastructure. In this role, you will work closely with engineering teams to streamline deployments, improve system reliability, and ensure our platforms run efficiently in production.

You will be responsible for building automation, managing cloud infrastructure, monitoring systems, and responding to incidents to maintain high availability and performance.

Key Responsibilities

  • Design, implement, and manage cloud-based infrastructure and deployment pipelines.
  • Build and maintain CI/CD pipelines to enable reliable and efficient software delivery.
  • Manage and optimize containerized environments using Kubernetes and Docker.
  • Automate infrastructure provisioning and configuration using Terraform and Helm.
  • Develop and maintain automation scripts using Python and Bash.
  • Monitor system health, performance, and reliability using logging and monitoring tools.
  • Troubleshoot production issues and participate in incident response and root cause analysis.
  • Ensure infrastructure security, network configuration, and system hardening best practices.
  • Collaborate with development teams to improve reliability, scalability, and deployment processes.
  • Maintain clear documentation for infrastructure, processes, and operational procedures.

Requirements

  • 1+ years of experience in DevOps, Site Reliability Engineering, or a related role.
  • Hands-on experience with at least one cloud platform (AWS, Azure, or GCP).
  • Strong experience with Kubernetes, Docker, Jenkins, Terraform, and Helm.
  • Proficiency in Python and Bash scripting.
  • Solid understanding of Linux system administration, networking concepts, and security practices.
  • Experience with monitoring, logging, and incident response systems.
  • Strong communication skills and the ability to document technical processes effectively.
  • Software development experience is a plus.

Nice to Have

  • Experience deploying and scaling AI/ML workloads in production environments.
  • Familiarity with single-tenant deployment models.
  • Experience with MLOps/AIOps platforms such as SageMaker or Kubeflow.
  • Knowledge of chaos engineering and disaster recovery strategies.
  • Experience with cloud cost optimization strategies.
  • Relevant cloud certifications (AWS, Azure, or GCP).

Why Join us

  • We are revolutionizing a unique industry that has the potential to impact and benefit patients
  • from all over the world - you can create impact at scale.
  • We have had company - sponsored workations in Bali, Sri Lanka, and Manali and take pride in our hard-working yet super fun culture .
  • We are working on a few of the most challenging problems in a highly regulated industry which provides you an opportunity to solve some of the most interesting things.
  • You will get a chance to work with experts from multiple industries and the best in the industry compensation.

Perks & Benefits:

  • Unlimited Leave Policy take time off when you need it. We believe in trust, not tracking.
  • Lunch Provided at the Office one less daily decision, one happier employee.
  • Flexible Working Hours we care about output, not clock-ins.
  • Health Insurance comprehensive coverage for you and your family.
  • Zomato Meal Benefit breakfast and dinner can be ordered when you come in early or leave late, because effort deserves fuel.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 144177597

Similar Jobs