- Architect, implement, and manage highly available Google cloud environment.
- Design VPC, Cloud DNS, VPN, Cloud Interconnect, Cloud CDN and IAM policies to enforce security standard processes.
- Implement robust security practices and enforce security policies using
- Identity and Access Management (IAM), VPC Service Controls, and Cloud Security Command Center.
- Architect solutions with cost optimization in mind using Google Cloud Billing and Cloud Cost Management tools.
Infrastructure as Code (IaC) & Automation
- Deploy and maintain Infrastructure as Code (IaC) and Site Reliability Engineering (SRE) principles using tools like
- Terraform, and Google Cloud Deployment Manager.
- Automate deployment, scaling, and monitoring using GCP-native tools & scripting.
- Implement and manage CI/CD pipelines for infrastructure and application deployments.
Cloud Security & Compliance
- Enforce standard methodologies in IAM, encryption, and network security.
- Ensure compliance with SOC2, ISO27001, and NIST standards.
- Implement Google Cloud Security Command Center,
- Cloud Armor, and Cloud IDS for threat detection and response.
Monitoring & Performance Optimization
- Set up Google Cloud Monitoring, Cloud Logging, Cloud Trace, and Cloud Profiler to enable proactive monitoring, trace analysis, and performance tuning of GCP resources
- Implement autoscaling, Cloud Load Balancing, and caching strategies for performance optimization.
- Troubleshoot cloud infrastructure issues and conduct root cause analysis.
Collaboration & DevOps Practices
- Work closely with software engineers, SREs, and DevOps teams to support deployments.
- Maintain GitOps standard processes for cloud infrastructure versioning.
- Support on-call rotation for high-priority cloud incidents
What we expect of you
- We are all different, yet we all use our unique contributions to serve patients. This is a hands-on engineering role requiring deep expertise in Infrastructure as Code (IaC), automation, cloud networking, and security.
- Blending cloud engineering and operations expertise, the individual will ensure that our cloud environment is running efficiently and securely while also being responsible for the day-to-day operational management, support, and maintenance of the cloud infrastructure.
Must-Have Skills:
- Deep hands-on experience with GCP (IAM, Compute Engine, Google Kubernetes Engine (GKE), Cloud Functions, Cloud Pub/Sub, BigQuery, Cloud SQL, Cloud Storage, Cloud Firestore, Cloud Load Balancing, VPC, etc.).
- Expertise in Terraform for GCP infrastructure automation.
- Strong knowledge of GCP networking (VPC, Cloud DNS, VPN, Cloud Interconnect, Cloud CDN).
- Experience with Linux administration, scripting (Python, Bash), and CI/CD tools (Jenkins, GitHub Actions, GitLab, etc.).
- Strong troubleshooting and debugging skills in cloud networking, storage, and security.
Good-to-Have Skills:
- Prior experience with containerization (Docker, Kubernetes) and serverless architectures is a plus.
- Familiarity with cloud CDK, Ansible, or Packer for cloud automation.
- Exposure to hybrid and multi-cloud environments (AWS, Azure).
- Familiarity with HPC, DGX Cloud.
Basic Qualifications:
Bachelor's degree in computer science, IT, or related field with 6-8 years of hands-on cloud experience
Professional Certifications (preferred):
- Certifications in GCP (e.g., Google Cloud Certified Professional – Cloud Architect and Cloud DevOps Engineer) are a plus.
- Terraform Associate Certification
Soft Skills:
- Strong analytical and problem-solving skills.
- Ability to work effectively with global, virtual teams
- Effective communication and collaboration with cross-functional teams.
- Ability to work in a fast-paced, cloud-first environment.