While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!
Role: Senior Platform Engineer
Experience Level: 3+ Years
Location: Hyderabad/Bangalore
Overview: We are seeking a highly experienced and motivated Senior Platform Engineer to join our growing team. You will play a pivotal role in leading the technical design, implementation, and maintenance of our machine learning and application platforms. This role demands deep expertise in GCP services, automated CI/CD pipelines, and asynchronous messaging architectures to ensure our AI solutions meet the highest standards of scalability and reliability.
Roles and Responsibilities
- Lead Technical Design & Implementation: Take a leadership role in the technical design of the ML platform, ensuring it meets stringent performance and security requirements.
- Cloud-Native Deployment: Design and optimize deployment strategies on GCP, specifically utilizing Cloud Run for scalable, containerized applications and FastAPI for high-performance service layers.
- CI/CD Pipeline Ownership: Own end-to-end pipelines utilizing GitLab CI/CD to establish fully automated, robust, and efficient build, test, and deployment processes.
- Data & Messaging Orchestration: Lead the implementation of complex, interdependent workflows using Airflow and manage real-time data streams via Pub/Sub and BigQuery (BQ).
- Kubernetes & Container Mastery: Architect and manage highly available applications using Kubernetes and Cloud Run, focusing on autoscaling, load balancing, and resource management.
- Monitoring & Optimization: Proactively troubleshoot complex issues in real-time pipelines and implement comprehensive monitoring and alerting solutions.
- Infrastructure as Code (IaC): Lead the adoption of IaC principles (e.g., Terraform) for managing all aspects of the cloud and on-premises infrastructure.
- Collaboration & Mentorship: Partner with data scientists and developers to provide expert technical guidance and mentor junior team members.
Skill Set Needed
- Platform Experience (3+ Years): Proven experience in building and managing complex technical platforms with a focus on high-scale infrastructure.
- GCP Mastery: Hands-on expertise with Google Cloud Platform services, including Cloud Run, GCS, BigQuery (BQ), and Pub/Sub.
- Workflow Orchestration: Expert-level experience with Airflow (Cloud Composer) for designing and implementing complex model and data orchestration strategies.
- GitLab CI/CD Leadership: Expert experience building and maintaining automated pipelines specifically within GitLab CI/CD, including security and pipeline orchestration.
- Programming & API Development: Advanced proficiency in Python and FastAPI for building performant, scalable APIs.
- Kubernetes Architecture: Expert-level experience with Kubernetes-based deployments, ingress, service mesh, and cluster operations.
- On-Premises Knowledge: Familiarity with on-premises deployments leveraging frameworks like Ray for distributed computing.
- Analytical Problem-Solving: Exceptional skills in debugging complex performance bottlenecks and system stability issues.
Good to Have
- Experience with other major cloud platforms (AWS/Azure) and hybrid cloud architectures.
- Familiarity with MLOps tools beyond CI/CD, such as MLflow or DVC.
- Knowledge of security best practices for platform engineering, including data governance and compliance.
- Experience with performance and load testing tools like Locust.
If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!