Search by job, company or skills

PyjamaHR

Senior Platform Engineer

Save
new job description bg glownew job description bg glow
  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About

QpiAI is a deep tech startup pioneering next-generation computing platforms, empowering enterprises to innovate and deploy AI solutions seamlessly across cloud and edge devices at scale. We're dedicated to making it easier to build meaningful AI powered experiences.

Key Responsibilities

  • Take ownership of infrastructure layer for on-premise and cloud deployments.
  • Assist development teams in designing scalable and portable applications.
  • Establish best practices within the organisation and help developers ship fast without breaking things.
  • Product ownership and manage periodic reporting to management and senior leadership.
  • Mentor team members to adopt a platform-first mindset.

Ideal Profile

  • Experience in DevOps engineering with Docker, Kubernetes, and shell scripting.
  • Experience with distributed GPU systems and ML infrastructure.
  • Expertise with cloud platforms like AWS, Azure, and GCP.
  • Experience building robust infrastructure for training and serving machine learning models.
  • Ability to set up multi-node Kubernetes clusters and use managed Kubernetes services.

Nice to Have

  • Experience with ML orchestration services like kubeflow, flyte, prefect.
  • Knowledge on MLOps concepts like model and data versioning.
  • Experience with distributed computing frameworks like ray.

Skills: Docker,Kubernetes,shell scripting,distributed GPU systems,ML infrastructure,Data Annotation,Data Curation,Model Registry,Model Serving,Workflow Orchestration,Retrieval Augmented Generation (RAG),Agentic Workflows,CUDA_ERROR_VERSION_MISMATCH,multi node kubernetes clusters,managed kubernetes services,kube native tooling,Ansible,Terraform,pulumi,scalable data lakes,data processing pipelines,highly available services,databases,AWS,Azure,GCP,networking principles,load balancing,DNS configurations,proxies,integration and deployment pipelines,ML orchestration services,kubeflow,flyte,prefect,distributed computing frameworks,ray,Role-Based Access Control (RBAC) principles,MLOps concepts,model and data versioning,orchestration,model serving,modern distributed applications

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 116671679

Similar Jobs

Bengaluru, India

Skills:

data engineering Api IntegrationBig Data TechnologiesSqlMLopsshell scriptingDockerTerraformKubernetesPythonCollaboration communicationdata lake architecturesSecurity-first mindsetAgile deliverymachine learning workflowsCI CD ExpertiseAWS Data Servicesmonitoring and observability toolsTesting Quality

Bengaluru, India

Skills:

DebuggingAPI designDockerStorage SystemsFastAPIKubernetesPythonMCP-based connectorsSDK-based integrationsTroubleshootingAI-assisted development toolsbusiness applications

Bengaluru, India

Skills:

Vulnerability ManagementMicrosoft ExchangePowershell ScriptingHybrid SCCMBackup RestoreAI Based Monitoring solutionActive DirectoryAzure AD App ProxyAzure Site Recovery

Bengaluru, India

Skills:

GitMLopsDockerTerraformPrometheusElk StackBashGrafanaKubernetesPython

Bengaluru, India

Skills:

New RelicJavaRESTGraphqlPostgreSQLKotlinOAuth 2.0Apollo FederationSpring Boot 3SpannerGCP Google Cloud Platform