Search by job, company or skills

  • Posted 13 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Hiring for AI/ML Ops/GPU acceleration + AI inference/TensorRT/ONNX

Build and maintain containerized applications using OpenShift, OpenShift AI, Kubernetes, and Helm charts.

  1. Integrate and optimize inference engines such as Triton and vLLM for scalable model serving.
  2. Lead model deployment, monitoring, and lifecycle management in production environments.
  3. Implement monitoring and alerting solutions using Grafana and Prometheus.
  4. Collaborate on GenAI and LLM projects, including Agentic AI initiatives.
  5. Automate CI/CD pipelines and infrastructure using Jenkins, Ansible, Groovy, and Terraform.
  6. Develop automation scripts and tools in Python.
  7. Architect, deploy, and manage AI/ML solutions on AWS Cloud; experience with Bedrock and SageMaker is a plus.
  8. Build and enhance AI Platform ( both on premise and in public cloud).
  9. Make is scalable, high performance and resilient
  10. Contribute to future road map and key architecture decisions.

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 143831513