Role Purpose
Serve as a senior technical authority responsible for designing and delivering advanced AI and cloud platform architectures using Red Hat OpenShift, Red Hat AI Inference Server, OpenShift AI, and hybrid/multi-cloud environments. Enable on-prem model training, enterprise-grade inference, and automation-driven managed service platforms. Drive deep technical exploration, feasibility assessments, and end-to-end implementation patterns that position the organization as a next-gen AI-driven managed services provider.
Key Responsibilities
- Architect robust platforms leveraging Red Hat OpenShift, Red Hat AI Inference Server (vLLM-based), OpenShift AI, and associated Red Hat AI products.
- Design and implement AI inference and on-prem model training solutions using vLLM, KServe/ModelMesh, llm-d, GPU orchestration, and Red Hat's validated model catalog.
- Build hybrid and multi-cloud AI platforms across on-prem OpenShift, OpenShift on AWS (ROSA), and other hyperscaler environments.
- Lead identification, exploration, and implementation of AI/LLM use cases that leverage Red Hat inference and training capabilities.
- Develop cloud-agnostic architectures enabling distributed inference, model serving, GPU scaling, and seamless workload placement across clouds.
- Own and deliver PoCs, performance benchmarks, and architectural accelerators that validate solution feasibility.
- Implement automation using Ansible, Terraform, GitOps, and OpenShift-native tooling to standardize multi-cluster operations.
- Establish best practices for observability, cost optimization, governance, and platform security for AI workloads.
- Work with cybersecurity teams to incorporate zero-trust, threat detection, and compliance principles into platform blueprints.
- Serve as a senior technical mentor, guiding engineering teams in deep technical problem-solving, architectural simplification, and design reviews.
- Create reusable architecture patterns, operators, templates, and deployment assets that scale across hybrid/multi-cloud environments.
- Participate in technical governance, architectural due diligence, and customer-facing solution architecture discussions.