Search by job, company or skills

Tata Communications

Sr Manager - Platform Engineering

new job description bg glownew job description bg glownew job description bg svg
  • Posted 9 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About The Company

Tata Communications Redefines Connectivity with Innovation and IntelligenceDriving the next level of intelligence powered by Cloud, Mobility, Internet of Things, Collaboration, Security, Media services and Network services, we at Tata Communications are envisaging a New World of Communications

Job Description

  • This is a lead role and is part of AI/HPC engineering team. The role specializes in Platform standardization initiatives, innovation, Testing and Optimization of different AI technologies. Specific role requires Installation, Administration, troubleshooting and analytical skills in the technology stacks covering Linux and Kubernetes. Exposure to Job Schedulers like SLURM and Provisioning tools like Nvidia Base command manager is preferred. Role will involve working with OpenSource Infrastructure and scripting tools like Ansible and similar DevOps tools.

Key Skills

Candidate should be B.E / B. Tech with over 6+ Years of experience in IT Infrastructure industry, 3 to 4 years in HPC and or AI technology with strong knowledge on Scripting and Linux with at least 1-2 years in Kubernetes.

Skills Required.

Good experience in Linux OS with scripting

Experienced in HPC and AI Platforms with GPU based Infrastructure.

Managing, Installing, Configuring, Deploying, Troubleshooting and administration of tools like Nvidia BCM, SLURM scheduler, Ansible Playbook/Tower.

Experience in ELK Log management system will be an advantage.

Exposure to Python Scripting

Devops Tools (Preferable but not mandatory)

Experience in deploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, Harbor Registry

Good to know.( (Preferable but not mandatory)

Networking: VLAN, VXLAN, InfiniBand, IP Subnetting, Routing, Firewall

Storage: Exposure to Parallel FS based storage, Object storage and NFS

Infrastructure: HP/Dell/ rack servers /GPU

Management /Monitoring tools: Zabbix, Prometheus, Grafana and ServiceNow

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 133382723