About The Company
Tata Communications Redefines Connectivity with Innovation and IntelligenceDriving the next level of intelligence powered by Cloud, Mobility, Internet of Things, Collaboration, Security, Media services and Network services, we at Tata Communications are envisaging a New World of Communications
Job Description
- This is a lead role and is part of AI/HPC engineering team. The role specializes in Platform standardization initiatives, innovation, Testing and Optimization of different AI technologies. Specific role requires Installation, Administration, troubleshooting and analytical skills in the technology stacks covering Linux and Kubernetes. Exposure to Job Schedulers like SLURM and Provisioning tools like Nvidia Base command manager is preferred. Role will involve working with OpenSource Infrastructure and scripting tools like Ansible and similar DevOps tools.
Key Skills
Candidate should be B.E / B. Tech with over 6+ Years of experience in IT Infrastructure industry, 3 to 4 years in HPC and or AI technology with strong knowledge on Scripting and Linux with at least 1-2 years in Kubernetes.
Skills Required.
Good experience in Linux OS with scripting
Experienced in HPC and AI Platforms with GPU based Infrastructure.
Managing, Installing, Configuring, Deploying, Troubleshooting and administration of tools like Nvidia BCM, SLURM scheduler, Ansible Playbook/Tower.
Experience in ELK Log management system will be an advantage.
Exposure to Python Scripting
Devops Tools (Preferable but not mandatory)
Experience in deploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, Harbor Registry
Good to know.( (Preferable but not mandatory)
Networking: VLAN, VXLAN, InfiniBand, IP Subnetting, Routing, Firewall
Storage: Exposure to Parallel FS based storage, Object storage and NFS
Infrastructure: HP/Dell/ rack servers /GPU
Management /Monitoring tools: Zabbix, Prometheus, Grafana and ServiceNow