Search by job, company or skills

T

Sr Manager - Platform Engineering

new job description bg glownew job description bg glownew job description bg svg
  • Posted 9 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

This is a lead role and is part of AI/HPC engineering team. The role specializes in Platform standardization initiatives, innovation, Testing and Optimization of different AI technologies. Specific role requires Installation, Administration, troubleshooting and analytical skills in the technology stacks covering Linux and Kubernetes. Exposure to Job Schedulers like SLURM and Provisioning tools like Nvidia Base command manager is preferred. Role will involve working with OpenSource Infrastructure and scripting tools like Ansible and similar DevOps tools.

Key Skills

Candidate should be B.E / B. Tech with over 6+ Years of experience in IT Infrastructure industry, 3 to 4 years in HPC and or AI technology with strong knowledge on Scripting and Linux with at least 1-2 years in Kubernetes.

Skills required.

----------------------------

Good experience in Linux OS with scripting

Experienced in HPC and AI Platforms with GPU based Infrastructure.

Managing, Installing, Configuring, Deploying, Troubleshooting and administration of tools like Nvidia BCM, SLURM scheduler, Ansible Playbook/Tower.

Experience in ELK Log management system will be an advantage.

Exposure to Python Scripting

Devops Tools (Preferable but not mandatory)

---------------

Experience in deploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, Harbor Registry

Good to know.( (Preferable but not mandatory)

---------------

Networking: VLAN, VXLAN, InfiniBand, IP Subnetting, Routing, Firewall

Storage: Exposure to Parallel FS based storage, Object storage and NFS

Infrastructure: HP/Dell/ rack servers /GPU

Management /Monitoring tools: Zabbix, Prometheus, Grafana and ServiceNow

More Info

Job Type:
Industry:
Employment Type:

About Company

Tata Communications is a digital ecosystem enabler that powers today&#8217&#x3B;s fast-growing digital economy. We enable the digital transformation of enterprises globally, including 300 of the Fortune 500. We carry around 30% of the world&#8217&#x3B;s internet routes and connects businesses to 60% of the world&#8217&#x3B;s cloud giants.
We have been a part of the rich heritage of the internet in India. Over the last 25 years, enterprise-enabled services have been essential to the adoption of digital services in the country. Connectivity is an essential fabric of sustenance for the economy. We are committed to enabling Industry leaders in this New World of Communications&#8482&#x3B;, with our unique promise of delivering secure connected digital experiences.
In 2020, we announced the launch of &#8216&#x3B;Secure Connected Digital Experience&#8217&#x3B; (SCDx), a proposition intended to meet this growing, worldwide demand for new ways of operating, which includes far higher levels of working from home, rising security risks, a shift to digital commerce, and more contactless experiences. It will help companies currently relying on short-term fixes by providing holistic, secure, enterprise-level digital solutions that address current challenges and are fit for the long term.

Job ID: 133480911