About The Company
Tata Communications Redefines Connectivity with Innovation and IntelligenceDriving the next level of intelligence powered by Cloud, Mobility, Internet of Things, Collaboration, Security, Media services and Network services, we at Tata Communications are envisaging a New World of Communications
Key Responsibilities / Accountabilities
The L3 OpenStack Administrator is responsible for ensuring reliable and secure cloud operations by managing advanced troubleshooting, performance tuning, upgrades, and incident resolution across all OpenStack services. The role focuses on maintaining high availability, optimizing infrastructure components, automating day2 operations, and supporting complex tenant and platform needs in a largescale OpenStack environment.
The ideal candidate should have deep knowledge of RHEL, CentOS, Ubuntu, SUSE, Oracle Linux, along with cloud Linux workloads (AWS, GCP, Azure, OCI), containerization (Docker, Kubernetes, OpenShift), and automation (Ansible, Terraform, Python, Bash).
Major Duties & Responsibilities
- Design and implement highly available and scalable architecture using Red Hat OpenStack Platform.
- Manage and operate the production OpenStack environment across compute, storage, and networking services.
- Lead deployment, configuration, and lifecycle management of OpenStack clusters (including major/minor upgrades and patching).
- Experience in Cisco ESC (Elastic Services Controller) for endtoend VNF lifecycle management including deployment, monitoring, scaling, healing, and termination on OpenStack environments.
- Perform capacity planning for compute, storage, and networking resources
- Act as L3 escalation point for critical production incidents (P1/P2)
- Conduct deep troubleshooting of Nova, Neutron, Cinder, Glance, and Keystone services
- Diagnose and resolve complex issues related to RabbitMQ, MariaDB Galera, HAProxy, and Pacemaker clusters
- Optimize performance (CPU pinning, NUMA tuning, hugepages, IO/network optimization) on Red Hat Enterprise Linux
- Manage and troubleshoot storage backends including Red Hat Ceph Storage, iSCSI, and NFS
- Troubleshoot advanced networking issues (OVS/OVN, VLAN, VXLAN, LACP, bonding)
- Implement automation using Ansible and scripting (Bash/Python)
- Ensure high availability, disaster recovery readiness, and backup strategies
- Perform root cause analysis (RCA) and provide preventive action plans
- Define security hardening standards and implement RBAC policies
- Review and improve operational SOPs and documentation
- Experience in virtualization platform KVM, proxmox, VM Ware etc.
- Mentor L1/L2 engineers and lead technical discussions during major incident bridges Design and implement highly available and scalable architecture using Red Hat OpenStack Platform
- Lead deployment, configuration, and lifecycle management of OpenStack clusters (including major/minor upgrades and patching)
- Perform capacity planning for compute, storage, and networking resources
- Act as L3 escalation point for critical production incidents (P1/P2)
- Conduct deep troubleshooting of Nova, Neutron, Cinder, Glance, and Keystone services
- Diagnose and resolve complex issues related to RabbitMQ, MariaDB Galera, HAProxy, and Pacemaker clusters
- Optimize performance (CPU pinning, NUMA tuning, hugepages, IO/network optimization) on Red Hat Enterprise Linux
- Manage and troubleshoot storage backends including Red Hat Ceph Storage, iSCSI, and NFS
- Troubleshoot advanced networking issues (OVS/OVN, VLAN, VXLAN, LACP, bonding)
- Implement automation using Ansible and scripting (Bash/Python)
- Ensure high availability, disaster recovery readiness, and backup strategies
- Perform root cause analysis (RCA) and provide preventive action plans
- Define security hardening standards and implement RBAC policies
- Review and improve operational SOPs and documentation
- Mentor L1/L2 engineers and lead technical discussions during major incident bridges
Required Knowledge, Skills And Abilities
- Strong expertise in Red Hat Linux and core OpenStack services with advanced troubleshooting and performance tuning skills.
- Hands-on experience with KVM virtualization, OVS/OVN networking, VLAN/VXLAN, and Ceph or similar storage.
- Proficiency in automation and scripting using Ansible, Bash, Python (Terraform/PowerShell optional).
- Working knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK.
- Solid understanding of cloud security, IAM, multi-tenant operations, and high-availability architectures.
- Strong analytical and problem-solving abilities for mission-critical environments.
Preferred Additional Skills And Abilities
- Experience with Linux-based Kubernetes clusters (EKS, AKS, GKE, OpenShift, Rancher).
- Understanding of CI/CD pipelines and DevOps tools (Jenkins, Git, GitLab, ArgoCD, Helm).
- Knowledge of big data, logging, and analytics tools (Splunk, ELK Stack, Kafka, Hadoop).
- Familiarity with database management on Linux (MySQL, PostgreSQL, MariaDB, MongoDB, Redis).
Qualifications And Experience
Following are the key skills and experience expected out of the candidate
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
- 7+ years of experience in Linux administration with strong exposure to OpenStack operations in production environments.
- Proven experience with virtualization, cloud infrastructure, and networking in large-scale, missioncritical setups.
- Hands-on experience with automation, monitoring, and infrastructure troubleshooting at an L3 level.
Certifications
(Preferred But Not Mandatory)
- Red Hat Certified Engineer (RHCE)
- OpenStack certifications such as COA (Certified OpenStack Administrator)
- Linux certifications (RHCSA, LFCS, or equivalent)
- Cloud or virtualization certifications (AWS/Azure/GCP Associate, VMware VCP optional but preferred)
- Automation/Scripting certifications (Ansible, Terraform optional advantage)