We are looking for NOC Team Lead, responsible for handling the team supporting the network & infrastructure at Nference. NOC Team Lead will be in charge of monitoring critical network elements and engaging in proactive network systems monitoring and provide 24/7 proactive operational monitoring and support.
This position is responsible for technical support and issues that come into the NOC via customer and/or monitoring software.
Responsibilities
- Responsible for managing and coordinating the NOC team.
- Point of contact for all NOC escalations, both external and internal.
- Drives incident management, monitoring, tracking, and ensuring that SLAs are met.
- Develops and Implements new solutions, strategies, and processes to support the NOC's standard operating procedures.
- Sets work schedules for 24x7x365 coverage.
- Oversees the work of the team to ensure that system requirements have been properly implemented and procedures carefully followed.
- Provides input to improve stability, security, efficiency, and scalability of systems.
- Responsible for monitoring production, staging and development environments for a large number of applications in an agile, fast paced organization.
- Monitoring the performance and capacity of computer systems in real time and responding to alerts in near real-time.
- Server build and installs, application upgrades, network equipment build and installation. Maintaining hardware audits.
- Performing regular checks on the production stack to ensure the systems and services are running in an optimal fashion.
- Analyzing production network issues to suggest corrective action.
- Provide critical service outage notification and escalate issues for timely resolution; notify appropriate Company personnel as appropriate
- Effectively communicates with team members and trains them in technical aspects.
- Maintenance of WIKI and technical documentation (for NOC) of processes and procedures used throughout normal operations.
Requirements
- 5-7 years of experience working in high pressure, dynamic, 24x7x365 environments.
- At least 1 year of experience handling a NOC team.
- People leadership and direction experience. For example: shift scheduling, disciplinary, hiring, etc
- Experience with Datacenter Technologies including Public Cloud (AWS/GCP)
- Application support and Deployment experience using tools such as Git, Puppet, etc.
- Familiarity with scripting tools such as Python and Bash.
- Experience using infrastructure automation tools such as Terraform
- Excellent troubleshooter, utilizing a systematic problem-solving approach spanning code, systems, and network theory & protocols (TCP/IP, UDP, ICMP) ability to read a packet capture/TCPdump, etc
- Good network diagnostic skills.
- Basic Linux CLI and sysadmin skills.
- Work well in a busy team, being quick to learn and able to deal with a wide range of issues.
- Strong analytical skills and able to collate and interpret data from various sources.
- Ability to assess and prioritise faults and respond or escalate accordingly.
- Ability to work independently and accommodate various shifts in a 24x7x365 environment
- Capable of multitasking, good time management and prioritisation of workload.
- Willing to learn and develop new skills.
- Strong interpersonal/communication skills.
Skills: linux