Lead Systems Engineer

4-8 years
2 months ago 1 Applied
Job Description


Design, build, manage and operate infrastructure and configuration of all platform environments with a focus on automation and infrastructure as code.

Design, build, manage and operate the infrastructure as a service layer (hosted and cloud-based platforms) that supports the different platform services.

Develop a log analytics solution to provide logging-as-a-service to hosted applications based on open-source solutions, to speed up the debugging process.

Evaluate performance trends and expected changes in demand and capacity and establish the appropriate scalability plans.

Identify and troubleshoot any availability and performance issues at multiple layers of deployment, from hardware, operating environment, network and application.

Recommend and maintain technology related policies and procedures.

Identify and suggest various opportunities to improve efficiency and functionality.

Implement data security and protection.


Administrating services status in Kubernetes Master and Minions

Administrating Pods, Docker images, services, replication controller

Scaling/Descaling Pods

Administrating configuration changes in the Kubernetes files

Microservices Deployment

Troubleshooting issues encountered during deployment

Provide Support to L1 L2 team in creating RCAs resolving issues in completing daily assigned activities including the training

Provide on-call support

Good knowledge in shell scripting

Conforming to client compliances and expectations

Istio or Service Mesh knowledge would be a plus.

Profile required

Should have minimum 4-8 years working experience in Docker/K8s and certified as Kubernetes administrator.

Preferably from a Linux Systems Admin or DevOps Engineering background.

The candidates must have experience in larger cluster administration.

Monitor system events to ensure health, maximum system availability and service quality

Manage the container platform ecosystem (installation, upgrade, patching, monitoring)

Troubleshoot complex technical issues and assist development team as necessary to resolve issues related to K8s.

Have experience to Perform system application patching

Provide on call support

Answer user s query and service requests

Solid experience in K8s Application troubleshooting

Should have the basic knowledge about the CI/CD process

Should have knowledge in Python programming

Good to have knowledge in Ansible





Career Advice to Find Better