Job Description
About LetitbexAI
LetitbexAI is a fast-growing AI-driven technology company focused on building intelligent, scalable, and enterprise-grade solutions. We work at the intersection of AI, data engineering, cloud, and business transformation, helping organizations unlock real value from artificial intelligence.
Position: Infra Support Engineer
Experience: 3-5 Years
Notice Period: 15 to 30 Days
Role Overview
We are seeking a highly skilled Infrastructure Support Engineer (L2/L3) to join the Managed Services team. The ideal candidate will be responsible for supporting, maintaining, and optimizing cloud-native and on-premise infrastructure environments. This role requires strong expertise in Kubernetes, cloud platforms, Linux administration, and security practices, along with the ability to troubleshoot complex infrastructure issues.
Key Responsibilities
Infrastructure & Cloud Operations
Provide L2/L3 support for infrastructure incidents, service requests, and problem management.
Manage and maintain Kubernetes clusters (deployment, scaling, troubleshooting).
Handle cloud infrastructure operations in AWS and Azure environments.
Monitor system performance, availability, and reliability across environments.
Perform root cause analysis (RCA) and implement preventive measures.
CI/CD & Automation
Manage and maintain CI/CD pipelines (Git-based workflows preferred).
Automate infrastructure provisioning using Terraform (Infrastructure as Code).
Use Ansible for configuration management and automation tasks.
Support deployment processes using Helm charts and containerized applications.
System Administration
Administer Linux environments including RedHat, Ubuntu, and CentOS 7.
Perform system upgrades, patching, and performance tuning.
Manage user access, system configurations, and OS-level troubleshooting.
Networking & Security
Configure and troubleshoot networking components including firewalls, routing, and traffic policies.
Handle Kubernetes networking and internal cluster communication issues.
Ensure Infrastructure Security By Managing
Container vulnerabilities
Network threats
Security configurations and compliance
Work with Keycloak for identity and access management.
Data Management & Integration
Manage data transfers using SFTP, AWS native services, and CLI tools.
Support integration and data workflows via REST APIs.
Assist in managing workflows using Apache Airflow (Nextflow is a plus).
Development & Scripting
Work with Java-based applications in deployment and troubleshooting.
Develop scripts/tools to improve operational efficiency.
Documentation & Collaboration
Create and maintain technical documentation, SOPs, and runbooks.
Collaborate with DevOps, Development, and Security teams.
Participate in on-call rotations and incident response.
Required Skills
Core Technical Skills
Kubernetes & Helm (Hands-on cluster management)
CI/CD tools (Git-based pipelines preferred)
Cloud Platforms: AWS & Azure
Terraform (IaC)
Ansible (Automation)
Linux Administration (RedHat, Ubuntu, CentOS)
Networking fundamentals (firewalls, routing, cloud networking)
Security Engineering (container & network security)
Programming & Tools
Java (basic to intermediate)
REST APIs integration
Apache Airflow (Nextflow – nice to have)
Data transfer tools (SFTP, CLI, AWS services)
Other Skills
Strong troubleshooting and analytical skills
Experience in L2/L3 support environments
Excellent documentation and communication skills