System Administrator - Linux & Low-Latency Systems
Location
Bengaluru
Employment Type
Full-Time
Experience Required
3-8 Years
Education
BE/B.Tech - Computer Science, Information Technology, Electronics, or related field
Preferred Certifications
RHCE, RHCSA, LPIC, CCNA, CCNP
Job Summary
We are looking for a highly skilled System Administrator with strong expertise in Linux systems, networking, and low-latency infrastructure environments. The ideal candidate will be responsible for deploying, managing, monitoring, and optimizing mission-critical infrastructure with a focus on performance, reliability, and high availability.
The role involves working closely with infrastructure, networking, and engineering teams to maintain ultra-low latency environments, troubleshoot complex production issues, and continuously improve system performance.
Key Responsibilities
Linux System Administration
- Deploy, configure, and maintain Linux servers across Ubuntu, Red Hat, and CentOS environments.
- Perform Linux system tuning including CPU pinning, NUMA optimization, IRQ affinity, huge pages configuration, and kernel parameter tuning.
- Manage OS-level scheduling policies, real-time priorities, and CPU isolation to improve system performance.
- Handle system hardening, patch management, firmware upgrades, and OS lifecycle management.
- Maintain infrastructure documentation, SOPs, and operational runbooks.
Low-Latency Infrastructure Management
- Manage and optimize bare-metal Linux infrastructure for high-performance environments.
- Support infrastructure related to market data, order management, and execution systems.
- Monitor and optimize latency, jitter, and packet loss across the infrastructure stack.
- Work on high-speed network hardware and ultra-low latency networking environments.
- Support exchange connectivity and colocation infrastructure operations.
Network & Data Center Operations
- Manage L2/L3 networking including VLANs, BGP, OSPF, multicast, and high-availability configurations.
- Configure and manage Cisco Nexus or similar data center switches.
- Coordinate with data center teams for rack management, cabling, power, and cooling activities.
- Ensure redundancy, failover readiness, and proactive infrastructure capacity planning.
Monitoring & Automation
- Build and maintain monitoring and alerting systems using Prometheus and Grafana.
- Automate routine operational tasks using Bash and Python scripting.
- Perform infrastructure health checks, deployment automation, log management, and troubleshooting.
- Collaborate with engineering teams to resolve infrastructure bottlenecks and production incidents.
Required Technical Skills
Core Linux
- Ubuntu
- Red Hat
- CentOS
- Linux internals
- Kernel tuning
- CPU pinning
- NUMA optimization
- BIOS/hardware tuning
Networking
- TCP/IP
- UDP
- Multicast
- VLANs
- BGP
- OSPF
- L2/L3 networking
Low-Latency & Infrastructure
- High-speed networking (10/25/40/100GbE)
- Mellanox / NVIDIA NICs
- Solarflare NICs
- Cisco Nexus switches
- DPDK / OpenOnload exposure
- NVMe / high-performance storage systems
Monitoring & Automation
- Bash scripting
- Python scripting
- Prometheus
- Grafana
- Data center operations
Preferred Qualifications
- Experience working in low-latency, HFT, trading, financial services, or high-performance computing environments.
- Strong troubleshooting skills in mission-critical production infrastructure.
- Exposure to colocation environments and high-speed trading infrastructure is an added advantage.
What We're Looking For
- Strong Linux administration and troubleshooting skills
- Excellent understanding of networking fundamentals
- Experience in performance tuning and system optimization
- Ability to work in fast-paced production environments
- Strong analytical and problem-solving capabilities
- Good communication and coordination skills