Job Summary :
We are looking for a Senior Middleware Engineer to administer and operate production Enterprise Linux infrastructure and the middleware services that support critical business applications. You'll ensure the platform is highly available, secure, performant, and observable, and you'll provide end-to-end L3 troubleshooting across OS, networking, and middleware layers while partnering with development teams to improve reliability and delivery quality.
Your primary focus will be keeping existing middleware systems stable, secure, and up to date, while contributing to design improvements and modernization initiatives as the platform evolves.
What You Will Do
- Administer and operate production Linux infrastructure ensuring high availability, security, and performance
- Deploy and operate middleware and integration components for distributed and microservices architectures.
- Design and maintain traffic routing and service exposure layers (HAProxy, Nginx)
- Operate and maintain messaging platforms (RabbitMQ, Kafka)
- Automate infrastructure and middleware configuration using Ansible
- Provide L3 support for international customers, performing end-to-end troubleshooting across OS, networking, and middleware layers
- Lead and contribute to critical production incident resolution and post-incident improvements (RCA, action items, runbooks)
- Work closely with development teams to improve platform reliability, scalability, and delivery quality
Required Skills & Experience
- Strong, proven experience with Enterprise Linux distributions (e.g., RHEL, Rocky Linux, AlmaLinux, CentOS, Oracle Linux). Experience with any of these is acceptable due to their compatibility.
- Networking fundamentals for troubleshooting (TCP/IP basics, HTTPS, connectivity, firewalls, routing concepts, DNS, ports).
- Automation: practical knowledge of Ansible to automate system and middleware deployments and configurations.
- Linux scripting: proficiency in Bash and common Unix tools (sed, awk, grep, etc.).
- Reverse proxy / load balancing concepts with hands-on experience in Nginx and/or HAProxy.
- Experience operating messaging platforms such as RabbitMQ and/or Kafka in production is strongly preferred.
- Red Hat certifications are a plus (RHCSA may be too basic; RHCE is preferred, even if expired).
Nice to Have
- Solid Linux system administration background (RHEL/OEL/Rocky/CentOS), including systemd services, networking, and patching/upgrades
- Experience with NGINX/HAProxy reverse proxy and load balancing concepts
- Experience operating RabbitMQ and Redis in HA setups (or equivalent messaging/caching platforms)
- Scripting and automation skills (Bash/Python) and exposure to configuration management (e.g., Ansible)
- Strong troubleshooting skills, ability to write clear documentation and runbooks
- Experience with service discovery and configuration templating (e.g., Consul)
- Containers and orchestration (e.g., Docker, Kubernetes)
- Infrastructure as Code (e.g., Terraform)
- Cloud and platform exposure (e.g., AWS, VMware vCenter)
- Observability and logging (e.g., Elasticsearch / Logstash / Kibana)
- Experience with Mesos/Marathon and related ecosystem tools
- Version control and collaboration (e.g., Git)
What We Value
End-to-End Ownership
You take ownership of production services and follow through on fixes until they're resolved.
Strong Hands-On Engineering
We expect engineers who dive deep into root cause analysis, automate repetitive work, and drive preventive improvements.
Risk-Aware Change Management
You manage risk in live environments through careful change planning and methodical execution.
Clear Communication
You communicate clearly during incidents, document what you learn, and produce runbooks others can follow.
Continuous Improvement
Blameless postmortems, automation over manual processes, standardisation opportunities, and learning from failure.
What Success Looks Like
- Middleware reliability sits at the centre of application availability — outages are reduced and recovery times improve
- Upgrades and migrations execute smoothly with minimal service disruption
- Teams ship changes safely because the platform is predictable and well-operated
- Operational work is increasingly automated and configurations are standardised
- Documentation and runbooks are current, clear, and actionable
Career Growth
You'll have the opportunity to deepen your expertise across middleware, Linux platform operations, and reliability practices, and to lead stabilisation and modernisation initiatives that improve how the platform is delivered and supported.
Used Technologies (Current Environment)
Linux (RHEL), Bash, Ansible, Nginx, HAProxy, RabbitMQ, Kafka, Docker, Kubernetes, Terraform, AWS, Mesos/Marathon, Consul, VMware vCenter, Elasticsearch / Logstash / Kibana, Git, TCP/IP
How to Apply
Please submit your CV and a short note answering:
- Which components have you operated in production (e.g., Nginx, HAProxy, RabbitMQ, Kafka or similar)
- Which Enterprise Linux distributions have you supported (RHEL/Rocky/Alma/CentOS/OEL), and what types of upgrades/migrations have you performed
- An example of an end-to-end incident you investigated (OS + network + middleware) and how you prevented recurrence.
- How have you used Ansible (or similar) to standardise and automate deployments/configuration
- Use of AI tools in daily life: describe how you have embedded AI into your workflows to improve efficiency and productivity.