Exp: 3+ years
Location: Bangalore/Pune
Shift timing: 1pm-9pm(Hybrid, Cab facility available)
- Strong hands-on experience with Apache Kafka (administration & operations)
- Experience managing Kafka in both on-premises and cloud environments
- Solid expertise in Ansible and Terraform
- Experience with monitoring tools (Grafana, Prometheus, Datadog, ELK, etc.)
- Good understanding of Linux, networking, and distributed systems
- Proven ability to troubleshoot complex Kafka and infrastructure issues
Key Responsibilities:
- Deploy and manage Kafka clusters on on-prem and cloud platforms (AWS/Azure/GCP)
- Configure, optimize, and maintain Kafka components, including:
- Kafka Brokers
- Zookeeper / KRaft
- Schema Registry
- Kafka Connect
- Automate provisioning and configuration of Kafka infrastructure using Ansible
- Build and manage scalable cloud infrastructure using Terraform
- Create and maintain monitoring dashboards (Datadog, Grafana, Prometheus)
- Implement alerting for cluster health, consumer lag, throughput, storage, latency, and broker performance
- Implement security best practices (authentication and authorization)
- Manage upgrades, patching, scaling, and disaster recovery processe