Job Description
Job Description- Database Site Reliability Engineer
Job Description: Database Site Reliability Engineer Experience: 3-5 Years Job Summary We are looking for a Database Site Reliability Engineer - I to manage, scale, and ensure high availability of open-source database systems. The role focuses on reliability, performance, monitoring, and cloud-native deployments. Key Responsibilities
Manage and maintain open-source SQL and NoSQL databases such as StarRocks, ClickHouse, Druid, and Cassandra.
Design and implement active-active architectures to ensure high availability and fault tolerance.
Handle database backup and restoration processes to ensure data safety. Integrate databases with Prometheus and Grafana for monitoring and alerting. Work with Kafka, EMR, and Kubernetes-based deployments to support data
platforms. Troubleshoot issues and ensure database reliability and performance.
Required Skills
Proven expertise in managing and setting up open-source SQL and NoSQL databases (StarRocks, ClickHouse, Druid, Cassandra, etc.).
Deep understanding of active architectures for high availability. Strong knowledge of database backup and restoration processes. Experience with Prometheus and Grafana for monitoring and observability. Hands-on experience with Kafka, EMR, and Kubernetes-based deployments.