Core Responsibilities
- Database Management: Install, configure, and maintain Distributed database systems.
- Data ingestion: Manage database Pipelines for streaming and load utility for batch data ingestion.
- Performance Tuning: Optimize queries, indexes, and database design for speed and scalability.
- Monitoring & Troubleshooting: Set up monitoring tools, diagnose performance issues, and resolve database-related problems.
- Backup & Recovery: Implement and test disaster recovery strategies and backup plans. Security: Apply database security best practices, manage user roles, and ensure compliance.
- Automation: Develop scripts to automate routine DBA tasks. Collaboration: Work with developers, DevOps, and infrastructure teams to ensure smooth integration.
Required Skills
- Distributed databaseExpertise: Good understanding of Distributed databases (or other columnar stores, such as ClickHouse, Snowflake) distributed architecture, pipelines, and query engine.
- Knowledge of columnstore vs rowstore optimizations: and how to balance them in Single
- StoreStrong SQL proficiency: writing complex queries, stored procedures, and optimizing query performance.
- Scripting like Python and Unix (Shell) skills for automation is desired.
Preferred Databases: Redshift, Singlestore, Greenplum, Clickhouse