Manage and maintain cloudera data platform (CDP) focusing on installation, configuration, security (Kerberos, Ranger), performance tuning (HDFS, YARN,Spark, Kafka), upgrades, monitoring (Cloudera Manager), and automation (Shell/Python scripting) to ensure platform stability, scalability, and optimal performance for data teams, requiring Linux/Unix skills and strong troubleshooting abilities
Data lifecycle management to ensure optimal storage optimization and cost optimization
Monitor cluster health and performance using Cloudera Manager; perform capacity planning, resource tuning, and proactive issue resolution
Perform upgrades, patching, and backup/recovery operations across Cloudera components, ensuring minimal downtime and compliance with SLAs
Troubleshoot and resolve cluster-level issues including service failures, job errors, and performance bottlenecks using logs, metrics, and diagnostic too
Manage and lead team of 6 people for platform administration and operation
Coordinate and collaborate with technology and security for platform audit review and adherence to guidelines
Strong understanding of data integration techniques and best practices
Familiarity with big data technologies and frameworks
Troubleshoot and optimize workflows, provide infrastructure recommendation, managing Development, Production and DR setup
Experience
8 to 12 years mandatory hands-on experience in Cloudera data platform administration, operation, monitoring, performance tuning and management
Experience in managing large cluster
Experience in leading technical team of 5 to 6 member in cloudera data platform
Automation of cloudera platform operation through scripts, performance tuning experience, resolving issues proactively