Cloud and AWS Expertise:
- In-depth knowledge of AWS services related to data engineering: EC2, S3, RDS, DynamoDB, Redshift, Glue, Lambda, Step Functions, Kinesis, Iceberg, EMR, and Athena.
- Strong understanding of cloud architecture and best practices for high availability and fault tolerance.
Data Engineering Concepts:
- Expertise in ETL/ELT processes, data modeling, and data warehousing.
- Knowledge of data lakes, data warehouses, and big data processing frameworks like Apache Hadoop and Spark.
- Proficiency in handling structured and unstructured data.
Programming and Scripting:
- Proficiency in Python, Pyspark and SQLfor data manipulation and pipeline development.
Expertise in working with data warehousing solutions like Redshift.