
Search by job, company or skills
Profile Summary:
We are seeking an experienced professional who apart from the required mathematical and statistical expertise also
possesses the natural curiosity and creative mind to ask questions, connect the dots, and uncover opportunities that lie
hidden with the ultimate goal of realizing the data's full potential.
Roles and Responsibilities:
Developing Modern Data Warehouse solutions using Databricks and Azure Stack
Ability to provide solutions that are forward-thinking in data engineering and analytics space
Collaboration with DW/BI leads to understanding new ETL pipeline development requirements.
Triage issues to find gaps in existing pipelines and fix the issues
Work with business to understand the need in reporting layer and develop data model to fulfill reporting needs
Help joiner team members to resolve issues and technical challenges.
Drive technical discussion with client architect and team members
Orchestrate the data pipelines in scheduler via Airflow
Skills and Qualifications:
Bachelor's and/or master's degree in computer science or equivalent experience.
Must have total 12+ yrs. of IT experience and 3+ years experience in Data warehouse/ETL projects.
Deep understanding of Star and Snowflake dimensional modelling.
Strong knowledge of Data Management principles
Experience of Databricks Data & AI platform and Databricks Delta Lake Architecture
Should have hands-on experience in SQL, Python and Spark (PySpark)
Candidate must have experience in Azure stack
Desirable to have ETL with batch and streaming (Kinesis).
Experience in building ETL / data warehouse transformation processes
Experience with Apache Kafka for use with streaming data / event-based data
Experience with other Open-Source big data products Hadoop (incl. Hive, Pig, Impala)
Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB, Cassandra, Neo4J)
Experience working with structured and unstructured data including imaging & geospatial data.
Experience working in a Dev/Ops environment with tools such as Terraform, CircleCI, GIT.
Proficiency in RDBMS, complex SQL, PL/SQL, Unix Shell Scripting, performance tuning and troubleshoot
Databricks Certified Data Engineer Associate/Professional Certification (Desirable).
Should have experience working in Agile methodology.
Job ID: 137601363