As a Data Engineer on the R&D Precision Medicine team, you will be responsible for the end-to-end development of an enterprise analytics and data mastering solution using Databricks and Power BI. This role requires expertise in both data architecture and analytics to create scalable and reliable solutions that support research and advanced research pipelines. You will be a key player in creating unified repositories of human data by integrating information from multiple sources.
Roles & Responsibilities
- Solution Design & Development: Design and build scalable enterprise analytics solutions using Databricks, Power BI, and other modern data tools. You will leverage data virtualization, ETL, and semantic layers to unify data while reducing data proliferation.
- Data Modeling & Integration: Develop advanced SQL queries to profile and unify data. You will design robust data models and processing layers to support analytical and operational reporting needs, ensuring seamless data flows across platforms.
- Power BI & Reporting: Develop and maintain Power BI solutions, including models and reporting packages, ensuring they are optimized for performance and scalability.
- Governance & Best Practices: Design and develop solutions based on best practices for data governance, security, and compliance within Databricks and Power BI environments. You will also create robust documentation from data analysis and profiling.
- Collaboration & Innovation: Collaborate with customers, product teams, and other IT teams to define data requirements and project goals. You will continuously evaluate and adopt new technologies to enhance the architecture and performance of data solutions.
Qualifications
- A Master's degree with 1-3 years of experience, a Bachelor's degree with 3-5 years of experience, or a Diploma with 7-9 years of experience in Data Engineering.
- Minimum of 3 years of hands-on experience with BI solutions (Power BI preferred), including report development, dashboard creation, and optimization.
- Minimum of 3 years of hands-on experience building Change-Data-Capture (CDC) ETL pipelines, data warehouse design, and enterprise-level data management.
- Hands-on experience with Databricks, including data engineering, optimization, and analytics workloads.
- Deep understanding of Power BI, including model design, DAX, and Power Query.
- Proven experience designing and implementing data mastering solutions and data governance frameworks.
- Expertise in cloud platforms (AWS), data lakes, and data warehouses.
- Certifications such as SAFe Agile Practitioner (6.0), Microsoft Certified: Data Analyst Associate (Power BI), and Databricks Certified Professional are preferred.
Soft Skills
- Problem-Solving: Excellent analytical and troubleshooting skills, with a deep intellectual curiosity and the ability to learn quickly.
- Communication: Strong verbal and written communication skills, with the ability to present complex technical topics to varied audiences.
- Leadership & Initiative: A high degree of initiative and self-motivation, with the confidence to act as a technical leader.
- Collaboration: The ability to work effectively with global, remote teams and successfully handle multiple priorities.
- Domain Knowledge: Experience with human healthcare data, laboratory testing, and clinical trial data management is a plus.