Major accountabilities:
- Responsible for Data Engineering - developing, testing and maintaining production grade Foundry Data Pipelines
- Working closely with Preclinical pipeline lead, tech lead and engineering team on data engineering requirements Actively participate in agile work practices.
- Evaluate and validate new Foundry platform features and align with the pipeline team to realize / enable tech spikes on the Foundry platform
- Collaboration with data scientists, data analysts and technology teams to gather requirements and implement solutions that will be tested and documented
- Coordinating with preclinical pipeline team to ensure quality controls, naming convention & best practices have been followed.
- Participate in PoC development to deliver products to address business needs.
- Understanding on Foundry Platform landscape/roadmap (preferrable).
Key performance indicators:
- Delivery of data pipeline engineering activities in a timely manner for the program.
- Execute CI/CD DevOps principles and maintain technical documentation of any new development.
- Apply Quality Engineering principles to ensure high quality delivery.
Job Dimensions:
Impact on the organization: Responsible for the development and maintenance of Preclinical Data Pipeline and delivering high quality PoC which integrates with business needs and help to drive data driven insights.
Minimum Requirements:
Education: Bachelor s/Masters degree in Computer Science, Applied Mathematics, Engineering, or any other technology related field; equivalent of the same in working experience may also be accepted
Work Experience:
- 6+ years IT experience, 4+ years experience in Data Engineering on Big Data platform.
- Able to design and implement data integration of different data modalities.
- Hands-on in programming languages primarily Python, PySpark and Spark.
- Hands-on experience with GIT workflow and Strong knowledge about DevOps (CI/CD and agile framework).
- Hands-on experience working with JIRA/Confluence for technical documentation.
- Strong Analytical thinking and problem-solving skills.
- Experience building scalable solutions and pipelines on big data platforms.
- Hands-on experience on Palantir Foundry Platform using Code Repository, Code Workbook, Data Connection, etc i.e. components to develop data pipelines (preferrable).
- Knowledge of AI/ML concepts with hands-on experience will be valuable.
- Knowledge of preclinical in-vivo study data e.g. CDSIC SEND standard will be desirable.
Skills:
- Back-End Development.
- Code Analysis.
- Big Data Platforms.
- Data Wrangling.
- Software Documentation.
- Software/Data Engineering.
- Software/Data Testing.
- Analytical thinking.
- CDISC SEND.
- Palantir Foundry.
- Unit Testing.