Help move products through the development lifecycle from proofs of concept all the way to final, production-ready products Engineer, build and maintain scalable automated datapipelines.
Implement best practices in management of data, including data processing,dataquality and lineage.
Managing code repositories, code deployments using GIT
Automate, maintain and manage system, to ensure the availability, performance, scalability of product.
Support regular and ad-hoc data querying and analysis.
Qualifications:
3-5 years of experience in Python and data-focused packages, e.g., Pandas, Numpy; interacting with JSON, XML, CSV, TSV formatted data
3-5 years of experience with snowflake, SQL.
2-3 years of experience with DBT
2-3 years of experience with version control like git, bitbucket, and Linux environment