
Search by job, company or skills
. In this Senior data engineer role, you will be responsible for designing and implementing complex data pipelines, optimizing data transformations, and ensuring the scalability and performance of our data infrastructure .
. The ideal candidate will have a deep technical background and a proven track record in managing data engineering projects and collaborating with cross-functional teams to drive data-driven decisions. This opportunity will provide you with experience working with cloud data platforms such as Snowflake and data transformation tools like dbt. You will gain exposure to AWS infrastructure for building innovative and efficient data solutions and work with various data sources, including Oracle, MS SQL Server, and Postgres.
Basic Qualifications
. Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or related field.
. 6+ years of experience in data engineering, with a proven track record of working in large-scale data initiatives.
. Deep expertise in Python, PySpark.
. Strong hands-on experience with Databricks (Spark, Delta Lake, Workflows)
. Strong experience with AWS (S3, IAM, Textract, Bedrock or equivalent)
. Experience with design and implement scalable document ingestion pipelines using Databricks Auto Loader and AWS S3.
. Understanding of vector embeddings and semantic search
. Strong understanding of data governance, privacy, and compliance in regulated industries (healthcare, life sciences).
Good To have :
. Advanced knowledge of data modeling, lakehouse/lake/warehouse design, and performance optimization.
. Familiarity with generative AI platforms and use cases.
. Contributions to open-source projects or thought leadership in data engineering/architecture.
. Experience with Agile methodologies, CI/CD, and DevOps practices.
. Exposure to FastAPI, or API-based ML services
. Experience evaluating LLM output quality
Key Responsibilities :
. Design, develop, and optimize complex data pipelines and transformation processes using Snowflake, dbt, and AWS services.
. Develop and maintain scalable data models and schemas in Snowflake, ensuring they meet performance and business requirements.
. Monitor and fine-tune the performance of data pipelines, queries, and data models to ensure optimal efficiency and cost-effectiveness.
. Utilize Snowflake's features, such as Time Travel, Zero-Copy Cloning, and Data Sharing, to enhance data management and performance.
. Leverage AWS services, such as AWS Lambda, S3, and Glue, to build and manage serverless data processing workflows and data storage solutions.
. Implement data security measures and ensure compliance with data privacy regulations and organizational policies.
. Troubleshoot and resolve complex data issues, including data sync errors, performance bottlenecks, and integration challenges.
. Provide support for data-related incidents and ensure effective resolution of production issues.
. Collaborate with data analysts, and other stakeholders to understand data needs and deliver effective solutions.
. Document data processes, models, and workflows, ensuring clear communication and knowledge sharing across teams.
. Independently assess situations, apply sound judgment and discretion, and make decisions on matters of significant impact without direct supervision
Perks and Benefits for Irisians
Iris provides world-class benefits for a personalized employee experience. These benefits are designed to support financial, health and well-being needs of Irisians for a holistic professional and personal growth. Click to view the benefits.
A strategic partner that transformational leaders can trust to realize the full potential of technology-enabled transformation.As a trusted technology partner, we focus our highly-experienced talent and rightsized teams to develop complex, mission-critical applications and solutions for leading enterprise across financial services, life sciences, including pharmaceutical, CROs and medical devices, manufacturing & logistics and educational services.
Job ID: 142581495